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£:w/erococaiJ fegcofo Polynucleotides and Polypeptides 



FIELD OF THE INVENTION 

5 The present invention relates to the field of molecular biology. In particular, it 

relates to, among other things, nucleotide sequences of Enterococcus faecalis, contigs, 
ORFs, fragments, probes, primers and related polynucleotides thereof, peptides and 
polypeptides encoded by the sequences, and uses of the polynucleotides and sequences 
thereof, such as in fermentation, polypeptide production, assays and pharmaceutical 

10 development, among others. 

BACKGROUND OF THE INVENTION 

Enterococci have been recognized as being pathogenic for humans since the turn 
of the century when they were first described by Thiercelin in 1988 as microscopic 

15 organisms. The genus Enterococcus includes the species Enterococcus faecalis or ^. 
faecalis which is the most common pathogen in the group, accounting for 80 - 90 
percent of all enterococcal infections. See Lewis et al. (1990) Eur J. Clin Microbiol 
Infect Dis. 9:111-117. 

The incidence of enterococcal infections has increased in recent years and 

20 enterococci are now the second most frequently reported nosocomial pathogens. 

Enterococcal infection is of particular concern because of its resistance to antibiotics. 
Recent attention has focused on enterococci not only because of their increasing role in 
nosocomial infections, but also because of their remarkable and increasing resistance to 
antimicrobial agents. These factors are mutually reinforcing since resistance allows 

25 enterococci to survive in an environment in which antimicrobial agents are heavily used; 
the hospital setting provides the antibiotics which eliminate or suppress susceptible 
bacteria, thereby providing a selective advantage for resistant organisms, and the hospital 
also provides the potential for dissemination of resistant enterococci via the usual routes 
of hand and environmental contamination. 

30 Antimicrobial resistance can be divided into two general types, inherent or 

intrinsic property and that which is acquired. The genes for intrinsic resistance, like 
other species characteristics, appear to reside on the chromosome. Acquired resistance 
results from either a mutation in the existing DNA or acquisition of new DNA. Tlie 
various inheroit traits expressed by enterococci include resistance to semisynthetic 

35 penicillinase-resistant praiidllins, cephalosporins, low levels of aminoglycosides, and low 
levels of clindamycin. Examples of acquired resistance include resistance to 
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chloramphenicol, erythromycin, high levels of clindamycin, tetracycline, high levels of 
aminoglycosides, penicillin by means of penicillinase, fluoroquinolones, and vancomycin. 
Resistance to high levels of penicillin without penicillinase and resistance to 
fluoroquinolones are not known to be plasmid or tran^oson mediated and presumably are 

5 due to mutation(s). 

Although the main reservoir for enterococci in humans is the gastrointestinal 
tract, the bacteria can also reside in the gallbladder, urethra and vagfaaa. 

E. faecalis has emerged as an important pathogen in endocarditis, bacteremia, 
urinary tract infections (UTIs), intraabdominal infections, soft tissue infections, and 

10 neonatal sq)sis (Lewis 1990, supra), in the 1970s and 1980s enterococci became firmly 
established as major nosocomial pathogens. They are now the fourth leading cause of 
hospital-acquired infection and the third leading cause of bacteremia in the United States. 
Fatality ratios for enterococcal bactermia range from 12% to 68%, with death due to 
enterococcal sepsis in 4 to 50% of these cases. See Emori, T.G. (1993) Clin. Microbiol. 

15 Rev. 6:428-442. 

The ability of enterococci to colonize the gastrointestinal tract, plus the many 
intrinsic and acquired resistance traits, means that these organisms, which usually seem to 
have relatively low intrinsic virulence, are given an excellent opportunity to become 
secondary invaders. Since nosocomial isolates of enterococci have displayed resistance to 

20 essentially every useiul antimicrobial agent, it will likely become increasingly difficult to 
successfully treat and control enterococcal infections. Particularly when the various 
resistance genes come together in a single strain, an event almost certain to occur at 
some time in the future. 

The etiology of diseases mediated or exacerbated by Enterococcus faecalis, 

25 involves the progranuned expression of E. faecalis genes, and that characterizing these 
genes and their patterns of expression would dramatically add to our understanding of the 
organism and its host interactions. Knowledge of the E, faecalis gene and genomic 
organization would improve our understanding of disease etiology and lead to improved 
and new ways of preventing, treating and diagnosing diseases. Thus, there is a need to 

30 characterize the genome of E. faecalis and for polynucleotides of this organism. 

SUMMARY OF THE INVENTION 

The present invention is based on the sequencing of fragments of the 
Enterococcus faecalis genome. The primary nucleotide sequences which were generated 
35 are provided in SEQ ID NOS: 1-982. 

The present invention provides the nucleotide sequence of hundreds of contigs of 
the Enterococcus faecalis genome, which are listed in tables below and set out in the 
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Sequence Listing submitted herewith^ and representative fragments thereof, in a form 
which can be readily used, analyzed, and interpreted by a skilled artisan. In one 
embodiment, the present invention is provided as contiguous strings of primary sequence 
information corresponding to the nucleotide sequences depicted in SEQ ID NOS: 1-982. 
5 The present invention further provides nucleotide sequences which are at least 

95%, 96%, 97%, 98%, and 99%, identical to the nucleotide sequences of SEQ ID N0S:1- 
982. 

The nucleotide sequence of SEQ ID NOS: 1-982, a representative fragment 
thereof, or a nucleotide sequence which is at least 95% identical to the nucleotide 

10 sequence of SEQ ID NOS: 1-982 may be provided in a variety of mediums to facilitate its 
use. In one application of this embodiment, the sequences of the present invention are 
recorded on computer readable media. Such media includes, but is not limited to: magnetic 
storage media, such as floppy discs, hard disc storage medium, and magnetic tape; optical 
storage media such as CD-ROM; electrical storage media such as RAM and ROM; and 

15 hybrids of these categories such as magnetic/optical storage media. 

The present invention further provides systems, particularly conq>uter-based . 
systems which contain the sequence information herein described stored in a data storage 
means. Such systems are designed to identify conmieicially important fragments of the 
Enterococcus faecalis genome. 

20 Another embodiment of the present invention is directed to fragments of the 

Enterococcus faecalis genome having particular structural or functional attributes. Such 
fragments of the Enterococcus faecalis genome of the present invention include, but are 
not limited to, fragments which encode pq)tides, hereinafter referred to as open reading 
frames or ORFs, fragments which modulate the expression of an operably linked ORF, 

25 hereinafter referred to as expression modulating fragments or EMFs, and fragments 
which can be used to diagnose the presence oi Enterococcus faecalis in a sample, 
hereinafter referred to as diagnostic fragments or DFs. 

Each of the ORFs in fragments of the Enterococcus faecalis genome disclosed in 
Tables 1-3, and the EMFs found 5' prime of the initiation codon, can be used in numerous 

30 ways as polynucleotide reagents. For instance, the sequences can be used as diagnostic 
probes or amplification primers for detecting or determining the presence of a specific 
microbe in a sample, to selectively control gene expression in a host and in the 
production of polypeptides, such as polypeptides encoded by ORFs of the present 
invention, particular those polypeptides that have a pharmacological activity. 

35 The present invention further includes recombinant constructs conqnising one or 

more fragments of the Enterococcus faecalis genome of the present invention. The 
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recombinant constructs of the present invention comprise vectors, such as a plasmid or 
viral vector, into which a fragment of the Enterococcus faecalis has been inserted. 

The present invention further provides host cells containing any of the isolated 
fragments of the Enterococcus faecalis genome of the present invention. The host cells 

5 can be a higher eukaryotic host cell, such as a mammalian cell, a lower eukaiyotic cell, 
such as a yeast cell, or a procaryotic cell such as a bacterial cell. 

The present invention is further directed to isolated polypeptides and proteins 
encoded by ORFs of the present invention. A variety of methods, well known to those of 
skill in the art, routinely may be utilized to obtain any of the polypeptides and proteins 

10 of the present invention. For instance, polypeptides and proteins of the present 

invention having relatively short, simple amino acid sequences readily can be synthesized 
using commercially available automated peptide synthesizers. Polypeptides and proteins 
of the present invention also may be purified from bacterial cells which naturally produce 
the protein. Yet another alternative is to pmiy polypq)tide and proteins of the present 

IS invention from cells which have been altered to express them. 

The invention further provides methods of obtaining homologs of the fragments 
of the Enterococcus faecalis genome of the present invention and homologs of the 
proteins encoded by the ORFs of the present invention. Specifically, by using the 
nucleotide and amino acid sequences disclosed herein as a probe or as primers, and 

20 techniques such as PCR cloning and colony/plaque hybridization, one skilled in the art can 
obtain homologs. 

The invention further provides antibodies which selectively bind polypeptides and 
proteins of the present invention. Such antibodies include both monoclonal and 
polyclonal antibodies. 

25 The invention further provides hybridomas which produce the above-described 

antibodies. A hybridoma is an immortalized cell line which is capable of secreting a 
specific monoclonal antibody. 

The present invention further provides methods of identifying test samples 
derived from cells which express one of the ORFs of the present invention, or a homolog 
30 thereof Such methods comprise incubating a test sample with one or more of the 

antibodies of the present invention, or one or more of the DFs of the present invention, 
under conditions which allow a skilled artisan to determine if the sample contains the 
ORF or product produced therefrom. 

In another embodiment of the present invention, kits are provided which contain 
35 the necessary reagents to carry out the above-described assays. 

Specifically, the invention provides a compartmentalized kit to receive, in close 
confinement, one » or more containers which comprises: (a) a first container comprising 
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one of the antibodies, or one of the DFs of the presmt invention; and (b) one or more 
other containers comprising one or more of the following: wash reagents, reagents 
capable of detectmg presence of bound antibodies or hybridized DFs. 

Using the isolated proteins of the present invention, the present invention 

5 further provides methods of obtaining and identiiying agents cs^able of binding to a 
polypeptide or protein encoded by one of the ORFs of the present invention. 
Specifically, such agrats include, as further described below, antibodies, peptides, 
carbohydrates, pharmaceutical agents and the like. Such methods comprise steps of: 
(a)contacting an agent with an isolated protein encoded by one of the ORFs of the 

10 present invention; and (b)dctermining whether the agent binds to said protein. 

The present genomic sequences of Enterococcus faecalis will be of great value to 
all laboratories working with this organism and for a variety of commercial purposes. 
Many fragments of the Enterococcus faecalis genome will be immediately identified by 
similarity searches against GenBank or protein databases and will be of immediate value to 

IS Enterococcus faecalis researchers and for immediate commercial value for the production 
of proteins or to control gene expression. 

The methodology and technology for elucidating extensive genomic sequences of 
bacterial and other genomes has and will greatly enhance the ability to analyze and 
understand chromosomal organization. In particular, sequenced contigs and genomes will 

20 provide the models for developing tools for the analysis of chromosome structure and 
function, including the ability to identify genes within large segments of genomic DNA, 
the structure, position, and spacing of regulatory elements, the identification of genes 
with potential industrial applications, and the ability to do comparative genomic and 
molecular phylogeny. 

25 

DESCRIPTION OF THE FIGURES 

FIGURE- 1 is a block diagram of a computer system (102) that can be used to 
implement computer-based systems of the present invention. 

30 FIGURE 2 is a schematic diagram dq)icting the data flow and computer programs 

used to collect, assemble, edit and aimotate the contigs of the Enterococcus faecalis 
genome of the present invention. Both Macintosh and Unix platforms are used to handle 
the AB 373 and 377 sequence data files, largely as described in Kerlavage et aL, 
Proceedings of the Twenty-Sixth Annual Hawaii International Conference on System 

35 Sciences, 585, IEEE Computer Society Press, Washington D.C. (1993). Factura (AB) is a 
Macintosh program designed for automatic vector sequence removal and end-trinmiing of 
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sequence files. The program Sequis runs on a Macintosh platform and parses the feature 
data extracted from the sequence files by Factora to the Unix based Enterococcus faecalis 
relational database. Assembly of contigs (and whole genome sequences) is acconqplished 
by retrieving a specific set of sequence files and their associated features using Extrseq, a 

5 Unix utility for retrieving sequences from an SQL database. The resulting sequence file is 
processed by seq_filter to trim portions of the sequences with more than 1% ambiguous 
nucleotides. The sequence files were assembled using TIGR Assembler, an assembly engine 
designed at The Institute for Genomic Research ( TIGR ) for rapid and accurate assembly 
of thousands of sequence fragments. The collection of contigs generated by the assembly 

10 step is loaded into the database with the lassie program. Identification of open reading 
frames (ORFs) is accomplished by processing contigs with GeneMark, described in 
Borodovsky, M, and Mclninch, J.D. (1993) CompuL Chem,, 17:123133. The ORFs are 
searched against E. faecalis sequences from GenBank and against all protein sequences 
using the BLASTN and BLASTP programs, described in Altschul et al, J. MoL BioL 225: 

15 403^10 (1990)). Results of the ORF determination and similarity searching steps were 
loaded into the database. As described below, some results of the determination and fte 
searches are set out in Tables 1-3. 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

The present mvention is based on the sequencing of fragments of the 
Enterococcus faecalis genome and analysis of the sequences. The primary nucleotide 
sequences generated by sequencing the firagments are provided in SEQ ID NOS: 1-982. 
(As used herein, the "primary sequence" refers to the nucleotide sequence represented by 
the lUPAC nomenclature system.) 

in addition to the aforementioned Enterococcus faecalis polynucleotide and 
polynucleotide sequences, the present invention provides the nucleotide sequences of SEQ 
ID NOS: 1-982 , or representative fragments thereof, in a form which can be readily used, 
analyzed, and interpreted by a skilled artisan. 

As used herein, a "rq)resentative fragment of the nucleotide sequence depicted in 
SEQ ID NOS: 1-982" refers to any portion of die SEQ ID NOS: 1-982 which is not 
presently represented within a publicly available database. Preferred representative 
fragments of the present invention are Enterococcus faecalis open reading frames (ORFs 
), expression modulating fragment ( EMFs ) and fragments which can be used to diagnose 
the presence of Enterococcus faecalis in a sample (DFs ). A non-limiting identification 
of preferred representative fragments is provided in Tables 1-3. As discussed in detail 
below, the information provided in SEQ ID NOS: 1-982 and m Tables 1-3 together with 
routine cloning, synthesis, sequencing and assay methods will enable those skilled in the 



25 



30 
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art to clone and sequence all "representative fragments** of interest, including open 
reading frames encoding a large variety of Enterococcus faecalis proteins. 

The present invention is further directed to nucleic acid molecules encoding 
portions or fragments of the nucleotide sequences described herein. Fragmrats include 

5 portions of the nucleotide sequences of Table 1-3 and SEQ ID NOS: 1-982, at least 10 
contiguous nucleotides in length selected from any two integers, one of which 
representing a 5* nucleotide position and a second of which representing a 3' nucleotide 
position, where the first nucleotide for each nucleotide sequence in SEQ ID NOS: 1-982 is 
position 1 . That is, every combination of a 5* and 3* nucleotide position that a fragment 

10 at least 10 contiguous nucleotides in length could occupy is included in the invention. At 
least means a fragment may be 10 contiguous nucleotide bases in length or any integer 
between 10 and the length of an entire nucleotide sequence of SEQ ID NOS:l-982 minus 
1. Therefoie, included in the invention are contiguous fragments specified by any 5' and 
3' nucleotide base positions of a nucleotide sequences of SEQ ID NOS: 1-982 wherein the 

15 contiguous fragment is any integer between 10 and the length of an entire nucleotide 
sequence minus 1. 

Further, the invention includes polynucleotides comprising fragments specified by 
size, in nucleotides, rather than by nucleotide positions. The invention includes any 
fragment size, in contiguous nucleotides, selected from integers between 10 and the length 

20 of an entire nucleotide sequence minus 1 . Preferred sizes of contiguous nucleotide 

fragments include 20 nucleotides, 30 nucleotides, 40 nucleotides, 50 nucleotides. Other 
preferred sizes of contiguous nucleotide iragmraits, which may be useful as diagnostic 
probes and primers, include fragments 50-300 nucleotides in length which include, as 
discussed above, fragment sizes representing each integer between 50-300. Larger 

25 fragments are also useful according to the present invention corresponding to most, if not 
all, of the nucleotide sequences shown in SEQ ID NOS: 1-982. The preferred sizes are, of 
course, meant to exemplify not limit the present invention as all size fragments, 
rq>iesenting any integer between 10 and the length of an entire nucleotide sequence 
minus 1, of each SEQ ID NO:, are included in the invention. 

30 The present invention also provides for the exclusion of any fragment, specified 

by 5' and 3* base positions or by size in nucleotide bases as described above for any 
nucleotide sequence of SEQ ID NOS: 1-982. Any number of fragments of nucleotide 
sequences in SEQ ID NOS: 1-982, specified by 5* and 3' base positions or by size in 
nucleotides, as described above, may be excluded from the present invention. 

35 While the presently disclosed sequences of SEQ ID NOS: 1-982 are highly 

accurate, sequencing techniques are not perfect and, in relatively rare instances, fiuther 
investigation of a fragment or sequence of the invention may reveal a nucleotide 
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sequence error present in a nucleotide sequence disclosed in SEQ ID NOS: 1-982. 
However, once the present invention is made available (i.e., once the information in SEQ 
ID NOS: 1-982 and Tables 1-3 has been made available), resolving a rare sequencing error 
in SEQ ID NOS: 1 -982 will be well within the skill of the art. The present disclosure 

5 makes available sufficient sequence information to allow any of the described contigs or 
portions thereof to be obtained readily by straightforward application of routine 
techniques. Further sequencing of such polynucleotides may proceed in like manner using 
manual and automated sequencing methods which are employed ubiquitous in the art. 
Nucleotide sequence editing software is publicly available. For example. Applied 

10 Biosystem's (AB) AutoAssembler can be used as an aid during visual inspection of 

nucleotide sequences. By employing such routine techniques potential errors readily may 
be identified and the correct sequence then may be ascertained by targeting further 
sequencing effort, also of a routine nature, to the region containing the potential error. 
Even if all of the very rare sequencing errors in SEQ ID NOS: 1-982 were 

15 corrected, the resulting nucleotide sequences would still be at least 95% identical, nearly 
all would be at least 99% identical, and the great majority would be at least 99.9% 
identical to the nucleotide sequences of SEQ ID NOS: 1-982. 

As discussed elsewhere herein, polynucleotides of the present invention readily 
may be obtained by routine application of well known and standard procedures for cloning 

20 and sequencing DNA. A wide variety of Enterococcus faecalis strains that can be used to 
prepare E, faecalis genomic DNA for cloning and for obtaining polynucleotides of the 
present invention are available to the public from recognized depository institutions, such 
as the American Type Culture Collection (ATCC). While the present invention is 
enabled by the sequences and other information herein disclosed, die £. faecalis strain 

25 that provided the DNA of the present Sequence Listing, Strain V586, kindly provided by 
Dr. Michael Gilmore, University of Oklahoma, has been deposited in the ATCC, as a 
convenience to those of skill in the art The E, faecalis strain V586 was deposited 2 May 
1997 at the ATCC, 10801 University Blvd. Manassas, VA 201 10-2209, and given 
accession number 55969. The provision of the deposits is not a waiver of any rights of 

30 the inventors or their assignees in the present subject matter. 

The nucleotide sequences of the genomes from different strains of Enterococcus 
faecalis differ somewhat However, the nucleotide sequences of the genomes of all 
Enterococcus faecalis strains will be at least 95% identical, in corresponding part, to the 
nucleotide sequences provided in SEQ ID NOS: 1-982. Nearly all will be at least 99% 

35 identical and the great majority will be 99.9% identical. 

The present implication is further directed to nucleic acid molecules at least 90%, 
95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in SEQ ID NOS: 
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1-982. The above nucleic acid sequences are included irrespective of whether they encode 
a polypeptide having E, faecalis activity. This is because even where a particular nucleic 
acid molecule does not encode a polypeptide having E. faecalis activity, one of skill in 
the art would still know how to use the nucleic acid molecule, for instance, as a 

5 hybridization probe. Uses of the nucleic acid molecules of the present invention that do 
not encode a polypeptide having £. faecalis activity include, inter alia^ isolating an £. 
faecalis gene or allelic variants thereof from a DNA library, and detecting £. faecalis 
mRNA expression samples, environmental samples, suspected of containing E. faecalis by 
Northern Blot analysis. 

10 Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 

97%, 98% or 99% identical to the nucleic acid sequence shown in SEQ ID NOS: 1-982, 
which do, in fact, encode a polypq>tide having E, faecalis protein activity By "a 
polypeptide having E, faecalis activity** is intended polypeptides exhibiting activity 
similar, but not necessarily identical, to an activity of the E. faecalis protein of the 

15 invention, as measured in a particular biological assay suitable for measuring activity of 
the specified protein. 

Due to the degeneracy of the genetic code, one of ordinary skill in the art will 
immediately recognize that a large number of the nucleic acid molecules having a 
sequence at least 90%, 95%, 96%, 97%, 98%, or 99% identical to the nucleic acid 

20 sequences shown in SEQ ID NOS: 1-982 will encode a polypeptide having E. faecalis 
protein activity. In fact, since degenerate variants of these nucleotide sequences all 
encode the same polypeptide, this will be clear to the skilled artisan even without 
performing the above described comparison assay. It will be iurther recognized in the art 
that, for such nucleic acid molecules that are not degenerate variants, a reasonable number 

25 will also encode a polypeptide having E. faecalis protein activity. This is because the 
skilled artisan is fiiUy aware of amino acid substitutions that are either less likely or not 
likely to significantiy effect protein function (e.g., replacing one aliphatic amino acid 
with a second aliphatic amino acid), as further described below. 

The biological activity or function of the polypeptides of the present invention 

30 are expected to be similar or identical to polypeptides from other bacteria that share a 
high degree of structural identity/similarity. Tables 1 and 2 lists accession numbers and 
descriptions for the closest matching sequences of polypeptides available through 
Genbank. It is therefore expected that the biological activity or function of the 
polypeptides of the present invention will be similar or identical to those polypeptides 

35 from other bacterial genuses, species, or strains listed in Tables 1 and 2. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% 
''identical** to a reference nucleotide sequence of the present invention, it is intended that 
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the nucleotide sequence of the polynucleotide is identical to the reference sequence 
except that the polynucleotide sequence may include up to five point mutations per each 
100 nucleotides of the reference nucleotide sequence encoding the E, faecalis 
polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence at 

5 least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the 
ref^ence sequence may be deleted, inserted, or substituted with another nucleotide. The 
query sequence may be an entire sequence shown in SEQ ID NOS: 1-982, the ORF (open 
reading frame), or any fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide 

10 is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the 
presence invention can be determined conventionally using known computer programs. 
A preferred method for determining the best overall match between a query sequence (a 
sequence of the present invention) and a subject sequence, also referred to as a global 
sequence alignment, can be determined using the FASTDB computer program based on 

15 the algorithm of Brutlag et al. See Brutlag et al. (1990) Comp. App. Biosci. 6:237-245. 
In a sequence alignment the query and subject sequences are both DNA sequences. An 
RNA sequence can be compared by first converting U*s to T*s. The result of said global 
sequence alignmoit is in percent identity. Preferred parameters used in a FASTDB 
alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k- 

20 tuple=4, Mismatch Penalty=l, Joining Penalty=30, Randomization Group Length=0, 
Cutoff Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght 
of the subject nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 

25 results. This is because the FASTDB program does not account for 5* and 3* tnmcations 
of the subject sequence when calculating percent idoitity. For subject sequences truncated 
at the 5* or 3' ends, relative to flie query sequence, the percent identity is corrected by 
calculating the number of bases of the query sequence that are 5' and 3' of the subject 
sequence, which are not matched/aligned, as a percent of the total bases of the query 

30 sequence. Whether a nucleotide is matched/aligned is determined by results of the 
FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using the specified parameters, to 
arrive at a final percent identity score. This corrected score is what is used for the 
purposes of the present invention. Only nucleotides outside the 5' and 3' nucleotides of 

35 the subject sequence, as displayed by the FASTDB alignment, which are not 

matched/aligned with the query sequence, are calculated for the purposes of manually 
adjustmg the percent identity score. 
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For example, a 90 nucleotide subject sequence is aligned to a 100 nucleotide query 
sequence to determine percent identity. The deletions occur at the 5' end of the subject 
sequence and therefore, the FASTDB alignment does not show a matched/alignment of 
the first 10 nucleotides at 5' end. The 10 unpaired nucleotides represent 10% of the 

5 sequence (number of nucleotides at the 5* and 3' ends not matched/total number of 
nucleotides in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 nucleotides were perfectly 
matched the final percent identity would be 90%. In another example, a 90 nucleotide 
subject sequence is compared with a 1 00 nucleotide query sequence. This time the 

10 deletions are internal deletions so that there are no nucleotides on the 5' or 3' of the 
subject sequence which are not matched/aligned with the query. In this case the percent 
identity calculated by FASTDB is not manually corrected. Once again, only nucleotides 
5* and 3* of the subject sequence which are not matched/aligned with the query sequence 
are manually corrected for. No other manual corrections are to made for the purposes of 

15 the present invention. 

COMPUTER RELATED EMBODIMENTS 

The nucleotide sequences provided in SEQ ID NOS: 1-982, a representative 
fragment thereof, or a nucleotide sequence at least 95%, preferably at least 99% and most 

20 preferably at least 99.9% identical to a polynucleotide sequence of SEQ ID NOS: 1-982 
may be "provided" in a variety of mediums to facilitate use thereof As used herein, 
provided refers to a manufacture, other than an isolated nucleic acid molecule, which 
contains a nucleotide sequence of the present invention; i.e., a nucleotide sequence 
provided in SEQ ID NOS: 1-982 , a representative fiagment thereof, or a nucleotide 

25 sequence at least 95%, preferably at least 99% and most preferably at least 99.9% 

identical to a polynucleotide of SEQ ID NOS: 1-982. Such a manufacture provides a large 
portion of the Enterococcus faecalis genome and parts thereof (e.g., a Enterococcus 
faecalis open reading frame (ORF)) in a form which allows a skilled artisan to examine 
the manufacture using means not directly applicable to examining the Enterococcus 

30 faecalis genome or a subset thereof as it exists in nature or in piirified form. 

In one application of this embodiment, a nucleotide sequence of the present 
invention can be recorded on coir^>uter readable media. As used herein, "computer 
readable media" refers to any medium which can be read and accessed directly by a 
computer. Such media include, but are not limited to: magnetic storage media, such as 

35 floppy discs, hard disc storage medium, and magnetic tape; optical storage media such as 
CD- ROM; electrical storage media such as RAM and ROM; and hybrids of these 
categories, such as magnetic/optical storage media. A skilled artisan can readily 
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appreciate how any of the presently known computer readable mediums can be used to 
create a manufacture comprising computer readable medium having recorded thereon a 
nucleotide sequence of the present invention. Likewise, it will be clear to those of skill 
how additional computer readable media that may be developed also can be used to create 
5 analogous manufactures having recorded thereon a nucleotide sequence of the present 
invention. 

As used herein, "recorded" refers to a process for storing information on 
computer readable medium. A skilled artisan can readily adopt any of the presently know 
methods for recording information on computer readable medium to generate 

10 manufactures comprising the nucleotide sequence information of the present invention. 
A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means 
chosen to access the stored information. In addition, a variety of data processor 

15 programs and formats can be used to store the nucleotide sequence information of the 
present invention on computer readable medium. The sequence information can be 
rqiresented in a word processing text file, formatted in commercially- available software 
such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, 
stored in a database ^plication, such as DB2, Sybase, Oracle, or the like. A skilled artisan 

20 can readily adapt any number of data-processor structuring formats (e.g., text file or 
database) in order to obtain computer readable medium having recorded thereon the 
nucleotide sequence information of the present invention. 

Computer software is publicly available which allows a skilled artisan to access 
sequence information provided in a computer readable medium. Thus, by providing in 

25 computer readable form the nucleotide sequences of SEQ ID NOS: 1-982, a rq^rcsentative 
fragment thereof, or a nucleotide sequence at least 95%^ preferably at least 99% and most 
preferably at least 99.9% identical to a sequence of SEQ ID NOS: 1-982 the present 
invention enables the skilled artisan routinely to access the provided sequence 
information for a wide variety of purposes. 

30 The examples which follow demonstrate how software which implements the 

BLAST (Altschul et aL,J. Mol Biol 275:403-410 (1990)) and BLAZE (Brutlag et al, 
Comp. Chem, 77:203-207 (1993)) search algorithms on a Sybase system was used to 
identify open reading frames (ORFs) within the Enterococcus faecalis genome which 
contain homology to ORFs or proteins from both Enterococcus faecalis and from other 

35 organisms. Among the ORFs discussed herein are protein encoding fragments of the 

Enterococcus faecalis genome useful in producing commercially inq)ortant proteins, such 
as enzymes used in fermentation reactions and in the production of commercially useful 
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metabolites, proteins to be used as vaccines or in the generation of immuno-therapeutic 
reagents, or as drug screening targets. 

The present invention further provides systems, particularly computer-based 
systems, which contain the sequence information described herein. Such systems are 
S designed to identify, among other things, commercially important fragments of the 
Enterococcus faecalis genome. 

As used hmin, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the 
10 present invention comprises a central processing unit (CPU), input means, output means, 
and data storage means. A skilled artisan can readily appreciate that any one of the 
currently available computer-based system are suitable for use in the present invention. 

As stated above, the computer-based systems of the present invention comprise a 
data storage means having stored therein a nucleotide sequence of the present invention 
15 and the necessary hardware means and software means for supporting and implementing a 
search means. 

As used herein, **data storage means** refers to memory which can store nucleotide 
sequence information of the present invention, or a memory access means which can 
access manufactures having recorded thereon the nucleotide sequence information of the 

20 present invention. 

As used herein, "search means" refers to one or more programs which are 
implemented on the computer- based system to compare a target sequence or target 
structural motif with the sequence information stored within the data storage means. 
Search means are used to identify fragments or regions of the present genomic sequences 

25 which match a particular target sequence or target motif. A variety of known algorithms 
are disclosed publicly and a variety of commercially available software for conducting 
search means are and can be used in the computer-based systems of the present invention. 
Examples of such software includes, but is not limited to, MacPattem (EMBL), BLASTN 
and BLASTX (NCBI). A skilled artisan can readily recognize that any one of the 

30 available algorithms or implementing software packages for conducting homology 
searches can be adapted for use in the present computer-based systems. 

As used herein, a "target sequence" can be any DNA or amino acid sequence of six 
or more nucleotides or two or more amino acids. A skilled artisan can readily recognize 
that the longer a target sequence is, the less likely a target sequence will be present as a 

35 random occurrence in the database. The most preferred sequence length of a target 
sequence is from about 10 to 100 amino acids or from about 30 to 300 nucleotide 
residues. However, it is well recognized that searches for commercially important 
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fragments, such as sequence fragments involved in gene expression and protein 
processing, may be of shorter length. 

As used herein, "a target structural motif,** or **target motif,** refers to any 
rationally selected sequence or combination of sequences in which the sequence(s) are 
5 chosen based on a three-dimensional configuration which is formed upon the folding of 
the target motif. There are a variety of target motifs known in the art Protein target 
motifs include, but are not limited to, enzymic active sites and signal sequences. Nucleic 
acid target motifs include, but are not limited to, promoter sequences, hairpin structures 
and inducible expression elements (protein binding sequences). 

10 A variety of structural formats for the input and output means can be used to 

input and output the information in the computer-based systems of the present 
invention. A preferred format for an output means ranks fragments of the Enterococcus 
faecalis genomic sequences possessing varying degrees of homology to the target sequence 
or target motif. Sudi presentation provides a skilled artisan with a raiiking of sequences 

15 which contain various amounts of the target sequoice or target motif and identifies the 
degree of homology contained in the identifled fragment. 

A variety of comparing means can be used to compare a target sequence or target 
motif with the data storage means to identify sequence fragments of the Enterococcus 
faecalis genome. In the present examples, implementing software which implement the 

20 BLAST algorithm, described in Altschul et ai (1990) J, Mol Biol. 215: 403-410, is used 
to identify open reading frames within the Enterococcus faecalis genome. A skilled 
artisan can readily recognize that any one of the publicly available homology search 
programs can be used as the search means for the computer-based systems of the present 
invention. Of course, suitable proprietary systems that may be known to those of skill 

25 also may be employed in tiiis regard. 

Figure 1 provides a block diagram of a computer system illustrative of 
embodiments of this aspect of present invention. The conq>uter system 102 includes a 
processor 106 connected to a bus 104. Also connected to the bus 104 are a main 
memory 108 Q)referably implemented as random access memory, RAM) and a variety of 

30 secondary storage devices 1 10, such as a hard drive 1 12 and a removable medium storage 
device 1 14. The removable medium storage device 1 14 may represent, for example, a 
floppy disk drive, a CD-ROM drive, a magnetic tape drive, etc. A removable storage 
medium 116 (such as a floppy disk, a compact disk, a magnetic tape, etc) containing 
control logic and/or data recorded therein may be inserted into the removable medium 

35 storage device 1 14. The computer system 102 inchides appropriate software for reading 
the control logic and/or the data from the removable medium storage device 1 14, once it 
is inserted into the removable medium storage device 1 14. 
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A nucleotide sequence of the present invention may be stored in a well known 
manner in the main memory 108, any of the secondary storage devices 1 10, and/or a 
ronovable storage medium 116. During execution, software for accessing and processing 
the genomic sequence (such as search tools, comparing tools, etc.) reside in main memory 
5 108, in accordance with the requirements and operating parameters of the operating 
system, the hardware system and the software program or programs. 

BIOCHEMICAL EMBODIMENTS 

Other embodiments of the present invention are directed to isolated fragments of 

10 the Enterococcus faecalis genome. The fragm^ts of the Enterococcus faecalis genome 
of the present invention include, but are not limited to fragments which encode peptides, 
hereinafter open reading frames (ORFs), fragments which modulate the expression of an 
operably linked ORF, hereinafter expression modulating fragments (EMFs) and fragments 
which can be used to diagnose the presence of Enterococcus faecalis in a sample, 

15 hereinafter diagnostic fragments (DFs). 

As used herein, an **isolated nucleic acid molecule" or an "isolated fragment of the 
Enterococcus faecalis genome" refers to a nucleic acid molecule possessing a specific 
nucleotide sequence which has been subjected to purification means to reduce, from the 
composition, the number of compounds which are normally associated with the 

20 composition. Particularly, the term refers to the nucleic acid molecules having the 

sequences set out in SEQ ID NOS: 1-982, to representative fragments thereof as described 
above, to polynucleotides at least 95%, preferably at least 99% and especially preferably 
at least 99.9% identical in sequence thereto, also as set out above. 

A variety of purification means can be used to generate the isolated fragments of 

25 the present invention. These include, but are not limited to methods vAnch separate 
constituents of a solution based on charge, solubility, or size. 

In one embodiment, Enterococcus faecalis DNA can be enzymatically sheared to 
produce fragments of 15-20 kb in length. Tbese fragments can then be used to genmte a 
Enterococcus faecalis library by inserting tihem into lambda clones as described in the 

30 Examples below. Primers flanking, for example, an ORF, such as those enumerated in 
Tables 1-3 can then be generated using nucleotide sequence information provided in SEQ 
ID NOS: 1-982. Well known and routine techiuques of PGR cloning then can be used to 
isolate the ORF from the lambda DNA library or Enterococcus faecalis genomic DNA. 
Thus, given the availability of SEQ ID NOS: 1-982, the information in Tables 1, 2 and 3, 

35 and the information that may be obtained readily by analysis of the sequences of SEQ ID 
NOS: 1-982 using methods set out above, those of skill will be enabled by the present 
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disclosure to isolate any ORF-containing or other nucleic acid fragment of the present 
invention. 

The isolated nucleic acid molecules of the present invention include, but are not 
limited to single stranded and double stranded DNA, and single stranded RNA. As used 
5 herein, an "open reading frame,** ORF, means a series of triplets coding for amino acids 
without any termination codons and is a sequence translatable into protein. Each 
sequCTce of SEQ ID NOS: 1-982, however, begins and ends with a termination codon. For 
purposes of numbering and reference to polynucleotide and polypeptide sequences the 
entire sequence of each sequence of SEQ ID NOS: 1-982 is included with the first 
10 nucleotide being positon 1. Therefore, for reference purposes the numbering used in the 
present invention is that provided in the sequence listing for SEQ ID NOS: 1-982. 

Tables 1, 2, and 3 list ORFs in the Enterococcus faecalis genomic contigs of the 
present invention that were identified as putative coding regions by the GeneMark 
software using organism-specific second-order Maricov probability transition matrices. It 
15 will be ^predated that other criteria can be used, in accordance with well known 
analytical methods, such as those discussed herein, to generate more inclusive, more 
restrictive, or more selective lists. 

Table 1 sets out ORFs in the Enterococcus faecalis contigs of the present 
invention that over a continuous region of at least 50 bases are 95% or more identical (by 
20 BLAST analysis) to a nucleotide sequence available through GenBank in March, 1997. 
Table 2 sets out ORFs in the Enterococcus faecalis contigs of the present 
invention that are not in Table 1 and match, with a BLASTP probability score of 0.01 or 
less, a polypeptide sequence available through GenBank in March, 1997. 

Table 3 sets out ORFs in the Enterococcus faecalis contigs of the present 
25 invention that do not match significantly, by BLASTP analysis, a polypeptide sequence 
available through GenBank in March, 1997. 

In each table, the first and second colunms identify the ORF by, respectively, 
contig number and ORF number within the contig; the third column indicates the 
coordinate of the first nucleotide of the ORF, counting from the 5' end of the contig 
30 strand; the fourth column indicates the coordinate of the final nucleotide of the ORF, 
counting from the 5' end of the contig strand. 

In Tables 1 and 2, column five lists the Reference for the closest matching 
sequence available through GenBank. These reference numbers are the database entry 
numbers commonly used by those of skill in the art, who will be familiar with their 
35 denominators. Descriptions of the nomenclature are available from die National Center 
for Biotechnology Information. Column six in Tables 1 and 2 provides the gene name of 
the matching sequence. 
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In Table 1, column seven provides the nucleotide BLAST percent identity score 
from the comparison of the ORF and the GenBank sequence, column eight indicates the 
length in nucleotides of the highest scoring segment pair identified by the BLAST identity 
analysis, and column nine provides the total length of the ORF in nucleotides. 
5 In Table 2, column seven provides the protein BLAST percent similarity of the 

highest scoring segment pair identified, column eight provides the percent identity of the 
highest scoring segment pair, and column nine provides the total length of the ORF in 
nucleotides. 

The concepts of percent identity and percent similarity of two polypeptide 

10 sequences is well understood in the art. For example, two polypeptides 10 amino acids in 
length which differ at three amino acid positions (e.g., at positions 1, 3 and 5) are said to 
have a percent identity of 70%. However, the same two polypeptides would be deemed to 
have a percent similarity of 80% if, for example at position S, the amino acids moieties, 
although not identical, were "similar** (i.e., possessed similar biochemical characteristics). 

15 Many programs for analysis of nucleotide or amino acid sequence similarity, such as fasta 
and BLAST specifically list percent identity of a matching region as an output parameter. 
Thus, for instance. Tables 1 and 2 herein enumerate the percent identity of the highest 
scoring segment pair in each ORF and its listed relative. Further details concerning the 
algorithms and criteria used for homology searches are provided below and are described in 

20 the pertinent literature highlighted by the citations provided below. 

It will be appreciated that other criteria can be used to generate more inclusive 
and more exclusive listings of the types set out in the tables. As those of skill will 
appreciate, narrow and broad searches both are useful. Thus, a skilled artisan can readily 
identify ORFs in contigs of the Enterococcus faecalis genome other than those listed in 

25 Tables 1-3, such as ORFs which are overlapping or encoded by the opposite strand of an 
identified ORF in addition to those ascertainable using the computer-based systems of the 
present invention. 

As used herein, an "expression modulating fragment," EMF, means a series of 
nucleotide molecules which modulates the expression of an operably linked ORF or EMF. 

30 As used herein, a sequence is said to "modxxlate the expression of an operably 

linked sequence" when the expression of the sequence is altered by the presence of the 
EMF. EMFs include, but are not limited to, promoters, and promoter modulating 
sequraces (inducible elements). One class of EMFs are fragments which induce the 
expression or an operably linked ORF in response to a specific regulatory factor or 

35 physiological event. 

EMF sequences can be identified wiflun the contigs of the Enterococcus faecalis 
genome by their proximity to the ORFs provided in Tables 1-3. An intergenic segment. 
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or a fragment of the intergenic segment, from about 10 to 200 nucleotides in length, 
taken from any one of the ORFs of Tables 1-3 will modulate the expression of an 
operably linked ORF in a fashion similar to that found with the naturally linked ORF 
sequence. As used herein^ an "intergenic segmrait" refers to fragments of &e 
5 Enterococcus faecalis genome which are between two ORF(s) herein described. EMFs also 
can be identified using known EMFs as a target sequence or target motif in the computer- 
based systems of the present invention. Further, the two methods can be combined and 
used together. 

The presence and activity of an EMF can be confirmed using an EMF trap vector. 

10 An EMF trap vector contains a cloning site linked to a marker sequence. A marker 
sequence encodes an identifiable phenotype, such as antibiotic resistance or a 
complementing nutrition auxotrophic factor, which can be identified or assayed when the 
EMF trap vector is placed within an appropriate host under appropriate conditions. As 
described above, a EMF will modulate the expression of an operably linked marker 

15 sequence. A more detailed discussion of various marker sequences is provided below. 

A sequence which is suspected as being an EMF is cloned in all three reading 
frames in one or more restriction sites upstream from the marker sequence in the EMF 
trap vector. The vector is then transformed into an appropriate host using known 
procedures and the phenotype of the transformed host in examined under appropriate 

20 conditions. As described above, an EMF will modulate the expression of an operably 
linked marker sequence. 

As used herein, a "diagnostic fragment," DF, means a series of nucleotide 
molecules which selectively hybridize to Enterococcus faecalis sequences. DFs can be 
readily identified by identifying unique sequences within contigs of the Enterococcus 

IS faecalis genome, such as by using well-known computer analysis software, and by 

generating and testing probes or amplification primers consisting of the DF sequence in 
an appropriate diagnostic format which determines amplification or hybridization 
selectivity. 

The sequences falling within the scope of the present invention arc not limited to 
30 the specific sequences herein described, but also include allelic and species variations 
thereof. Allelic and species variations can be routinely determined by comparing the 
sequences provided in SEQ ID NOS: 1-982, a representative fragment thereof, or a 
nucleotide sequence at least 99% and preferably 99.9% identical to SEQ ID NOS: 1-982, 
with a sequence from another isolate of the same species. Furthermore, to accommodate 
35 codon variability, the invention includes nucleic acid molecules coding for the same 
amino acid sequences as do the specific ORFs disclosed herein. In other words, in the 
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coding region of an ORF, substitution of one codon for another which encodes the same 
amino acid is expressly contemplated. 

Any specific sequence disclosed herein can be readily screened for errors by 
resequencing a particular fragment, such as an ORF, in both directions (i.e., sequence both 
5 strands). Alternatively, error screening can be performed by sequencing corresponding 
polynucleotides of Enterococcus faecalis origin isolated by using part or all of the 
fragments in question as a probe or primer. 

Each of the ORFs of the Enterococcus faecalis genome disclosed in Tables 1, 2 
and 3, and the EMFs found 5 to the ORFs, can be used as polynucleotide reagents in 

10 numerous ways. For example, the sequences can be used as diagnostic probes or diagnostic 
amplification primers to detect the presence of a specific microbe in a sample, 
particularly Enterococcus faecalis. Especially preferred in this regard are ORFs such as 
those of Table 3, which do not match previously characterized sequences from other 
organisms and thus are most likely to be highly selective for Enterococcus faecalis. Also 

15 particularly preferred are ORFs that can be used to distinguish between strains of 

Enterococcus faecalis, particularly those that distinguish medically important strain, such 
as drug-resistant strains. 

In addition, the fragments of the present invention, as broadly described, can be 
used to control gene expression through triple helix formation or antisense DNA or RNA, 

20 both of which methods are based on the binding of a polynucleotide sequence to DNA or 
RNA. Triple helix-formation optimally results in a shut-off of RNA transcription from 
DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
polypq>tide. Information from the sequences of the present invention can be used to 
design antisense and triple helix-forming oligonucleotides. Polynucleotides suitable for 

25 use in these methods are usually 20 to 40 bases in length and are designed to be 
complementaiy to a region of the gene involved in transcription, for triple-helix 
formation, or to the mRNA itself, for antisense inhibition. Both techniques have been 
demonstrated to be effective in model systems, and the requisite techniques are well 
known and involve routine procedures. Triple helix techniques are discussed in, for 

30 example, Lee et ai, NucL Acids Res. 5:3073 (1979); Cooney et aL, Science 241:456 

(1988); and Dervan et al^ Science 257:1360 (1991). Antisense techniques in general are 
discussed in, for instance, Okano, / Neurochem. 56:560 (1991) and 
Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression^ CRC Press, Boca 
Raton, FL (1988)). 

35 The present invention further provides recombinant constructs comprising one or 

more fragments of the Enterococcus faecalis genomic fragments and contigs of the 
present invention. Certain preferred recombinant constructs of die present invention 
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comprise a vector, such as a plasmid or viral vector, into which a fragment of the 
Enterococcus faecalis genome has been inserted, in a forward or reverse orientation. In 
the case of a vector comprising one of the ORFs of the present invention, the vector 
may Airther comprise regulatory sequences, including for example, a promoter, op^ably 
5 linked to the ORF. For vectors comprising the EN4Fs of the present invention, the 
vector may further comprise a marker sequence or heterologous ORF operably linked to 
the EMF. 

Large numbers of suitable vectors and promoters are known to those of skill in 
the art and are commercially available for generating the recombinant constructs of the 

10 present invention. The following vectors are provided by way of example. Useful 

bacterial vectors inchide phagescript, PsiX174, pBS SK (+ or -), pBS KS (+ or -), pNHSa, 
pNH16a, pNH18a, pNH46a (available from Stratagene); pTrc99A, pKK223-3, pKK233- 
3, pDR540, pRIT5 (available from Pharmacia). Useful eukaryotic vectors include 
pWLneo, pSV2cat, pOG44, pXTl, pSG (available from Stratagene) pSVK3, pBPV, pMSG, 

IS pSVL (available from Pharmacia). 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpi, lambda PR, and trc. Eukaryotic promoters include CMV 

20 immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
mouse metallothionein- I. Selection of the appropriate vector and promoter is well 
within the level of ordinary skill in the art. 

The present invention further provides host cells containing any one of the 
isolated fragments of the Enterococcus faecalis gmomic fragments and contigs of the 

25 present invention, wherein the fragment has been introduced into the host cell using 
known methods. The host cell can be a higher eukaryotic host cell, such as a 
manunalian cell, a lower eukaryotic host cell, such as a yeast cell, or a procaryotic cell, 
such as a bacterial cell. 

A polynucleotide of the present invention, such as a recombinant construct 

30 comprising an ORF of the present invention, may be introduced into the host by a 
variety of well established techniques that are standard in the art, such as calcium 
phosphate transfection, DEAE, dextran mediated transfection and electroporation, which 
are described in, for instance, Davis, L. et al, BASIC METHODS IN MOLECULAR 
BIOLOGY (1986). 

35 A host cell containing one of the fragments of the Enterococcus faecalis genomic 

fragments and contigs of the present invention, can be used in conventional manners to 
produce the gene product encoded by the isolated fragment (in the case of an ORF) or can 
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be used to produce a heterologous protein under the control of the EMF. The 
present invention further provides isolated polypeptides encoded by the nucleic acid 
fragments of the present invention or by degenerate variants of the nucleic acid 
fragments of the present invention. By "degenerate variant" is intended nucleotide 

5 fragments which differ from a nucleic acid fragment of the present invention an 
ORF) by nucleotide sequence but, due to the degeneracy of the Genetic Code, encode an 
identical polypeptide sequence. 

Preferred nucleic acid fragments of the present invention are the ORFs depicted 
in Tables 2 and 3 which encode proteins. 

10 A variety of methodologies known in the art can be utilized to obtain any one of 

the isolated polypeptides or proteins of the present invention. At the simplest level, the 
amino acid sequence can be synthesized using commercially available peptide synthesizers. 
This is particularly useful in producing small pq)tides and fragments of larger 
polypeptides. Such short fragments as may be obtained most readily by synthesis are 

15 useful, for example, in generating antibodies against the native polypeptide, as discussed 
further below. 

In an altmiative method, the polypeptide or protein is purified from bacterial 
cells which naturally produce the po]ypq)tide or protein. One skilled in the art can 
readily employ well-known methods for isolating polypeptides and proteins to isolate and 
20 purify polypeptides or proteins of the present invention produced naturally by a bacterial 
strain, or by other methods. Methods for isolation and purification that can be employed 
in this regard include, but are not limited to, immunochromatogr^hy, HPLC, size- 
exclusion chromatography, ion-exchange chromatography, and immuno-affinity 
chromatography. 

25 The polypeptides and proteins of the present invention also can be purified from 

cells which have been altered to express the desired polypeptide or protein. Preferred 
polypeptides and proteins of the present mvention are polypeptides and proteins coded 
for by the polynucleotides of SEQ ID NOS: 1-982, wherein the polypeptides and proteins 
are coded in the same frame as the termination codon at the end of each sequence of SEQ 

30 ID NOS: 1-982. As used herein, a cell is said to be altered to express a desired polypeptide 
or protein when the cell, through genetic manipulation, is made to produce a polypeptide 
or protein which it normally does not produce or which the cell normally produces at a 
lower level. Those skilled in the art can readily adapt procedures for introducing and 
expressing either recombinant or synthetic sequences into eukaiyotic or prokaryotic cells 

35 in order to generate a cell which produces one of the polypeptides or proteins of the 
present invention. 
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The polypeptides of the present mvention are preferably provided in an isolated 
form, and preferably are substantially purified. A recombinantly produced version of the 
£. faecalis polypeptide can be substantially purified by the one-stqp method described by 
Smith et al. (1988) Gene 67:31-40. Polypeptides of the invention also can be purified 
from natural or recombinant sources using antibodies directed against the polypeptides of 
the invention in methods which are well known in the art of protein purification. 

The invention further provides for isolated E. faecalis polypeptides comprising 
an amino acid sequence selected from the group including: (a) the amino acid sequence of 
a full-length E. faecalis polypq)tide having the complete amino acid sequence from the 
first methionine codon to the termination codon of each sequence listed in SEQ ID 
NOS: 1-982, wherein said termination codon is at the end of each S£Q ID NO: and said 
first methionine is the first methionine in frame with said termination codon; and (b) the 
amino acid sequence of a full-length E, faecalis polypq[>tide having the complete amino 
acid sequence in (a) excepting the N-termtnal methionine. 

The polypeptides of the present invention also include polypeptides having an 
amino acid sequence at least 80% identical, more preferably at least 90% identical, and 
still more preferably 95%, 96%, 97%, 98% or 99% identical to those described in (a) and 
(b) above. 

The present invention is further directed to polynucleotide encoding portions or 
fragments of the amino acid sequences described herein as well as to portions or fragments 
of the isolated amino acid sequences described herein. Fragments include portions of the 
amino acid sequences described herein, are at least 5 contiguous amino acid in length, are 
selected from any two integers, one of which representing a N-terminal position. The 
initiation codon of the polypeptides of the present inventions position 1. The initiation 
codon (positon 1) for purposes of the present invention is the first methionine codon of 
each sequence of SEQ ID NOS: 1-982 which is in frame with the termination codon at the 
end of each said sequence. Every combination of a N-tmninal and C-terminal position 
that a fragment at least 5 contiguous amino acid residues in length could occupy, on any 
given amino acid sequence encoded by a sequence of SEQ ID NOS: 1-982 is included in the 
invention, i.e., from initiation codon up to the termination codon. At least means a 
fragment may be 5 contiguous amino acid residues in length or any integer between 5 and 
the number of residues in a full length amino acid sequence minus 1 . Therefore, included 
in the invention are contiguous fragments specified by any N-terminai and C-terminal 
positions of amino acid sequence set forth in SEQ ID NOS: 1-982 wherein the contiguous 
fragment is any integer between 5 and the number of residues in a full length sequence 
minus 1. 
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Further, the invention includes po)ypq)tides comprising fragments specified by 
size, in amino acid residues, rather than by N-tenninal and C-tenninal positions. The 
invention includes any fragment size, in contiguous amino acid residues, selected from 
integers between 5 and the number of residues in a full length sequence minus 1 . Preferred 
5 sizes of contiguous polypeptide fragments include about 5 amino acid residues, about 10 
amino acid residues, about 20 amino acid residues, about 30 amino acid residues, about 40 
amino acid residues, about 50 amino acid residues, about 100 amino acid residues, about 
200 amino acid residues, about 300 amino acid residues, and about 400 amino acid 
residues. The preferred sizes are, of course, meant to exemplify, not limit, the present 

10 invention as all size fragments representing any integer between 5 and the number of 
residues in a full length sequence minus 1 are included in the invention. The present 
invention also provides for the exclusion of any fragments specified by N -terminal and C- 
terminal positions or by size in amino acid residues as described above. Any number of 
fragments specified by N-terminal and C-terminal positions or by size in amino acid 

1 5 residues as described above may be excluded. 

The above fragments need not be active since they would be useful, for exan^le, 
in immunoassays, in q)itope man>ing, epitope tagging, to generate antibodies to a 
particular portion of the protein, as vaccines, and as molecular weight markers. 

Further polypeptides of the present invention include polypeptides which have at 

20 least 90% similarity, more preferably at least 95% similarity, and still more preferably at 
least 96%, 97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which comprises 
the amino acid sequence of a E. faecalis polypq)tide having an amino acid sequence which 
contains at least one conservative amino acid substitution, but not more than 50 

25 conservative amino acid substitutions, not more than 40 conservative amino acid 

substitutions, not more than 30 conservative amino acid substitutions, and not more than 
20 conservative amino acid substitutions. Also provided are polypeptides which conqirise 
the amino acid sequence of a £. faecalis polypeptide, having at least one, but not more 
than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino acid substitutions. 

30 By a polypeptide having an amino acid sequence at least, for example, 95% 

"identical" to a query amino acid sequence of the present invention, it is intended that the 
amino acid sequence of the subject polypeptide is identical to the query sequence except 
that the subject polypq)tide sequence may include up to five amino acid alterations per 
each 1 00 amino acids of the query amino acid sequence. In other words, to obtain a 

35 polypq)tide having an amino acid sequence at least 95% identical to a query amino acid 
sequence, up to 5% of the amino acid residues in the subject sequence may be inserted, 
deleted, (indels) or substituted with another amino acid. These alterations of the 



wo 98/50555 



PCTAJS98/08985 



24 

reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those termmal positions, interspersed 
either individually among residues in the reference sequence or in one or more contiguous 
groups within the reference sequence. 

S As a practical matter, whether any particular polypeptide is at least 90%, 95%, 

96%, 97%, 98% or 99% identical to the amino acid sequences encoded by the sequences 
of SEQ ID NOS: 1-982, as described hererin, can be determined conventionally using 
known computer programs. A preferred method for determining the best overall match 
between a queiy sequence (a sequence of the present invention) and a subject sequence, 

10 also referred to as a global sequence alignment, can be determined using the FASTDB 
computer program based on the algorithm of Brutlag et al., (1990) Comp. App. Biosci. 
6:237-245. In a sequence alignment the query and subject sequences are both amino acid 
sequences. The result of said global sequence alignment is in percent identity. Preferred 
parameters used in a FASTDB amino acid alignment are: Matrix==PAM 0, k'tuple=2, 

15 Mismatch Penalty=l, Joining Penalty=20, Randomization Group Length=0, Cutoff 

ScoTe=l, Window Size=sequence length. Gap Penalty=5, Gap Size Penalty=0.05, Window 
Size=500 or (he length of the subject amino acid sequoice, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-terminal 
deletions, not because of internal deletions, the results, in percent identity, must be 

20 manually corrected. This is because the FASTDB program does not account for N- and C- 
terminal truncations of the subject sequence when calculating global percent identity. For 
subject sequences truncated at the N- and Otermini, relative to the query sequence, the 
percent identity is corrected by calculating the number of residues of the query sequence 
that are N- and C-terminal of the subject sequence, which are not matched/aligned with a 

25 corresponding subject residue, as a percent of the total bases of the query sequence. 

Whether a residue is matched/aligned is determined by results of the FASTDB sequence 
alignment. This percentage is then subtracted firom the percent identity, calculated by 
the above FASTDB program using the specified parameters, to arrive at a final percent 
identity score. This final percent identity score is what is used for the purposes of the 

30 present invention. Only residues to the N- and C-termini of the subject sequence, which 
are not matched/aligned with the query sequence, are considered for the purposes of 
manually adjusting the percent identity score. That is, only query amino acid residues 
outside the farthest N- and C-terminal residues of the subject sequrace. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 residue 

35 query sequence to determine percent identity. The deletion occurs at the N-terminus of 
the subject sequence and therefore, the FASTDB alignmrat does not match/align with the 
first 10 residues at the N-terminus. The 10 unpaired residues represent 10% of the 
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sequence (number of residues at the N- and C- termini not matched/total number of 
residues in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 residues were perfectly matched 
the final percent identity would be 90%. In another example, a 90 residue subject 
5 sequence is compared with a 100 residue queiy sequence. This time the deletions are 
internal so there are no residues at the N- or C-termini of the subject sequence which are 
not matched/aligned with the query. In this case the percent identity calculated by 
FASTDB is not manually corrected. Once again, only residue positions outside the and 
C-terminal.ends of the subject sequence, as displayed in the FASTDB alignment, which 

10 are not matched/aligned with the query sequence are manually corrected. No other 
manual corrections are to made for the purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they have 
their normal biological activity. This is because even where a particular polypeptide 
molecule does not have biological activity, one of skill in the art would still know how to 

IS use the polypeptide, for instance, as a vaccine or to generate antibodies. Other uses of 
the polypeptides of the present invention diat do not have E, faecalis activity include, 
inter alia^ as epitope tags, in q>itope msq>ping, and as molecular weight markers on 
SDS-PAGE gels or on molecular sieve gel filtration columns using methods known to 
those of skill in the art 

20 As described below, the polypeptides of the present invention can also be used to 

raise polyclonal and monoclonal antibodies, which are useful in assays for detecting E. 
faecalis protein expression or as agonists and antagonists enable of enhancing or 
inhibiting £. faecalis protein fimction. Further, such polypeptides can be used in the 
yeast two-hybrid system to ''capture** E, faecalis protein binding proteins which are also 

25 candidate agonists and antagonists according to the present invention. See, e,g.. Fields et 
al. (1989) Nature 340:245-246. 

Any host/vector system can be used to express one or more of the ORFs of the 
present invention. These include, but are not limited to, eukaryotic hosts such as HeLa 
cells, CV-1 cell, COS cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. 

30 subtilis. The most preferred cells are those which do not normally express the particular 
polypeptide or protein or which expresses the polypeptide or protein at low natural level. 

**Recombinant,** as used herein, means that a polypeptide or protein is derived 
fi'om recombinant (e.g., microbial or manunalian) expression systems. "Microbial" refers 
to recombinant polypeptides or proteins made in bacterial or fungal (e.g., yeast) 

35 expression systems. As a product, "recombinant miorobial" defines a polypq>tide or 

protein essentially free of native endogenous substances and unaccompanied by associated 
native glycosylation. Polypeptides or proteins expressed in most bacterial cultures, e.g.. 
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E. coli, will be free of glycosylation modifications; polypeptides or proteins expressed in 
yeast will have a glycosylation pattern different from that expressed in mammalian cells. 

*74ucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. 
Generally, DNA segments encoding the polypeptides and proteins provided by this 
5 invention are assembled from fragments of the Enterococcus faecalis genome and short 
oligonucleotide linkers, or from a series of oligonucleotides, to provide a synthetic gene 
which is capable of being expressed in a recombinant transcriptional unit con4>rising 
regulatory elements derived fit)m a microbial or viral operon. 

Recombinant expression vehicle or **vector" refers to a plasmid or phage or virus 

10 or vector, for expressing a polypeptide from a DNA (RNA) sequence. The expression 
vehicle can comprise a transcriptional unit comprising an assembly of (1) a genetic 
regulatory elements necessary for gene expression in the host, including elements required 
to initiate and maintain transcription at a level suHicient for suitable expression of the 
desired polypeptide, including, for example, promoters and, where necessary, an enhancer 

15 and a polyadenylation signal; (2) a structural or coding sequence which is transcribed into 
mRNA and translated into protein, and (3) appropriate signals to initiate translation at 
the beginning of the desired coding region and terminate translation at its end. Structural 
units intended for use in yeast or eukaiyotic expression systems preferably include a 
leader sequence enabling extracellular secretion of translated protein by a host cell. 

20 Alternatively, where recombinant protein is expressed without a leader or transport 

sequence, it may include an N-terminal methionine residue. This residue may or may not 
be subsequently cleaved from the expressed recombinant protein to provide a final 
product. 

"Recombinant expression system" means host cells which have stably integrated a 
25 recombinant transcriptional unit into chromosomal DNA or carry the recombinant 
transcriptional unit extra chromosomally. The cells can be prokaryotic or eukaryotic. 
Recombinant expression systems as defined herein will express heterologous polypeptides 
or proteins iq>on induction of the regulatory elements linked to the DNA segment or 
synthetic gene to be expressed. 
30 Mature proteins can be expressed m mammalian cells, yeast, bacteria, or other 

cells under the control of appropriate promoters. Cell-free translation systems can also 
be en^loyed to produce such proteins using RNAs derived fix)m the DNA constructs of 
the present invention. Appropriate cloning and expression vectors for use with 
prokaryotic and eukaryotic hosts are described in Sambrook et al. Molecular Cloning: A 
35 Laboratory Mantiai 2nd Edition, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, New York (1989), the disclosure of which is hereby incorporated by reference in 
its entirety. 
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Generally, recombinant expression vectors will include origins of rq)lication and 
selectable markers permitting transformation of the host cell, e.g., the anqiicillin 
resistance gene of £. coli and S. cerevisiae TRPl gene, and a promoter derived from a 
highly expressed gene to direct transcription of a downstream structural sequence. Such 

5 promoters can be derived from operons encoding glycolytic enzymes such as 3- 

phosphoglycerate kinase (PGK), alpha-factor, acid phosphatase, or heat shock proteins, 
among others. The heterologous structural sequence is assembled in appropriate phase 
with translation initiation and termination sequences, and preferably, a leader sequence 
capable of directing secretion of translated protein into the periplasmic space or 

10 extracellular medium. Optionally, the heterologous sequence can encode a ftision protein 
including an N-terminal idoitification peptide imparting desired characteristics, e.g., 
stabilization or simplified purification of expressed recombinant product 

Useful expression vectors for bacterial use are constructed by inserting a structural 
DNA sequoice encoding a desired protein together with suitable translation initiation and 

IS termination signals in operable reading phase with a functional promoter. The vector will 
comprise one or more phenotypic selectable markers and an origin of replication to 
ensure maintenance of the vector and, when desirable, provide amplification within the 
host. 

Suitable prokaryotic hosts for transformation include strains of E. coliy B, sitbtilis, 
20 Salmonella typhimurium and various species within the genera Pseudomonas and 
Streptomyces. Others may, also be employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for 
bacterial use can comprise a selectable marker and bacterial origin of replication derived 
from commercially available plasmids comprising genetic elements of the well known 
25 cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, 
pKK223-3 (available form Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 
(available from Promega Biotec, Madison, Wl, USA). These pBR322 "backbone" 
sections are combined with an appropriate promoter and the structural sequence to be 
expressed. 

30 Following transformation of a suitable host strain and growth of the host strain to 

an ^propriate cell density, the selected promoter, where it is inducible, is derepressed or 
induced by appropriate means (e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period to provide for expression of the induced gene product. 
Thereafter cells are typically harvested, generally by centrifiigation, disrupted to release 

35 expressed protein, generally by physical or chemical means, and the resulting crude 
extract is retained for further purification. 
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Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts, described in Gluzman, Cell 25:175 (1981), and other 
cell lines capable of expressing a con^atible vector, for example, the CI 27, 3T3, CHO, 
5 HeLa and BHK cell lines. 

Mammalian expression vectors will comprise an origin of replication, a suitable 
promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation 
site, splice donor and acceptor sites, transcriptiona] termination sequences, and 5 
flanking nontranscribed sequences. DNA sequences derived from the SV40 viral genome, 
10 for example, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites 
may be used to provide the required nontranscribed genetic elements. 

Recombinant polypeptides and proteins produced in bacterial culture is usually 
isolated by initial extraction from cell pellets, followed by one or more salting-out, 
aqueous ion exchange or size exclusion chromatography steps. Microbial cells employed 
15 in ^pression of proteins can be disrupted by any convenirat method, including freeze- 
thaw cycling, sonication, mechanical disn^tion, or use of cell lysing agents. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature 
protein. Finally, high performance liquid chromatography (HPLC) can be employed for 
final purification steps. 

20 The present invention further includes isolated polypeptides, proteins and nucleic 

acid molecules which are substantially equivalent to those herein described. As used 
herein, substantially equivalent can refer both to nucleic acid and amino acid sequences, 
for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net effect of which does not result in an adverse 

25 functional dissimilarity between reference and subject sequences. For purposes' of the 
present invention, sequences having equivalent biological activity, and equivalent 
expression characteristics are considered substantially equivalent. For purposes of 
determining equivalence, truncation of the mature sequence should be disregarded. 

The invention further provides methods of obtaining homologs from other 

30 strains of Enterococcus faecaliSy of the fragments of the Enterococcus faecalis genome of 
the present invention and homologs of the proteins encoded by the ORFs of the present 
invention. As used herein, a sequence or protein of Enterococcus faecalis is defined as a 
homolog of a fragment of the Enterococcus faecalis fragments or contigs or a protein 
encoded by one of the ORFs of the present mvention, if it shares , significant homology to 

35 one of the fragments of the Enterococcus faecalis genome of the present invention or a 
protein encoded by one of the ORFs of the present invention. Specifically, by using the 
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sequence disclosed herein as a probe or as primers, and techniques such as PGR cloning and 
colony/plaque hybridization, one skilled in the art can obtain homologs. 

As used herein, two nucleic acid molecules or proteins are said to "share 
significant homology" if the two contain regions which possess greater than 85% 
5 sequence (amino acid or nucleic acid) homology. Preferred homologs in this regard are 
those with more than 90% homology. Especially preferred are those with 93% or more 
homology. Among especially preferred homologs those with 95% or more homology are 
particularly preferred. Very particularly preferred among these arc those with 97% and 
even more particularly preferred among those are homologs with 99% or more 

10 homology. The most preferred homologs among these are those with 99.9% homology 
or more. It will be understood that, among measures of homology, identity is particularly 
preferred in this regard. 

Region specific primers or probes derived from the nucleotide sequence provided 
in SEQ ID NOS: 1-982 or from a nucleotide sequence at least 95%, particularly at least 

15 99%, especially at least 99.5% identical to a sequence of SEQ ID NOS: 1-982 can be used 
to prime DNA synthesis and PGR amplification, as well as to identify colonies containing 
cloned DNA encoding a homolog. Methods suitable to this aspect of the present 
invention are well known and have been described in great detail in many publications 
such as, for example, Innis et al^ PCR Protocols ^ Academic Press, San Diego, CA 

20 (1990)). 

When using primers derived from SEQ ID NOS: 1-982 or from a nucleotide 
sequence having an aforemraitioned identity to a sequence of SEQ ID NOS: 1-982, one 
skilled in the art will recognize that by employing high stringency conditions (e.g., 
annealing at 50-60X in 6X SSPC and 50% formamide, and washing at 50- 65''C in 0.5X 

25 SSPC) only sequences which are greater than 75% homologous to the primer will be 

amplified. By employing lower stringency conditions (e.g., hybridizmg at 35-37**C in 5X 
SSPC and 40-45% formamide, and washing at 42''C in 0.5X SSPC), sequences which are 
greater than 40-50% homologous to the primer will also be amplified. 

When using DNA probes derived from SEQ ID NOS: 1-982, or from a nucleotide 

30 sequence having an aforementioned identity to a sequence of SEQ ID NOS: 1-982, for 
colony/plaque hybridization, one skilled in the art will recognize that by employing high 
stringency conditions (e.g., hybridizing at 50- 65**C m 5X SSPC and 50% formamide, and 
washing at 50- 65**C in 0.5X SSPC), sequences havmg regions which are greater than 90% 
homologous to the probe can be obtained, and that by employing lower stringency 

35 conditions (e.^,, hybridizing at 35-37**C in 5X SSPC and 40-45% formamide, and washing 
at 42^C in 0.5X SSPC), sequences having regions which are greater than 35-45% 
homologous to the probe will be obtained. 
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Any organism can be used as the source for homologs of the present invention so 
long as the organism naturally expresses such a protein or contains genes encoding the 
same. The most preferred organism for isolating homologs are bacteria which are closely 
related to Enterococcus faecalis. 

5 

ILLUSTRATIVE USES OF COMPOSITIONS OF THE INVENTION 

Each ORF provided in Tables 1 and 2 is identified with a function by homology to 
a known gene or polypeptide. As a result, one skilled in the art can use the polypeptides 
of the present invention for commercial, therapeutic and industrial purposes consistent 

10 with the type of putative identification of the polypeptide. Such identifications permit 
one skilled in the art to use the Enterococcus faecalis ORFs in a manner similar to the 
known type of sequences for which the identification is made; for example, to ferment a 
particular sugar source or to produce a particular metabolite. A variety of reviews 
illustrative of this aspect of the invention are available, including the following reviews 

15 on the industrial use of enzymes, for exan^)le, BIOCHEMICAL ENGINEERING AND 

BIOTECHNOLOGY HANDBOOK, 2nd Ed., MacMillan Publications, Ltd. NY (1991) and 
BIOCATALYSTS IN ORGANIC SYNTHESES, Tramper etai, Eds., Elsevier Science 
Publishers, Amsterdam, The Netherlands (1985). A variety of exemplary uses that 
illustrate this and similar aspects of the present invention are discussed below. 

20 

1. Biosynthetic Enzymes 

Open reading frames encoding proteins involved in mediating the catalytic 
reactions involved in intermediary and macromolecular metabolism, the biosynthesis of 
small molecules, cellular processes and other functions includes enzymes involved in the 

25 degradation of the intermediary products of metabolism, enzymes involved in central 
intermediary metabolism, enzymes involved in respiration, both aerobic and anaerobic, 
enzymes involved in fermentation, enzymes involved in ATP proton motor force 
conversion, enzymes involved in broad regulatory function, enzymes involved in amino 
acid synthesis, enzymes involved in nucleotide synthesis, enzymes involved in cofactor 

30 and vitamin synthesis, can be used for industrial biosynthesis. 

The various metabolic pathways present in Enterococcm faecalis can be identified 
based on absolute nutritional requirements as well as by examining the various enzymes 
identified m Table 1-3 and SEQ ID NOS: 1-982. 

Of particular interest are polypeptides involved in the degradation of 

35 intermediaiy metabolites as well as non-macromolecular metabolism. Such enzymes 
include amylases, glucose oxidases, and catalase. 
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Proteolytic enzymes are another class of commercially important enzymes. 
Proteolytic enzymes find use in a number of industrial processes including the processing 
of flax and other vegetable fibers, in the extraction, clarification and depectmization of 
fruit juices, in the extraction of vegetables' oil and in the maceration of fruits and 

5 vegetables to give unicellular fruits. A detailed review of the proteolytic enzymes used in 
the food industry is provided in Rombouts et al^ Symbiosis 21:19 (1986) and Voragen et 
ai. in Biocatalysts In Agricultural Biotechnology ^ Whitaker et ai^ Eds., American 
Chemical Society Symposium Series 389:93 (1989) . 

The metabolism of sugars is an important aspect of the primary metabolism of 

10 Enterococcus faecalis. Enzymes involved in the degradation of sugars, such as, 
particularly, glucose, galactose, fructose and xylose, can be used in industrial 
fermentation. Some of the important sugar transforming enzymes, from a commercial 
viewpoint, include sugar isomerases such as glucose isomerase. Other metabolic enzymes 
have found commercial use such as glucose oxidases which produces ketogulonic acid 

15 (KGA). KGA is an intermediate in the commercial production of ascorbic acid using the 
Reichstein's procedure, as described in Krueger et aly Biotechnolozy 6(A), Rhine et aL, 
Eds., Verlag Press, Weinheim, Germany (1984). 

Glucose oxidase (GOD) is commercially available and has been used in purified 
form as well as in an immobilized form for the deoxygenation of beer. See, for instance, 

20 Hartmeir et al.^ Biotechnology Letters 7:21 (1979). The most important application of 
GOD is the industrial scale fermentation of gluconic acid. Market for gluconic acids 
which are used in the detergent, textile, leather, photographic, pharmaceutical, food, feed 
and concrete industry, as described, for example, in Bigelis et aly begiiming on page 357 
in GENE MANIPULATIONS AND FUNGI; Benett et al, Eds., Academic Press, New 

25 York (1985). In addition to industrial applications, GOD has found applications in 
medicine for quantitative determination of glucose in body fluids recently in 
biotechnology for analyzing syrups from starch and cellulose hydrosylates. This 
application is described in Owusu et ai, Biochem, et Biophysica. Acta, 572:83 (1986), for 
instance. 

30 The main sweetener used in the world today is sugar which comes from sugar beets 

and sugar cane. In the field of industrial enzymes, the glucose isomerase process shows 
the largest expansion in the market today. Initially, soluble enzymes were used and later 
inmiobilized enzymes were developed (Krueger et al^ Biotechnology, The Textbook of 
Industrial Microbiology^ Sinauer Associated Incorporated, Sunderland, Massachusetts 

35 (1990)). Today, the use of glucose- produced high fructose synq>s is by far the largest 
industrial business using immobilized enzymes. A review of the industrial use of these 
enzymes is provided by Jorgensen, Starch 40:301 (1988). 
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Proteinases, such as alkaline serine proteinases, are used as detergent additives and 
thus represent one of the largest volumes of microbial enzymes used in the industrial 
sector. Because of their industrial importance, there is a large body of published and 
uiq>ublished information regarding the use of these enzymes in industrial processes. (See 
5 Faultman et aLy Acid Proteases Structure Function and Biology, Tang, J., ed.. Plenum 
Press, Mew Yoric (1977) and Godfrey et aL, Industrial Enzymes, MacMillan Publishers, 
Surrey, UK (1983) and Hepner et aL, Report Industrial Enzymes by 1990, Hel Hepner & 
Associates, London (1986)). 

Another class of commercially usable proteins of the present invention are the 

10 microbial lipases, described by, for instance, Macrae et aL, Philosophical Transactions of 
the Chiral Society of London 310:227 (1985) and Poserke, Journal of the American Oil 
Chemist Society 61 :\75% (1984). A major use of lipases is in the fat and oil industry for 
the production of neutral glycerides using lipase catalyzed inter-esterification of readily 
available triglycerides. Application of lipases include the use as a detergent additive to 

15 facilitate the removal of fats from fabrics in the course of the washing procedures. 

The use of enzymes, and in particular microbial enzymes, as catalyst for key steps 
in the synthesis of complex organic molecules is gaining popularity at a great rate. One 
area of great interest is the preparation of chiral intermediates. Preparation of chiral 
intermediates is of interest to a wide range of synthetic chemists particularly those 

20 scientists involved with the preparation of new pharmaceuticals, agrochemicals, 

fragrances and flavors. (See Davies et al. Recent Advances in the Generation of Chiral 
Intermediates Using Enzymes^ CRC Press, Boca Raton, Florida (1990)). The following 
reactions catalyzed by enzymes are of interest to organic chemists: hydrolysis of 
caiboxylic acid esters, phosphate esters, amides and nitriles, esterification reactions, 

25 trans-esterification reactions, synthesis of amides, reduction of alkanones and 

oxoalkanates, oxidation of alcohols to carbonyl compounds, oxidation of sulfides to 
sulfoxides, and carbon bond forming reactions such as the aldol reaction. 

When considering the use of an enzyme encoded by one of the ORFs of the 
present invention for biotransformation and organic synthesis it is sometimes necessary 

30 to consider the respective advantages and disadvantages of using a microorganism as 
opposed to an isolated enzyme. Pros alnd cons of using a whole cell system on the one 
hand or an isolated partially purified enzyme on the other hand, has been described in 
detail by Bud et aL, Chemistry in Britain (1987), p. 127. 

Amino transferases, enzymes involved in the biosynthesis and metabolism of 

35 amino acids, are usefiil in the catalytic production of amino acids. The advantages of 
using microbial based enzyme systems is that the amino transferase enzymes catalyze the 
stereo- selective synthesis of only L<>amino acids' and generally possess uniformly high 
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catalytic rates. A description of the use of amino transferases for amino acid production 
is provided by Roselle-David, Methods of Enzymology I3&A19 (1987). 

Another category of useful proteins encoded by the ORFs of the present 
invention include enzymes involved in nucleic acid synthesis, repair, and recombination. 

5 

2. Generation of Antibodies 

As described here, the proteins of the present invention, as well as homologs 
thereof, can be used in a variety of procedures and methods known in the art which are 
currently applied to other proteins. The proteins of the present invoition can further be 

10 used to generate an antibody which selectively binds the protein. 

E, faecalis protein-specific antibodies for use in the present invention can be 
raised against the intact E. faecalis protein or an antigenic polyp^tide fragment thereof, 
which may be presented together with a carrier protein, such as an albumin, to an animal 
system (such as rabbit or mouse) or, if it is long enougih (at least about 25 amino acids), 

15 without a carrier. 

As used herein, the term "antibody** (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules, smgle chain whole antibodies, and antibody fragments. 
Antibody fragments of the present invention include Fab and F(ab*)2 and other fragments 
includiag single-chain Fvs (scFv) and disullide-linked Fvs (sdFv). Also included in the 

20 present invention are chimeric and humanized monoclonal antibodies and polyclonal 
antibodies specific for the polypeptides of the present invention. The antibodies of the 
present invention may be prepared by any of a variety of methods. For example, cells 
eTqpressing a polypeptide of the present invention or an antigenic fragment thereof can 
be administered to an animal in order to induce the production of sera containing 

25 polyclonal antibodies. For example, a preparation of E, faecalis polypeptide or fragment 
thereof is prepared and purified to render it substantially free of natural contaminants. 
Such a prq>aration is then introduced into an animal in order to produce polyclonal 
antisera of greater specific activity. 

In a preferred method, the antibodies of the present invention are monoclonal 

30 antibodies or binding fragments thereof. Such monoclonal antibodies can be prepared 
using hybridoma technology. See, e.g., Harlow et al., ANTIBODIES: A LABORATORY 
MANUAL^ (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); Hammeriing, et al., in: 
MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS 563-681 (BlscviCT, 
N.Y., 1981). Fab and F(ab')2 fragments may be produced by proteolytic cleavage, using 

35 enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab*)2 
fragments). Alternatively, E. faecalis polypeptide-binding fragments, chimeric, and 
humanized antibodies can be produced through the application of recombinant DNA 
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technology or through synthetic chemistry using methods known in the art. 

Alternatively^ additional antibodies capable of binding to the polypq)tide antigen 
of the present invention may be produced in a two-stq> procedure through the use of 
anti'idiotypic antibodies. Such a method makes use of the fact that antibodies are 
5 themselves antigens, and that, therefore, it is possible to obtain an antibody which binds 
to a second antibody. In accordance with this method, E. faecalis polypeptide-specific 
antibodies are used to immunize an animal, preferably a mouse. The splenocytes of such 
an animal are then used to produce hybridoma cells, and the hybridoma cells are screened 
to identify clones which produce an antibody whose ability to bind to the E. faecalis 

10 polypeptidc-specific antibody can be blocked by the E, faecalis polypeptide antigen. 

Such antibodies comprise anti-idiotypic antibodies to the E. faecalis polypeptide-specific 
antibody and can be used to immunize an animal to induce formation of further E. 
faecalis polypeptide-^ecific antibodies. 

Antibodies and fragements thereof of the present invention may be described by 

15 the portion of a polypeptide of the present invention recognized or specifically bound by 
the antibody. Antibody binding fragements of a polypq)tide of the present invention 
may be described or specified in the same manner as for polypeptide fragements discussed 
above., i.e, by N-terminal and C-terminal positions or by size in contiguous amino acid 
residues. Any number of antibody binding fragments, of a polypeptide of the present 

20 invention, specified by N-teiminal and C-terminal positions or by size in amino acid 

residues, as described above, may also be excluded from the present invention. Therefore, 
the present invention includes antibodies the specifically bind a particuarlly discribed 
fiagement of a polypqitide of the present invention and allows for the exclusion of the 
same. 

25 Antibodies and fiagements thereof of the present invention may also be described 

or specified in terms of their cross-reactivity. Antibodies and fragements that do not bind 
polypeptides of any other species of Enterococcus other than E. faecalis are included in 
the present invention. Likewise, antibodies and fragements that bind only species of 
Enterococcus y i.e. antibodies and fragements that do not bind bacteria from any genus 

30 other than Enterococcus^ are included in the present invention. 

3. IMagnostic and Detection Assays and Kits 

The present invention further relates to methods for assaying enterococcal 
infection in an animal by detecting the expression of genes encoding enterococcal 
35 polypeptides of the present invention. The methods comprise analyzing tissue or body 
fluid from the animal for Enterococcus-speti&c antibodies, nucleic acids, or proteins. 
Analysis of nucleic acid specific to Enterococcus is assayed by PCR or hybridization 
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techniques using nucleic acid sequences of the present invention as either hybridization 
probes or primers. See, e.g., Sambrook et al. Molecular cloning: A Laboratory Manual 
(Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 reference); Eremeeva et al. 
(1994) J. Clin. Microbiol. 32:803-810 (describing difTerentiation among spotted fever 

5 group Rickettsiae species by analysis of restriction fragment length polymorphism of 
PCR-amplified DNA) and Chen et al. 1994 J. Clin. Microbiol. 32:589-595 (detecting 
B. burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Enterococcus has 
already been made, the present invention is useful for monitoring progression or 

10 regression of the disease state whereby patients exhibiting enhanced Enterococcus gene 
expression will experience a worse clinical outcome relative to patients expressing these 
gene(s) at a lower level. 

By "biological sample" is intended any biological sample obtained from an animal, 
cell line, tissue culture, or other source which contains Enterococcus polypeptide, mRNA, 

15 or DNA. Biological samples include body fluids (such as saliva, blood, plasma, urine, 
mucus, synovial fluid, etc.) tissues (such as muscle, skin, and cartilage) and any other 
biological source suspected of containing Enterococcus polypeptides or nucleic acids. 
Methods for obtaining biological samples such as tissue are well known in the art. 

The presCTit invention is useiul for detecting diseases related to Enterococcus 

20 infections in animals. Preferred animals include monkeys, apes, cats, dogs, birds, cows, 
pigs, mice, horses, rabbits and humans. Particularly preferred are humans. 

Total UNA can be isolated from a biological sample using any suitable technique 
such as the single-step guanidinium-thiocyanate-phenol-chloroform method described in 
Chomczynski et al. (1987) Anal. Biochem. 162:156-159. mRNA encoding Enterococcus 

IS polypeptides having sufficient homology to the nucleic acid sequences identified in SEQ 
ID NOS: 1-982 to allow for hybridization between con^)lementary sequences are then 
assayed using any appropriate method. These include Northmi blot analysis, SI nuclease 
mapping, the polymerase chain reaction (PCR), reverse transcription in combination 
with the polymerase chain reaction (RT-PCR), and reverse transcription in combination 

30 with the ligase chain reaction (RT-LCR). 

Northern blot analysis can be performed as described in Harada et al. (1990) Cell 
63:303-312. Briefly, total RNA is prepared from a biological sample as described above. 
For the Northern blot, the RNA is doiatured in an appropriate buffer (such as 
glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to agarose gel 

35 electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been 
linked to the filter by a UV linker, the filter is prehybridized in a solution containing 
formamide, SSC, Daihardt's solution, denatured salmon sperm, SDS, and sodium 
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phosphate bufTer. A £. y^eca/ii polynucleotide sequence shown in SEQ ID NOS: 1-982 
labeled according to any appropriate method (such as the ^^P-muldprimed DNA labeling 
system (Amersham)) is used as probe. After hybridization overnight, the filter is washed 
and exposed to x-ray film. DNA for use as probe according to the present invention is 
5 described in the sections above and will preferably at least 15 nucleotides in length. 

SI mapping can be performed as described in Fujita et al. (1987) Cell 49:357-367. 
To prepare probe DNA for use in SI mapping, the sense strand of an above-described E, 
faecalis DNA sequence of the present invention is used as a template to synthesize labeled 
antisense DNA. The antisense DNA can then be digested using an appropriate restriction 

10 endonuclease to generate further DNA probes of a desired length. Such antisense probes 
are useful for visualizing protected bands corresponding to the target mRNA (i.e., mRNA 
encoding Enterococcus polypeptides). 

Levels of mRNA encoding Enterococcus polypeptides are assayed, for e,g,^ using 
the RT-PCR method described in Makmo et al. (1990) Technique 2:295-301. By this 

15 method, the radioactivities of the ''amplicons" in the polyacrylamide gel bands are 

linearly related to the initial concentration of the target mRNA. Briefly, this method 
involves adding total RNA isolated from a biological sample in a reaction mixture 
containing a RT primer and appropriate buffer. After incubating for primer annealing, 
the mixture can be supplemented with a RT bufTer, dNTPs, DTT, RNase inhibitor and 

20 reverse transcriptase. After incubation to achieve reverse transcription of the RNA, the 
RT products are then subject to PCR using labeled primers. Alternatively, rafter than 
labeling the primers, a labeled dNTP can be included in the PCR reaction mixture. PCR 
amplification can be performed in a DNA thermal cycler according to conventional 
techniques. After a suitable number of rounds to achieve amplification, the PCR reaction 

25 mixture is electrophoresed on a polyacrylamide gel. After drymg the gel, the 

radioactivity of the sqppropriate bands (corresponding to the mRNA encoding the 
Enterococcus polypeptides of the present invention) are quantified using an imaging 
analyzer. RT and PCR reaction ingredients and conditions, reagent and gel 
concentrations, and labeling methods are well known in the art. Variations on the 

30 RT-PCR method will be apparent to the skilled artisan. Other PCR methods that can 
detect the nucleic acid of the present invention can be found in PCR PRIMER: A 
LABORATORY MANUAL (C.W. Diefifenbach et al. eds., Cold Spring Harbor Lab Press, 
1995). 

The polynucleotides of the present invention, including both DNA and RNA, may 
35 be used to detect polynucleotides of the present invention or Bnterococcal species 

including E, faecalis using bio chip technology. The present invention includes both high 
density tbip arrays (>1000 oligonucleotides per cm^) and low density chip arrays (<1000 
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oligonucleotides per cm^). Bio chips comprising arrays of polynucleotides of the present 
invention may be used to detect Enterococcal species, including E, faecalis, in biological 
and environmental samples and to diagnose an animal, including humans, with an E. 
faecalis or other Enterococcal infection. The bio chips of the presrait invention may 

5 comprise polynucleotide sequences of other pathogens including bacteria, viral, parasitic, 
and iungal polynucleotide sequences, in addition to the polynucleotide sequences of the 
present invention, for use in rapid difTenertial pathogenic detection and diagnosis. The 
bio chips can also be used to monitor an £. faecalis or other Enterococcal infections and 
to monitor the genetic changes (deletions, insertions, mismatches, etc.) in response to 

10 drug therapy in the clinic and drug development in the laboratory. The bio chip 

technology comprising arrays of polynucleotides of the present invention may also be 
used to simultaneously monitor the expression of a multiplicity of genes, including those 
of the present invention. The polynucleotides used to comprise a selected array may be 
specified in the same manner as for the fragements, i.e, by their S' and 3' positions or 

15 length in contigious base pairs and include from. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, includiog E, 
faecalis^ using bio chip technology include those known in the art and those of: U.S. 
Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 5527681, 
5451683, 5424186, 5607646, 5658732 and Worid Patent Nos. WO/9710365, 

20 WO/9511995, WO/9743447, WO/9535505, each mcorporated herein m their entireties. 

Biosensors using the polynucleotides of the present invention may also be used to 
detect, diagnose, and monitor E, faecalis or other Enterococcal species and infections 
thereof. Biosensors using the polynucleotides of the present invention may also be used 
to detect particular polynucleotides of the present invention. Biosensors using the 

25 polynucleotides of the present invention may also be used to monitor the genetic changes 
(deletions, insertions, mismatches, etc.) in response to drug therapy in the clinic and drug 
development in the laboratory. Methods and particular uses of the polynucleotides of the 
present invention to detect Enterococcal species, including E. faecalis, using biosenors 
mclude those known in the art and those of: U.S. Patent Nos 5721102, 5658732, 

30 5631170, and Worid Patent Nos. WO97/35011, WO/9720203, each incorporated herein 
in their entireties. 

Thus, the present invention includes both bio chips and biosensors comprising 
polynucleotides of Oie present mvention and methods of their use. 

Assaying Enterococcus polypeptide levels in a biological sample can occur using 
35 any art-known method, such as antibody-based techniques. For example, Enterococcus 
polypeptide expression in tissues can be studied with classical immunohistological 
methods, in these, the specific recognition is provided by the primary antibody 
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^ (polyclonal or monoclonal) but the secondaiy detection system can utilize fluorescent, 
enzyme, or other conjugated secondary antibodies. As a result, an immunohistological 
staining of tissue section for pathological examination is obtained. Tissues can also be 
extracted, with urea and neutral detergent, for the liberation of Enterococcus 
5 polypeptides for Western-blot or dot/slot assay. See, e.g., Jalkanen, M. et al. (1985) J. 
Cell. Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell . Biol. 105:3087-3096. In this 
technique, which is based on the use of cationic solid phases, quantitation of a 
Enterococcus polypeptide can be accomplished using an isolated Enterococcus 
polypeptide as a standard. This technique can also be applied to body fluids. 

10 Other antibody-based methods useful for detecting Enterococcus polypeptide gene 

expression include immunoassays, such as the ELISA and the radioimmunoassay (RIA). 
For example, a Enterococcus polypeptide*specific monoclonal antibodies can be used 
both as an immunoabsoibent and as an enzyme-labeled probe to detect and quantify a 
Enterococcus polypeptide. The amount of a Enterococcus polypq)tide present in the 

15 sample can be calculated by reference to the amount present in a standard preparation 
using a linear regression computer algorithm. Such an ELISA is described in lacobelli et 
al. (1988) Breast Cancer Research and Treatment 11:19-30. In another ELISA assay, two 
distinct specific monoclonal antibodies can be used to detect Enterococcus polypeptides 
in a body fluid. In this assay, one of the antibodies is used as the immunoabsorbent and 

20 the other as the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or "two-step" 
assay. The "one-step** assay involves contacting the Enterococcus polypeptide with 
immobilized antibody and, without washing, contacting the mixture with the labeled 
antibody. The "two-step** assay involves washing before contacting the mixture with the 

25 labeled antibody. Other conventional methods may also be employed as suitable. It is 

usually desirable to immobilize one component of the assay system on a support, thereby 
allowing other components of the system to be brought mto contact with the component 
and readily removed from the sample. Variations of the above and other immunological 
methods included in the present invention can also be found in Harlow et al., 

30 ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd 
ed. 1988), 

Suitable enzyme labels include, for example, those from the oxidase group, which 
catalyze the production of hydrogen peroxide by reacting with substrate. Glucose oxidase 
is particularly preferred as it has good stability and its substrate (glucose) is readily 
35 available. Activity of an oxidase label may be assayed by measuring the concentration of 
hydrogen peroxide formed by the enzyme-labeled antibody/substrate reaction. Besides 
enzymes, other suitable labels include radioisotopes, such as iodine ('^I, ^^'l), carbon 
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('^CX sulphur (^*S), tritium (^H), indium (*"ln), and technetium C^c), and fluorescent 
labels, such as fluorescein and rhodamine, and biotin. 

Further suitable labels for the Enterococcus polypeptide-specific antibodies of the 
present invention are provided below. Examples of suitable enzyme labels include malate 
5 dehydrogenase, Enterococcal nuclease, delta-5-steroid isomerase, yeast-alcohol 

dehydrogenase, alpba-glycerol phosphate dehydrogenase, triose phosphate isomerase, 
peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, 
ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and 
acetylcholine esterase. 

10 Examples of suitable radioisotopic labels include ^H, "'in, '^I, '^'l, ^^P, ^^S, '"^C, 

^•Cr, ^^To, ^«Co, ^^Fe, ^^Se, '"Eu, ^, ^'Cu, ^'^Ci, ^"At, ^'^Pb, ^^Sc, >^Pd, etc. "'in is a 
preferred isotope where in vivo imaging is used since its avoids the problem of 
dehalogenation of the '^^1 or '^*l-labeled monoclonal antibody by the liver. In addition, 
this radionucleotide has a more favorable gamma emission energy for imaging.. See, e.g,, 

15 Parkms et al. (1985) Eur, J. Nucl. Med. 10:296-301; Carasquillo et al. (1987) J. Nucl. 
Med. 28:281-287. For example, '"in coupled to monoclonal antibodies with 
l-(P-isothiocyanatobenzyl)-DPTA has shown little uptake in non-tumors tissues, 
particularly the liver, and therefore enhances speciflcity of tumor localization. See, 
Esteban et al. (1987) J. Nucl. Med. 28:861-870. 

20 Examples of suitable non-radioactive isotopic labels include '^^Gd, ^^Mn, '^^Dy, 

^^r, and "Fe. 

Exanqples of suitable fluorescent labels include an ^^^£u label, a fluorescein label, 
an isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin label, an 
allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label. 

25 Exanq^les of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, 

ricin, and cholera toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal label, 
an aromatic acridinium est^ label, an imidazole label, an acridinium salt label, an oxalate 
ester label, a luciferin label, a hiciferase label, and an aequorin label. 

30 Examples of nuclear magnetic resonance contrasting agents include heavy metal 

nuclei such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies axe 
provided by Kennedy et al. (1976) Qin. Chim. Acta 70:1-31, and Schuis et al. (1977) 
Clin. Qiim. Acta 81:1-40. Coupling techniques mentioned in the latter are the 

35 glutaraldehyde metibod, the periodate method, the dimaleimide method, the 

m-maleimidobenzyl-N-bydroxy-succinimide ester method, all of which methods are 
incorporated by reference herein. 
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In a related aspect, tbe invention includes a diagnostic kit for use in screening 
serum containing antibodies specific against E. faecalis infection. Such a kit may include 
an isolated £. faecalis antigen comprising an epitope which is specifically 
immunoreactive with at least one anti-£. faecalis antibody. Such a kit also includes 
5 means for detecting the bmding of said antibody to the antigen. In specific embodiments, 
fht kit may include a recombinantly produced or chemically synthesized peptide or 
polypq)tide antigen. The peptide or polypeptide antigen may be attached to a solid 
support. 

In a more specific embodiment, the detecting means of the above-described kit 

10 includes a solid support to which said peptide or polypeptide antigen is attached. Such a 
kit may also include a non-attached reporter-labeled anti-human antibody. In this 
embodiment, binding of the antibody to the E. faecalis antigen can be detected by binding 
of the rq)orter labeled antibody to the anti-£. faecalis polypeptide antibody. 

In a related aspect, die invention includes a method of detecting E, faecalis 

15 infection in a subject This detection method includes reacting a body fluid, preferably 
serum, from the subject with an isolated E. faecalis antigen, and examining the antigen 
for the presence of bound antibody. In a specific embodiment, the method includes a 
polypeptide antigen attached to a solid support, and serum is reacted with the support 
Subsequently, the support is reacted with a reporter-labeled anti-human antibody. The 

20 support is then examined for the presence of reporter-labeled antibody. 

The solid surface reagent employed in the above assays and kits is prq)ared by 
known , techniques for attaching protein material to solid support material, such as 
polymeric beads, dip sticks, 96-well plates or filter material. These attachment methods 
generally include non-specific adsorption of tbe protein to the siq)port or covalent 

25 attachment of the protein , typically through a free amine group, to a chemically 

reactive group on tbe solid support, such as an activated carboxyl, hydroxyl, or aldehyde 
groiq>. Alternatively, streptavidin coated plates can be used in conjunction with 
biotinylated antigen(s). 

The polypqitides and antibodies of the present invention, including fragments 

30 thereof, may be used to detect Enterococcal species including E, faecalis using bio chip 
and biosensor technology. Bio chip aiid biosensors of the present invention may 
conq>rise the polypeptides of the present invention to detect antibodies, which 
specifically recognize Enterococcal species, including E, faecalis. Bio chip and biosensors 
of the present invention may also comprise antibodies which specifically recognize the 

35 polypeptides of the present invention to detect Enterococcal species, including £. 
faecalis or specific polypeptides of the present invmtion. Bio chips or biosensors 
comprising polypeptides or antibodies of the present invention may be used to detect 
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Enterococcal species, including E. faecaliSy in biological and environmental samples and 
to diagnose an animal, including humans, with an E. faecalis or other Enterococcal 
infection. Thus, the present invention includes both bio chips and biosensors comprising 
polypeptides or antibodies of the present invention and methods of their use. 

5 The bio chips of the present invention may further comprise polypeptide 

sequences of other pathogens including bacteria, viral, parasitic, and fungal polypeptide 
sequences, in addition to the polypeptide sequences of the present invention, for use in 
rapid diffenertial pathogenic detection and diagnosis. The bio chips of the present 
invention may further comprise antibodies or firagements thereof specific for other 

10 pathogens including bacteria, viral, parasitic, and fungal polypeptide sequences, in addition 
to the antibodies or firagements thereof of the present invention, for use in ra^id 
difTenertial pathogenic detection and diagnosis. The bio chips and biosensors of the 
present invention may also be used to monitor an £. faecalis or other Enterococcal 
infection and to monitor Oie genetic changes (amio acid deletions, insertions, 

15 substitutions, etc.) in response to drug therapy in the clinic and drug development in the 
laboratory. The bio chip and biosensors comprising polypeptides or antibodies of the 
present invention may also be used to simultaneously monitor the expression of a 
multiplicity of polypeptides, including those of the present invention. The polypeptides 
used to comprise a bio chip or biosensor of the present invention may be specified in the 

20 same manner as for the fragements, i.e, by their N-terminal and C-terminal positions or 
length in contiguous amino acid residue. Methods and particular uses of the polypeptides 
and antibodies of the present invention to detect Enterococcal species, including E, 
faecalis, or specific polypeptides using bio chip and biosensor technology include those 
known in the art, those of the U.S. Patent Nos. and World Patent Nos. listed above for 

25 bio chips and biosensors using polynucleotides of the present invention, and those of: 
U.S. Patent Nos. 5658732, 5135852, 5567301, 5677196, 5690894 and World Patent 
Nos. W09729366, W09612957, each incorporated herein in their entireties. 

4. Screening Assay for Binding Agents 
30 Using the isolated proteins of the present invention, the present invention 

further provides methods of obtaining and identifying agents which bind to a protein 
encoded by one of the ORFs of the present invention or to one of the fragments and the 
Enterococcus faecalis fragment and contigs horein described. 
In general, such methods comprise steps of: 
35 (a) contacting an agent with an isolated protein encoded by one of the ORFs of 

the present invention, or an isolated firagment of the Enterococcus faecalis genome; and 
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(b) determining whether the agent binds to said protein or said fragment 
The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be 
selected and screened at random or rationally selected or designed using protein modeling 
5 techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical 
agents and the like are selected at random and are assayed for their ability to bind to the 
protein encoded by the ORF of the present invention. 

Alternatively, agents may be rationally selected or designed. As used herein, an 

10 agent is said to be ''rationally selected or designed"* when the agent is chosen based on the 
configuration of the particular protein. For example, one skilled in the art can readily 
adapt currently available procedures to generate peptides, pharmaceutical agents and the 
like capable of binding to a specific peptide sequence in order to generate rationally 
designed antipeptide peptides, for example see Hurby et al^ "Application of Synthetic 

15 Peptides: Antisense Peptides,** in Synthetic Peptides, A User's Guide^ W. H. Freeman, NY 
(1992), pp. 289-307, and Kaspczak et aL, Biochemistry 25:9230-8 (1989), or 
pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as 
broadly described, can be used to control gene expression through binding to one of the 

20 ORFs or EMFs of the present invention. As described above, such agents can be 

randomly screened or rationally designed/selected. Targeting the ORF or EMF allows a 
skilled artisan to design sequence specific or element specific agents, modulating the 
expression of either a single ORF or multiple ORFs which rely on the same EMF for 
expression control. 

25 One class of DNA binding agents are agents which contain base residues which 

hybridize or form a triple helix by binding to DNA or RNA. Such agoits can be based on 
the classic phosphodiester, ribonucleic acid backbone, or can be a variety of sulfiiydryl or 
polymeric derivatives which have base attachment capacity. 

Agents suitable for use in these methods usually contain 20 to 40 bases and are 

30 designed to be complementary to a region of the gene involved in transcription (triple 
helix - see Lee e/ a/., Nucl Acids Res, d:3073 (1979); Cooney et al. Science 241:456 
(1988); and Dervan et al.. Science 257.1360 (1991)) or to the mRNA itself (antisense - 
Okano, J, Neurochem. 55:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of 
Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple helbc- formation optimally 

35 results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization 
blocks translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the sequences 
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of the present invention can be used to design antisense and triple helix-forming 
oligonucleotides, and other DNA binding agents. 

5. Pharmaceutical Compositions and Vaccines 

S The present invention further provides pharmaceutical agents which can be used 

to modulate the growth or pathogenicity of Enterococcus faecaliSy or another related 
organism, in vivo or in vitro. As used herein, a "pharmaceutical agent" is defined as a 
composition of matter which can be formulated using known techniques to provide a 
pharmaceutical compositions. As used herein, the "pharmaceutical agents of the present 

10 invention" refers the pharmaceutical agents which are derived from the proteins encoded 
by die ORFs of the present invention or are agents which are identified using the herein 
described assays. 

As used herein, a phannaceutical agent is said to **modulate the growth and/or 
pathogenicity of Enterococcus faecaiis or a related organism, in vivo or in vitro" when 

IS the agent reduces the rate of growth, rate of division, or viability of the organism in 

question. The pharmaceutical agents of the present invention can modulate the growth 
or pathogenicity of an organism in many fashions, although an understanding of the 
underlying mechanism of action is not needed to practice the use of the pharmaceutical 
agents of the present invention. Some agents will modulate the growth by binding to an 

20 important protein thus blocking the biological activity of the protein, while other agents 
may bind to a component of the outer surface of the organism blocking attachment or 
rendering the organism more prone to act the bodies nature immune system. 
Alternatively, the agent may comprise a protein encoded by one of the ORFs of the 
present invention and serve as a vaccine. The development and use of a vaccine based on 

25 outer membrane componrats are well known in the art 

As used herein, a ^'related organism** is a broad term which refers to any organism 
whose growth can be modulated by one of the pharmaceutical agents of the present 
invention. In general, such an organism will contain a homolog of the protein which is 
the target of the pharmaceutical agent or the protein used as a vaccine. As such, related 

30 organisms do not need to be bacterial but may be fungal or viral pathogens. 

The phannaceutical agents and compositions of the present invention may be 
administered in a convenient manner, such as by the oral, topical, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes. The 
pharmac^tical compositions are administered in an amount which is effective for 

35 treating and/or prophylaxis of the specific indication. In general, they are administered 
in an amount of at least about 1 mg/kg body weight and in most cases they will be 
administered in an amount not in excess of about 1 g/kg body weight per day. In most 
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cases, the dosage is from about 0.1 mg/kg to about 10 gAcg body weight daily, taking into 
account the routes of administration, symptoms, etc. 

The agents of the present invention can be used in native form or can be modified 
to form a cbonical dmvative. As used herein, a molecule is said to be a "chemical 

5 derivative" of another molecule when it contains additional chemical moieties not 

normally a part of the molecule. Such moieties may improve the molecule's solubility, 
absorption, biological half life, etc. The moieties may alternatively decrease the toxicity 
of the molecule, eliminate or attenuate any undesirable side effect of the molecule, etc. 
Moieties capable of mediating such effects are disclosed in, among other sources, 

1 0 REMINGTON»S PHARMACEUTICAL SCIENCES (1 980) cited elsewhere herein. 

For example, such moieties may change an immunological character of the 
functional derivative, such as afimity for a given antibody. Such changes in 
immunomodulation activity are measured by the appropriate assay, such as a competitive 
type immunoassay. Modifications of such protem properties as redox or therma] 

IS stability, biological half-life, hydrophobicity, susceptibility to proteolytic degradation or 
the tendency to aggregate with cairiers or into multimers also may be effected in this way 
and can be assayed by methods well known to the skilled artisan. 

The therapeutic effects of the agents of the present invention may be obtained by 
providing the agent to a patient by any suitable means (e.g., inhalation, intravenously, 

20 intramuscularly, subcutaneously, enterally, or parenterally). It is preferred to administer 
the agent of the present invention so as to achieve an effective concentration within the 
blood or tissue in which the growth of the organism is to be controlled. To achieve an 
effective blood concentration, the preferred method is to administer the agent by 
injection. The administration may be by continuous infusion, or by single or multiple 

25 injections. 

In providing a patient with one of the agmXs of the present invention, the dosage 
of the administered agent will vary depending upon such factors as the patienfs age, 
weight, height, sex, general medical condition, previous medical history, etc. In general, 
it is desirable to provide the recipient with a dosage of agent which is in the range of from 

30 about 1 pg/kg to 10 mg/kg (body weight of patient), although a lower or higher dosage 
may be administered. The therapeutically effective dose can be lowered by using 
combinations of the agents of the present invention or another agent. 

As used herein, two or more compounds or agents are said to be administered "in 
combination" with each other when either (1) the physiological effects of each 

35 compound, or (2) the serum concentrations of each compound can be measured at the 
same time. The composition of the present invention can be administered concurrendy 
with, prior to, or following the administration of the other agent 
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The agents of the present invention are intended to be provided to recipient 
subjects in an amount sufTicioit to decrease the rate of grow& (as defined above) of the 
target organism. 

The administration of the agent(s) of the invention may be for either a 
5 "prophylactic" or **thenipeutic" purpose. When provided prophylactically, the agent(s) 
are provided in advance of any symptoms indicative of the organisms growth. The 
prophylactic administration of the agent(s) serves to prevent, attenuate, or decrease the 
rate of onset of any subsequent infection. When provided therapeutically, the agcnt(s) 
are provided at (or shortly after) the onset of an indication of infection. The therapeutic 

10 administration of the compound(s) serves to attenuate the pathological symptoms of the 
infection and to increase the rate of recovery. 

The agents of the present invention are administered to a subject, such as a 
mammal, or a patient, in a pharmaceutically acceptable form and in a therapeutically 
effective concentration. A composition is said to be "pharmacologically acceptable" if 

15 its administration can be tolerated by a recipient patient Sudi an agent is said to be 
administered in a "therapeutically effective amount" if the amount administered is 
physiologically significant An agent is physiologically significant if its presence results 
in a detectable change in the physiology of a recipient patient 

The agents of the present invention can be formulated according to known 

20 methods to prepare pharmaceutically useful compositions, whereby these materials, or 

their functional derivatives, are combined in a mixture with a pharmaceutically acceptable 
parrier vehicle. Suitable vehicles and their formulation, inclusive of other human 
proteins, e.g., human serum albumin, are described, for example, in REMINGTON'S 
PHARMACEUTICAL SCIENCES, 16th Ed., Osol, A., Ed., Mack Publishing, Easton PA 

25 (1980). In order to form a pharmaceutically acceptable composition suitable for 

effective administration, such compositions will contain an effective amount of one or 
more of the agents of the present invention, together with a suitable amount of carrier 
vehicle. 

Additional pharmaceutical methods may be employed to control the duration of 
30 action. Control release preparations may be achieved through the use of polymers to 
complex or absorb one or more of the agents of the present invention. The controlled 
delivery may be effectuated by a variety of well known techniques, including formulation 
with macromolecules such as, for example, polyesters, polyamino acids, polyvinyl, 
pyrrolidone, ethylenevinylacetate, methylcelhilose, carboxymethylcellulose, or 
35 protamine, sulfate, adjusting the concentration of the macromolecules and the agent in 
the formulation, and by sq^propriate use of methods of incorporation, which can be 
manipulated to effectuate a desired time course of release. Another possible method to 
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control the duration of action by controlled release preparations is to incorporate agents 
of the present invention into particles of a polymeric material such as polyesters, 
polyamino acids, hydrogels, poly(lactic acid) or ethylene vinylacetate copolymers. 
Alternatively, instead of incorporating these agents into polymeric particles, it is possible 

5 to entrap these materials in microcapsules prepared^ for example, by coacervation 

techniques or by interfacial polymerization with, for exan^le, hydroxymcthylcellulose or 
gelatiDe-microcq)sules and poly(methylmethacylate) microcapsules, respectively, or in 
colloidal drug delivery systems, for example, liposomes, albumin microspheres, 
microemulsions, nanoparticles, and nanocapsules or in macroemulsions. Such techniques 

10 are disclosed in REMINGTON'S PHARMACEUTICAL SCIENCES (1 980). 

The invention further provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such container(s) can be a notice in the 
form prescribed by a governmental agency regulating the manufacture, use or sale of 

15 pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. 

In addition, the agents of the present invention may be employed in conjunction 
with other therapeutic compounds. 

The present invention also provides vaccines comprising one or more 

20 polypeptides of the present invention. Heterogeneity in the composition of a vaccine 
may be provided by combining E. faecalis polypeptides of the present invention. 
Multi-component vaccines of this type are desirable because they are likely to be more 
effective in eliciting protective immune responses against multiple species and strains of 
the Enterococcus genus than single polypeptide vaccines. 

25 Multi-component vaccines are known in the art to elicit antibody production to 

numerous immunogenic conqponents. See, e.g.. Decker et al. (1996) J. Infect Dis. 
174:S270-275. In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent 
vaccine has recently been demonstrated to elicit protective levels of antibodies in human 
infants against all four pathogenic agents. See, e,g., Aristegui, J. et al. (1997) Vaccine 

30 15:7-9. 

The present invention in addition to single-component vaccines includes 
multi-component vaccines. These vaccines comprise more than one polypeptide, 
immunogen or antigen. Thus, a multi-conqwnent vaccine would be a vaccine comprising 
more than one of the £. faecalis polypeptides of the present invention. 
35 Farther within the scope of the invention are whole cell and whole viral vaccines. 

Such vaccines may be produced recombinantly and involve the expression of one or more 
of the E. faecalis polypq[>tides described in SEQ ID NOS: 1-982. For example, the E, 
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faecalis polypeptides of the present invention may be either secreted or localized 
intracellular, on the cell surface, or in the periplasmic space. Further, when a 
recombinant virus is used, the E. faecalis polypeptides of the present invention may, for 
exanq)le, be localized in the viral envelope, on the surface of the capsid, or intenially 

5 wiAin the capsid. Whole cells vaccines which employ cells expressing heterologous 
proteins are known in the art. See, e,g.y Robinson, K. et al. (1997) Nature Biotech. 
15:653-657; Sirard, J. et al. (1997) Infect. Immun. 65:2029-2033; Chabalgoity, J. et al. 
(1997) Infect. Immun. 65:2402-2412 . These cells may be administered live or may be 
killed prior to administration. Chabalgoity, J. et al., supra, for example, report the 

10 successful use in mice of a live attenuated Salmonella vaccine strain which expresses a 
portion of a platyhelminth fatty acid-binding protein as a fusion protein on its cells 
surface. 

A multi-component vaccine can also be prepared using techniques known in the - 
art by combining one or more £. faecalis polypeptides of the present invention, or 

IS fragments thereof, with additional non-Enterococcal conq>onents (e.g., diphtheria toxin 
or tetanus toxin, and/or other compounds known to elicit an immune response). Such 
vaccines are useful for eliciting protective immune req>onses to both members of the 
Enterococcus genus and non-Enterococcal pathogenic agents. 

The vaccines of the present invention also include DNA vaccines. DNA vaccines 

20 are currently being developed for a number of infectious diseases. See, et al, Boyer, et al. 
(1997) Nat. Med. 3:526-532; reviewed in Spier, R. (1996) Vaccine 14:1285-1288. Such 
DNA vaccines contain a nucleotide sequence encoding one or more E. faecalis 
polypeptides of the present invention oriented in a manner that allows for expression of 
the subject polypq>tide. For example, the direct administration of plasmid DNA 

25 encoding burgdorgeri OspA has been shown to elicit protective immunity in mice 
against bonelial challenge. See, Luke et al. (1997) J. Infect Dis. 175:91-97. 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim et al. 
(1997) Nature Biotech. 15:641-646, for example, report the enhancement of immune 

30 responses produced by DNA immunizations when DNA sequences encoding molecules 
which stimulate the immune response are co-administered. In a similar fashion, the 
vaccines of the present invention may be co-administered with either nucleic acids 
encoding immune modulators or the immune modulators themselves. These immune 
modulators include granulocyte macrophage colony stimulating factor (GM-CSF) and 

35 CD86. 

The vaccines of the present invention may be used to confer resistance to 
Enterococcal infection by eithor passive or active immunization. When the vaccines of 
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the present invention are used to confer resistance to Enterococcal infection through 
active immunization, a vaccine of the present invention is administered to an animal to 
elicit a protective inmiune response which either prevents or attenuates a Enterococcal 
infection. When the vaccines of the present invention are used to confer resistance to 
5 Enterococcal infection through passive immunization, the vaccine is provided to a host 
animal {e.g., human, dog, or mouse), and the antisera elicited by this antisera is recovered 
and directly provided to a recipient suspected of having an infection caused by a member 
of the Enterococcus genus. 

The ability to label antibodies, or fragments of antibodies, with toxin molecules 

10 provides an additional method for treating Enterococcal infections when passive 

immunization is conducted. In this embodiment, antibodies, or fragments of antibodies, 
capable of recognizing the E.faecalis polypeptides disclosed herein, or fragments thereof, 
as well as other Enterococcus proteins, are labeled with toxin molecules prior to their 
administration to the patient When such toxin derivatized antibodies bind to 

15 Enterococcus cells, toxin moieties will be localized to these cells and will cause their 
death. 

The present invention thus concerns and provides a means for preventing or 
attenuating a Enterococcal infection resulting from organisms which have antigens that 
are recognized and bound by antisera produced in response to the polypeptides of the 

20 present invention. As used herein, a vaccine is said to prevent or attenuate a disease if its 
administration to an animal results either in the total or partial attenuation (i.e., 
suppression) of a symptom or condition of the disease, or in the total or partial immunity 
of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for 

25 either a "prophylactic" or "therapeutic** purpose. When provided prophylactically, the 
compound(s) are provided in advance of any symptoms of Entmcoccal infection. The 
prophylactic administration of the compound(s) serves to prevent or attenuate any 
subsequent infection. When provided therapeutically, the con^ound(s) is provided upon 
or after the detection of symptoms which indicate that an animal may be infected with a 

30 member of the Enterococcus genus. The therapeutic administration of the compound(s) 
serves to attenuate any actual infection. Thus, the E. faecalis polypeptides, and 
fragments thereof, of the present invention may be provided either prior to the onset of 
infection (so as to prevent or attenuate an anticipated infection) or after the initiation of 
an actual infection. 

35 The polypeptides of the invention, whether encoding a portion of a native 

protein or a functional derivative thereof, may be administered in pure form or may be 
coupled to a macromolecular carrier. Example of such carriers are proteins and 
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carbohydrates. Suitable proteins which may act as macromolecular carrier for enhancing 
the immunogenicity of the polypeptides of the present invention include keyhole limpet 
hemac^anin (KLH) tetanus toxoid, pertussis toxin, bovine serum albumin, and ovalbumin. 
Methods for coupling the polypeptides of the present invention to such macromolecular 

5 carriers are disclosed in Hariow et al., ANTIBODIES: A LABORATORY MANUAL, 
(Cold Spring Harbor Laboratory Press, 2nd ed. 1988), 

A composition is said to be "pharmacologically or physiologically acceptable'' if 
its administration can be tolerated by a recipient animal and is otherwise suitable for 
administration to that animal. Such an agent is said to be administered in a 

10 "therapeutically efTective amount" if the amount administered is physiologically 

significant An agent is physiologically significant if its presence results in a detectable 
change in the physiology of a recipient patient. 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that the 

IS composition of a pharmacologically acceptable compound varies with the animal to 
which it is administered. For example, a vaccine intended for human use will generally 
not be co-administered with Freund's adjuvant. Further, the level of purity of the E, 
faecalis polypeptides of the present invention will normally be higher when administered 
to a human than when administered to a non-human animal. 

20 As would be understood by one of ordinary skill in the art, when the vaccine of 

the present invention is provided to an animal, it may be in a composition which may 
contain salts, buffers, adjuvants, or other substances which are desirable for improving the 
efficacy of the composition. Adjuvants are substances that can be used to specifically 
augment a q>ecific immune response. These substances generally perform two functions: 

25 (1) they protect the antigen(s) firom being rapidly catabolized after administration and (2) 
they nonspecifically stimulate immune responses. 

Normally^ the adjuvant and the composition are mixed prior to presentation to 
the inmiune system, or presented sq>arately, but into the same site of the animal being 
immunized. Adjuvants can be loosely divided into several groups based upon their 

30 composition. These groups include oil adjuvants (for example, Freund's complete and 
irKomplete), mineral salts (for example, A1K(S04)2, AlNa(S04)2, AINH4(S04), silica, 
kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and 
certain natural substances (for example, wax D from Mycobacteriwn tuberculosis^ as well 
as substances found in Corynebacterium panmnty or Bordetella pertussis^ and members of 

35 the genus Brucella, Other substances useful as adjuvants are the ssqionins such as, for 
example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in the present 
invention inchide ahmunum salts, such as A1K(S04)2, AINa(S04)2, and AINH4(S04). 
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Examples of materials suitable for use in vaccine compositions are provided in 
REMINGTON'S PHARMACEUTICAL SCIENCES 1324-1341 (A. Osol, ed, Mack 
Publishing Co, Easton, PA, (1980) (incorporated herein by reference). 

The therapeutic compositions of the present invention can be administered 
5 parenterally by injection, rapid infusion, nasophaiyngeal absorption 

(intranasopharangeally), dermoabsorption, or orally. The compositions may 
alternatively be administered intramuscularly, or intravenously. Compositions for 
parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, 
and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene 

10 glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. 
Carriers or occlusive dressings can be used to increase skin permeability and enhance 
antigen absoiption. Liquid dosage forms for oral administration may generally comprise a 
liposome solution containing the liquid dosage form. Suitable forms for suspending 
liposomes include emulsions, suspensions, solutions, syrups, and elixirs containing inert 

15 dihients commonly used in the art, such as purified water. Besides the inert diluents, such 
conq)05itions can also include adjuvants, wetting agents, emulsifying and suspending 
agents, or sweetening, flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
encapsulated form. For example, intranasal immunization using vaccines encapsulated in 

20 biodegradable microsphere composed of poly(DL-lactide-co-glycolide). See^ Shahin, R. et 
al. (1995) Infect. Immun. 63:1195-1200. Similarly, orally administered encapsulated 
Salmonella typhimurium antigens can also be used. Allaoui-Attaiki, K. et al. (1997) 
Infect Immun. 65:853-857. Encapsulated vaccines of the present invention can be 
administered by a variety of routes including those involving contacting the vaccine with 

25 mucous membranes (eg., intranasally, intracolonicly, intraduodoially). 

Many different techniques exist for the timing of the immunizations when a 
multiple administration regimen is utilized. It is possible to use the compositions of the 
invention more than once to increase the levels and diversities of expression of the 
immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple 

30 immunizations are given, they will be given one to two months apart 

According to the present invention, an "effective amoimt" of a ther^eutic 
conqposition is one which is sufficient to achieve a desired biological effect Generally, 
the dosage needed to provide an effective amount of the composition will vary depending 
upon such factors as the animal's or human*s age, condition, sex, and extent of disease, if 

35 any, and other variables vdiich can be adjusted by one of ordinary skill in the art 

The antigenic preparations of the invention can be administered by either single 
or multiple dosages of an effective amount. Effective amounts of the compositions of 
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the invention can vary from 0.01-1,000 )ig/ml per dose, more preferably 0.1-500 (ig/ml 
per dose, and most preferably 10-300 (ig/ml per dose. 

6, Sbot-Gun Approach to Megabase DNA Sequencing 

5 The present mvention fuither demonstrates that a large genome can be sequenced 

using a random shotgun approach. This procedure, described in detail in the examples 
that follow, has eliminated the up front cost of isolating and ordering overlapping or 
contiguous subclones prior to the start of the sequencing protocols. 

Certain aspects of the present invention are described in greater detail in the 

10 examples that follow. The examples are provided by way of illustration. Other aspects 
and embodiments of the present invention are contemplated by the inventors, as will be 
clear to those of skill in the art from reading the present disclosure. 

ILLUSTRATIVE EXAMPLES 

15 

LIBRARIES AND SEQUENCING 

L Shotgun Sequencing Probability Analysis 

The overall strategy for a shotgun approach to whole goiome sequencing follows 
from the Lander and Waterman (Landerman and Waterman, Genomics 2:231 (1988)) 

20 application of the equation for the Poisson distribution. According to this treatment, the 
probability, PO, that any given base in a sequence of size L, in nucleotides, is not 
sequenced after a certain amount, n, in nucleotides, of random sequence has been 
determined can be calculated by the equation PO - e-m, where m is L/n, the fold 
coverage. For instance, for a genome of 2.8 Mb, m=l when 2.8 Mb of sequence has been 

25 randomly generated (IX coverage). At that point, PO = e-1 = 0.37. The probability that 
any given base has not been sequenced is the same as the probability that any region of 
the whole sequence L has not been determined and, dierefore, is equivalent to the fraction 
of the whole sequence that has yet to be deteimined. Thus, at one-fold coverage, 
approximately 37% of a polynucleotide of size L, in nucleotides has not been sequenced. 

30 When 14 Mb of sequence has been generated, coverage is SX for a 2.8 Mb and the 

unsequenced fraction drops to .0067 or 0.67%. 5X coverage of a 2.8 Mb sequence can be 
attained by sequencing ^proximately 17,000 random clones from both insert ends with 
an average sequence read length of 410 bp. 

Similarly, the total gap length, G, is detemuned by the equation G = Le-m, and the 

35 average gap size, g, follows the equation, g = Un, Thus, 5X coverage leaves about 240 
gaps averaging about 82 bp in size in a sequence of a polynucleotide 2.8 Mb long. 
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The treatment above is essentially that of Lander and Waterman, Genomics!: 
231 (1988). 

2. Random Library Construction 

S In order to approximate the random model described above during actual 

sequencing, a nearly ideal library of cloned genomic fragments is required. The following 
libraiy construction procedure was developed to achieve this end. 

Enterococcus faecalis DNA is prepared by phenol extraction. A mixture 
containing 200 jig DNA in 1.0 ml of 300 mM sodixim acetate, 10 mM Tris-HCl, 1 mM 

10 Na-EDTA, 50% glycerol is processed through a nebulizer (IPI Medical Products) with a 
stream of nitrogen adjusted to 35 Kpa for 2 minutes. The sonicated DNA is ethanol 
precipitated and redissolved in 500 TE buffer. 

To create blunt-ends, a 100 pi aliquot of the resuspended DNA is digested with 5 
units of BAL31 nuclease (New England BioLabs) for 10 min at SO^'C in 200 ^l BAL31 

15 buffer. The digested DNA is phenol-extracted, ethanol-precipitated, redissolved in 100 pi 
TE buffer, and then size-fractionated by electrophoresis through a 1.0% low melting 
tenq>erature agarose gel. The section containing DNA fragments 1.6-2.0 kb in size is 
excised from the gel, and the LGT agarose is melted and the resulting solution is extracted 
with phenol to separate the agarose from the DNA. DNA is ethanol precipitated and 

20 redissolved in 20 pi of TE buffer for ligation to vector. 

A two-step ligation procedure is used to produce a plasmid library with 97% 
inserts, of which >99% were single inserts. The first ligation mixture (50 ul) contains 2 
pg of DNA fragments, 2 p,g pUC18 DNA (Pharmacia) cut with Smal and 
dephosphoiylated with bacterial alkaline phosphatase, and 10 units of T4 ligase 

25 (GIBCO/BRL) and is incubated at 14^C for 4 hr. The ligation mixture tiien is phenol 
extracted and ethanol precipitated, and the precipitated DNA is dissolved in 20 ^1 TE 
buffer and electrophoresed on a 1.0% low melting agarose gel. Discrete bands in a ladder 
are visualized by ethidium bromide-staining and UV illumination and identified by size as 
insert (I), vector (v), v+I, v+2i, v+3i, etc. The portion of the gel containing v+I DNA is 

30 excised and the v+I DNA is recovered and resuspended into 20 pi TE. The v+I DNA then 
is blunt-ended by T4 polymerase treatment for 5 min. at 37**C in a reaction mixture (50 
ul) containing the v+I linears, 500 pM each of the 4 dNTPs, and 9 units of T4 
polymerase (New England BioLabs), under recommended buffer conditions. After phenol 
extraction and ethanol precipitation the repaired v+I linears are dissolved in 20 pi TE. 

35 The final Ugation to produce circles is carried out in a 50 pJ reaction containing 5 pi of 
v+I Unears and 5 units of T4 ligase at M'^C overnight. After 10 min. at 70**C the 
following day, the reaction mixture is stored at -20*'C. 
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This two-stage procedure results in a molecularly random collection of single- 
insert plasmid recombinants with minimal contamination from double-insert chimeras 
(<1%) or free vector (<3%). 

Since deviation from randomness can arise from propagation the DNA in the 

5 host, £. coli host cells deficient in all recombination and restriction functions (A. 

Greener, Strategies 3 (1):S (1990)) are used to prevent rearrangements, deletions, and loss 
of clones by restriction. Furthermore, transformed cells are plated directly on antibiotic 
diffusion plates to avoid the usual broth recovery phase which allows multiplication and 
selection of the most rapidly growing cells. 

10 Plating is carried out as follows. A 100 fil aliquot of Epicurian Coli SURE II 

Supercompetent Cells (Stratagene 200152) is thawed on ice and transferred to a chilled 
Falcon 2059 tube on ice. A 1.7 |il aliquot of 1.42 M beta-mercaptoethanol is added to the 
aliquot of cells to a final coxK:entration of 25 mM. Cells are incubated on ice for 10 min. 
A 1 |il aliquot of the final ligation is added to the cells and incubated on ice for 30 min. 

15 The cells are heat pulsed for 30 sec. at 42'^C and placed back on ice for 2 min. The 
outgrowth period in liquid culture is eliminated from diis protocol in order to minimize 
the preferential growth of any given transformed cell. Instead the transformation 
mixture is plated directly on a nutrient rich SOB plate containing a 5 ml bottom layer of 
SOB agar (5% SOB agar 20 g tryptone, 5 g yeast extract, 0.5 g NaCl, 1.5% Difco Agar 

20 per liter of media). The 5 ml bottom layer is supplemented with 0.4 ml of 50 mg/ml 

ampicillin per 100 ml SOB agar. The 15 ml top layer of SOB agar is supplemented with 1 
ml X-Gal (2%), 1 ml MgC12 (1 M), and 1 ml MgSO4/100 ml SOB agar. The 15 ml top 
layer is poured just prior to plating. Our titer is approximately 100 colonies/10 (il aliquot 
of transformation. 

25 All colonies are picked for template prq)aration regardless of size. Thus, oidy 

clones lost due to ''poison'* DNA or deleterious gene products are deleted from the library, 
resulting in a slight increase in gap number over that ^ected. 

3. Random DNA Sequencing 

30 High quality double stranded DNA plasmid templates are prepared using a "boiling 

bead" method developed in collaboration with Advanced Genetic Technology Corp. 
(Gaithersburg, MD) (Adams et aJ,, Science 252:1651 (1991); Adams et al.. Nature 
355:632 (1992)). Plasmid preparation is poformed in a 96-well format for all stages of 
DNA preparation from bacterial growth through final DNA purification. Template 

35 concentration is determined using Hoechst Dye and a Millipore Cytofhior. DNA 

concentrations are not adjusted, but low-yielding templates are idraitified where possible 
and not sequenced. 
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Templates are also prepared from an Enterococcus faecalis lambda genomic 
library in the vector DASH n (Stratagene). In particular, Enterococcus faecalis DNA (> 
100 kb) is partially digested in a reaction mixture (200 ul) containing SO \ig DNA, IX 
Sau3AI buffer, 20 units Sau3AI for 6 min. at 23''C. The digested DNA was phraiol- 

S extracted and fractionated by sucrose density gradient centriiugation. Fractions of the 
sucrose gradient containing 15 to 25 kb are recovered in a final volume of 6 ul. One )jJ 
of fragments is used with 1 p.1 of lambda DASHII vector (Stratagene) in the recommended 
ligation reaction. One ^1 of the ligation mixture is used per packaging reaction following 
the recommended protocol with the Gig^ack II XL Packaging Extract (Stratagene, 

10 #22771 1). Phage are plated directly without amplification from the packaging mixture 
(after dilution with SOO )il of recommended SM buffer and chloroform treatment). Yield 
is about 2.5x103 pfu/ul. An amplified library is prepared by infecting restructure NM539 
host £. coli cells eitn approximately 1x104 phage particles and recovering the progeny 
phages particles. The recovered phage is stored frozen in 7% dimethylsulfoxide. The 

IS phage titer is approximately 1x109 pfii/ml. 

For high throughput sequencing of individual lambda phage clones, liquid lysates 
(100 pJ) are prepared from randomly selected plaques (from the unamplified library) and 
template is prepared by long-range PCR using T7 and T3 vector-specific primers. 

Sequencing reactions are carried out on plasmid and/or PCR templates using the 

20 AB Catalyst LabStation with Applied Biosystems PRISM Ready Reaction Dye Primer 
Cycle Sequencing Kits for the M13 forward (M13-21) and the M13 reverse (M13RP1) 
primers (Adams et al.^ Nature 368:474 (1994)). Dye terminator sequencing reactions are 
carried out on the lambda ten^lates on a Peikin-Elmer 9600 Thermocycler using the 
Applied Biosystems Ready Reaction Dye Terminator Cycle Sequencing kits. T7 and T3 

25 primers are used to sequence the ends of the inserts from the Lambda DASH II library. 
Sequencing reactions are performed by eight individuals using an average of fourteen AB 
373 DNA Sequencers per day. All sequencing reactions are analyzed using the Stretch 
modification of the AB 373, primarily using a 34 cm well-to-read distance. The overall 
sequencing success rate very ^proximately is about 85% for M13-21 and M13RP1 

30 sequences and 65% for dye-terminator reactions. The average usable read length is 485 
bp for M13-21 sequences, 445bp for M13RP1 sequences, and 375 bp for dye-terminator 
reactions. 

Ridiards et aL Chapter 28 in AUTOMATED DNA SEQUENCING AND 
ANALYSIS, M. D. Adams, C. Fields, J. C. Venter, Eds., Academic Press, London, (1994) 
35 described the value of using sequence from both ends of sequencing templates to facilitate 
ordering of contigs in shotgun assembly projects of lambda and cosmid clones. We 
balance the desirability of both-end sequencing (including the reduced cost of lower total 
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number of templates) against shorter read-lengths for sequencing reactions performed 
with the M13RP1 (reverse) primer compared to the M13-21 (forward) primer. 
Approximately one-half of the templates are sequenced from both ends. Random reverse 
sequencing reactions are done based on successful forward sequencing reactions. Some 
5 M13RP1 sequences are obtained in a semi-directed fashion: Ml 3-21: sequences pointing 
outward at the ends of contigs are chosen for M13RP1 sequencing in an efTort to 
specifically order contigs. 

4. Protocol for Automated Cycle Sequencing 

10 . The sequencing was carried out using ABI Catalyst robots and AB 373 Automated 

DNA Sequencers. The Catalyst robot is a publicly available sophisticated pipetting and 
temperature control robot which has been developed specifically for DNA sequencing 
reactions. The Catalyst combines pre-aliquoted templates and reaction mixes consisting 
of deoxy- and dideoxynucleotides, the thermostable Taq DNA polymerase, fluorescently- 

15 labelled sequencing primers, and reaction buffer. Reaction mixes and templates are 

combined in the wells of an aluminum 96-well thermocycling plate. Thirty consecutive 
cycles of linear amplification (i.e.., one primer synthesis) steps are performed including 
denaturation, annealing of primer and template, and extension; i.e., DNA synthesis. A 
heated lid with rubber gaskets on the thermocycling plate prevents evaporation without 

20 the need for an oil overiay. 

Two sequencing protocols are used: one for dye-labelled primers and a second for 
dye-labelled dideoxy chain terminators. The shotgun sequencing involves use of four dye- 
labelled sequencing primers, one for each of die four terminator nucleotide. Each dye- 
primer is labelled with a different fluorescent dye, permitting the four individual reactions 

25 to be combined into one lane of the 373 DNA Sequencer for electrophoresis, detection, 
and base-calling. ABI currently supplies pre-mixed reaction mixes in bulk packages 
containing all the necessary non-template reagents for sequencing. Sequencing can be 
done with both plasmid and PCR- generated templates with both dye-primers and dye- 
terminators with approximately equal fidelity, although plasmid templates generally give 

30 longer usable sequences. 

Thirty-two reactions are loaded per AB373 Sequencer each day, for a total of 960 
samples. Electrophoresis is run overnight following the manufacturer's protocols, and the 
data is collected for twelve hours. Following electrophoresis and fluorescence detection, 
the ABI 373 performs automatic lane tracking and base-caltmg. The lane-tracking is 

35 confirmed visually. Each sequence electropherogram (or fluorescence lane trace) is 

inspected visually and assessed for quality. Trailing sequences of low quality are removed 
and the sequence itself is loaded via software to a Sybase database (archived daily to Snmi 
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tape). Leading vector polylinker sequence is removed automatically by a software 
program. Average edited lengths of sequences from the standard ABI 373 are around 400 
bp and depend mostly on the quality of the template used for the sequencing reaction. 
ABI 373 Sequencers converted to Stretch Liners provide a longer electrophoresis path 
5 prior to fluorescence detection and increase the average number of usable bases to 500- 
600 bp. 

INFORMATICS 

1. Data Management 

10 A number of information management systems for a large-scale sequencing lab 

have been developed. (For review see, for instance, Kerlavage et al.y Proceedings of the 
Twenty-Sixth Annual Hawaii International Conference on System Sciences, IEEE 
Computer Society Press, Washington D. C, 585 (1993)) The system used to collect and 
assemble the sequence data was developed using the Sybase relational database 

15 management system and was designed to automate data flow wherever possible and to 
reduce user error. The database stores and correlates all information collected during the 
entire operation from template preparation to final analysis of the genome. Because the 
raw ou^ut of the ABI 373 Sequencers was based on a Macintosh platform and the data 
management system chosen is based on a Unix platform, it was necessary to design and 

20 implement a variety of multi- user, client-server applications which allow the raw data as 
well as analysis results to flow seamlessly into the database with a minimum of user efl'ort 

2. Assembly 

An assembly engine (TIGR Assembler) developed for the rapid and accurate 
25 assembly of thousands of sequence fragments is enq;>loyed to generate contigs. The TIGR 
assembler simultaneously chisters and assembles fragments of the genome. In order to 
obtain the speed necessaiy to assemble more than 104 fragments, the algorithm builds a 
hash table of 10 bp oligonucleotide subsequences to generate a list of potential sequence 
fragment overlaps. The number of potential ovo'laps for each fragment determines 
30 which fragments are likely to fall into repetitive elements. Beginning with a single seed 
sequence fragment, TIGR Assembler extends the current contig by attempting to add the 
best matching fragment based on oligonucleotide content The contig and candidate 
fragment are aligned using a modifled version of the Smith- Waterman algorithm which 
provides for optimal g^ped alignments (Waterman, M. S., Methods in Enzymology 
35 164:165 (1988)). The contig is extended by the fragment only if strict criteria for the 
quality of the match are met The match criteria include the minimum length of overlap, 
the ma^Kimiim length of an unmatched end, and the minimum percentage match. These 
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criteria are automatically lowered by the algorithm in regions of minimal coverage and 
raised in regions with a possible repetitive element. The numb^ of potential overlaps for 
each fragment determines which fragments are likely to fall into repetitive elements. • 
Fragments representing the boundaries of repetitive elements and potentially chimeric 

5 fragments are often rejected based on partial mismatches at the ends of alignments and 
excluded from the current contig. TIGR Assembler is designed to take advantage of clone 
size information coupled with sequencing from both ends of each template. It enforces 
the constraint that sequence fragments from two ends of the same template point toward 
one another in the contig and are located within a certain range of base pairs (defmable 

10 for each clone based on the known clone size range for a given library). 

The process resulted in 982 contigs as represented by SEQ ID NOs: 1-982. 

3. Identifying Genes 

The predicted coding regions of the Enterococcus faecalis genome were initially 
15 defined with the program GeneMaik, indiiich finds ORFs using a probabilistic classification 
technique. The predicted coding region sequences were used in searches against a database 
of all Enterococcus faecali nucleotide sequences from GenBank (March, 1997), using the 
BLASTN search method to idoitify overlaps of 50 or more nucleotides Avith at least a 
95% identity. Those ORFs with nucleotide sequence matches are shown in Table 1 . The 
20 ORFs without such matches were translated to protein sequences and compared to a non- 
redundant database of known proteins generated by combining the Swiss-prot, PIR and 
GenPept databases. ORFs tiuit matched a database protein with BLAST? probability less 
than or equal to 0.01 are shown in Table 2. The table also lists assigned functions based 
on the closest match in the databases. ORFs that did not match protein or nucleotide 
25 sequences in the databases at these levels are shown in Table 3. 

ILLUSTRATIVE APPLICATIONS 

1. Production of an Antibody to a Enterococcus faecalis Protein 
Substantially pure protein or polypeptide is isolated from the transfected or 
30 transformed cells using any one of the methods known in the art. The protein can also 
be produced in a recombinant prokaryotic egression system, such as E, coliy or can be 
chemically synthesized. Concentration of protein in the final prq)aration is adjusted, for 
example, by concentration on an Amicon filter device, to the level of a few 
micrograms/ml. Monoclonal or polyclonal antibody to the protein can then be prepared 
35 as follows. 
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2. Monoclonal Antibody Production by Hybridoma Fusion 
Monoclonal antibody to epitopes of any of the peptides identified and isolated as 

described can be prepared from murine hybridomas according to the classical method of 
Kohler, G. and Milstein, C, Nature 256:495 (1975) or modifications of the methods 

5 thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of the selected 
protein over a period of a few weeks. The mouse is then sacrificed, and the antibody 
producing cells of the spleen isolated. The spleen cells are fused by means of 
polyethylene glycol with mouse myeloma ceUs, and the excess unfused cells destroyed by 
growth of the system on selective media comprising aminopterin (HAT media). The 

10 successfully fused cells are diluted and aliquots of the dilution placed in wells of a 

microtiter plate where growth of the culture is continued. Antibody-producing clones are 
identified by detection of antibody in the supernatant fluid of the wells by immunoassay 
procedures, such as ELISA, as originally described by Engvall, E., Meth. EnzymoL 70:419 
(1980), and modified methods thereof. Selected positive clones can be expanded and 

15 their monoclonal antibody product harvested for use. Detailed procedures for monoclonal 
antibody production are described in Davis, L. et ah, Basic Methods in Molecular Biology, 
Elsevier, New York. Section 21-2 (1989). 

3. Polyclonal Antibody Production by Immunization 

20 Polyclonal antisemm containing antibodies to heterogenous epitopes of a single 

protein can be prepared by inmiunizing suitable animals with the expressed protein 
described above, which can be unmodified or modified to enhance immunogenicity. 
Effective polyclonal antibody production is affected by many factors related both to the 
antigen and the host ^>6cies. For example, small molecules tend to be less immunogenic 

25 than others and may require the use of carriers and adjuvant Also, host animals vary in 
response to site of inoculations and dose, with both inadequate or excessive doses of 
antigen resulting in low titer antisera. Small doses (ng level) of antigen administered at 
multiple intradermal sites appears to be most reliable. An effective immunization 
protocol for rabbits can be found in Vaitukaitis, J. et a/., J, Clin, Endocrinol. Metab. 

30 33:988-991 (1971). 

Booster injections can be given at regular intervals, and antiserum harvested when 
antibody titer thereof, as determined semi-quantitatively, for example, by double 
immunodiffusion in agar against known concentrations of the antigen, begins to fall. See, 
for example, Oucbterlony, O. et al^ Chap. 19 in: Handbook of Ejq>erimental 

35 Immunology, Wier, D., ed, Blackwell (1973). Plateau concentration of antibody is 
usually in the range of 0. 1 to 0. 2 mg/ml of serum (about 12M). Affinity of the 
antisera for the antigen is detmnined by prqjaring competitive binding curves, as 
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described, for example, by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 
second edition, Rose and Friedman, eds., Amer. Soc. For Microbiology, Washington, D. C. 
(1980) 

Antibody preparations prepared according to either protocol are useful in 
S quantitative immunoassays which determine concentrations of antigen-bearing substances 
in biological samples; they are also used semi- quantitatively or qualitatively to identify 
the presence of antigen in a biological sample. In addition, antibodies are useful in various 
animal models of enterococcal disease as a means of evaluating the protein used to make 
the antibody as a potential vaccine target or as a means of evaluating the antibody as a 
10 potential immunotherapeutic or inununoprophylactic reagent. 

4. Preparation of PCR Primers and Amplification of DNA 

Various fragments of the Enterococcus faecalis genome, such as those of Tables 
1-3 and SEQ ID NOS: 1-982 can be used, in accordance with the present invention, to 
15 prepare PCR primers for a variety of uses. The PCR primers are preferably at least 15 
bases, and more preferably at least 18 bases in length. When selecting a primer sequence, 
it is preferred that the primer pairs have approximately the same G/C ratio, so that 
melting temperatures are {^proximately the same. The PCR primers and amplified DNA 
of this Example find use in the Examples that follow. 

20 

5. Isolation of a Selected DNA Clone From the Deposited 
Sample of E. faecalis 

Three approaches can be used to isolate a E, faecalis clone comprising a 
polynucleotide of the present invention from any £. faecalis genomic DNA library. The 
25 E, faecalis strain V586 has been deposited as a convienent source for obtaining a E. 
faecalis strain although a wide varity of strains E, faecalis strains can be used which are 
known in the art 

£. faecalis genomic DNA is prepared using the following method. A 20ml 
overnight bacterial culture grown in a rich medium (e.g., Trypticase Soy Broth, Brain 

30 Heart Infusion broth or Super broth), pelleted, ished two times with TES (30mM Tris-pH 
8.0, 25mM EDTA, 50mM NaQ), and resuspended in 5ml high salt TES (2.5M NaCl). 
Lysostaphin is added to final concentration of approx 50ug/ml and the mixture is rotated 
slowly I hour at 37C to make protoplast cells. The solution is then placed in incubator 
(or place in a shaking water bath) and wanned to 55C. Five hundred micro liter of 20% 

35 sarcosyl in TES (final concentration 2%) is then added to lyse the cells. Next, guanidine 
HCl is added to a final concentration of 7M (3.69g in 5.5 ml). The mixture is swirled 
slowly at 55C for 60-90 min (solution should clear). A CsCl gradient is then set up in 
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gradient is carefully overlayed with the DNA-contaming GuHCl sohition. The gradient is 
spun at 30,000 ipm, 20C for 24 hr and the lower DNA band is collected. The volume is 
increased to 5 ml with TE buffer. The DNA is then treated with protease K (10 ug^il) 
5 overnight at 37 and precipitated with ethanol. The precipitated DNA is resuspended in 
a desired bufTer. 

In the first method, a plasmid is directly isolated by screening a plasmid E. 
faecalis genomic DNA library using a polynucleotide probe corresponding to a 
polynucleotide of the present invention. Particularly, a specific polynucleotide with 30- 

10 40 nucleotides is synthesized using an Applied Biosystems DNA synthesizer according to 
the sequence reported. The oligonucleotide is labeled, for instance, with ^P-y-ATP using 
T4 polynucleotide kinase and purified according to routine methods. {See, e.g., Maniatis 
et al.. Molecular Qontng: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, 
NY (1982).) The library is transformed into a suitable host, as indicated above (such as 

IS XL-1 Blue (Stratagene)) using techniques known to those of skill in the art See, e.g,y 
Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring 
Harbor, N.Y. 2nd cA 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR 
BIOLOGY (John Wiley and Sons, N.Y. 1989). The transformants are plated on 1.5% 
agar plates (containing the appropriate selection agent, e.g., ampicillin) to a density of 

20 about 150 transformants (colonies) p^ plate. These plates are screened using Nylon 
membranes according to routine methods for bacterial colony screening. See, 
Sambrook et aL MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring 
Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR 
BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to those of skill 

25 in the art 

Alternatively, two primers of 15-25 nucleotides derived from the 5* and 3' ends of 
a polynucleotide of SEQ ID NOS: 1-982 are synthesized and used to amplify the desired 
DNA by PCR using a E. faecalis genomic DNA prep as a template. PCR is carried out 
under routine conditions, for instance, in 25 ^ of reaction mixture with 0.5 ug of the 

30 above DNA template. A convenient reaction mixture is 1.5-5 mM MgCIa, 0.01% (wA^) 
gelatin, 20 \iM each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 
Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94^C for 1 min; 
annealing at 55**C for 1 min; elongation at 72**C for 1 min) are performed with a Perkin- 
Elmer Cetus automated thermal cycler. The amplified product is analyzed by agarose gel 

35 electrophoresis and the DNA band with expected molecular weight is excised and purified. 
The PCR product is verified to be the selected sequence by subcloning and sequencing the 
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DNA product. 

Finally, overlapping oligos of the DNA sequences of SEQ ID NOS: 1-982 can be 
chemically synthesized and used to generate a nucleotide sequence of desired length using 
PCR methods known in the art 

5 

6(a). Ei[pression and Purification Enterococcal polypeptides 
in E. coii 

The bacterial expression vector pQE60 was used for bacterial expression of some 
of the polypeptide fragements of the present invention which were used in the soft tissue 

10 and systemic infection models discussed below. (QIAGEN, Inc., 9259 Eton Avaiue, 

Chatsworth, CA, 91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and 
contains a bacterial origin of replication ("ori"), an IPTG inducible promoter, a ribosome 
bindmg site (**RBS**), six codons encoding histidine residues that allow affinity purification 
using nickel-nitrilo-tri-acetic acid ("Ni-NTA**) affinity resin (QIAGEN, Inc., supra) and 

15 suitable single restriction enzyme cleavage sites. These elements are arranged such that 
an inserted DNA fragment encoding a polypeptide expresses that polypeptide with the six 
His residues (i.e., a **6 X His tag**) covalently linked to the carboxyl terminus of that 
polypeptide. 

The DNA sequence encoding the desired portion of a E. faecalis protein of the 

20 present invention was amplified from E, faecalis genomic DNA using PCR 

oligonucleotide primers which anneal to the 5' and 3' sequences coding for the portions 
of the E. faecalis polynucleotide shown in SEQ ID NOS: 1-982. Additional nucleotides 
containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5' 
and 3' sequences, respectively. 

25 For cloning the mature protein, the 5' primer has a sequence containing an 

appropriate restriction site followed by nucleotides of the amino terminal coding sequence 
of the desired £. faecalis polynucleotide sequence in SEQ ID NOS: 1-982. One of ordinary 
skill in the art would sq[)preciate that the point in the protein coding sequence where the 5' 
and 3* primers begin may be varied to aii^>liiy a DNA segment encoding any desired 

30 portion of the complete protein shorter or longer than the mature form. The 3' primer 
has a sequence containing an appropriate restriction site followed by nucleotides 
complementary to the 3* end of the polypeptide coding sequence of SEQ ID NOS: 1-982, 
excluding a stop codon, with the coding sequence aligned with the restriction site so as to 
maintain its reading frame with that of the six His codons in the pQE60 vector. 

35 The amplified E, faecalis DNA fragment and the vector pQE60 were digested 

with restriction en^mes which recognize the sites in the primers and the digested DNAs 
were then ligated together. The E. faecalis DNA was inserted into the restricted pQE60 
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vector in a manner which places the £. faecalis protein coding region downstream from 
the IPTG-inducible promoter and in-frame with an initiating AUG and the six histidine 
codons. 

The ligation mixture was transformed into competoit E. coli cells using standard 

5 procedures such as those described by Sambrook et al., supra,, E, coli strain Ml5/rep4, 
containing multiple copies of the plasmid pREP4, which expresses the lac repressor and 
confers kanamycin resistance ("Kanr"), was used in carrying out the illustrative example 
described herein. This strain, which was only one of many that are suitable for expressing 
a E. faecalis polypeptide, is available commercially (QIAGEN, Inc., supra). 

10 Transformants were identified by their ability to grow on LB agar plates in the presence 
of ampicillin and kanamycin. Plasmid DNA was isolated from resistant colonies and the 
identity of the cloned DNA confirmed by restriction analysis, PGR and DNA sequencing. 

Clones containing the desired constructs were grown overnight CO/N") in liquid 
culture in LB media supplemented with both ampicillin (100 ^g^l) and kanamycin (25 

15 |ig/ml). The 0/N culture was used to inoculate a large culture, at a dilution of 

approximately 1 :25 to 1 :250. The cells were grown to an optical density at 600 nm 
("OD600'*) of between 0.4 and 0.6. Isopropyl-p-D-thiogalactopyranoside ("IPTG**) was 
then added to a final concentration of 1 mM to induce transcription from the lac 
repressor sensitive promoter, by inactivating the lad repressor. Cells subsequently were 

20 incubated further for 3 to 4 hours. Cells then were harvested by centrifugation. 

The cells were then stirred for 3-4 hours at 4**C in 6M guanidine-HCl, pH 8. The 
cell dd>ris was removed by centrifugation, and the supematant containing the E, faecalis 
polypeptide was loaded onto a nickel-nitrilo-tri-acetic acid (^i-NTA") affinity resin 
column (QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin 

25 with high affinity were purified in a simple one-stq) procedure (for details see: The 

QIAexpressionisty 1995, QIAGEN, Inc., supra). Briefly the supematant was loaded onto 
the column in 6 M guanidine-HCl, pH 8, the column was first washed with 10 volumes of 
6 M guanidine-HCl, pH 8, then washed with 10 volumes of 6 M guanidine-HCl pH 6, and 
finally the E. faecalis polypeptide was eluted with 6 M guanidine-HCl, pH 5. 

30 The purified protein was then renatured by dialyzing it against phosphate-buffered 

saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the 
protein could be successfully refolded while immobilized on the Ni-NTA colunm. The 
recommended conditions are as follows: renature using a linear 6M-IM urea gradient in 
500 mM NaQ, 20% glycerol, 20 mM Tris/HCl pH 7.4, containing protease inhibitors. 

35 The renaturation should be performed over a period of 1.5 hours or more. After 
renaturation the proteins can be eluted by the addition of 250 mM immidazole. 
Immidazole was removed by a final dialyzing step against PBS or 50 mM sodium acetate 
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pH 6 bufTer plus 200 mM NaCl. The purified protein was stored at 4"" C or frozen at -80° 
C. 

Some of the polypeptide of the present invention were prepared using a non- 
denaturing protein purification method. For these polypeptides, the cell pellet from each 

5 liter of culture was resuspended in 25 mis of Lysis Buffer A at 4**C (Lysis Buffer A = 50 
mM Na-phosphate, 300 mM NaCl, 10 mM 2-mercaptoethanol, 10% Glycerol, pH 7.5 
with 1 tablet of Complete EDTA-free protease inhibitor cocktail (Boehringcr Mannheim 
#1873580) per 50 ml of buffer). Absorbance at 550 nm was approximately 10-20 
O.D7ml. The suspension was then put through three freeze/thaw cycles from -70**C 

10 (using a ethanol-dry ice bath) up to room temperature. The cells were lysed via 

sonication in short 10 sec bursts over 3 minutes at approximately 80W while kept on ice. 
The sonicated san^>le was then centriiuged at 15»000 RPM for 30 minutes at 4**C. The 
supernatant was passed through a colunm containing 1.0 ml of CL-4B resin to pre-clear 
the sample of any proteins that may bind to agarose non-specifically, and the flow- 

15 through fraction was collected. 

The pre-cleared flow-through was dspphed to a nickel-nitrilo-tri-acetic acid C^i- 
NTA") afimity resin column (Quiag^, Inc., supra). Proteins with a 6 X His tag bind to 
the Ni-NTA resin with high affinity and can be purified in a simple one-step procedure. 
Briefly, the supernatant was loaded onto the column in Lysis Buffer A at 4°C, the column 

20 was first washed with 10 volumes of Lysis Buffer A until the A280 of the eluate returns to 
the baseline. Then, the colunm was washed with 5 volumes of 40 mM Imidazole (92% 
Lysis Buffer A / 8% Buffer B) (Buffer B = 50 mM Na-Phosphate, 300 mM NaCl, 10% 
Glycerol, 10 mM 2-mercivtoethanol, 500 mM Imidazole, pH of the final buffer should be 
7.5). The protein was ehited off of the column with a series of mcreasing Imidazole 

25 solutions made by ai^'usting the ratios of Lysis Buffer A to Buffer B. Three different 
concentrations were used: 3 volumes of 75 mM Imidazole, 3 volumes of 150 mM 
Imidazole, 5 volumes of 500 mM Imidazole. The fractions containing the purified 
protein were analyzed using 8 %, 10 % or 14% SDS-PAGE depending on the protein size. 
The purified protein was then dialyzed 2X against phosphate-buffered saline (PBS) in 

30 order to place it into an easily workable buffer. The purified protein was stored at 4^ C or 
frozen at -80**. 

The following altCToative method may be used to purify £. faecalis expressed m E 
coll when it is present in the form of inclusion bodies. Unless otherwise specified, all of 
the following stq)s are conducted at 4-1 O^C. 
35 Upon completion of the production phase of the £. coli fermentation, the cell 

culture is cooled to 4-10°C and the cells are harvested by continuous centrifugation at 
15,000 ipm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
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weight of cell paste and the amount of purified protein required, an Expropriate amount 
of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, SO mM 
EDTA, pH 7.4. The cells are dispersed to a homogeneous suspoision using a high shear 
mixer. 

5 The cells are then lysed by passing the solution through a microfluidizer 

(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogcnate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaCI, followed by 
centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

10 The resulting washed inchision bodies are solubilized with 1 .5 M guanidine 

hydrochloride (OuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the 
pellet is discarded and the E, faecalis polypeptide-containing supernatant is incubated at 
4^C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

15 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 
volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 
vigorous stirring. The refolded diluted protein solution is kept at 4^C without mixing for 
12 hours prior to further purification steps. 

To clarify the refolded E. faecalis polypeptide solution, a previously prepared 

20 tangential filtration unit equipped with 0.16 \im membrane filter with appropriate surface 
area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is onployed. The 
filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The cohmm is washed with 40 mM sodium acetate, pH 6.0 and eluted with 
250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a stepwise 

25 manner. The absoibance at 280 mm of the effluent is continuously monitored. Fractions 
are collected and further analyzed by SDS-PAGE. 

Fractions containing the E. faecalis polypeptide are thai pooled and mixed with 4 
volimies of water. The diluted sample is then loaded onto a previously prepared set of 
tandem cohmms of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion 

30 (Poros CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated 
with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium 
acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using a 10 column 
volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1 .0 
M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected under constant A2S0 

35 monitoring of the effluent Fractions containing the £. faecalis polypeptide (determined, 
for instance, by 16% SDS-PAGE) are thai pooled. 
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The resultant E. faecalis polypeptide exhibits greater than 95% purity after the 
above refolding and purification stq>s. No major contaminant bands are observed from 
Commassie blue stained 16% SDS-PAGE gel when 5 \ig of purified protein is loaded. The 
purified protein is also tested for endotoxin/LPS contamination, and typically the LPS 
5 content is less than 0.1 ngAnl according to LAL assays. 

6(b). Alternative Expression and Purification £nterococcal 
polypeptides in E. coli 

Tthe vector pQElO was alternatively used to clone and express some of the 
10 polypeptides of the present invention for use in the soft tissue and systemic infection 
models discussed below. The difference being such that an inserted DNA fragment 
encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a **6 X 
His tag**) covalently linked to the amino terminus of that polypeptide. The bacterial 
expression vector pQElO (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311) 
15 was used in this example . The components of the pQElO plasmid are arranged such that 
the inserted DNA sequence encoding a polypeptide of the present invention expresses the 
polypeptide with the six His residues (i.e., a "6 X His tag")) covalently linked to the 
amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of SEQ ID 

20 NOS: 1-982 were amplified using PCR oligonucleotide primers from genomic E. faecalis 
DNA. The PCR primers anneal to the nucleotide sequences encoding the desired ammo 
acid sequence of a polypeptide of the present invention. Additional nucleotides 
containing restriction sites to facilitate cloning in the pQElO vector were added to the 5' 
and 3' primer sequences, respectively. 

25 For cloning a polypeptide of the present invention, the 5' and 3' primers were 

selected to amplify their reqsective nucleotide codmg sequences. One of ordinary skill in 
the art would appreciate that the point in the protein coding sequence where die 5* and 3* 
prim^ begins may be varied to amplify a DNA segment encoding any desired portion of 
a polypeptide of the present invention. The 5' primer was designed so the coding 

30 sequence of the 6 X His tag is aligned with the restriction site so as to maintain its reading 
frame with that of E. faecalis polypeptide. The 3' was designed to include an stop codon. 
The amplified DNA fragment was then cloned, and the protein expressed, as described 
above for the pQE60 plasmid. 

The DNA sequences encoding the amino acid sequCTces of SEQ ID NOS: 1-982 

35 may also be cloned and expressed as fusion proteins by a protocol similar to that described 
directly above, ^idierein the pET-32b(+) vector (Novagen, 601 Science Drive, Madison, 
WI 5371 1) is preferentially used in place of pQElO. 
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The above methods are not limited to the polypeptide fiagements actually 
produced. The above method, like the methods below, can be used to produce either full 
length polypeptides or desired iragements therof. 

S 6(c). Alternative Eiipression and Purification of Enterococcai 

polypeptides in E. coK 

The bacterial expression vector pQE60 is used for bacterial expression in this 
example (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311). However, in this 
example, the polypeptide coding sequence is inserted such that translation of the six His 

10 codons is prevented and, therefore, the polypeptide is produced with no 6 X His tag. 

The DN A sequence encoding the desired portion of the E, faecalis amino acid 
sequence is amplified from an E, faecalis genomic DNA prq> the deposited DNA clones 
using PGR oligonucleotide primers which anneal to the 5' and 3' nucleotide sequences 
corresponding to the desired portion of the E. faecalis polypeptides. Additional 

15 nucleotides containing restriction sites to facilitate cloning in the pQE60 vector are added 
to the 5' and 3' primer sequences. 

For cloning a E. faecalis polypeptides of the present invention, 5' and 3' primers 
are selected to amplify their respective nucleotide coding sequences. One of ordinary skill 
in the art would appreciate that the point in the protein coding sequence where the 5' and 

20 3' primers begin may be varied to asnptify a DNA segment encoding any desired portion 
of a polypeptide of the present invention. The 3' and 5' primers contain appropriate 
restriction sites followed by nucleotides complementary to the 5' and 3' ends of the 
coding sequence respectively. The 3* primer is additionally designed to include an in- 
flame stop codon. 

25 The anq)lified E, faecalis DNA fragments and the vector pQE60 are digested with 

restriction enzymes recognizing the sites in the primers and the digested DNAs are then 
ligated together. Insertion of the E, faecalis DNA into the restricted pQE60 vector 
places the E, faecalis protein coding region including its associated stop codon 
downstream from the IPTG-inducible promoter and in-frame with an initiating AUG. 

30 The associated stop codon prevents translation of the six histidine codons downstream of 
the insertion point. 

Hie ligation mixture is transformed into competent E, coli cells using standard 
procedures such as diose described by Sambrook et al. £. coli strain MlS/rep4, containing 
multiple copies of the plasmid pREP4, which expresses the lac repressor and confers 
35 kanamycin resistance ^Kanr**), is used in carrying out the illustrative example described 
herein. This stram, which is only one of many that are suitable for caressing E. faecalis 
polypeptide, is available commercially (QIAGEN, Inc., supra). Transformants are 
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identified by their ability to grow on LB plates in the presence of ampidllin and 
kanamycin. Plasmid DNA is isolated from resistant colonies and the identity of the 
cloned DNA confirmed by restriction analysis, PGR and DNA sequencing. 

Clones containing the desired constructs are grown overnight ("O/N") in liquid 

5 culture in LB media supplemented with both ampicillin (100 \ig/m\) and kanamycin (25 
(ig/ml). The 0/N culture is used to inoculate a large culture, at a dilution of approximately 
1:25 to 1:250. The cells are grown to an optical density at 600 nm ("OD600") of 
between 0,4 and 0.6. isopropyl-b-D-thiogalactopyranoside ("IPTG") is then added to a 
final concentration of 1 mM to induce transcription from the lac repressor sensitive 

10 promoter, by inactivating the lad repressor. Cells subsequently are incubated Airther for 
3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the £. faecalis polypeptide, the cells are then stirred for 3-4 hours at 
4°C in 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the 
supernatant containing the E. faecalis polypeptide is dialyzed against 50 mM Na-acetate 

15 buffer pH 6, siq>plemented with 200 mM NaG. Alternatively, the protein can be 

successfully refolded by dialyzing it against 500 mM NaCl, 20% glycerol, 25 mM Tris/HCl 
pH 7.4, containing protease inhibitors. After renaturation the protein can be purified by 
ion exchange, hydrophobic interaction and size exclusion chromatography. 
Alternatively, an affinity chromatography step such as an antibody column can be used to 

20 obtain pure E, faecalis polypeptide. The purified protein is stored at 4*^ C or frozen at - 
80" C, 

The following alternative method may be used to purify E, faecalis polypeptides 
expressed in E coli when it is present in the form of inclusion bodies. Unless otherwise 
specified, all of the following steps are conducted at 4-lO^C. 

25 Upon completion of the production phase of the E, coli fermentation, the cell 

culture is cooled to 4-10''C and the cells are harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate amount 
of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 mM 

30 EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a high shear 
mixer. 

The cells ware then lysed by passing the solution through a microfluidizer 
(Microfuidics, Coip. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaQ, followed by 
35 centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 
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The resulting washed inchision bodies are solubilized with 1.5 M guanidine 
hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centriftigation for 15 min., the 
peUet is discarded and the £1 faecalis polypeptide-containing supernatant is incubated at 
A^C overnight to allow further GuHCl extraction. 

5 Following high speed centrilugation (30,000 x g) to remove insoluble particles, 

the GuHCl solubilized protein is refolded by quickly mixing Ae GuHCl extract with 20 
volumes of buffer containing 50 mM sodium, pH 4,5, 150 mM NaCl, 2 mM EDTA by 
vigorous stirring. The refolded diluted protein solution is kept at 4^C without mixing for 
12 hours prior to further purification steps. 

10 To clarify the refolded E. faecalis polypeptide solution, a previously prepared 

tangential filtration unit equipped with 0.16 \im membrane filter with appropriate surface 

(e g-> Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 
filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted with 

15 250 mM, 500 mM, 1000 mM» and 1500 mM NaCl in the same buffer, in a stepwise 

manner. The absorbance at 280 mm of the effluent is continuously monitored. Fractions 
are collected and further analyzed by SDS-PAGE. 

Fractions containing the E. faecalis polypeptide are then pooled and mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of 

20 tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion 
(Poros CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated 
with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium 
acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using a 10 column 
volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 

25 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected under constant A280 

monitoring of the effluent Fractions containing the E. faecalis polypeptide (determined, 
for instance, by 16% SDS-PAGE) are thai pooled. 

The resultant E, faecalis polypeptide exhibits greater than 95% purity after the 
above refolding and purification steps. No major contaminant bands are observed from 

30 Commassie blue stained 16% SDS-PAGE gel when 5 \ig of purified protein is loaded. The 
purified protein is also tested for endotoxin/LPS contamination, and typically the LPS 
content is less than 0.1 ng^ according to LAL assays. 



35 



6(d). Cloning and Expression of E. faecalis in Other Bacteria 
E. faecalis polypeptides can also be produced in: E. faecalis using the methods of 
S. Skinner et aL, (1988) Mol. MiCTobioL 2:289-297 or J. I. Moreno (1996) Protein Expr. 
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Purif. 8(3):332-340; Lactobacillus using the methods of C. Rush et al., 1997 Appl. 
Microbiol. Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods Chang et 
al., U.S. Patent No. 4,952,508. 

5 7. Cloning and Expression in COS Cells 

A E, faecalis expression plasmid is made by cloning a portion of the DN A 
encoding a E. faecalis polypeptide into the expression vector pDNAI/Amp or pDNAlU 
(which can be obtained from Invitrogen, Inc.)- The expression vector pDNAI/amp 
contains: (1) an E, coli origin of replication effective for propagation in E, coli and other 

10 prokaryotic cells; (2) an ampicillin resistance gene for selection of plasmid-containing 
prokaryotic cells; (3) an SV40 origin of replication for propagation in eukaryotic cells; 
(4) a CMV promoter, a polylinker, an SV40 intron; (5) several codons encoding a 
hemagglutinin fragment (i.e., an "HA" tag to facilitate purification) followed by a 
tennination codon and polyadenylation signal arranged so that a DNA can be 

15 conveniently placed under expression control of the CMV promoter and operably linked 
to the SV40 intron and the polyadenylation signal by means of restriction sites in ttie 
polylinker. The HA tag corre^onds to an epitope derived from the influenza 
hemagglutinin protein described by Wilson et al 1984 Cell 37:767. The fusion of the HA 
tag to the target protein allows easy detection and recovery of the recombinant protein 

20 with an antibody that recognizes the HA epitope. pDNAIII contains, in addition, the 
selectable neomycin marker. 

A DNA fragment encoding a E, faecalis polypeptide is cloned into the polylinker 
region of the vector so that recombinant protein expression is directed by the CMV 
promoter. The plasmid construction strategy is as follows. The DNA from a E. faecalis 

25 genomic DNA prep is anq[)lified using primers that contain convenient restriction sites, 
much as described above for construction of vectors for expression of E. faecalis in 
E, coli. The 5' primer contains a Kozak sequence, an AUG start codon, and nucleotides 
of the 5' coding region of the E. faecalis polypeptide. The 3* primer, contains 
nucleotides complementary to the 3* coding sequence of the £. faecalis DNA, a stop 

30 codon, and a convenient restriction site. 

The PGR amplified DNA fragment and the vector, pDNAI/Amp, are digested with 
appropriate restriction enzymes and dien ligated. The ligation mixture is transformed 
into an appropriate E. coli strain such as SURE^ (Stratagene Cloning Systems, La JoUa, 
CA 92037), and the transformed culture is plated on ampicillin media plates which then 

35 are incubated to allow growth of ampicillin resistant colonies. Plasmid DNA is isolated 
from resistant colonies and examined by restriction analysis or o&er means for the 
presence of the fragment encoding the E, faecalis polypeptide 
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For expression of a recombinant £. faecalis polypeptide, COS cells are transfected 
with an expression vector, as described above, using DEAE-dextran, as described, for 
instance, by Sambrook et al. {supra). Cells arc incubated und^ conditions for expression 
. of E. faecalis by the vector. 
S Expression of the E. faecaiis-HA fusion protein is detected by radiolabeling and 

immunoprecipitation, using methods described in, for example Harlow et al., supra,. To 
this end, two days after transfcction, the cells are labeled by incubation in media 
containing ^^S~cysteine for 8 hours. The cells and the media are collected, and the cells 
are washed and the lysed with detergent-containing RIPA buffer: 150 mM NaCl, 1% NP- 
10 40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TRIS, pH 7.5, as described by Wilson et al. 
(supra ). Proteins are precipitated from the cell lysate and from the culture media using 
an HA-specific monoclonal antibody. The precipitated proteins then are analyzed by 
SDS-PAGE and autoradiography. An expression product of the expected size is seen in 
the cell lysate, which is not seen in negative controls. 

15 

8. Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of E. faecalis polypeptide in this 
example. Plasmid pC4 is a derivative of the plasmid pSV2-dhfT (ATCC Accession No. 
37146). The plasmid contains the mouse DHFR gene under control of the SV40 early 
" 20 promoter. Chinese hamster ovary cells or other cells lacking dihydrofolate activity that 
are transfected with these plasmids can be selected by growing the cells in a selective 
medium (alpha minus MEM, Life Technologies) supplemented with the cbemotherapeutic 
agent methotrexate. The an^lification of the DHFR genes in cells resistant to 
methotrexate (MTX) has been well documented. See, e.g., Alt et al., 1978, J. Biol. 

25 Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys. Acta, 1097:107-143; 
Page et al., 1991, Biotechnology 9:64-68. Cells grown in increasing concentrations of 
MTX develop resistance to the drug by overproducing the target enzyme, DHFR, as a 
result of amplification of the DHFR gene. If a second gene is linked to the DHFR gene, it 
is usually co-amplified and over-expressed. It is known in the art that this approach may 

30 be used to develop cell lines carrying more than 1,000 copies of the amplified gene(s). 
Subsequently, when the methotrexate is withdrawn, cell lines are obtained which contain 
the amplified gene integrated into one or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal rqseat (LTR) of 
the Rouse Sarcoma Vinis, for expressing a polypq>tide of interest, Cullen, et al. (1985) 

35 Mol. Cell. Biol. 5:438-447; plus a fragment isolated firom the enhancer of the inmiediate 
early gene of human cytomegalovirus (CMV), Boshart, et al., 1985, Cell 41:521-530. 
Downstream of the promoter are the following single restriction enzyme cleavage sites 
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that allow the integration of the genes: Bam HI, Xba I, and Asp 718. Behind these 
cloning sites the plasmid contains the 3' intron and polyadenylation site of the rat 
preproinsulin gene. Other high efficiency promoters can also be used for the expression, 
e.g., the human B-actin promoter, the SV40 early or late promoters or the long terminal 

5 repeats from other retroviruses, e.g., HIV and HTLVI. Clontech's Tet-Off and Tet-On 
gene expression systems and similar systems can be used to express the E. faecaiis 
polypeptide in a regulated way in mammalian cells (Gosscn et al., 1992, Proc. Natl. Acad. 
Sci. USA 89:5547-5551. For the polyadenylation of the mRNA other signals, e.g., from 
the human growth hormone or globin genes can be used as well. Stable cell lines carrying 

10 a gene of interest integrated into the chromosomes can also be selected upon co- 
transfection with a selectable maricer such as gpt, 041 8 or hygromycin. It is 
advantageous to use more than one selectable marker in the beginning, e.g., G418 plus 
methotrexate. 

The plasmid pC4 is digested with the restriction enzymes and then 
15 dephosphorylated using calf intestinal phosphates by procedures known in the art. The 
vector is then isolated from a 1% agarose gel. The DNA sequence encoding the E. 
faecaiis polypeptide is amplified using PGR oligonucleotide primers corresponding to the 
5' and 3' sequences of the desired portion of the gene. A 5* primer containing a 
restriction site, a Kozak sequence, an AUG start codon, and nucleotides of the 5' coding 
20 region of the E. faecaiis polypeptide is synthesized and used. A 3* primer, containing a 
restriction site, stop codon, and nucleotides complementary to the 3' coding sequence of 
the E. faecaiis polypeptides is synthesized and used The amplified fragment is digested 
with the restriction endonucleases and then purified again on a 1% agarose gel. The 
isolated fragment and the dephosphorylated vector are then ligated with T4 DNA ligase. 
25 E. coli HBlOl or XL-1 Blue cells are then transformed and bacteria are identified that 
contain the fragment insCTted into plasmid pC4 using, for instance, restriction enzyme 
analysis. 

Chinese hamster ovary cells lacking ap active DHFR gene are used for 
transfection. Five ^g of the expression plasmid pC4 is cotransfected with 0.5 p.g of the 

30 plasmid pSVneo using a lipid-mediated transfection agent such as Lipofectin^ or 
LipofectAMINE.™ (LifeTechnologies Gaithersburg, MD). The plasmid pSV2-neo 
contains a dominant selectable marker, die neo gene from Tn5 encoding an enzyme that 
confers resistance to a groqp of antibiotics iiKluding G418. The cells are seeded in alpha 
minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are trypsinized and 

35 seeded in hybridoma cloning plates (Oreiner, Germany) in alpha minus MEM 

supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml 0418. After about 
10-14 days single clones are trypsiiuzed and then seeded in 6-well petri dishes or 10 ml 
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flasks using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 
nM, 800 nM). Clones growing at the highest concentrations of methotrexate are then 
transferred to new 6-well plates containing even higher concentrations of methotrexate 
(1 (iM, 2 (iM, 5 (iM, 10 mM, 20 mM). Hie same procedure is repeated until clones are 
5 obtained which grow at a concentration of 1 00-200 |iM. Expression of the desired gene 
product is analyzed, for instance, by SDS-PAGE and Western blot or by reversed phase 
HPLC analysis. 

9. Quantitative Murine Soft Tissue Infection Model for 
10 E. faecalis 

Compositions of the present invention, including polypeptides and peptides, are 
assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E, faecalis) using the following quantitative murine 
soft tissue infection model. Mice (e.g., NIH Swiss female mice, approximately 7 weeks 

15 old) are first treated with a biologically protective effective anaount, or immune 

enhancing/stimulating effective amount of a composition of the present invention using 
methods known in the art, such as those discussed above. See.e.g., Harlow et aL, 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd 
ed. 1988). An example of an appropriate starting dose is 20ug per animal. 

20 The desired bacterial species used to challenge the mice, such as E. faecalis^ is 

grown as an overnight culture. The culture is diluted to a concentration of 5 X lO' 
cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1 :2 with sterilized Cytodex 3 microcarrier beads preswollen in 
sterile PBS (3g/100ml). Mice are anesthetize briefly until docile, but still mobile and 

25 injected with 0.2 ml of the Cytodex 3 bead/bacterial mixture into each animal 

subcutaneously in the inguinal region. After four days, counting the day of injection as 
day one, mice are sacrificed and the contents of the abscess is excised and placed in a 15 
ml conical tube containing 1 .OmI of sterile PBS. The contents of the abscess is then 
enzymatically treated and plated as follows. 

30 The abscess is first disrupted by vortextng with sterilized glass beads placed in the 

tubes. 3.0mls of prepared enzyme mixture (l.OmI CoUagenase D (4.0 mg/ml), 1.0ml 
Trypsin (6.0 mg/ml) and 8.0 mis PBS) is then added to each tube followed by a 20 min. 
incubation at 37C. The solution is then centrifuged and die supernatant drawn off. 0.5 
ml dH20 is then added and the tubes are vortexed and thai incubated for 10 min. at room 

35 temperature. 0.5 ml media is than added and samples are serially diluted and plated onto 
agar plates, and grown overnight at 37C. Plates with distinct and sq)arate colonies are 
then counted, compared to positive and negative control samples, and quantified. The 
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method can be used to identify composition and determine appropriate and efTective 
doses for humans and other animals by comparing the efTective doses of compositions of 
the present invention with compositions known in the art to be efTective in both mice 
and humans. Doses for the effective treatment of humans and other animals, using 
5 compositions of the present invention^ are extrapolated using the data from the above 
experiments of mice. It is appreciated that further studies in humans and other animals 
may be needed to determine the most effective doses using methods of clinical practice 
known in the art 

10 10. Murine Systemic Neutropenic Model for £. faecalis Infection 

Compositions of the present invention, including polypeptides and peptides, are 
assayed for their ability to function as vaccmes or to enhance/stimulate an immune 
response to a bacterial species (e.g., E, faecalis) using the following qualitative murine 
systemic neutropenic model. Mice (e.g., NIH Swiss female mice, approximately 7 weeks 

15 old) are first treated with a biologically protective effective amount, or immune 

enhancing/stimulating effective amount of a composition of the present invention using 
methods known in the art, such as those discussed above. See,e.g., Harlow et al., 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratoiy Press, 2nd 
ed. 1988). An example of an appropriate starting dose is 20ug per animal. 

20 Mice are then injected with 250 - 300 mg/kg cyclophosphamide intraperitonially. 

Counting the day of CP. injection as day one, the mice are left untreated for 5 days to 
begin recovery of PMNL'S. 

The desired bacterial species used to challenge the mice, such as E. faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10^ 

25 cfu/ml, in an im>propriate media, mixed well, serially diluted, and titered. The desired 
doses are further diHuted 1:2 in 4% Brewer's yeast in media. 
Mice are injected with the bacteria/brewer's yeast challoige intr^eritonially. The 
Brewer's yeast solution alone is used as a control. The mice are then monitered twice 
daily for the first week following challenge, and once a day for the next week to ascertain 

30 morbidity and mortality. Mice remaining at the end of the experiment are sacrificed. 
The method can be used to identify compositions and determine appropriate and 
effective doses for humans and other animals by comparing the effective doses of 
compositions of tiie present invention with compositions known in the art to be 
effective in both mice and humans. Doses for the effective treatment of humans and 

35 other animals, using compositions of the present invention, are extrapolated using the 

data from the above experiments of mice. It is appreciated that further studies in humans 
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and other animals may be needed to determine the most effective doses using methods of 

clinical practice known in the art. 

The disclosure of all publications (including patents, patent applications, journal 

articles, laboratory manuals, books, or other documents) cited herein are hereby 
5 incorporated by reference in their entireties. 

The present invention is not to be limited in scope by the specific embodiments 

described herein, which are intended as single illustrations of individual aspects of the 

invention. Functionally equivalent methods and components are within the scope of the 

invention, in addition to those shown and described herein and will become apparant to 
10 those skilled in the art from the foregoing description and accompanying drawings. Such 

modifications are intended to fall within the scope of the appended claims. 
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(i) APPLICANT: Charles Kunsch 

* Patrick J, Dillon 

Steven C. Barash 

(ii) TITLE OF INVENTION: Enterococcus faecialis Polynucleotides and 

Polypeptides 

(iii) NUMBER OF SEQUENCES: 982 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. 4315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 1: 
AAGTCAATCA CTTGCAAGTC GTTTTCTGTC ATATGGCCGA CTAGCTCAGT 
ATATAGCCAC CAGGGAAAAT ATAACGATTA ATCCAAGCAT TTTTAGCCCC 
CGACTGATCC CATGAATCAA CGCCGTACCT TTAGGCGCTA AATTCCGCTG 
AAATATTCAT GTAGATTTTC CGCACCGACA TGTTCAAACA TCCCA.CACT 
TCAAAAGACT CTCCTTTTAA ATCACGATAA TCCATCAATT TGACAGTCAT 
AGATCTTCTT TTTCTATAAT ATGGCGAATA TGATGAAATT GCTCTTCACT 
CCAGTTGCTT TGGCTCCATA TTCTTTCACC GCAGTTAAAA TTAACGTGCC 
CCAATATCCA GTAAAGTGTC GCCCTCTTTG ATAAACAATT TATCTAAAAT 
TTAf TCACTT GCGCTTGTTC TAATGTATCT TCAGGCGTTt" TAAAATAAGC 
CTCATTGTTT GGTCAAGCCA TTTTTTGTAA AAATCATTTC CTAGATCGTA 
ATATCCTCTT GCGAACGTTT TTTTGAATGA CTTTCTTTAG GAAGCCATTT 
GCATTGTGTA AAAAGCTATC CTTTTGGTTA TACACATCAT AAATCAGsGC 
CCTTCGATTT CAATTTTGCG ATCCATGTAG GCTTCCCCTA AAGTTAACGA 
AGTAAATCCT TCACAGGAAT TTTTTCATTG AATACAATTT TAAAAACCGG 
CCTTGCCCAT ACTCTTTGAC GGTACCATCC CAGTATGTGA CTTGTGTCTT 
GACCATTTAA ACAGTTGACT GTACGTTTCT TTTTCTAACA TTGCATTCCC 
ACCATTTGAA GCCAAAACAA AA..GAAGTCG CTTTCCGGTA GTTCGTC..AA 
CAGTCCGTTC TAAACTGAAG CACAGAAAAG TTATCACCCC TTCTATGTTC 
TTTGCAATTACAGTTCTATT CTACTCCTCT TTTAAAAATT TGA..CATTCT 
TACCTACTAT TGTTATTCTT TATCACAAAA AAACTAGAGC CAGTCCTTGA 
TAGTtCTAA.. TATTATGCTT TCTTACGCAT CCGTTGTTCC GCATGAGTGT 



GACACCAGGA 
ACCTTGTTGG 
AACGACATCA 
CGTAATATGG 
TCGATCTTGT 
TAATGTAATC 
CCAGCCGCAG 
ATGATGAACT 
ACATGAATAC 
ATGGCTGTGA 
AATAAATTTA 
TTGGATATCG 
AGCGTTATTC 
ATCCCCCGAC 
TTTTGAAAAA 
TCCATTAAAT 
ACAAACACCA 
CGCTTCTTTT 
TTTAACGTAA 
CAGACTCCTC 
AAGCGCCATG 



60 

120 

180 

240 

300 

360 

420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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CCATACGTGC 


CCCACATAAG 


GATTAACTTG 


AATACCGTGT 


TTAATCGCCG 


CTGCTACAAA 


1320 


TTTTTTTGCT 


AAAGTTACTG 


CTTCTAACAC 


CGAATAACCT 


TTCGCCAAGC 


CAGCTGTGAT 


1380 


TGCCGCTGAA AAAGTACAAC 


CTGCACCATG 


ATTATAATCA 


GTAGGATATA 


ATTCATTTTC 


1440 


CAAAAGATGC 


GCGGTGTGAC 


CATCGTAAAA 


CAAGTCCAGT 


GCTTTTTCAC 


CAGCTAAGCG 


1500 


ATGTCCCCCT 


TTAACCACGA 


CATGCTTGGC 


TCCCATTTGT 


ACAATTCGTT 


TTGCCGCTTC 


1560 


TTCCATCTCC 


GCCAC6GAAG 


AAATTTCGCC 


TAAACCAGAT 


AAGATGCCCG 


CTTCAATTAA 


1620 


ATTAGGCGTG 


GCCACTAATG 


CTAATGGCAG 


TAAATCGTTT 


TTTAGGCCTT 


CCACACTTTT 


1680 


GGGTTGCAGA ATTTGTGCCG 


TTCCCTTACA 


AGCAATGACT 


GGGTCAATCA 


CGACTTTTTG 


1740 


AATTTTCTCT 


TGTTTAATGT 


ACTTACTAGC 


CATTTTAATA 


TTTTGTTCAT 


TACCCATCAT 


1800 


CCCTGTTTTC 


AAAGCCGCTA 


CTGGACCGCC 


TGCAAAAACC 


GAAATCAATT 


GTTTTTCTAA 


1860 


GAGCGTTTCT 


GGCAATTCAG 


TTACTTCATG 


TGACCAACCT 


GTCGTAGGAT 


CCATCGTCAC 


1920 


AATCGAGGTT 


AAACTTGAAA 


ATCCAAAAAC 


TCCATACTCT 


TCAAATGTTT 


TTAAATCTGC 


1980 


TTGAATCCCT 


GCCCCTCCAG 


TTGAATCGGA 


GCCTGCAATC 


GTCAATACTT 


TTTCCATTAA 


2040 


ATCACCTAAC 


CTTTTTCTCC 


AAGTATACTG 


7VAGAAACAAG 


TCTGCTAAAA 


CAGCCAATTG 


2100 


GCTTATTTTT 


TAGCCAGCCA 


ATTTCTAAAC 


AAAAAAAAGA 


CCAGAGAATA 


AATTCTCTGG 


2160 


TCTTACGTCC 


GAATACCCCA 


GTTTTTTACG 


CTGGTTAAAA 


CTATAGTTAA 


AAAGTTAATT 


2220 


ATTTAACGAT 


TTCAGTAACA 


ACGCCTGAAC 


CTACAGTACG 


TCCGCCTTCA 


CGAATAGAGA 


2280 


AACGAGTTCC 


GTCTTCGATA 


GCGATTGGGT 


GAATTAATTC 


AACGTCCATA 


GCAACGTTAT 


2340 


CACCAGGCAT 


TACCATTTCA 


GTACCTTCTG 


GCAATTCTAC 


AACACCAGTA ACGTCTGTTG 


2400 


TACGGAAGTA 


GAATTGAGGA 


CGGTAGTTAG 


TGAAGAATG6 


AGTGTGACGT 


CCGCCTTCTT 


2460 


CTTTTGATAA TACGTATACT TCAGCTTkGA ATTTgGTGTG TGGAGTGATT 


GTAGCTGGTT 


2520" 


TAGCTAATAC TyGyCCACGT 


TCGATATCTT 


CACGTGCTAC 


ACCACGTAtA 


AAGCACCGAT 


2580 


GTTGTCGCCT 


GCTTCAGCGT 


AGTCTAATAA 


TTTACGGAAC 


ATTTCAACAC 


CTGTAACAGT 


2640 


TGTTTTAGAT 


GTTTCGTCTT 


TAATACCAAC 


GATTTCAACT 


TCGTCACCAA 


CGCGAACTTC 


2700 


ACCACGTTCA 


ACACGGCCTG 


TAGCAACAGT ACCACGTCCA GTGATTGAGA ATACgTCTTC 


2760 


GACTGGCATC 


ATGAATGGTT 


TGTCAGTATC 


ACGTTCTGGA GTTGGGATAT 


ATTCGTCAAC 


2820 


TGCAGCCATT AATTCTAAGA TTTTTTCTTC ATAAGACTCG TCGCCTTCTA AAGCTTTCAA 


2880 


AGCAGAACCT 


GCGATAACTG 


GAACATCATC 


GCCTGGGAAA 


TCGTATTCTG 


ATAATAAGTC 


2940 


ACGAACTTCC 


ATTTCTACTA 


ATTCTAATAA 


TTCTTCGTCA 


TCAACCATAT 


CCATTTTGTT 


3000 


TAAGAATACA 


ACGATGTATG 


GTACACCAAC 


GTTACGTGAT 


AATAAGATAT 


GTTCACGTGT 


3060 


TTGAGGCATA 


GGACCATCAG 


CAGCAGAAAC 


TACTAAGATA 


GCTCCGTCCA 


TTTGAGCAGC 


3120 


ACCAGTGATC 


ATGTTTTTAA 


CGTAGTCCGC 


ATGTCCTGGG 


CAGTCaACGT 


GTGCaTAGTG 


3180 


ACGAGTTwCA 


GTTTCATATT 


CGATATGAGA AGTGTTGATT 


GTGATTCCAC 


GTTCTTTTTC 


3240 
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TTCTGGAGCG 


TTATCGATAG 


AATCG'IAGCT 


TTGTGCTTCC 


CCGCCACCGT 


GTTTTGATAA 


3300 


TACAGTAGCA 


ATTGCAGCTG 


TTAATGTAGT 


TTTACCATGG 


TCAACGTGTC 


CGATAGTACC 


3360 


AATGTTAACA 


TGGGATTTAG 


AACGGTCAAA 


TTTTTCTTTT 


GCCATTTTGA 


ATGTTCCTCC 


3420 


TAAAAATATA 


AATTTTATTG 


TTTGTTTTAT 


ATGTGATAGA 


CGTTACTAAA 


AATCGTCTGA 


3480 


TCATCATAAG 


CTATTTTACA 


CGAAATCTTG 


AAAA7WVGGAT 


AGTTCTAAAA 


ACAAACCTTT 


3540 


TATTCTCAAG 


GAAAATTATG 


CGTtGCCGCC 


GTTTTTCTTG 


ATGATTTCTT 


CTTGAACAGA 


3600 


CTTAGGTACG 


TCTTCATAGT 


GGTCAAATAC 


CATCATAAAT 


GTACCACGTC 


CTTGTGTTGC 


3660 


TGAACGTAAT 


GTTGTAGCAT 


AACCAAACAT 


TTCAGCTAAA 


GGTACCATCG 


CATTAACGAT 


3720 


TTGAGAGTTA 


CCGTGTGCTT 


CCATACCTTC 


AACACGTCCA 


CGACGACTTG 


TTACGTGTCC 


3780 


CATGATATCA 


CCTAAGTAAT 


CTTCAGGAAC 


AGTAATTGTA 


ACTTTCATCA 


TTGGTTCTAA 


3840 


AATAACTGGG 


TTTGCATTTT 


TAGCCGCAGC 


TTTCAATGCC 


ATAGAAGCAG 


CTACACGGAA 


3900 


GGCCGTTTCA 


TTTGAATCGA 


CATCGTGGTA 


TGAACCATCG 


TAAAGTTTTG 


CTTTGATATC 


3960 








\3 X 1. \^r\ 1 tvjJrWD 


TrTTPTAACC 


CTTTTTCAAC 


4020 


TGCTGGGaTG 


TATTCACGAG 


GAACCACACC 


ACCGACAATT 


GCGTTTTCAA 


ACTCGAAGCC 


4080 


TTTACCTTCT 


TCATTTGGTG 


TAAATTCAAC 


CCaTAcGTGA 


CCGTATTGTC 


CTTTACCACC 


4140 


AGACTGACGA 


ACGAATTTAC 


CTTCTGCTTT 


AGTCGCAGCA 


CGGAATGTTT 


CGCGGTAAGA 


4200 


TACTTGAGGA 


GCACCTACGT 


TAGCTTCAAC 


TTTGAATTCA 


CGTTTCATAC 


GGTCTACTAA 


4260 


AACGTCTAAG 


TGCAATTCAC 


CCATaCCAGA 


GATAACTGTT 


TCACCAGTTT 


CAACC 


4315 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2716 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATTGTGCCAT AGATTTGACC ATCTGCTGCT GTTGTAGTTT CCCAGTTTGT AATTTGGTAA 60 

TGTTGTAAGA nCTTAGTTGG GACATTAATG GCAAAAGTTT TTTGGCGATA GGTATTAAAA 120 

TTGGTTTTAC CCACCCAACC TTCATAGGCT TCAAATTCAG CTCCACCTAA CAAAGGGTTG 180 

TCGGAAATAA TCCAAGTGTC TTTTTGATGA TTCCGAATAA AACGAACGAT TGTACTGTTA 240 

AAGCCTTTCG TTGTTGTACC GCCGTAATCT TTTTCTGCAG ACCAATGGTT CCAAACCGAG 300 

TCATACTCTA AAGCATTCGG AAATTCATTG GTTGTAAACC AACCGAGTGA ATTAATTTCT 360 

TTACTTAGTT GACGACTTGC CCAACCTGAT TCGCCTTGGT TGCCCCAAAC ATCAACATAA 420 

ATATAATCTA GATTCGGTGc TTTTtGTTTT AATcnTTAAA GCGCTCATAG CGACGACCAC 480 

TCAATGTAtC GGGtCTTkGT TTAATAAAAT AAGACGGATC TAACCAATCC CAGCCACGCT 540 
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TCGTTGGATC 


AACTAATTCC 


TCATTAAATC 


CTTTTGCTTC 


TGGGTAAGAC 


TCGGTGTCAT 


600 


TAATATGCAC 


ACCGAAAACG 


GCATTTAATT 


TATGTCCTTC 


ATC7VATTAAT 


TGATTCAACG 


660 


CTTGTTCCCC 


ACCAGGTCGC 


TGACCAATAG 


CACCGTAATC 


TGGATGCGCA 


GAATCATGTC 


720 


CTTCATTTTG ATACCCTTTT 


AGTAAATTCA 


TTTGTCCTAA 


TCCATCTGTT 


AAATTGTAAA 


780 


TACGTTTTGA TTCGTCTAAC 


GTCACTAAGA ATGGGTTTGT 


CGCCTGACTA 


GCAAAGTTAA 


840 


AAGGAATCCG 


TTGGTTGACT 


AATTCAGGGA CTTTTTCCGC 


ACCCATTGGG 


TTATTCATAA 


900 


TTGAACGGAA 


CCCAATTGCC 


GCATCTTGCC 


AATCCACCCG 


ATTGTCGTCG 


TTGCTATCTT 


960 


TTGAGAATTT 


AACTTTCACT 


TCTGGCGTTT 


CTCCAGTTGT 


GTAATCTTCC 


GGTGCATCAA 


1020 


ATGGTCGATA GGTCCATGCC 


CCACTTGACA AGGTTGTTAC 


AAAACCAGTC 


GCCGCTTCTT 


1080 


TTGTTTGTTT ATGGATTCGA TTGTTGTCAC 


TACCATCGAC 


AGAGCCGTCT 


CCATAGGCGT 


1140 


TCGTCCAAAA ACTTGCAGCA 


TAATTCGCCG 


TATTTAAGAA 


ACCATACATA 


TATTTTTTCG 


1200 


TTTCTTCTTG 


GCTTGTTGTA 


GTTAAATCTT 


GGATAGTATC 


ACCATTTTTA 


TTTCCGTTAT 


1260 


TTGCATTTGT 


CCCAGTATTC 


ATCACAACGC 


CAGCAAAAGT 


CGCCCCTTCA 


TCTGTACTAT 


1320 


TGACAGAAAT 


CAAGCCTTGA 


TTTGGAATTG 


T^TTGTGTG 


TATTTTGGTT 


CCTTCTTCAC 


1380 


GAATATCTGA 


GAGTTTCATA 


GCTAATTCTT 


GGCCTTCGGA 


AACAGTCAAT 


GTCGTTGTGA 


1440 


AACGCAAACC 


AATTTCTGGG 


ACAGCTACGT 


TATAAACGGC 


TTGaTTTGCT 


TCaCTCTTCT 


1500 


GTTTTTcAGC 


AGTTGCTTTA 


tATGCTTTGC CAyCAAyCAT 


TACTTGGTCa ATTTTCTCgT 


1560 


GTTGGCCATT 


AAAAATCACT 


TGATTACTCT 


GAGGGTCTTC 


GTAGCGAATA ATTCGTGGAA 


1620 


ATTGTGGATC 


AACCACGGCT 


GTTAAATCTG 


CGGAAGAGAT 


TTTGTCGCCT 


TTTTCAGCTT 


1680 


CATTGTCTTT 


AGCAACAACA 


ATGCTGGCTT 


TTGCTTTGAT 


GTTTGTATTT 


TCCAAAGTAC 


1740 


CGTCGACTTC 


AAAAGTTCCT 


GGTTCTTTAT 


ATGCATCAGG 


ATCGATTTCA 


TTCCAACGGA 


1800 


CTGCTTCATT 


GGCTTCTTTA 


CCAGTATTAT 


AGGTCACTTT 


TACTTGTTTC 


GGtAATTTTG 


1860 


GTGCGACACC AATTTTTGTA AAAACTTTGA 


CTGGCGCAAT 


TTGTGTCACT 


TCTGGCACAA 


1920 


TTTCTGGAAT 


GCTATCTACG 


GGGCCATTTT 


TAAAATAGTC 


ATAATTGACA ATTTTTTTGT 


1980 


CGTACCACAA ACGGACACCC 


ACATGTCCGG 


CTTCTGTTGG 


AATTTTGTCT 


CCATTGGCCA 


2040 


AAACAGGTTC 


TCCTTCGTAA 


ATCAACGTGG 


TGTTAAGCCA 


GATAGTAATT 


TTTTCACCAA 


2100 


CATAACGTAC 


TTTCAACAAA 


TAATTCGTAT 


CTTCATTTAA 


CGTTGGCCCA GAAATTGAAT 


2160 


CATTCCACGA 


ATTAGGGCTT 


TCAACTAACC 


ATTTGCCATT 


CGCGTTATAG 


CCTACAAAAA 


2220 


CCCAGCTATC 


TTTGGTATTA 


CCACGTATAA 


TCACACCCGT 


TCGACCGCTT 


CCAGCAGTGT 


2280 


ATTTAAACTT 


TGTTTCTACT 


TCACCATCAG 


CTAATTTAGG 


GGAATCTAAA 


TTTAAAGATA 


2340 


CGGCATTATT 


ACCTGCTGCT 


GCATCTCGTG 


AAATCGCCAA 


TGACTCTCCT 


GCTAATTCTC 


2400 


GATTTGTTTT 


ACCAATCACA 


TCTTGCCACT 


CACCAGTTTT 


CCCTCCTGGA 


AAATCAGATT 


2460 


GCCACAATTT 


TTCTTCTGTT 


GCGGTAGGTT 


CCACCATTGA 


TGAGCTTGTT 


TCAGTGCTGC 


2520 
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TGTTTGTTGT TTCTTCTGCA GACGTACTTA ATGGTACTAA AATCGTTGCG CTTGCCAATA 2580 

GTGTCAATGT ACTAAATCGT TTTATTTTTC CATGTTTCAT CTGaTAAATC AGCCCCCTGA 2640 

ACATTTTTGG AACACGCTTT CATTTATTTA ACAATTAAAG GGGGGTTnCC AAAAATTAAA 2700 

TTGTnTTCAT TTTTGG 2716 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



TTGAAATTGC 


TAGCGTTTTC 


CGCCATTGGC 


TATGTTTAGT 


CGCAACGGAG 


GGACCsCtGA 


60 


CTTTTTAGCG 


AAGCATcgTT 


GCCTGTTAAG 


aAAGTGACGA 


AAATTTCTAA 


TCCTGAAGGG 


120 


GAAACCGTGC 


TGGATGTCAT 


TCGCAATGGG 


AATGCGCAAG 


TGGTAATCaG 


TACCATGGAT 


180 


AAAAATCGTT 


CCAGTGcCAA 


CCAAGATGGA 


TTTTCAATCC 


GTCGGGAAGC 


AGTCGAACAT 


240 


GGGATTCCAT 


TGTTTACCTC 


ACTTGACACA 


GCAAATGCGA 


TTTTAAAAGT 


TTTAGAATCA 


300 


A6AGCCTTTA 


CGACAGAAGC 


CATTTAAGCA 


AAAATCTGGA 


ACAATCGTCT 


GAGGAGGACT 


360 


GTTCCAGATT 


CTTTTACGAA 


GAAATAGAAA 


ATCATTTATA 


GGGGAATCAG 


AGAATAAAGG 


4zU 


AGTGAAGGAT 


ACTGATGCAG 


AGAAAGCAAG 


AAATGATGAC 


CATTGTAGCT 


CAAAAACAGT 


480 


TAGCGCCACG 


GATTTATCAG 


TTAGACTTAC 


AAGGCGAGTT 


AGTGAAAGAG 


ATGACACGAC 


540 


CTGGACAGTT 


TGTGCATATC 


AAGGTTCCGC 


GTGCAGATTT 


ACTTTTGCGG 


CGACCGATTA 


600 


GCATCAACCA 


AATTGATCAT 


TC6AACGAAA 


CCTGTCGTTT 


GATTTATCGA 


GTCGAAGGCG 


660 


CTGGAACGGA 


AGTGTTTGCC 


ACCATGAAGG 


CTGGTGAACA 


ATTAGATATT 


TTAGGTCCTT 


720 


TAGGAAACGG 


CTTTGATATA 


ACTACCGTAG 


CAGCTGGTCA 


AACTGCTTTT 


ATCGTTGGCG 


780 


GAGGAATCGG 


GATTCCCCCA 


TTGTATGAAT 


TATCAAAACA 


ACTCAATGAA AAAGGCGTGA 


840 


AAGTGATTCA 


TTTTCTTGGT 


TATGCATCAA 


AAGAGGTTGC 


TTATTACCAA 


CAAGAATTTA 


900 


TGGCATTAGG 


AGAAACACAT 


TTTGCCACAG 


ATGACGGCTC 


GTTTGGCGCT 


CATGGCAACG 


960 


TGGGCCGTTT 


ATTGTCAGAA 


GCGTTAGCAA 


AAGGACGGAT 


CCCTGATGCA 


GTGTATGCTT 


1020 


GTGGTGCGAA 


TGGTATGTTA 


AAAGCAATTG 


ATTCTTTATT 


TCCAACACAT 


CCACACGTTT 


1080 


ACCTTTCTTT 


AGAAGAACGG 


ATGGCCTGTG 


GAATTGGGGC 


TTGCTATGCC 


TGCGTTTGCC 


1140 


ATAAAAAAGG 


AGACACTACT 


GGAGCAAAAA 


GTGTCAAGGT 


CTGTGATGAA 


GGTCCGATTT 


1200 


TTAAAGCAAG 


. TGAGGTTATC 


TTATGATGAA 


AAAyCCTTTA 


GCCGTTTCAA 


TTCCAGGTTT 


1260 


AACATTAAAA 


AATCCAATTA 


TTCCCGCCAG 


TGGTTGCTTT 


GGGTTCGGGG 


A 


1311 



(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6948 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



TCCTCTGATG 


AACCGATTGA 


AAGAATTAAG 


• GAAAGAAnAA 


AATATTACTT 


TAGTTGAGTT 


60 


AAGTGAGGAG 


TTAGGTATTC 


CACGTTCCAC 


TCTTAATAGG 


TATGAAAACG 


AAGATAGCGA 


120 


ACCAAAACAA 


GAAaCTTGGG 


AAAAATTAGC 


TGATTATTAT 


GGTGTTTCTA 


CGGCTTATTT 


180 


AATGGGGATA 


TCCAACCAAA 


AGGTTAGCGA 


AGAAAAAGCT 


TTGACGGCCG 


CAAAGAAAGT 


240 


TTATCAAGTC 


TATCTTTCCG 


ACGACGATTT 


AGGAAAAGAA 


ATTCGAAAAG 


CTCTAATGTA 


300 


TTTTAATAAA 


AATGATTTAG 


ATAGTGTCTT 


AAAACAAGCA 


ATGCAGCAGT 


ATTTTACTAT 


360 


CCCAGCCGTT 


GAATGGAATA 


CAGAATTTCA 


ATCTTTAAAA 


AATACTGGTT 


TTCTATACAG 


420 


TTGGTTAGAA 


GGATATTTGG 


TAGACAGGTA 


TCGAAAAGAA 


GTGAAAACTA 


ATCATAATTT 


480 


AATTACAAAT 


ACGTATTATA 


GTATCCCTTC 


TATTGATGAT 


GTGAATGAGT 


ATGGAATTTT 


540 


CTCTGATGAA 


ATTCAAATAC 


CGTTGACTGA 


AGATTTTAAA 


CAAATTTTCA 


AAGTTGAGGA 


600 


TCTATCCACG 


AAAACACATG 


AAGAAATAAA 


AAAACTTTTT 


AAATCCGATT 


CTACGCTCAT 


660 


TGcTTACAGC 


TATGAAAGCT 


CCGTAGACGA 


TAAATTAAAA 


GATGATATTA 


ATATGATTTT 


720 


AGATAATGCT 


CGTTCCGAAA 


TTCTTAATCT 


AAAAGAAAAA 


TATCCTGATA 


AACCTAGTAG 


780 


AATAGaACAA 


ACAACCGTAC 


TCATATCTAA 


AGACGAAGAT 


ACTTCCTTTT 


GGTCAAGGAA 


840 


TGGCTCAAAT 


GATTATATAG 


ATGAATTGAA 


CTTATCTGAG 


GAAACTAAGA 


AATTATTTGT 


900 


TCATTTAGGT 


TCAGTATTTA 


TTGAGGAAAA 


AAAACGTAAA 


TCAAAAGCTA 


AAGAATGAGT 


960 


TTTTTTCTTT 


GTATAATCCA 


GTTATAGAAA 


CATCTCTCCC 


CTAACATCTA 


CATACTATTG 


1020 


GCGTGGTGGT 


AGATAGGAGA 


AAAACAATGA 


CACAAATTAA 


CAAGTATACA 


AAAAAAGACG 


1080 


GTACAACCGC 


CTATATGTTT 


AATGCTTATG 


TAGGACGTCA 


TCCCAAAACT 


GGTAAAAACG 


1140 


TTTATCGTAA 


GCGGCAAGGA 


TTTAAAACAA 


AAAAACAAGC 


ACAAATTGCA 


CTTGCTGAAA 


• 1200 


TTCTGAAGGA 


TATTGAAGAA 


AATGGACTAG 


ATAATAAACC 


CTCTGTTTTG 


ACGTTTAAAC 


1260 


AGCTTTACGA 


GAAATGGTTA 


GCACAGCAAC 


GGTTAACGAT 


TAAACCATCT 


TCAATCGCTG 


1320 


TGAATAAGCG 


TTTTGCTGAA 


AAACACATAT 


TACCTTATCT 


AGGTGACTGC 


AAACTTGACG 


1380 


AAATAACCGT 


TATACAGTGT 


CAGGACTTAG 


TTAATAAATG 


GTTTAATCAA 


GGACACAAGC 


1440 


AATACTCGTT 


CTATAGAAAA 


CTCACTGCTC 


AGATTATGCG 


ATATGGCGAA 


TCTATGGAAC 


1500 


TAATGAATAC 


TAACCCAATG 


AGAAAAACAA 


TCCTTCCTAA 


GTGGAAAGAG 


GAAGAAACAA 


1560 


AACTGGAGTA 


TTACACTAAA 


CAAGAATTGA 


ATCATTTTTT 


TGATTGTCTA 


AAACAATATG 


1620 


GTAATTTCAA 


ACAGTTAGCA 


TTTTTTAGGT 


TGCTAGCCTT 


TACAGGTTGT 


CGTAAGAGTG 


1680 


AAGTTCTAGC 


CTTGCAGTGG 


AAAGACCTAG 


ACATAACAAA 


CAAGAGCGTA 


TATATAGGCA 


1740 
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AGACCGTAGC ACAAGACGAA TTCTATAATA TACTCACTCA AACGCCCAAA ACAGCTTCTA 1800 

GCACACGTTC TATCAGCTTA GACAATGAAA CACTTCAAAT TATGGCTAAA TGGCGTACAA 1860 

TCCAACGTAG TGAcTATTTC CAAATGGGCT TTAATACATC TAGTGAAGAA CAATATATTT 1920 

TCACGAACAA CCATAATAAA CTGCTCTCAC CAAATGTAGT TAATATATGG CTCAATTGCT 1980 

TAATAAAAAA ATATGATTTA CCTAGTATTA CACCACATCA TTTTAGGCAT ACACATGCAA 2040 

GTTTACTTCT TCAATCAGGA GTGCCTATTA AAGAAGTTGC TGAAAGATTA GGACATACTA 2100 

GCACTGCTAT TACTGATCGT ATCTATTCTC ATGTTATGCC TGAAGAAAAA GAAAAAACAG 2160 

CAGATAAGTT CGCTCAATTC GTCGGTTTTT AATAAATGCA CAGACAAAAG CACAGATCTT 2220 

TACTATTTTT ATGTAAAAAA AAGAGCTAGA ATCCTTATAA ATAAAGGTTT CTAACTGGCA 2280 

TTAGTAACGC TATTAATGAA ACAACGTGTT TAATCATATA ATAAACCAAC AAATAAGTAA 2340 

ATATCAAATT CTAAATTCTG TAAAAAACGA ACAATCACGA TTAAAAAAAG TATTCCTTTC 2400 

TAGGAATACT TTTTAAATTA AATCGTTTGT GTGGTCGGGC GAACAACCAT TTCACTGATT 2460 

GACATACGAT CTGGCGTGTC TATGGCAAAG ACCACTGCTT GCGCAATGTC TTCCGCCTTT 2520 

AAGCCCCAAC TTAACTGTTC TAGATGTAAC GTTTCAGCTA CTACTCGATT AGAAATTGTT 2580 

TGATAAAGTT CTGTTTGTAC AGCACCTGGT GAAATAATCG TTGATTTAAT GTTATTCTCT 2640 

CTTTGTTCTT GTCGCAATCC TTCCATAATT GCTCGAACAG CAAACTTTGT TCCACAGTAG 2700 

ACAGCCGAAT CTGGATAAAC GACATGTCCT GCCACAGAAT CAGTAGCAAT AATCTGCCCT 2760 

GATTTTTGTT CAACCATAAT TGGAAGAACT GCCGCAATGC CATTTAAAAC ACCCATAATA 2820 

TTAATATCTA GCATTTGACG CCATTCTCCT TTTGGTGCTT CAATAAGAGG CGCCGTTGGC 2880 

ATAACTCCTG CATTGTTAAA AAGAACATCA ATTCGTCCAT ATTTTTCCAT TGTAAGCTTG 2940 

ATTACACGCT GAACTTCCTC TTCTTTCGTT ACATCTGCTT GTTGCACGAG AATCGTTGCT 3000 

TCAGGAAGTT CTTTTTTTAT AGCAATTAAA CGTTCTTGTC GACGTGCTGC AATGACTAAC 3060 

TTCGCTCCTT TTCTGGCAAG TAAACGGGCT GTTGCCTCAC CGATGCCACT GGAAGCGCCC 3120 

ATGATAACAA TAACTTTTTC TGATAATGAT TTCATTATAT TTCCTCCTTA TCTTCCTGAA 3180 

TCCAACGAAT AATTTTAGCT GGTGTTCCTG CAACAATAGC GTTAgcTGGc ACATCTTTAG 3240 

TTACAGTCGC ATTAGCTGCT ACGATAGCAT TTTCACCAAT CGTAATACCT GGTAAAACAG 3300 

TGACACCCGC GCCTAACCAC GCAAATTTTT TAATGGTAAT TGGTGCTAGC AATACGCCTC 3360 

GCCGTTTCAT TGGCAGTTCC GGATGATTGA CACTTAAGAG ATTAACGCGC GGACCAATTA 3420 

AAACATTGTC TTCAATTATA ATGCCCCCTA AATCAACAAA CATAGCTGCC CGATTAATAA 3480 

AAATATTTTT ACCAAAATGT ATATTGCGTC CAAAATCTGT ATAAAAAGGT AATAAAATCG 3540 

TAACGGTTTC GTCAATTTTG TCTTGGATAA TCGTGGCAAG AAGATTCCTT ACTTCATTAT 3600 

GTGTATAGGC TTGTTGATTT AATTCTATTA AATAGCGACT ATTTTCCTCT TGAATCTGAT 3660 

GAATAAGTTG AAATAGTGGT TCATCTTTAT CAATGCTACC TGTTGCAATT AGTTCTTGGA 3720 
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Q^.pYTTTTGT AAGCATTCCT AATCGCCTCC TTTTGCTAAA TTTATCGTAA TACGTGACAA 3780 

CTCCTCTGTC TAATACTTAT AATAAATAGA TAAGTA^GCG TTTTAGTTAT AGGAGGGAAA 3840 

GTAATGGAAT TACGAGTGAT TCACTATTTC TTAGCAGTGG TCCAAGAAAA AACGATTAGT 3900 

GGCGCAgcCA AACAATTGCA TGTATCACAA CCAACATTAT CTAAGCAATT AAAAGAATTA 3960 

GAGGAAGAAT TAGGCGTGAC ATTATTTATA CGAGGAAATC GACAAATACA ATTAACACCT 4020 

GAAGGAGAAT ATTTAGCTAA ACAAGGGCAA GATATTTTAA GCTTAGCGAA TAAAACAGTC 4080 

ACCAACTTGT CGCAAAATGA ATTCATTAAT GGCGAAATCA CGATTGGTGG CGGCGAAACA 4140 

AAAGCTATGT CTTTTTTAGC GAATGCACTA CAACAAATAA CAAGCCAGCA CTCAGCTGAT 4200 

ATTCACCTTC ATTTATACAG TGGGAATGCA GATGATGTAA TAGAACGGCT AGACAAAGGA 4260 

TTATTAGATT TTGGTTTAAT TATTGAGCCT GCACCTAAAC AAAAATATAG CTATTTAACA 4320 

TTACCAGTTG TAGATACATG GGGCTTAATT ACTGTAAAGG ACCATCCCTT AGCCACTAAA 4380 

AATGTTATTA CTGCAGCCGA TTTAAAAGAA GAACCTTTAT TTATTTCTCG ACAAGCACAA 4440 

GTTCCGAGCC AACTCTCTGA TTGGCTCGAA GCAAGTCTAG ATCAGTTCCG AATCGTTGGG 4500 

ACCTACAACT TACTTTACAA CGCTTCACTG ATGGTAGAAG CTGGGCTAGG TAGCGCCCTA 4560 

AGCATTGATG GTATTCTAGA AACAAAGCAA ACAAATTTAC GTTTTATCCC GCTATATCCT 4620 

GCTTTAACAG CCAAAATTAG TTTAATTTGG CGCAAAAATA CGGTTCTTTC CACAGCTGCT 4680 

GCATTATTTT TAGAACAAAT AAAAAAAAGT ATTCAACGCC CTGAATAATA GGTCGTTGAA 4740 

TACTTTTTTA AATCATCACT GTGATCGTAG CTGCTGCCAA AATCAAAACT AAGCCGATAA 4800 

TCGTTACGGT CATTTCCTTG GCTGTTTTCT TTTGACCTAA GAACCAAATA CCTGTCAACG 4860 

TTGCAAGCAC AACTGATGTT TGGGATAGAA TAAAGCCAGT TGCCAAGCCG TTCATATCTG 4 920 

GcTGCGCTGA AATTAAATAA GTTAACGCTG CAAAGGCGAA GAAAAAGCCT GAAAAAATTT 4 980 

GTTTGTAAGA AACGGCTTCT ATAAATGGGG ATATTTCCCC ACCCTTAATA GTGACAACCG 504 0 

CTGAGTAAAT AACTGCGACA ATTACCATGC CTATTGCTTG TGGTAAAAAG GCATGCATGC 5100 

CATCAATTGC CGTTGCTTGA GGTGCTGCAG AGTACGCCCA ATAGCCGATT TCACCAACGG 5160 

CTAATAGAAG AACTGCTTTT TTCATAATCC CAGCACTCTC AGCGGACTCC GTTTCCGACC 5220 

AGACCGTCAT TTTGGCACCA ATCATGATTA ACACTAAGGC AAATGCGCCT AACAATTTGG 5280 

CTGTCGCACC TGGCCAGTTT CCTAAGAAAA AGACACCCCA GAGAGAAGCA CCTAAAAGTT 5340 

GGAACGCGGT CGTTACTGGC ATGGCACGTG ATGAACCAAT CATCGTGAAG CATTTAAACG 5400 

TAATGATTTG CGCACAGGCC CAACCGACAC CAGATAAAAT TGAAAAGAAT AAATCCATGC 5460 

CCGCTGGAAA AGCTAAACCA TTAATCATTG AAAAAATAGC GGCAAAAATC AATGTTCCTA 5520 

AACTGGTACC GAGAATTTGA TTCACTGGAC GACCACCAAT TTTGGAAGCG ATTGTTGGGA 5580 

AAAGCCCCCA ACCTAGAAGC GGTCCTAAAC CGATTAATAA TGCAGTCGCA TTCATCTCTA 5640 

TTTCCCCTCT CTCGTTTAAA AGACAACGTT ACTTTCTAAT AAAATATTTG CGTAAGGAGT 5700 



wo 98/50555 PCTAJS98/08985 

289 

CATTTCCCCT GTCCGAATAA ATGCATGGCA GTTGTTTAAT TCTTGTTTCA TTTCTGTATG 5760 

AGGAATAAAA CTGATGGGAG TTTCTGGTAA ACGAGTTTCA ATTGCTGCCA ACATGTCAGG 5820 

ATTTTCTGTT TTAATTTCTT CTGCTAAATA AATTCGTTGA ACTGCCAATT CCTCAAGGAC 5880 

ATTGTTTAAA ACTTCCATAA AACTAGGGAC GCCATTGGTA ACTGCTAAAT CAATTTTTTC 5940 

TGTAGTCCGA GGCACTGGCA TGCCTGCATC ACCAATACTT AATTTATCAA AATGACCCAT 6000 

TTGTGCGATG ACGCGGGAAA TATCTGAATT AATAACTTTT GTTTTTTTCA TGTCGCTCGC 6060 

TCCTTTTTAT TTGTCTGCCA ATTCGTGTTG ATAAGGAATC GAAGGTTGGG CTCCAAAACG 6120 

TTGAACAGTC AACGAAGACG CCTTGTTTCC ATAACGAATA GCTTCTTCCA AATTGCTAAA 6180 

ATCTTTTTCT AATATACTAC TTAATGCGCC AATAAAAGTA TCGCCAGCAG CCGTTGTATC 6240 

7UVCCGCT-TTC ACTTTAAAAG CAGGCAC7AT ACCACTTCGT CCATTGACGT CATAAAAGGC 6300 

GCCTTTACTA CCTACTGTAA TAATTACTGC TTCAAyCCCT AATTGATGAA GTGCTTCTGC 6360 

GGCTTTACGC ATACTCGCTT CaTCTGTGAT TTTAATGCCT GTTAAAATTT CGGTTTCTGT 6420 

TTCATTTGGT ACAATCATAT CTGTCACGTT TAGTAATTCT TCAGGAACTT GTTCTAACGC 6480 

AGGTGCGGGA TTTAAAATCG TTTTGACGCC TGCTTTTTTA GCAATTTTAA ACGCTGCAAT 6540 

CGTACTATCA ATCGCACTTT CAAATTGGGC AATCACAAAA TCACTTTTTT CAATAATTTC 6600 

TTGATGTTCC TGGACTTGTT TTGGCGTAAA GGCGTTATTG GCTCCTGCGT AAATCATAAT 6660 

GCTATTTTCA CCAGCGTTAT CTACCAyAAT AAAGGCTTGG CCAGTAGCTG TTTTCTCTAA 6720 

AGTGGTGACC CCAGTTAAAT TAATTTCATC TTGACTCATT AAATCAGTCA TCATAGCTCC 6780 

AGCGCCATCA TTTCCCACAG CACCAATAAA ATATGTTTCT GCGCCTGAAC GTTTCGCTGC 6840 

AACTGCTTGG TTTGCTCCTT TACCACCACC TGCTGTAAAA TGTTCAATGG CGTGAATTGT 6900 

TTCGCCTGGT TTTGGCATTT CTTTGACTCT TAATGTTGTG TCTAAATT 6948 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AATTTGTTGA ACCAGCGATT GCGCnGCnAA TACTGGTTTT TCCAGTGCCA GGCGGCCCAT 60 

AAAGAATCAT CGATGAAAGC ATTTTGGCTT CGACCATCCG ACGAATAATT TTACCAGGGC 120 

CGACTAAATG TTGTTGACCA ACTACTTCGT CTAAGTGACG AGGTCGCATT CGATAGGCTA 180 

AAGGTTGTTG CATAGAAGGA CCTCCTTTTT CATAATCATC TTATTATAAC ATAAAGAACG 240 

TACATyCGCT TTTTGGTGGk TTTTTGGGIcT TTAGATGGAT ATAGCGGAAG GTTcGTGtAT 300 

GATAGGGAGT GGTGAAAGAA ATGAAAGAAA CGATAAATAT CTTT7VACACA AAAACAACTG 360 
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AAGAAGTGGC ACAATACTTG TTAGGGATGT ATTTAGmGCA CGAAACAGCA ACAGGCGTTT 420 

TAGGTGGGTA CATTGTGGAT GCAGAAGCTT ATTTAGGTCC TGATGATGAG GCGGCCCATA 480 

GCTTTGGCTT GAGAAAAACG CCACGCTTAC AAGCCATGTA TGACAAACCA GGTACGATTT 540 

ATTTATATAC TATGCACACG CATTTAATTT TAAATATGGT GACACAAGAA CAAGGCAAAC 600 

CACAAGGGGT GATGATCCGC GCGATTGAAC CTGTTGAAGG TGTCGACAAA ATGATTGAAA 660 

ATCGCCAAGG ACGTCAAGGC GTGGAATTGA CCAATGGACC AGGTAAATTA GTTGCAGCCT 720 

TAGGAATTGA TAAGCAATTA TATGGGCAGT CGATTTTTTC TAGTTCGTTA CGGCTCGTGC 780 

GAGAAAAACG AAAATTTCCG AAAAAAATTG AGGCACTACC ACGAATTGGG ATTCCCAATA 840 

AAGGTCGTTG GACAGAGTTG CCTTTAAGAT ACGTCGTGGC TGGCAATCCA TATATTTCTA 900 

AACAAAAAAG AACAGCAGTA GACCAAATAG ATTTTGGCTG GAAGGATGAA GAAAATgAAA 960 

AAAGCAACAA TGCTCACATA CTTAGAGGAA CAACTTGAAA AACACTTAGG AGATTACGAA 1020 

gTAGGCCTTG ATTGGGATCG CAAAAACCAT ACCATCGAAG TCATTGTTcG TTTATATGCA 1080 

GAAAaCAATG AGCmAGTGGC GATTGATGAT GTTGATGGTA CGCTTTCAGA AGwAGAATTC 1140 

ATTGAATTTG AAGATGGTTT GTTATTTTAC AATCCGCAAA AGTCTGTCGT TGATGACGAA 1200 

GAGTATTTAG TCACAATTCC TTATGAAGGG AAAAAAGGgC TACGC7VAAGC AGTTTTAGAC 1260 

GGATTCATTC ACTATTTAAA AGTGGTTTTA GATGAAGGGC AAAGTGATTT GTTAGACTTT 1320 

TTATCAGATG AT^CAGCGGA AGTTTTTGAA TTGCATTGGG AGCCAGCAGA TTTTGAAGCG 1380 

ATGATTAAAA AAGTGGCAGA AACAGAAAAA GAACAATGGA TTGCGTATCC AAGTTATTAA 1440 

TTTAGAGGTG AGAACATGAA GTGGACAGAA GTAAAAGTCG AAACGGCTAG CGAAgCAGTT 1500 

GAAGCAATTT CAAACATTAT GATGGnArCT GGCGCAAGTG GCGTGGCCAT TGAAGATGCG 1560 

TTGGACATTG AAAATTTTGA AAGTGATCTG TATGGGGAAA TTTTAGATAA AGAGCAATTC 1620 

ACCCACATTA AAGAAGGGGC GATTGTGATG GCTTATTTTC CTGAAACAAC CTTCTTACCA 1680 

GAAATTTTAC CATTTATGAA AGAAAATATT TTACGCTTGC CTGAGTACGG CTTATCCATT 1740 

GGTAAAAACG AAATGACAAT TAGTGAAGTA GCAGAAAGCG ACTGGGCAAC TGCTTGGAAA 1800 

AAATATTATC ATCCTGTCCG TGTCACACGA TTCTTAACGA TTGTGCCAAG TTGGGAAGCC 1860 

TATCACGCAC AAGATGAAGC AGAAAAAATT ATCACATTGG ATCCGGGCAT GGCTTTCGGT 1920 

ACAGGCACGC ATCCAACGAC TCGTTTAACC TTGCAAGCTT TAGAAaCTGT TTTACGTGGT 1980 

GGCGAAACAG TTCTAGATGT AGGAACGGGT TCTGGTGTTT tGAGTATTGC AAGTCGCTAT 204 0 

TTAGGGGCTA AAGACGTCTA CGCATATGAT TTAGATGAAG TAGCAGTTGC AGCGGCAAAA 2100 

GAAAaTATGG ATTTAAATCC CATTGCGGCC GATGTTCATG TGTCAGCCAA TGATTTACTA 2160 

AAAGGGATTG ACCATTCTGC TGATGTAATC GTCGCAAATA TTTTGGCAGA TATTATTGTG 2220 

TTAATGATTG AAGATGCTTG GCGTTTGCTA AAACAAGACG GTACCyTCmt TAtTTCTGGA 2280 

ATTATTGAAG wTAmAAAAGC aATGGTTTTA GAAGCACTAA CGAAAGTCGG TTTTGTGGTG 2340 
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TTAATPAAnn 




X X Cl X 1^ X 


THAAflAAAPP 




2400 


Xnf\i\o lOWln 




T^AAAI^Aaf^A 


TTATPPAfiAA 
X X AX v^l^A^/vV 


AArrATTTHTA 




7460 


AX in 




1 isUva Al3 i I A 


TPPr^AATCAP 


(7PPrf^ A ATn AT 
VjV^LpUAA X oA X 


PPTTPTT ATT 
^oX XuX XAX X 






aaaTaaaaPT 


I7L.UAX 1 X XAI9 


C X V^AvjAX X V3 X 


fZP A A A TTf^ AT 
uVjAAAX XIaAX 


TP A AP ATPT*^ 
X ^AAk^AX V« X 0 




TTTACTTTAA 


AGAAATTAbl 


a a a/^an^a a a 
AAAGAAGAAA 


T/^PApaa apa 
Xi^GALAAAuA 


AX XGCl^AAX X 


Pa APTPappa 




TTGCGTGTGG 


GT T ACCAAAA 


ppppapaapT 
GouGAuAAQa X 


TAPAATPPAT 
X AuAA X uijA X 


rpf^mm/^ a A A A A 

X X X L> AAAAA 


ppp a ppp a pp 


z /uu 


TAGGTGGCAA 


TCAGTTTATC 


GbCTTTUCTG 


r*f* a a a a 4- nr^f^ 
UoAAAAc X V.*v# 


01 XI3XOAAAT 


GGGACv^GAA 


^ /oU 


AAAAAGAGCA 


/^e^r^T< -jk Jk^Tk Tk Ik 

GCGAAAGAAA 


7k7k A^A»T"1»A/^A 

AAAGAT T AuA 


A A A A n»P»P/"V*/^ 

AAAAAX iGCU 


AUAlaAAGCGG 


CGGAGCAATC 


■50 Ort 


GCATCGACAG 


CAGACACCAA 


GTGTTTCTTT 


r^f^^e^e* a a a a a 
GGTGGAAAAA 


A^A^A A/^ A A A 

ACACAAGAAA 


TCAT TGCGCA 


ZooU 


ATTTGACTCA 


TATGACACAG 


TCTTGGTTGC 


CrrttK f*/^ A A /** A A 

GTATGAAGAA 


n*/^ A /^/^ A AAA 

T C AG C AAAAC 


A A f /"'Is. A A A A 

AAGGAGAAAA 




AAGTCAATTA 


GCACAAGTAT 


TATCCACTTG 


CCAGCCAGGC 


GCACGTTTGT 


GTGTGCTCTT 


3000 


TGGACCTGAA 


GGTGGCTTTG 


CACCGCAAGA 


AATTGAACAA 


TTTCTGCAGG 


CTGGGGCGAA 


3060 


ACTTTGCGGG 


TTGGGTCCGA 


GAATTTTACG 


AGCAGAAACA 


GCACCGTTGT 


ATCTTTTAAG 


3120 


TGTGGTTAGC 


TACCAAATGG 


AACTTTTAAA 


TTAAACGAAA 


GCCCGCCTGT 


AAAATTCTTG 


3180 


TAGGCATTCT 


CTTTAAATAA 


ACAGAGAGTA 


AAAAAGCGAC 


AGTGATTG/\A 


^ tfvmtnt^ ^ ^^^^^^^ 

ATTTAAGGGA 


3240 


AATTTAGTAT 


M ih fit Ik J Ulillk^* 

AATAGAGTTG 


mm^w 7k Ik ^*§n^^ 

TTGAAAGTGC 


TACTGTTTCA 


AGTGGTACTT 


TTCTTTTTTA 


3300 


mm Ik Ik ft nniVk^lk 

TTAAAATAGA 


mik Ik Ik Ik ^^Mit^Tk 

TAAAACGTGA 


7k <<«^7k Tk 1— m/^ 

ACGAATgnTC 


mrrifTi A ino/^ A a a 

TTTATCCAAA 


GGTCACCTTC 


7k fTI A Z^ A f^f^ A f^*T* 


33dO 


GAATACCATG 


^^^ik Ik Ik vk ^ Ik Ik ^ 

CCAAAAGAAG 


AAATACTGAC 


AGGCCCAGCG 


^Tf^A rpO A AAA 

GTCATCAAAA 


TTGT AAG TAG 




CTATATGGGA 


/^^■jk /^ik Ik ^ Ik fi*f^ 
CCAGAACATG 


TTGAGTTGGT 


m/^TV A A A A A 

TCAAAAAGCA 


TTAACTTATG 


/^#p/^7V A A A A 

CTGAAAAAGC 




AGAGGAAGGT 


CAAGTCCGAC 


Ik A flir^ A /^/^f|»^ A 

AATCAGGTGA 


ACCTTACATC 


ATGGaTGLGA 






AGGCATTCTG 


GCTGAATTAC 


ATATGGATCC 


r*r^1\, wr% Tk r^wn^m a 

CCATACTGTA 


A A ^ A A f1> 

GCAAGAGGAT 


mmfpm A A A 

TTTTACATGA 


JOUU 


TGTTGTCGAA 


7k m fk Ik 

GATACGGACG 


rfi 7k Tk^AOrpf^/^A 

TAACACTGGA 


^OA'P'P'PA AAA 

CGATTTAAAA 


GAAGAATTTG 


^/^r* A A r* A A T 

GCGAAGACAT 


jbbU 


f^f^^l\ TV 'POfprp Ik 

GGCAATGTTA 


GTCGATGGCG 


TGACGAAACT 


A^/^ A AAA avpn* 
AGGAAAAAxT 


AAATAItAAAfP 

AAATATAAAT 


^TOAI^/^A a^A 

GTCAT&AAGA 


J / 


AUAATTAGCT 


GAAAATCATC 


r*r*'T% AAA »¥»/*• rpnn 

GGAAAATGTT 


ATTGCaUAATfa 


f^f^rr'n ar* A tt 
oCGCAACjAT r 


X AL.GGGTCA x 


J /ou 


TaTPr*T/^aaa 




u i X X otJACAA 


TaTr^ppPAPA 
X A X o^Ln.>AL< A 


TTA AA APATT 
X X AAAAwAX X 


TAPPTPA APA 
X A^Vf X AAo A 








a ar» a a a r*T*T»T 


APA AATTTAT 
AvaAAAX X XAX 


PPPPPTTTAf^ 
V3V>olk>^X X XAu 


PAPATPf^PTT 


3900 






f^/lIT* A ATT AflA 


aPATAPTPPP 
Av3A X MV^ X VjV^V^ 


TTAPf^TTATA 
X IAOV7X X AXr\ 


TT A APPPP A A 


3960 


PPAATAPTAT 




AX X XuAXoWi 


AAfZPAAAPCP 
AAoVaAAAwV?^^ 


OA X oAA^o X \D 


AAHPTTAT^^T 
AAV3v« X X AX U X 


4020 


GGCAGAA6CA 


GTCGAAGATA 


TTCGTTTAGC 


yACGGAAGAT 


TTAGAAATTT 


ACGCrGAAAT 


4080 


ATATGGTCGA 


CCAAAACATA 


TTTATTCAAT 


CTATCGTAAA 


ATGAAGGACC 


AAAAGAAACA 


4140 


GTTTAATGAA, 


ATCTATGACT 


TATTAGCCAT 


nCGTGTTATt 


GTCGATTCCA 


TTAAAsaTTG 


4200 


CTATGCCGTG 


TTAGGGcGAk 


CATACACGTT 


GGACACCAAT 


GCCAGGTCGC 


TTTAAAGATT 


4260 


ATATTGCTAT 


GCCAAAAGCG 


AATATGTATC 


AATCTATCCA 


TACGACAGTT 


ATTGGACCAA 


4320 
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AAGGGAATCC 


AGTAGAGGTC 


CAAATTCGGA 


CCCATGAGAT 


GCACCAAATC 


GCCGAATTTG 


4380 


GGGTGGCGGC 


TCACTGGGCC 


TATAAAGAAG 


#H^^m m IK ^^^^^^^^^ 

GCAAAGCGGA 


AAAAATTGAA 


ACAGATGAAC 


4440 


TGACGAAGCA AGTCGATTGG 


TTCCATGAAA 


TTATTGAATT 


« ^4k Ik Ik Ik 

ACAAGATGAA 


AGCTATGATG 


4500 


CTTCTGAATT 


TATGCAAGGT 


GTGAAAGAAG 


ACATTTTTAG 


CGACAAAGTC 


TATGTCTTTA 


4560 


CGCCAAGTGG AGATGTAACC 


GAACTGCCAA 


AAGGCTCCGG 


ACCACTAGAT 


TTTGCTTACA 


4620 


GTATCCATAC 


TGAAATTGGG 


AATAAAACAA 


CCGGTGCGAA 


GGTCAACGGC 


7W\AATGGTAC 


4680 


AGTTAGACTA 


TGTCCTAAAA 


AATGGGGATA 


TTATTGAAGT 


CCTAACATCA 


CCGAATTCTT 


4740 


TCGGACCTAG 


TCGCGACTGG 


CTGAAAATGG 


TTAAAACAAG 


TAAAGCCAAA 


AATAAAATCA 


4800 


AACGTTTCTT 


TAAAGAACAA 


GACCGTGAAG 


ATAACATTAT 


TAAAGGGCAT 


GATGCTGTCA 


4860 


TTAAATACAT 


GACAGAAATC 


GGCTTTACAC 


CGAAAGAATT 


TTTAACGAAG 


AATAAAATGG 


4920 


CGGAAGTGTT 


AGATAAATTT 


AAATTTCAAA 


CGGAAGATGA 


TTTATTTGCG 


GCAGTAGGTT 


4980 


ATGGCGAAGT 


AAGTGCGCAA 


GTAGTGTGCA 


ATCGTTTAAC 


AGAAAAAGAA 


CGCCGCGAAC 


5040 


AAGAGTTGGA ACGCCAACGT 


CAAGAAGCAG 


AAGAGTTGTT 


AAATCAGCCA 


GCTAAAAAAG 


5100 


AATCAGAAAA AATGAAAGTT 


CGTCATGAAG 


GCGGCATCGT 


GATTCAAGGG 


GTGGACAACC 


5160 


TATTGATTCG 


TTTAAGCCGC 


TGTTGTAATC 


CAGTGCCTGG 


TGATGAAATC 


GTTGGTTACA 


5220 


TTACCAAAGG 


ACGTGGCGTT 


TCAATTCATC 


GTGCGGACTG 


TCCTAACGTA 


CAACATCAAG 


5280 


AAGAACT7VGC 


ACAACGTCTA 


ATTGAAGTTG 


AGTGGGAAGA 


TACCGAGCAC 


AGTCGCAAAG 


5340 


AATACGCTGC 


CGATTTGGAG 


ATTTACGGCT 


ATGATCGTAG 


TGGCTTACTG 


AGTGATGTTT 


5400 


TACAAGTCAT 


TAGTTCGATG 


AC/UVAAAATT 


TAGTCGGCGT 


TGAAGCTCGT 


CCATCCAAAG 


5460 


ATAAGATGGC 


CCAGATTCAT 


GTTACTGTGA 


AAATTCAAAA 


CCTTTCTCAT 


TTAAAAACCA 


5520 


TCGTTGACAA AATTAAAAGT 


GTACCAGATG 


TTTATAGTGT 


TCGTCGAACC 


AATGGGTAAG 


5580 


TGTTAAAAGT 


GAGGCAAAAA 


AGATGAAAGT 


AGTTATTCAA 


AGAGTCAGTC 


AAGCACAAGT 


5640 


TGCGATAGAA 


GAGCAGATAG 


TGGGACAAAT 


CAAACAAGGC 


TTCATGGTGC 


TGGTGGGGAT 


5700 


TCATCAAGAA 


GACACGCCAG 


AAGATGTGGC 


TTATGTTGTT 


GGCAAAATCA 


GTAAATTGCG 


5760 


TGTTTTTGAA 


GATGACGAAG 


GAAAAATGAA 


TCGCAGTATT 


CAAGAGATTG 


AAGGATCAAT 


5820 


TTTGAGTATC 


TCTCAATTTA 


CCCTCTATGC 


AAAAACTAAA 


AAAGGCAATC 


GTCCCAGTTT 


5880 


TATTGAAGCG 


GCTCGTCCAG 


ATGTGGCGAT 


TCCTTTGTAT 


GAATTATTTA 


ATCAACAATT 


5940 


AGAAGCAGAA 


GGAATAGCGG 


TAGCCACAGG 


AGAATTTGGC 


GCGGATATGC 


AAGTGTCGTT 


6000 


GACCAATGAT 


GGTCCAGTGA 


CCATTGTGAT 


TGATACACGA 


GAAAAATAAT 


TGATTTTCCT 


6060 


GTCGGAACCC 


CTTGACTTTC 


TAAGA6AACT 


TTAGTAGTTT 


ATAGATAATA 


TTAAAAATCT 


6120 


ACGAAGGAAC 


. GAGTAAGTTT 


CATCTGTCTG 


CGAAGAGAGA 


GTTGCCGTGG 


CTGAGAGCAA 


6180 


CTTCAACAGT 


GAAAGTGAAA 


GACATTCTGG 


AGATGGACGG 


AAGAAAAGCC 


GTCCCGTTAC 


6240 


GAGCGTTAAT 


CGTTGAGGAA 


AGAGTAAAAT 


CTTTCAAAAC 


TAGGTGGTAC 


CGTGTAAAAG 


6300 
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AGACTTTTAC 


GCCCTTGTAT 


GTTCAACATA 


CAAGGGCTTT 


TTTCATTTTT 


AG7\AAAAAGA 




AAATGGCGCA CACATTTTTG TGGAATGATA 


GAAGGAGAGA 


AAGACAATGA 


GTTATCAAAA 


6420 


ACCAAAAGGA ACAAACGATA 


TTTTGCCAGG AACTTCTGAA AAATGGCAAT 


TTGTGGAAGA 


6480 




TTGATTTTTA AAGATTATCA ATACCAAGAA ATCAGAACCC CGATTTTTGA 


b34U 




la 1 AA 1 Al V« A v« 






GATATTGTTT 


CAAAAGAAAT 


DbUU 


GTATGATTTT 


«P7V A T A A A r* 

TATbA 1 AAAo 




v^vj X vxrvv^vsv^ X n 


CGTCCTGAGG 


GGACAGCGCC 


ccct\ 
oobU 


aATTGTl CbCi 


GCCTTCGTTG 


aaa aTaaaTT 
AAAAlAAAl 1 




GAATATACGA AACCATATAA 


6720 


AACCTATTAC 


ATGGGGCCGA 






CAAGCTGGTC 


GTTTGCGTCA 


6780 


ATTCCATCAA 


ATTGGTGTGG 


AACjVJA ill yjSa 


TA^'pr'a a a a^ 
1 Ao i AAAA^-. 


CCAGCATTGG 


ATGTTGAAAT 


6840 


CATGGCTATG 


GCTTTGGACT 


TCTl uAAAUA 


All AvjVjrt- A 1 U 


CAACAAATCA 


AATTAGTTAT 


6900 


TAATTCCTTG 


GGGGATAAAG 


AAACACGTGC 


TACLj 1 ACCG 1 


CAAGCATTAA 


TCGATTATTT 


6960 


AGAGCCCCAT 


ATGGCAGAAT 


TAAVsUCzAbtiA 


TT/^ar*aar*r*T 


CGCTTACACG 


AAAACCCATT 


7020 . 


GCGGGTGTTA 


GACAGCAAAG 


ATAAAAAAiiA 


L-AAbbliaAJ. 1 


GTCGCAGAAG 


CGCCCTCCAT 


7080 


TTTGGATTAT 


TTAAATGAAC 


CAT CT AAAbC 


A 1^ A TTTTr* a a 
AWVX 1 1 luAA 


GCAGTAACTG 


ATATGTTAGA 


7140 


TTTACTAGAA ATTCCTTATG 


A A AfifP^ AfP A/^ 

AAATTuAi Ab 


lAAiAliaul i 


CGTGGCCTGG ATTATTATAC 


7200 


ACACACAATT 


TTTGAAATTA 




^r»o»p A a a a HY* 
UCUlAAAAlVs 


GGTGCGCAAT 


CAACTATTTG 


7260 


T6CAGGAGGC 


CGATACAATG 


GTT TAb 1 1 iiA 


ar'aa'PTappr* 


GGCCCAGACA CACCAGGTTT 


7320 


TGGTTTTGGT 


ATGGGCATTG 


AvsCuAo 1 o 1 i 


pTTaapaaT/^ 

bi iAAV«>AAIV3 


GAAGCTGAAG 


AAGTTGTGAT 


7380 


TCCAGCGTTA 


TCTGAATTAG 


arrr'aTaTCT 


VaiViX A\7\9\3AX X 


GGTTCAGACA CCAACGTCGC 


7440 


AGCTTTGCAA 


CTTGTTCAAA 




I^X X X UV3X X X v« 


TCAGCTGATC 


GTGATTACAT 


7500 


GAATCGCAAA 


CCAAAAGCGC 




CCCCGATAAA 


TTACAAGCAA AATTAGTTTT 


7560 


AACAATCGGT 


GAAAATGAAT 




CATTGTCAAC 


GTAAAATCAA TGGCAACACG 


7620 


CGAAGAAAAA 


GCCTTCCCGT 


TAAGTGCTAT 


TCATGATTCA 


TTTGATGAAG 


TGTATGACGA 


1 f O A 


AATGATGACA 


AAAATGATTG 


AAGAATAAAA 


AGGAGAGATT 


GAGAAATGGA 


AAAACGCACA 


7740 


ACCTATTGCG 


GCAATGTATC 


TGCCGAATTT 


ATTGAAAAAG 


AAGTTGTATT 


AAAAGGATGG 


7800 


GTTCAAAAAC 


GTCGTGACTT 


AGGAGGCGTT 


ATCTTTATTG 


ATTTACGTGA 


CCGTGAAGGA 


T o c r> 
/obu 


ATCGTCCAAG 


TGGTCTTTAA 


CCCAGAAAAG 


TCAAAAGAAG 


CATGGGAAAT 


TGCTGATAAA 




TGTCGTAGTG 


AATACGTCAT 


CGAAGTTAAA 


GGTCAAGTCG 


TTTATCGTGA 


TAAAGAAGCG 




ATTAATCCCA 


AAATGAAAAC 


TGGCGAGTTT 


GAAGTGATGG 


CGACTGACAT 


TACGATTTTA 


8040 


MTACTGCGA 


AAACAACGCC 


ATTTACCATT 


GAAGATGACA 


ATAACGTTAA 


TGACGAATTG 


8100 


CGCATGAAAT 


. ATCGTTATTT 


AGATTTACGT 


CGTCCATCAA 


TGACAAACAA 


CATTAAATTA 


8160 


CGTCACCAAG 


TAACGAAAAC 


AATCCGTCAC 


TATTTAGATA 


ATCATGACTT 


TTTAGACATT 


8220 


GAAACTCCTT 


ATTTAGGCAA 


ATCAACACCA 


GAAGGCGCTC 


GCGATTATTT 


GGTTCCTTCT 


8280 
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CGTGTCCATG CCGGTCATTT TTATGCTTTA CCACAATCAC CACAACTCTT TAAACAATTA 8340 

TTAATGGGGG CAGGCTTCGA TCGCTATTAC CAAATCGTTC GTTGTTTCCG TGATGAAGAT 8400 

TTACGAGGCG ACCGTCAGCC AGAATTTACA CAAATCGATA TTGAAACGAC CTTTTTAACA 8460 

CCAGAAGAAA TTCAAACATA TACAGA 8486 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



AATGTCAAAA 


TTATTAGTTG 


TTAAAGCACA 


TCCATTAACA 


AAAGAAGAAT 


CACGCTCAGT 


60 


TCGTGCGTTA 


GAAACATTTT 


TAGCTTCATA 


CCGTGAAACA 


AATCCATCAG 


ACGAAATCGA 


120 


AATTTTAGAT 


GTTTATGCCC 


CAGAAACAAA 


CATGCCGGAA 


ATCGATGAAG 


AATTATTATC 


180 


TGCTTGGGGT 


GCACTTCGCG 


CTGGCGCAgc ATTTGAAACA TTAAGCGAAA ACCAACAACA 


240 


AAAAGTGGCT 


CGTTTTAATG 


AATTAACTGA 


TCAATTTTTA 


TCTGCAGACA 


AAGTAGTAAT 


300 


TGCTAATCCA 


ATGTGGAACT 


TAAACGTACC 


GACACGCTTA AAAGCTTGGG 


TAGATACAAT 


360 


CAACGTTGCT 


GGAAAAACAT 


TCCAATATAC 


TGCAGAAGGA 


CCAAAACCTC 


TAACAAGTGG 


420 


TAAAAAAGCC 


TTACACATCC 


AATCAAATGG 


CGGCTTCTAC 


GAAGGAAAAG 


ATTTTGCTTC 


480 


TCAATACATT 


AAAGCGATTC 


TAAACTTTAT 


TGGCGTTGAT 


CAAGTTGACG 


GATTATTCAT 


540 


CGAAGGAATC 


GATCATTTCC 


CTGATCGCGC 


GGAAGAACTT 


TTAAATACTG 


CCATGACCAA 


600 


AGCAACTGAA 


TACGGTAAAA 


CATTCTAAAT 


TTTCCGAGTG 


ATTTACATCA 


AAAaGAGACA 


660 


AGCAGTTCGT 


TCACTGCTTG 


TCTCTTTTGT 


TACGTAGATT 


CTTGTTGaTC 


TGGcTTATCt 


720 


TCATcATACA 


AACCGGaAAC 


GaTTTGGtAC 


CACTCaAAAA 


TGTGTGTAAT 


ATCACnTTAG 


780 


CTGCGGCAT 












789 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGTCGTTTAT TACGTCGTGA CGGTAATGCA CGGGAAAAAA ATTAGGCATT AATGAAGCCT 60 

TTTTATACAA.ATTAGTGCCT GTTGTTGGCG AAATCATGGT AAGCTACTAT CCAGAAGTAT 120 

TACAACAAAA AGACTTCATC GAAAAAGTGG TTCGGACAGA GGAAGAAnGT TTCCATGAAA 180 

CCATCAATGA AGGTTTAAGC ATGTTAAATG AGGTTATTAA AGAAGTTAAA GATGCAAAAG 240 
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GCGATACATT AGATGGAAAA . ATTATCTTCA AACTTTATGA TACTTTTGGc TtCCCCGTAG 300 

AATTAACGGA AGAAGTTGCA GAAGATGAAG GCTtGAAAGT CGACCAtGCT GGGTTTGAAA 360 

CAGAAATGGA AGCACAGCgT GAACGTGCTC GTTCTGCCCG CAGTAAAGAA ACTTCAATGG 420 

GTGTTCAATC TGCTTTATTA ACAGATATTA AAGTAGAAAG TAAATTTGTT GGCTATACAG 480 

AATTAACACA TGATAGTGAA TTATTTGTTA TTATTCAAGG TGACGCACTA GTAAATGAAG 540 

CATCTGCAGG AACAGCCGAA TTAATTTTTG CTGAAACACC ATTTTATGCT GAAATGGGTG 600 

GACAAATTGC TGACCGCGGC TATGTAAAAA ATACCGCAGG GgAAGTCGTT GCCAACGTGG 660 

TGGATGTGAA AAAAGCACCA AACGGTCAAT TTTTACACAA AGTAGAAGTT TTGGCGCCAT 720 

TAGCAGAAGG TCAAATTTAT CAATTGCAAG TGGACGAACG GATGCGGACA CGTaTTTTGA 780 

AAAATCATAC GGCAACCCAT TTATTGCATC GTGCTTTAAA AGATGTGTTA GGGGaGCATG 840 

CCAACCAAGC AGGCTCATTA GTTGCACCAG GACATCTACG TTTTGACTTT ACTCATTTTG 900 

GGCAAGTTAC ATCAGAAGAA CTAGCGCGGA TGGAAGCCAT CGTTAACGAA AAAATCTGGG 960 

AAGCTATTCC TGTTGTCACA ATTGAAACAG ATATTGATAC AGCGAAAAAC ATGGGCGCAA 1020 

TGGCGTTATT TGGCGAAAAA TATGGCAAAG AAGTCCGTGT AGTTAATATT GGGGATTACT 1080 

CTATCGAATT ATGTGGTG6A ACACACGTTG CTAATACAGA AGATATCGGG ATTTTCAAAA 1140 

TTGTTTCTGA ATCTGGAATT GGTGCAGGAG TACGCCGTAT TGAAGCAGTG ACGAGTAAAG 1200 

AAGCGTATCA ACTTTTACAA GAAGAAGAAC GTCAATTGAA AGAGATTGCT ACATTAGTTA 1260 

AATCACCTCA ATTAAAAGAA GTGGTTACGA AAACAGAGCA ATTGCAACAA CAATTACGTG 1320 

ATCTACAAAA AGAAAACGAA CAATTGGCAG GCAAATTAGC GAACCAACAA GCTGGTGACA 1380 

TTTTTAAAGA TGTCAAAGAC ATCAATGGTG TTCGTTATAT TGCCGCTCAA GTCAATGTTA 1440 

AAGACATGAA TCAATTACGC CAATTGGCTG ACCAATGGAA ACAAAAAGAA TTGTCTGATG 1500 

TACTAGTTCT AGCAACTGCA CAAGATGAAA AAGTAAGCTT GTTAGCGGCT ATGACAAAAG 1560 

ACATGAACGG AAAAGGCTTA AAAGCCGGTG ACTTAATCAA AGCAATTGCG CCAAAAGTTG 1620 

GCGGTGGCGG CGGTGGTCGT CCTGATATGG CTCAAGCTGG TGGGAAAAAT CCAGCAGGGA 1680 

TTGCGGATGC CTTAGCAGAA GTTGAAAACT GGTTGGCAAA TGCCTAATTA AAAATGTTTT 1740 

CCCTATTATT AATATAAGGA GAATGCTCTT TATAGCaATA ATGACAGCCC TCTTCAAAGT 1800 

TATGAAATTG ATTTTGAAGG GGGTTCTTTT TTTtAAATGA TTAAGGAGAG GAAACAGTAG 1860 

ATTGCACGAA TGTGTAAGTT TTCATCTTCA ATTGACAATG GTATAATAAG CAACGGAGGA 1920 

ATCAAAAATA TGAAT6AAAA TCAATTaTCa AAACGTCTCG CAACGGTTGG TGATTTAATC 1980 

CCTAGAGG7UV GTCGCTTAGC AGACATTGGT TCAGACCATG CTTATTTACC AGTTGCTTTA 2040 

ATGTTAGAAA ATAAACTGTC TTTTGCTGTG GCTGGGGAAG TGGTGGAAGG CCCCTATCAA 2100 

TCTGCCAAAA CCCAAGTCAG TAAATCCAAT TTGACGGATA AAATTATCGT ACGTTTAGCG 2160 

AACGGGCTAG ATGCAATTGA ACCTGAAGAT CAmATAGACG TCATTAGTAT TTGTGGAATG 2220 
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GGCGGCACAT 


TAATCCGAGA TATTTTAGAA GCTGGACGCA AAaAAAATCG TTTAACAGGG 


2280 


AAGGAACGCC 


TTGTTTTACA ACCGAaTATT GGTGAACCAA CTTTACGCCG CTGGTTAATG 


2340 


GCAAATGACT 


ACAGTATTAT TGATGAAACA ATCGTGGAAG AAAACCGTAA ATTGTATGAA 


2400 


ATTATCGTAG 


CCGAAAAAAC AGAGCAATCC GTTTCTTATA CGGACCAAGA ATTACTATTT 


2460 


GGGCCAGTTT 


TAATTAAAAA ACAAGGACCG GTTTTTACTA AAAAATGGCA ACGAGAATTA 


2520 


AAACAGCGCA 


AAACTGTTCT TGCACAACTA GCAAAAGCAA GTGGAGAACA TATTGAAAAG 


2580 


CAAGCGAAAC 


TACAGCAAGA CCAACAATTA ATTGAGGAGG TACTGGCCAA TGGCTGTGAA 


2640 


CGGTAAGACA 


TTGATTCGAC GCTTCAATGA CTATTGTCCC GAATGGTTAG CAGAAACGGG 


2700 


AGATCCTGTG 


GGCTTGCACA TTGGTACATT AGATAAACCA ATTGAAAATG TGATGGTTAC 


2760 


CTTAGATGTT 


CGTCCTGAGG TCGTTGCAGA AGCAATTGAG AAGCAGGTTG ATTTAATTAT 


2820 


TGCGAAACAC 


CCACCTATTT TCCGCCCAGT TAAGCGATTA ACCACAGATA ATTTTCAAGA 


2880 


AAAAATGTAC 


GCTGATTTAT TAAAACACGA TATTGCTGTT TATGCAGCGC ATACTAACAT 


2940 


GGATATTATT 


GACAATGGGC TAAATGATTG GTTTTGTGAA TTATTGGGAA TTAAACAGAC 


3000 


CACCTTTTTA ACTAAAACAC ATACTGTTCC TTATAAAAAA TTAGCCGTTT TTGTGCCAAT 


3060 


TGATGAAgCT CCGCAAATGA GAGAAGCTTT AGGTCTAGCT GGCGCCGGGT CTCAAGGTGA 


3120 


TTATTCTAAA ACAAGTTACT CTTTAATTGG AACAGGACGT TTTACACCAA CACAAGGAGC 


3180 


AAATCCAACG 


ATTGGCGAAA TTGGTCAAGA AAGTGTCGTC CAAGAAGCAA AAATCGAAGT 


3240 


GATTTTCCCT 


GAAACTAAGC AAGAACAAGT TCTTGCAGCT ATGTTACAGG CGCATCCTTA 


3300 


TGAAGAACCG 


GcGTATGATG TTTATACCAT TGAAAATCAA TCAAAAGAAT TTGGTTTAGG 


3360 


TCGGGTTGGA GTATTAGACA AGCCTGTGAG ATTCTCTTAT TTTGTACaGC AAGTCAAGGa 


3420 


AGCTTTCCaA 


TTGGATGGCT TGCGGJcTCAT TGCAAAAGAT GATACTAAAA TGATTCAACG 


3480 


GGTAGCCATT 


TGTGGTGGAA GTGGTGAAAA ATTCTATCAT GATGCATTAC GTAAACAAGC 


3540 


CGATGTCTAT 


ATTACTGGTG ATGTGTATTA TCATACGGCA CATGACATGA TTGCTGAAGA 


3600 


TTTACCAGTT 


ATCGATCCTG GTCATTACAT TGAAGCGCTT TGTAAGCCAr AACTTGTGGA 


3660 


ATTAATGAAT 


CAATGGAAAC mAGAAAaTGA GTGGgCTGTA TCCATTTTkG AaTCAGAAGC 


3720 


GAATACCAAC 


CCATTTCGTT TTAAATAAAA AAATGAAGAT TAAAGGAGTG TTCTTTTATG 


3780 


TATGAAAATC 


TATTACCTCG TTTTTTACGG TATGTGAAaA CAGAAACACG CTCGGATGCA 


3840 


ACTAGCACAA CGACACCATC AACACAAACA CAAGTAGCCT TTGCACAAAC CTTAAAAAAA 


3900 


GAATTAGAAG 


AATTAGGGAT GAGCGATGTT ATTTATAACG AAACAAATGG TTTTGTGATn 


3960 


GCTACGTTAC 


CTAGTAACGT GGAGAAAGAC GTTCGTTCAA TCGGCTTTAT TGCCCATATG 


4020 


GATACCGCTG 


ATTTTAATGC AGTGAATGTT TCTCCGCAAA TTGTTGAGAA CTATGACGGA 


4080 


GAATCAACGA 


TTCCTTTA6A TAAAGAAGGC AAATTCACTT TAAACACGT^ AGACTTTCCT 


4140 


AACTTAAAAA ATTATCGTG 


4159 
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(2; INFORMATION FOR SEQ ID NO: 8: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



ATCGATACAA 


TTTTTCTTTT 


GTCACAACAG 


AAGAAGAATT 


ATTAGAGATT 


TTAAAAGATG 


60 


CCTTAAAAGA 


TAAAGCCTTA 


GTTGTCAGTA 


CATTAGTCAG 


TAAACAACTA 


ATCACAGCGG 


120 


CGAAAGAATT 


TAGTGAACGA ACAGGGTTGT 


TATATTTAGA 


TTTAATGGCG 


CCATTTTTTG 


180 


AATTAATTCA AGCGAAAGCC 


GGAGTAGATC 


CTATTGAAGA 


GCCTGGACGA 


CGCCACCAAC 


240 


TAGATCGTGC 


CTATTTTGAT 


AAAATCTCAG 


CGATTGAATT 


TGCTGTAAAA 


TATGaTGATG 


300 


GCAAAAATCC 


TCAAGGGTTT 


CTTGATTCTG 


ATATATTGTT 


GTTAGGCGTT 


TCGCGGACCT 


360 


CAAAGACGCC 


AGTCAGTATG 


TATTTAGCGA ATCAAGGCTA 


CCGCGTTTCT 


AACTTACCAT 


420 


TAATTCCAGA 


AGTTCCATTG 


CCGCCAATTT 


TGGAAGAAAT 


GGATCCACAA 


AAAATGATTG 


480 


GTTTAGTTTG 


TTCGCCAGAA 


ACATTAGGAC 


AGATTCGTAG 


CAGTCGGTTG 


GCTTCCTTAG 


540 


GTTTAGGTAA 


TGAGACCAGT 


TATACCAATG 


TTGAACGGAT 


TGAACAAGAA 


TTAGCTTATG 


600 


CCGAAGAGAT 


TTTTGCGAAG 


TATGGCATCC 


CAGTGATTGA 


TGTAACCGCA 


AAATCTGTCG 


660 


AAGAAACAGC 


CTTTTTAATT 


AAAGAAAAAC 


TAGATGAAAG 


AAATTAAAAA 


GCTTGGCAAC 


720 


CTCACTAAAT 


TTCAGTGAAG 


TTGCCAAGCT 


TTTTATACTT 


AATTGAGCTT 


CTTTTTGTTA 


780 


GTCAATATTT 


TTTTGATCGA 


CAATGAGTAA 


AGCGATCATC 


ACGTAGACAA 


TCCAACTACT 


840 


TTTAGAATAA AGTGTATAGC 


GGGGTTGCCA 


TTTGCTAGCA AATTTTTCCT 


TGTATTCGCG 


900 


TAATCCTTGG 


AAGGAGTAAA 


TTTCTGAACC 


AAATTCATAA 


ATTAACGCAG 


CAATCCGTTC 


960 


TTGAATAAAA CTTTTTCTTG ATTGACCAAC GTTGGCTAAC GGAGCCATCC 


CTAAATTGAA 


1020 


ATAATGAATA 


TTT TCTGTTT 


TCATATATTC 


AAATAAATGA 


ATAAATAAGA 


AATCCATACT 


1080 


TCCAGAAGGG 


GCTTTTTCTT 


TATGGTGACG 


CATCAAGTCG 


ATGGTACCAA 


CTTCATTTGT 


1140 


GTAACTAGGA ATGATATTAG CGAAAGAGAC CATTTCTCCT TCAGCATTTC 


GGACAACAGC 


1200 


AATTGGGTTT 


CTTTGAAGAT ACGCTTCATC 


GAAAAATCCT 


AGTGAAAAGC 


CTTTTTCTTT 


1260 


TCGATTGTCT 


AACCAACTAT 


CCGAAATAGC 


TTTTAATTCG 


TGCATTTGTT 


CGGTACTAAA 


1320 


A6GAGGTTGG 


AGTACGTCAA ATGAATAGCC 


GGCTTTAGTA 


ATTTTATTAA 


ACACTGCGCG 


1380 


GGTACCTTTG 


AATTTTTTCC 


CAGACATCGT 


TUt^TGTTTCT 


AGATTAACTA 


AGGCTTCTTC 


1440 


GCCCATTTTG 


ATAAATTCGT 


AGCCATATTC 


GTGTAGTAAC 


ATCACGCTGT 


CCTCATTGGT 


1500 


TTCGTAAAAG 


ACGGGTACAT 


AGCCGAAACG 


ATCTGTTTCT 


TTAAGGAATG 


CATCAATCGC 


1560 


CGCAGGAAAA 


TCTTCTTTGT 


TCCCAGAAGG 


GTTGCCCATC 


ACAACGCATT 


TATTATTAAT 


1620 


GGTGTTGAAT 


TGTAAAAAGA 


CAGTCGGTTC 


TTCTTTGGGA 


TATAAGAAAA 


CCTGTTTATC 


1680 
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ATGAAGAAAG 


ACAAGTTCAC 


TGTCAATATT 


CCCGCCATAA 


GTTGTTAAGA 


TATTATGCAA 


1740 


GATACTATCT 


TCTGGAAACT 


CACCAACTTG 


GTGTTTTTTC 


CCTTGGAGGT 


AGCGAACAAA 


1800 


TAGAAAGTTA AAACTCATAA 


CTAAGAGGAT 


AGCGATAAAT 


CCTGACAACC 


AGATCCGTTC 


1860 


GGAAGGGAAC 


AAGAAAAATT 


CCACAAAATG 


ATGACGATGG 


TGTGGGAAGT 


TCGGTAAATT 


1920 


ATAAACACCA ATTACCAGGT 


ATAACAAAGT 


CAACAAGCCA 


AAGATAAAGC 


CATCAATCGT 


1980 


CATCCACTCC 


CAAGAGTAAA 


CCAGTTGTTC 


ACGGTACAAT 


TCTGATTTTG 


AAAAAATAAC 


2040 


GATAAATAAT 


AAAATAGCTA 


AATAGAAAAT 


GGAAAACATA 


CTAAAGTCAA 


CAACAAATGA 


2100 


ATACAAGATA 


GCACCAATTA 


ATAAAATAAT 


CGTTGGATAA 


TAGGCTCTTT 


tAaCmCGATC 


2160 


AGCATCCCcG 


TCCCAT/^TT 


mATAATGCAA 


AGCCTAAAAC 


AATACTCGGA 


ATTTGnATAA 


2220 


TAATATGTGA 


ACGGAAGGGG 


TTAATATCTC 


TTAACCAATG 


CACTTGAGTG 


AAGGCTTCTG 


2280 


GAATAGTGGC 


TAGGAGCACC 


ATCATAATTC 


CTGAAAAATA 


AAGTJ\AAACC 


ACTTCTAGCT 


2340 


TGTGGGCAAT 


TTCCAAAGTA 


ATTTGTTTTA 


GCAGTTGACG 


ATAATGAGTA 


TCCAACTTTT 


2400 


GGCTTAAGTG 


TTTTGTAAAG 


AACAAACAGC 


CAATTAAGAA 


CGGAATAATA 


TAATAGAATA 


2460 


AACGATAGAG 


CAATAACCAC 


AAAACAATAA 


TTtCACGATC 


GACACCTAAG 


TTGCTCAGTC 


2520 


CTAAAATCAT 


CATGACATCA 


AAACTACCTA 


AAGCCCCTGG 


AATCATCGAA 


GCAATCCCAA 


2580 


TAATCGAAGC 


TGCTACATAT 


AAAGGAACAA 


TATCAATTAA 


AGGAATCGGT 


ACATCTAATA 


2640 


AGTAACCAAT 


ACTGATAAAG 


GTAATTAAAA 


CACCTGTCCA 


TTCTAAAAAC 


GAaGAAACAA 


2700 


CTAGAGATAA 


GCGATTTTTT 


GCATCGAGGT 


TACCAAAGGA 


ACTATTTTTT 


TGGAAATGTA 


2760 


AAATAAGATA AAGTAACGGG 


AAATAAAGAC 


CGCCACCTAA 


TAACCAAATC 


CAATATTGCT 


2820 


GTAAATAAGC 


CATCTTTGGA 


TCAAACTGAA 


CAAGGAAAAA 


GCAAAGAATA 


CTATAAATAG 


2880 


ATAGACCGGA 


TAACACAAAA 


AGTAAAATAT 


GGGTTAACGC 


AGGTAAAATT 


TTCTTTTCCT 


2940 


CTGTTTTATT 


TCCATAAAAT 


TCAGAACGAA 


GTCCCATGCT 


GACCAAACCA 


CCGAATCCAC 


3000 


CAaTATTATT 


GATTGTATTG 


ACAATCCAGC 


TAGATTCAAG 


GAGTTCCCGT 


TTTGAGACTT 


3060 


TCAAGTTTAA 


AATGCGATTT 


AATGTTAAGT 


CATAATTTAG 


CATCGGAGTA 


ACGGAAACTA 


3120 


AACCGATAAC 


CATCATCAAT 


AACAATTTCC 


ATAGCGGAAT 


CTCATCAA/^ 


ATCTGTTTTA 


3180 


ACTGTTCAAA 


TGAGATCGTT 


TTACCAATTG 


ATAAAAGTTG 


GGCAACAACA 


ATAATTGAGA 


3240 


CCGCAATAAC 


AAACAGTGTT 


TTTAGTAATC 


CTGTATGCGT 


TTTGATCCAA 


TGAATAATTT 


3300 


GTTTCATTTA 


GACACCTTCA 


TTTCCTTTAA 


GAATTTTTCG 


ATTGTTTTAT 


TAAAAATATC 


3360 


TGGTGAGTTT 


TTGGCAAATA 


AATGACCCTG 


TTTATTTACC 


AAGACCAATT 


TGCCGGCAGG 


3420 


AATGGcATCA 


GCAATCGCCC 


GTGAATGGGA 


AAGCGCAATT 


AAATCTCGAC 


TGCCAGTAAT 


3480 


AACAAGAGTA 


G6ACAGGGGA 


TGTGTTCTAA 


ATCTTGTTTT 


GTCAGTCCAG 


TATCTTCAAA 


3540 


TAACAGAGCC 


ACGATAGGGA 


GCCGTGCGCG 


CAATTTTTTT 


GAAAAAAAGG 


CGCCAATACG 


3600 


ACAAAACCAA 


TATTGAAAAT 


AAGTGGCAAT 


CGTTACATAT 


TTTTTCAGAC 


CAGACATGGT 


3660 
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GACATTTCCA 


GT^TTTAAGA 


CGAGTTTGTT 


GACGAAATCA 


GGATATAAAT 


GGGTAAAAAC 


3720 


CATGGCAATA 


TTTGCTCCGT 


CACTAAAGCC CAATAAATTA ATTTTTGTTA AGTGTTCTTG 


3780 


TTGAATAATA 


CCAGCTAAAT 


CTTCTGCCAA 


AAGCGGAAAA 


GTTAATTGAC 


TTTGTGTATT 


3840 


GGTGGAACGT 


ccatgtc::gc 


GGGTATCCAC 


CGCAATAACT 


TGATACTCCT 


TGGAGAAATA 


3900 


AGAAATTTGA TAATTAAAAA 


AAGTATGATC 


ATTGCCATTA 


CCATGAAGTA 


ACACAAGGGG 


3960 


GTAATTTGTT 


kCTTTGCCAA 


AAATTTGAyA AAAAATTTGA CTGCCATCTA TTGTTTGAAA 


4020 


ATATTTTTCT 


ATTGCCATTC 


TCCGTCACCT 


ATCTCATCTA 


TAATCAATCG 


AGTCCTCTCT 


4080 


CATTATAGAG 


TCTTTTGTCT 


TTTTTTTGTA 


ATAAAAAACA 


GAAT^TTTTC 


CGAACAGTTT 


4140 


TGTAAGGTGT 


TGTAATATAG 


TGTGATATAG 


TAGAAGATAG 


CCACCTAAAA ATTAGACTAG 


4200 


AGGAGGGAAA 


GTTGATGTCT 


TTAATCGTTT TAATTGGTGC CCAAGCTGT C GGGAAAATGA 


4260 


CCGTTGGCAA 


GGCTTTAGAA 


AAGCAGTTA6 


ATGCTAAATT 


ACTATTTAAT 


CATCAAACCA 


4320 


TTGACTTATT 


TGCAAACTAT 


CTTGGGTATA 


CGGAACGTGc 


ATTTCAATTG 


TCTGATTCCG 


4380 


TTAGAAAGGA 


ACTTTTTCAT 


GCCTTTGTCG 


AAAATCCAGC 


CACAAATACA 


ACAAAAAACA 


4440 


TTATTTTTAC 


AGTGGTGATT 


GCTTTTGATC 


AAGAACAGGA 


TTTTCAATTT 


TTAGAAGAAA 


4500 


TATCAGAAAT 


TTTTCTTAGT 


AAACATGAGT 


CCGTTTATTT 


CGTCGAACTC 


GTCTCTTCAA 


4560 


CATCAATCCG 


CCTAGAACGA 


AATGTTCATC 


CAGAACGGCT 


AGAGGCAAAA 


CCTTCAAAAC 


4620 


GCGATGTCGA 


CTTTTCTCGA 


AAAGAATTAC 


TCAATACCTT 


TAAAGAACAT 


CGATTAGTAA 


4680 


GCCACGAaGG 


TGAACTAGAA 


AGTATATTTC 


CAAATGTAAA 


AATAGCGAAA 


ATAAATAATT 


4740 


CAAACATGAC 


ACCAGAAGAA 


GTAGCCACGA 


AGATAATTGC 


TACTTTTGGT 


TTGACATAAT 


4800 


GTGAAAGGAA 


CATCGGAATC 


AATGAAAAAG 


TTTTTTAAAT 


ATCTCGGTAG 


CTTGATGGTC 


4860 


CTTCTTTTAA 


GCATAGCTTT 


AATTTTGATA 


GGATATTTAA 


CCTTCCGAGA 


GTTTCGACCA 


4920 


CGAGCGGTTG 


AACCAGTTGC 


TATAAATAAA 


CAATTTTCAG 


AGCAAACCAT 


CGAAAAAAAG 


4980 


CAATCCATTT 


CTTTAGTCAC 


GTATAACATA GGCTATGCCG 


GCCTAGGACA AACAGAAGAT 


5040 


TTTTTTATGG 


ACGGTGGCAA 


AACTGTCCAA 


CCAGTTAATA 


A6GCAATGGT 


CCAACAGAAT 


5100 


TTAACAGGCA 


TTGAAGAAAC 


CTTAAAAGAA TTACCAGCGA 


TGATTTACTT 


ATTCCAAGAA 


5160 


GTCGATCGTC 


GCTCGCAACG 


TTCTTATGAA GTCAATCAAG AGGAAGAATT 


AC7VAAAACAG 


5220 


CTACAGCTGA ACAGCGCATT 


TGCTTATAAT 


TTTAAGGTAG 


ATTATGTGCC 


CATTCCTTGG 


5280 


CCGCCGATTG 


GACGAGTAGA 


AAGTGGTTTA 


TTAACATTAT 


CAAATGAAAA 


AATAACAGAG 


5340 


GCCAAACGCA 


TTGCTTTACC 


CAATCCCTTT 


CGTTGGCCCG 


TTAGCATTAG 


TAACTTAAAA 


5400 


CGAGCACTTC 


TAGAAaCGCG 


CTTTCCAATT 


AAAGGCACAG 


ACAAAGAATT 


AGTGGTTTTT 


5460 


AACTTACATT 


TGGAAGCCTA 


TGACAATGGG 


GAAGGGAAGA 


TTGCTCAAAG 


CAAAAAGTTA 


5520 


GCTGACGTAT 


TATCCCAAGA 


ATACGCTAAA GGAAACTATG 


TAATTGCTGG 


AGGTGACTTT 


5580 


AATCaAGTTT 


TCaAAGGCAG 


TCATCGATAT 


CCTGACTTAG 


GGGAAGCTGG 


TTGGGTACCA 


5640 
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GGAGAAATTG ATCCAGCAGA CTTACCTAAA CATTTTTCAC TTGCTTATGA TGATCAACAA 


5700 


CCGACTGTGC ' 


GGGTGCfTAA TAAGCCTTAC 


ACTGGCTCTT 


ATGAAACATC 


CCAAGTGTAT 


5760 


GTAATTGATG 


GTTTTATTGT TTCTGATAAT 


GTGGCTGTTC 


ATAGCGTGAA 


AACAAAGGAC 


5820 


GAACAATTTA AATATACAGA TCACCAACCT 


GTGAAGATGG 


AAGTTGGGTT 


GAAATAGAAA 


5880 


AAGGAGGAGA 


TAGATGAAAT TAGGGGTTAT 


TGCCGCAATG 


GATTTAGAGC 


TTCAAAAATT 


5940 


ATTGGAATAT 


TTTCCACCAC AACGAAAGAT 


ACAGCTAGCA AAAAATACAT 


TTTATATTTA 


6000 


TGAACAAAAA ACATCGCAAG TCATTATGGT TTGTGCAGGA CAAGGAAAAA 


CGAATGCTAC 


6060 


GCTATATAGT 


CAAATACTTA TTGATTCATT 


TCAAATAGAA CAACTAATTA ATATCGGTAT 


6120 


TTGTGGCTGT 


TTAAATGAAA AACTGCAACT 


CTTTGAGATG GTGCTAGGTG AGGAATACTG 


.6180 


CCATTACGAT 


ATTCGTGAAA GACAGTCAAT 


CAATAAATTT 


CCTTATCAGC 


TGTACTATAA 


6240 


AGGAGATAAA 


GGAATGCTGG CCGAACTTCA ACATCTAGAT 


GAAAAGATAA AGTGTGTTCG 


6300 


TTTTGGAACT GGAGAAGGCT TCGTTTGTGA CACAACTGAA AAAAGAAACT TAATTGAACA 


6360 


ATTCTCTATT 


GACTGTGTGG ATATGGAATC 


CGCAGCTATT 


GCACAATGCT 


GCTTTTTAAA 


6420 


TGATTGCGCT 


TTTGTTTCTA TAAGAATTAT 


TTGCGATAGA 


GCGGATGCTA ACGCAGTCAA 


6480 


AGTAACAGAA 


GATACACAAA CACAGGCAAT 


GGAAAAAGTA 


TTTGAATTGG 


TCTGTGCATA 


6540 


TATAAATTAA 


AAAAGAAAGA GCTTGGAACA 


AAAATCCAGA 


AAGATTTTTG 


TTCCAAGCtT 


6600 


AAAAAGTAAT 


ATAGGTTGGG GATAAAAGTA 


ACATTGTTAT 


TGTTCCAATT 


ATTTTACAGA 


6660 


TAGCCTTTAG 


TAATAGAATG AGTGGAATTT 


AGCATATTTG 


CTGGGGTTCC 


TGTAAAAAGA 


6720 


AGCTGTCCGC 


CATTTTTTCC ACCTTCGGGA 


CCTAGTTCAA 


TCAACCAGTC 


GGCACTTTTA 


6780 


ATCACATCAA 


TGTGATGTTC TAATAAAATC 


AAGCTGTTTC 


CTTCTTCTAC 


CATTCGATTG 


6840 


AAAAGTTGAA 


GACTTTGTTG GATATCAATT 


AAGTGTAAAC 


CATCTGTAGG 


TTCATCCATT 


6900 


AAATAGATAG 


CTTTTTTTTG GTGAAGTGTG 


TCTGCTAATT 


TTACCCTTTG 


CAATTCGCCA 


6960 


CCAGATAATG 


TTGAGAGCGA TTGATTGAGT 


TTAAGATAGC 


TTAAGCCAAC 


TTCCAGTAAA 


7020 


TTTTTTAATG 


AAAGAGCAAA AGGCTGGTCT 


TTGAAAAAAT 


CATAGCCATC 


TTTAACGCTT 


7080 


AAAGCTAGTA 


CTTCAACGAT ATTTTTTCCG 


TTATACAGAT 


AATGAAGCAC 


CTCTTCTTTG 


7140 


TAACGCGTTC 


CGTGACAGGT TTCGCAAATA CTAGTAACAT CCTCCATAAA AGACATATCG 


7200 


GAGACAATTA 


TGCCTTTCCC CTTGCAAGTA .GGACAGGCGC 


CTTTGG7\ATT 


ATAGCTAAAT 


7260 


AAAGCTGGAC 


TAACATGATT TTCTTCCGCA AACAGTTTGC 


GGACCTTATC 


AAAAATATTA 


7320 


AGATAGGTCA 


TAGGTGTGGA TCGTAAATTT 


GCGGTAATGC 


TTTTTTGTGA 


AAGATGAATA 


7380 


ATTTCTTGGT 


TATCAGCCTG GGCTTTTTGA 


TAAATTTCTT 


CCGCTAATGA 


TGATTTACCT 


7440 


GAACCAGCGA 


CCCCACAAAT AACAGTCAAA 


ACTCCTAACG 


GAACTTCGAC 


AGATAGATTG 


7500 


TTTAAATTAT 


GCAGTGTCGC ATGTTCGATT 


GATAAAGACT 


TCCTTGCTTT 


TCTTGGTTGG 


7560 


TCGTTTAAAG GGAGCGGCTC TTGAAGCGCT TGACTAGTCA AGGTTTTGGA GACTAAAAAT 


7620 
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GCGTCATACG TTCCTGAAAA CTGGACATGG CCACCATTTT CGCCTGCGAA AGGTCCGATA 7680 

TCGATGATAA AATCAGCTTC TCTAATTAAT TGTGGATTAT GTTCCACGAG AACCACGGTG 7740 

TTTCCTTTAT TTTTTAAATT GAGCAATGCA CGACTGATCC GTTCGATGTC CTTTGGATGT 7800 

AAGCCAGCAC TTGGTTCATC TAAAATATAC ATAATATCAT TTAAGGCGCT GTTTACATAC 7860 

TTAGCAATTT TAATCCGCTG TGCTTCACCA CCAGAAAGTG TTTCAGTTGC TCGTCCTAAT 7920 

GTGAGGTAGG AGAGACCGAT ATTAATTnAT GCTTCTAGAC GTACAAGTAG CTCTTCTTGA 7980 

ATAGTTTTTA TTAAGGATAG GTCCATTGAA CGAATAAAAG AATGTAATTC AGTGAGTGGC 8040 

ATGTCAACAG CATCAGCAAT ACTTTTCTGA TTAATTTTGC AGCTACGAAC ACGTTCATTG 8100 

ACTCTACTTC CTAAACAATC AGGACATCTT TTTACGGTAA CAAAGTGATT AAGGTATTTT 8160 

TGATGACGTT TGCCTTCGTC TGTATGCAAT ATGCTACGTT GCATACGCGG GACGATTCCT 8220 

TCATACAAAG CTGTATGAGG CCACTCTTTG GGTGGATTAG CTAGTTTTTG TTGTGGAGCa 8280 

TATAArmrTA ATGCTAACTC TTCAGGAGmw TAGTCCTTGA TTTTwkTATC CAAATCAAAT 8340 

AGACCACTAT GGGCATAACG CTTCCACCGC CAGTTGC 8377 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



GTTTTTAAAC 


ACGAACGTTA 


AACGGCCGAA 


TGTTCGGTTT 


TTACGTTATT 


TTTGTAAATA 


60 


ACTA6GAGGC 


AGTAAGTATG 


AAAGTTTTTG 


ATTATGAAGA 


TGTCCAATTA ATTCCCAATA 


120 


AATGTATCGT 


GAATAGTCGT 


TCTGAATGTG 


ATACTACAGT 


AACTTTAGGA AAACATTCGT 


180 


TtAAAATGCC 


TGTCkTcCmG 


CGAACmTGCm 


AACatTATgr 


TGrAACgATT GCGGAAcCCT 


240 


TGCTGAAAtG 


GTTATTTTTA 


TATCATGCAT 


CGGTTTGGAT 


GAAGAGGCGC 


GAGTGCCaTT 


300 


TATTAAAAAA 


ATGcAACAAA 


AGGGCTTGAT 


CACGTCAATT 


AGTGTGGGTG 


TGAAAGAAGG 


360 


GGAATACGCT 


TTTGTTGAAA 


CGTTAGCCCG 


AGAAGGTTTA 


GTTCCTGACT 


ATGTGACAAT 


420 


TGATATCGCC 


CATGGTCATT 


CAAATGCGGT 


GATTAATATG 


ATTCAACACT 


TGAAAAAATC 


480 


TTTACCAGAA 


ACGTTCGTTA 


TTGCTGGCAA 


TGTTGGAACA 


CCAGAAGCCG 


TTCGCGAATT 


540 


AGAAAACGCT 


GGAGCCGATG 


CCACCAAAGT 


GGGGATTGGA 


CCAGGAAAAG 


TTTGTATTAC 


600 


* TAAGATTAAG 


ACTGGCTTTG 


GGACAGGTGG 


CTGGCAGTTA 


GCGGCTTTAC 


GTTGGTGTGC 


660 


CAAAGCTGCT 


XGGAAACCAA 


TTATTGCAGA 


T6GTGGAATT 


CGGACCCATG 


GAGACATTGC 


720 


AAAATCTGTG 


CGTTTTGGGG 


CaACTATGGT 


CATGATTGGT 


TCaTTATTTG 


CTGGACATGA 


780 


AGAATCACCa 


GgTGrAACaA 


AAGTGGaAGA 


TGGTGTTGTT 


TACAAAGAAT ACTTTGGTAG 


840 
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CGCCTCTGAA TTCCA/U^AAG GCGAAAAGAn AAACGTTGAA GGCAAAAAAA TCTGGTTACG 900 

CCATAAAGGG AAGTTGGCAG ATACTTTAGT CGAAATGCAA CAAGATTTAC AATCTTCCAT 960 

TTCATACGCG GGAGGTCGCG ATTTAGAAGC GATTCGTAAA GTTGATTATG TGATTGTAAA 1020 

AnATTCTATT TTTAATGGTG ATACCATTTA ACATTCAACT ATAGACTGAA AAGAAGACCG 1080 

CACGTAGnAA GGAAATTAAA AT 1102 

(2) INEX)RMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1009 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

nAAGAATACG GTCTGTCTTT AACAACTTCA ATTGATGATA TTTTAGAAGA TAAAGAGATT 60 

CAAATCGTTG TGGAATTAAT GGGAACCATT GAACCAGCCA AAACGTATAT CATGAAAGCT 120 

TTAGAAAAGG GCAAACATAT TGTGACTGCC AATAAAGATT TACTGGCGCA ACATGGGAGT 180 

GAATTAGTAG CGTTAcCCAA AAACATCATT GTGATTTATA TTACGAAGCA AGTGTTGCTG 240 

GCGGGATTCC AATTTTAAGA ACAATTGCTA ATAGCTTAGC GGCAGACAAT ATTCAACAAG 300 

TTTTAGGGAT TGTGAACGGT ACGACGAATT ATATGCTTAC GCAAATGGTT TCTGCTGATA 360 

AAAGTTATGA AGAAGCTTTA GCAGAAGCGC AAgcGTTGGG GTTTGCTGAA GCGGATCCGA 420 

CAAATGATGT CGATGGAATT GACGCAGCCT ATAAAATGGT TATTTTAAGT CAATTTGCTT 480 

TTGGGATGAA TGTTTCATTA CCACAAGTTG ATATTCGTGG AATTCGCGGA TTGTCTTTAG 540 

ATGATGTCGC TATGGCTAAA CAACTGGGGT ATGAAATCAA ATTGATTGGC TCTGCTGAAC 600 

AAAATGA6AA TAGTATCTCG GTGGAAGTGG CGCCTATGCT AGTCAATCAA AAGCACCCTA 660 

TTGCATCTGT TCGGAATGAG TACAATGCAG TATTTATTAA AAGTGCAGGC GTGGGGGAAT 720 

CAATGTATTA CGGTCCAGGA GCTGGAGCTA AACCGACAGC AACCAGTGTG GTCAGTGATT 780 

TAATTACGAT TGCTAAAAAC ATTCGCTTGG CAACGACTGG CCATATGTTC AATTCGTATC 840 

AACATAAAAC ACAATTGACG AGTTCCGAAA ATGTATTTGG GCAGTACTAT TTTyCATTAG 900 

aTGTTCCAGA TACACCAGGA CaATTTTTGC AGTTGACGCA ATTAATGACT AAAGCAGAAG 960 

TCAGCTTTGA TCAtTAGTTC AACAGaAATC AGATGGACAG CGAGCAAGA 1009 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5558 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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nCGGnTAAAA 


TTTTTTCAGC 


AiUL. 1 1 AaAx 


1 AU Wlla 1 1 1 Is 


(a X C i i 1 UAAg 


ACl ACi XGCG 


bU 


ATcAGCaCtT TgTAcAgTTA 




AxAi CsCUAaC 


Aa rL-GAualaU 


TTTtAACTaG 


1 OA 


TCAaTTATTT 


GGACATCTTA 


H Ik /^.««.#^#*» #'1111111111 

AAGgCGCTTT 


TACGGGAGCA 


A lll»IIA*^/*fk >T*Tk 

GaTTCCGATA 


Ik Ik Ik fnm/^^^ 

AAATTGGG SC 


180 


CTTTCAAAGT 


GCAGAAGGGG 


/^/^^ III ■III ■■•■Ilk mm 

GCGTTTTATT 


mmmik m^ik ik 


f*tnn%f^ Ik m^^mm 

GTTCATCGTT 


TGAGTCCTGA 


240 


AGGGCAGGAA 


AAATTGTTTA 


CATTTTTAGA 


TCAGGGTATT 


K finiiimt m#^^« Ik 

ATTTATCGAA 


TGGGTGAAAC 


300 


CAATCGGCCC 


ATTCCTGTTA 


CGTGTCGGTT 


GTGCTTTGCC 


ACAACTGAAG 


AAATTTCTAG 


360 


CACCTTTTTA ACGACATTTT 


TAAGACGAAT 


TCCTATTCAA 


ATTAAAATCC 


CaAGTCTTGC 


420 


TCAACGCACT 


CAAGCGGAAA 


GAAAACAATT 


AATTATGCGA 


GCATTTTATG 


AGGAACAGCA 


480 




AAAKPAGTTA 


CTATCACGCC 


ACAAGTTATC 


CAACTTTTAG 


AAAACCACCA 


540 


X X /\ X O X rWDy^fK* 


AATGTAriGTG 


AATTAAGAAA 


TAATATAAAA 


ATTATTACTG 


CACGCAGCTT 


600 


TGCCC^rC AAT 
X vj^v^vj\^v»..ri<v X 


PTTflATAAAT 

^ X X OiA X rUAr\ X 


GTGTTATTCC 


AATCACTTTG 


CACGATCTAC 


CGAAAGAATT 


660 


X X \3\3r\ X \^nn 


TCAATPAAAP 


TAGTTCCTGA 


TGAAAATGAA 


TTGCCTATTC 


GTTTAGATGA 


720 


GCAAACAAAT 


w X WW X Xn,w X X 


TGTTAGAGGA 


AACAGAGCTT 


GCGCAArGAA 


GAATTATCCA 


780 


CAGTTATGAA 




GTCTTTATGT 


CTCTCACCAT 


CATCATCTTT 


CCACTGCTAA 


840 


TAACGATATC 


TCGA7VAGAAA 


TTGAACGCTT 


ATTCGATGAT 


TTATTATTTG 


AAAAGAAGCG 


900 


agag;!\aaaat 


CACGAAATGC 


TATTGTTTAT 


TACGCAAAAT 


ATCCGTCAAT 


TATTGGAAAC 


960 


GATTGAATCG 


TCTTATCAAA 


TTCGGTTTAA 


TGGCAGTTTA 


GTCTATGCAT 


TAAGCACCTA 


1020 


tctgtttcaa 


CGAAGATGCA 


TTGATTGGTT 


TCCAGAACAA 


GAACCTACAA 


CAGTCATTGA 


1080 


cgaactttta 


ACAGAAGTTC 


AAACCAAATT 


AGCAAGTAGT 


TATGGtACGC 


TGAACAACTG 


1140 


ttaactttag 


TGAAACGTTC 


ATTAGATATT 


GAACTATCCC 


AGATGGAwCG 


AATTATTGTC 


1200 


ACCATTTATC 


TTCACTATTC 


GGGTTCAGTA 


7UVGAAAAGTC 


ACTATCCGAA 


AGCGGTGATT 


1260 


gttgcccacg 


GATATGCCAC 


AGCAAGTAGT 


ATTGCGAATG 


TTGCAAATCG 


GCTGTTAAAT 


1320 


gtgccaattt 


TTCAATCGTT 


TGACATGCCG 


TTAGATGTTA 


^m^^^^» Tk --<k <k ^ 

CTCCCArAAA 


AATTTCCGAA 


1380 


CACTTAATTC 


ACTATATGGA 


ACGTCAAGAA 


ACGAGGAATG 


GTTTAGTAAT 


TCTTTTTGAT 


1440 


atgggttcat 


TGAAAGAwAT 


TTATCAGTAT 


TTtCCGGCAG 


Ik Tk /^-^Tk /^*k Ik f^f^ 

AAGrAGAAGG 


JcCCtTTTTTA 


1500 


tTAATGraTa 


ATGtGACmAC 


mAGcTTGGCT 


CTG kCTaT wG 


GAGrAGCCmT 


rp, .Tk Jk^.^/^/^lk Ik 

TwAAGmCGAA 


1 c £:f\ 

Ibou 


kTTCTTTTGA AGAATTgCCC 


CAAAAAGcGT 


TAmCGGTyCA 


TCCmAATGaA 


TGGGAAaTCa 


1620 


TATTACCTGA AAATAAAACG 


bAAC-gXCaTuA 


TTTTAACAAC 


GTGTAgTACC 


Jk 7k Ik 

GGAATTGGCA 


1 £ior\ 
IboU 


CAGCCGTTAA AATTAGAGAT 


TTACTTGAAA 


AAAGTTTACC 


TGCGGAAGCA 


CAATTAAAGA 


1740 


TTATTCCTTG 


TGAATATAAT 


CAATTACGTA 


ATGCAGAGTC 


GATCAAAGAA 


AGTTTTCCAG 


1800 


AATATGA6AT. 


TGTCGGCATC 


ATCGGTACAA 


ACAACCCGTC 


TTCAAATGAC 


CTCCCGTATA 


1860 


TCTCTTTAGA 


AGAGTTGATT 


GCAGGAAAAG 


GAATTACGAC 


CCTTTTAGAA 


TGGACAAAGA 


1920 


GAGAATTAAC 


TAAAGATATG 


CTGTCCTATG 


TAAATCATGA 


GTTGATTCGT 


AaCTTTTCAT 


1980 



wo 98^0555 



TAGATCGGGT 
AGGTCTTTTT 
TAGCGGTATA 
AAAATTATAA 
AAGAGGCGTT 
TTTATGTCTA 
AAAATAAAAA 
TGAGGTGAAG 
GATTCAGACA 
TTATATGGAA 
ACAAAATGAA 
TAATTATGTG 
TGAAATATAT 
TGAAAACGGA 
AGAAGaATTT 
ATGGTCAAGT 
ATGATGACGT 
GAGTAAAACT 
CAGAAAAATA 
CTGCTTATCC 
GAAACATTTC 
TCGCAAATGG 
TAAATGTCTT 
TCGCTTTTAT 
TAACCGGTTT 
CGACCTTAGA 
TAGTGACTGG 
CAGCCTTAGT 
TGGGTTTGCT 
ATAAAGGGGT 
TCATTGTAAC 
TCCCTGAATT 
TTGCGATGCT 



CATTCAATCA 
AATTCAATTA 
TGTCCATGTC 
TGGCGCTGAA 
TAGTGTCATT 
TGATGTGTTG 
GGAGGGAGTG 
AAAATGGAAC 
TCATTAGCTT 
CAACCTTATG 
TTAATTGTGA 
GGGAAACAAG 
ACCCAAATAA 
CAAGAAAGTC 
TAGGAgGAat 
GGCTTTTTCA 
ACCTAAAAAT 
GGTCATTAAA 
CAAACTTTTT 
AGAAATAAAA 
TAAAGCAATT 
TTGTGAAGTT 
GTAGAAAGGA 
TGCGCAGTCG 
ATTTGTCGGA 
GTTAGCTTTC 
TGGGATTCTT 
CTTAGGCTTG 
CATCCCAATC 
GGAGCGCATG 
. TATTTCATTT 
TATTCAACAT 
TGCACGCTTA 



GTGACGATTT 
GAAGAGCGGT 
AGTTGTTTAA 
CAACTAAAGC 
GAAAAAGTTT 
TTTGGAAAAA 
CCACTTTTTT 
GTAAAATTAT 
TAATTTGTGG 
ATTTAGCTAA 
TTACAGACCT 
CAATGCATTT 
ATACTGTTGA 
TTTGTTATTG 
gAAnTGATTA 
TGGACCCAAG 
GATATTCGAA 
ACGATTGAAG 
ATTGTAGTTG 
GAAATTAACT 
AATATCACAC 
GAAATTCGAC 
GAAATAAAAA 
GAATATGCTT 
ATTGTATTAG 
ATCGGTTCTT 
GGTACAGCGT 
CCAATCGCAA 
ATGAGTCAAA 
CATTTACTTG 
GCCGTGGGCA 
GGATTGTCTG 
CTGATTAACA 



304 

TAGATACAGA 
GGCAACAAAC 
TTGAACGATT 
AGTGTCAAAG 
ATAGTGTCAA 
CGGAGTTTAA 
GGCACTCTTT 
TCTGGCAACA 
TGAAACAAAG 
GACGGTCAAA 
ATTTGGCGGA 
AATTTCAGGA 
TTCATTAGTT 
TAATGAGTTA 
AATTGTTAAG 
GCTTAGGAGC 
AAACAACCAT 
ATGCGATTGC 
AAAGTGTGGA 
TAGGAGGAAT 
CGGATGAAGA 
AAGTCCCAAA 
TGATAGGACA 
TAGGAACCTC 
GTGATGTCAA 
TTTCAGTGGG 
TTGCTATTAC 
CGTTAACGTT 
AAGCTGATCA 
CAGGCTTTGG 
GCAACACCAT 
TTGCAACGGG 
AACAAGTGGC 
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AAAAATTATT 
AATTCAAAAT 
AATCCGGAAT 
AACGGTCTTG 
TATACCTGAA 
CAACGCAGAA 
TTGCTATATA 
CATGGTAATT 
AATGTGGAAT 
GACATTTTAA 
AGCGTGAATA 
TTAAATTTAG 
GATTTAGTAA 
TCTAGTAAAG 
AGTCGATCAT 
CGATTGTATT 
TAAGCTAGCT 
CGCACTAAAA 
AGATGCTTAT 
GAAAGTAAGG 
A/^AAATGGTT 
TGACAAAAAA 
AGCGATTCTT 
GTTGCTTTCA 
AACGGGCGTA 
CGCATCAATC 
TGCTGGCGCA 
GaTTCTTAAA 
GTATGCTGAA 
CTTATCGTTC 
TAAAGGCTTA 
GATTATTCCA 
GCCATATTTC 



CGACAAGTAG 
GATCGTAAAT 
GAACCTATTG 
CAAGAGTTAA 
TCGGAGCTTT 
AGCGATTTTT 
AAGAAGTAAG 
TCGCTGCAGG 
GTCTTTGCGC 
AGAACAATCA 
ACGAATTCTT 
CAACATTAAT 
AACGTGCCGT 
AAATTTTAGA 
CGCTTGTTAC 
TT/^TCGCAA 
AAACCAGCAG 
AGTGGTGTTA 
CGATTGGCTA 
GAAAATACTC 
AAAGAATTGG 
GTAGCAGCAT 
TTAGGTTGCA 
AGACCAATTG 
ATCATGGGCG 
CCACCTGATG 
GGAACAGAAA 
AATATTTATT 
GAAGGAGCAT 
aTGctAGCGC 
CTTGATCTGA 
GCGCTAGGAT 
TTTTTAGGTT 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 
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TTGTCTTAAT GGCCTACTTT AAGTTACCTG TAACAGGAGT 
CAGCAGTAAT TGTTGTGAAT ATCATGAATT ATGCCAAAGG 
CAACAAGTGG GGAGGTAATT GATGATGACG AAGACTTCTA 
ATTACAAAAA AAGAACTTAA TCAAGTCTTT TGGCGTTCAT 
AATTATGAAC GTCAGATGAA TATGGCTTAT GTCTATGCCA 
TTATATCCTC AAAAAGAAGA GATGGCGGCA GCTTTAAAAC 
ACTACGCCGC ATATCGTGAC GTTTATTTTA GGAATTAATG 
GTTCTGGATG AAAATTTTGA AGTTGGAACA ATCGATAGTA 
CCTTTAGCTG GTTTAGGTGA TTCCTTTTTC TGGGGAACCT 
GTTGGAACTT CGTTAGCTTT ACAAGGGAAT GTGTTAGGGC 
TTTAACGTCC CCCATGTGAT TGTTCGTTAT TTACTGAATC 
ACTGGTTTTC TCAAAAAGAT TCAAGCAAAT GGCATGATGG 
TCmATAATTG GCyTGATrGt GGkArGTGCy ATGaCCGCCT 
CCgATTACAA TCAGCgGAAG TGGGaAAAAT GCaGTCACGG 
ATCATGCCTA AATTATTACC GCTTGCATCT TTTGGTTTTG 
GAAGTAAAAC CTTTAGCGAT TTTAGGTGGC ATGGCAATTG 
ATTGGTCTAT TTTAGGAGGC AAAATATGAA AACGATGCTG 
aGCAGCTCTA rCTCAGr.TTC TTAACCAATT TCAaTTAACT 
CACGCACTGt CTAATTTTAG CAACTGGCTC GTCACATAAC 
TTACCTTGAA GAAGTGGCGC CTATTTATGT GGAAATCCAA 
TTATGGCAAA GTGGATGATC GGTTTGATCT GGTCGTAGCG 
TGCTTCAACA ATTGGATGCC ATTGCAAAAA TTAAAAACCA 
CCTTGgACAA GTGaTGTCAC CAGCCCGATT ACGGaGGTTG 
AAAATAGGGc TTGAAACAGT TGGtTTTGTC ACAAAAGGGt 
TTAtTTCtAt TTGGtTTgAA ACTGGGCTAT CAAAAGCATC 
GAAGAAGAAT TAGCAAAACT TnACCAAAGC AATTCAGGAA 
AACCGAACAA TTTTGGGCAA AAAACGGAAA AAAGGAAn 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AnATnTGAAT CGTATAGCGA AGCAATTCTC TGATCAGAAT nATTTAACTT CGTTTGGGAA 



AGCAATTTTG 
AAAGAATGAA 
AAACTAAGAC 
TTCAAATGGA 
TGATCCCTAT 
GGCATCTTGC 
CTGCAATGGA 
TTAAAACCTC 
TACGCTTGAT 
CTiATTCTATT 
GTTGGGGCTA 
AAAGTCyCAC 
CCATGATTGA 
TyCAAAATAT 
TTTTTTATTT 
TCGGAATTCT 
GATTATrTAA 
GATACAGACG 
GyTTkTCAGG 
GAGCCGTTCA 
GTTTCGCAAA 
AACGTCTGTA 
TGGaTGaAGT 
ATACGGcAAC 
AATTAACAAA 
ATCGATTCGG 



GGTGCCATTA 
ACAATACAAA 
AGAAAACTTA 
ATTTTCATGG 
TTTAAAGAAA 
TTTCTTTAAT 
AGAAGAAAAT 
GTTAATGGGA 
TGCAACAGGT 
TTTACTCATT 
TAAATTAGGA 
ATTAGGCGCT 
TTTAACTATC 
CyTTGACGAT 
ATTGnAAAAA 
TGGCTCATTA 
ATGAGGAGCA 
TTTCAAAGGT 
CGGCTAAArT 
ATTTTGCTCA 
GTGGTAAAAG 
AAAACAGTGG 
CCTGGaTTTa 
ACTTTTaAAT 
AGTAGAGGTT 
TGATTTATAA 



4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5558 



60 
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AAATTTCTGG CGCAAAACGT ATAACGGAAA ATCACCCACT CAGGAACTAC TGAAGGTTGC 120 

AACGGAGGAT TTAGTACAAC ATAAAATGAT TAAACAGCTG" GCAACGGAGT 'TAGAAATAGC 180 

CCACTCACAA ACATTTGGGc AAGAAAAAAA AGAATGGCAA GACCAAAAGA GCAGTTTAAC 240 

GCTCTGGCAA TTTTTAGATG CAAAAGATCA ACAATTACAG GACCAAATAA AAGAGAAATT 300 

AATGGAGAAA GAAAAGCCAA CGCAAAAAGA ATTGCGGcAG GCCTTTGAGC AATTGGATGA 360 

TAAATwTAAA AAGACCGATT ACTTTGTTGA AGCAATTGAA ATTCCAAATT TTTCAGGTAA 420 

GCAAGCAGAA TTAGAAAAAA TAGCCGAAGC AATCTCTCCG AATCTATCTT ATGAGGAGAC 480 

GTTATTGGAG TGGCAAAATA AATTACCTAA TCTTGTTATT GAATCGTATC AATTGAAGAG 54 0 

TGCAGAAATC CAAAAAGAGG ATATTTATTC GTTGTCTGTT GGGGAGATTC TTAGTGAAAA 600 

AGCTGTTGGT ACCGTGGTGA AAGGATACCA CGAAAACCAA TACTTTTATA TTTTTAATAA 660 

GGAAGGAGGG CAACTTTTAC AGTTTGAAGA AGCACCACAA TTTGGCAAAA ATGCCTATAT 720 

CAACACACGC TACAAAGAGA AaTTAGCTAC GTATCmACmA GCAACCAAAG TAGACTTGTT 780 

GGAGAAAGAC AGAGAAAAGT TTTTTCAAAA TTATCAAAAT AAAAACAGTT AAGGAGCAGA 840 
AAATGAGAAA AAAAGCGTAT TTTAAAAGTG TAACCATCTT CCTTTCTATT TTGCTTGTTT 900 
TTTCCCAGCT TCAATTATTT AGTCTTCCTG CTTATGGGGA AACAGTGAGT GATCAACCGT 960 

TACTTTTTCG TTCAGTAGGT GTCGCTCAAA GTGGGACGAC CTACTATGTG GATGGGGAAG 1020 

GAGGAAATAA TGCCAATGAT GGGCAGTCGC CAGCCAGTGC TTGGCGTGAT TTTGAAAAAG 1080 

TGAATCAGAC GGAGTTTCAG CCAGGCGACC ATGTTCTGCT GAATGCGCAA AGTACCTGGA 1140 

ATAACCAATT GTTGCATCCA AAAGGGAATG GGACAGCGGC ACAAAAGATT GTCATTGATT 1200 

TTTATGATAC GAATGACAAA GGAGAAACCA TTTTTGAAAC AACACGGCGT CCGATTATTA 1260 

ACGGTGGTGG CACCTACAGC ACAGGGACTT TCAAACGTGC AATCTCAGGG GCCGTGCAGT 1320 

TAGTGAACCA AGAATATTGG GATATTTCAA ATTTAGAAGT GACAAATACA CCAGAGCTAG 1380 

ATAATCTCGA AGGCTATAAG AAACCAGGAG ATGCTCAAAG AGCGGGGATA TTAGTACTAG 1440 

GATATGAACA AAATCGAACG TTCAATAGTG TCACGATAAG GAATAACTAT GTTCATGATG 1500 

TTCAAACAGA ATATTATTTA AATCTAAGTG GGAATACTGC AACCAAACGA TTAAAAGCGG 1560 

TAGGTGGAAT TATTGTTTTA GGAAGTTGGT TTGATGAGAA TGGCAATGTA GTAACGGCAG 1620 

CTAATGATCA TCGAACAACG ACTGGATTTA ATGATATTTT GATTGAAAAT AATGTGATTC 1680 

AACGAGTGGG ACTTGAAGGA ATTCGAACAA AAGCTGATTC AGATACCtCA AGAGGCAACA 1740 

CTTTCTACAA AACGTTTTCT AATATTACGA TTCGTAATAA CTATTTAGAA GATATTGCTG 1800 

GCGACGGGAT TGTCTTGTCA GAAGCCAAAA GTGGCGGTGT. CGTCGAAGGA AATGTTGCTG 1860 

TCCGAATGTG CAATGCCGAT TATGGTACAC AAAATTACGC GGGCGTTTGG GCAATGTCTG 1920 
TAGATGACGG ACTGTTCCAA TATAACGAAg TATACGGAAT CAAATATGGA TTTAACGATG 1980 
CTGAAGCTTA TGATGTGGAC ATGCAGTCAA ACAATGTTAT TTATCAATAT AACTATTCAC 2040 
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ATCATAATAC TGGCGGCTTT 
ACAATATCTC AGCTAATGAC 
GAGCTGGTGG CTACAATTAT 
GAGCGGCAAT GCCAACTATC 
CATTATTTGG TGAAGGAAaT 
ATATTTTATA CAAAGAAGGT 
GGACACAACC AATTGAACGT 
TTATTTGGCC AAAAGAAATT 
CAAGTGGAAA CATTTTTGAA 
AAGAGTTGGC AGAGCAAGAA 
TTACTTCTAA AGAACGCTTA 
CAGCGATTGG TAAAGGCTTG 
GCTTGAAAAA TAAAGTTTTA 
TTCGTTATCA AAATCAAGTG 
CAGAGCAAGT TGAATTAACG 
AAAAAGAATT TAAGTTACAA 
CAGTTG7UVGT TGCAGCAACA 
AGGTCTCTTT TGAGGGTGAA 
AAACTGCCTA TGTTCAAAAG 
CCGCTGCCAT TAGTTCaGGA 
ATTACGCATT -AAAATTAAAA 
AAATTGATAC ACAAGAATTG 
ATGTGATGCG TTATGATGCG 
ATACGCAGTT CCAAGTAGAT 
CTTGGAACAA TGGTCCAAAT 
CTTTTACAAA TAGCTCAATT 
ATCTACGGAA GCTAATTCAA 
CGATTACAGA CAGTCGTTTG 
CAAAAGAAGG CGCACAAAAG 
CTG6TCAACA GCTAGCAGAG 
AAGTCAAACA TGAGTCGATT 
AAAAAACAAC TATTTTTTAT 
AAGTTCCAGC ATCTTCTTAT 
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TTATTGCTTA TGTCTGATCA AAAAAATTCG GTTATTCGGT 2100 

GGCGGTGGTA ACCGAGGAAC AGGTAAAGAC AATCCAGGCG 2160 

AAAGAACAAA GCATTTTCCA TTATTGGGTG AAAAATGATG 2220 

CACAATAATA CGATTTATGT TGGCGACGGT ATTTCAACAT 2280 

AGTTCTGACA ACAGTGGTAC AGTAGCCAAT TTTTACAATA 2340 

ACCGGACAAT TAAAATTCTT ATCTAATTAT CCAACAAATG 2400 

AAAATGGTCG ATAATCCAGA AAAATATTTC AAAAATAATG 24 60 

GCTACAGAAA AAAGTGGTGC AACTGTTGAA AAATTAGTAT 2520 

AAACCACAAT TAGAAATCAC AGATAACCCT GAAAAAGTGA 2580 

TTTACGACAT TGAAACCAAC AAAAGATAAT GTGGTTGAAT 2640 

CGTCAGCGAG CACAAATGTT CCGATTAAAA GAAAACTCTC 2700 

TCGGAAGTGA ACAGTCCGGC CGAAGACTTC TTTGGAAATA 2760 

GATATTGGTG CGCAACAAGC ATCCACAATT GAAAAAAGTA 2820 

TTAGAAATTA GTTCGGCAAC AGGTGTGTAT CCTAATTTGC 2880 

TACGAAGAAG TCGTTAATGA AGAAGTGGTT GCGACAGGCA 2940 

TGGGAAGCAA TCCCTCAAGA AAAAATTAAT ACAGCAGGCA 3000 

GTTATCGGCT TACCGATAGA TGCTGTCAAA GTAACAGCCA 3060 

CTTGGTGAAG GAAAAGACAC CGTTAAACTA AAAACAGCTC 3120 

AGTGATGGTA ACCGAGCATA CTCTGCTATA GCAGGCGGTA 3180 

GACGCCTATA AATATCCTTA TGGAGTAAAC TATACGGGCA 3240 

AATGCTAGTT CTGCGGGGTA TAATCGCCGG ATTTACGTTG 3300 

AAGAACTATC AATCACTAAA AAGTGCCAAT TTAGAACTAA 3360 

TGGAATGGTG CTGGAAATAC AAATGATGAA CGTTTGAAAA 3420 

GTTTATGGAa CAGATACTAA TTGGATGAGT AACACCATTA 3480 

AATTTAAATG TACCAAATGA AGAATTTATT GCTCGACAAT 3540 

ATGAATAATC AAAATACAAT CTCTATTGAT ATTAGTAACT 3600 

AGTGGTGAGA AAATTCCAGC CAAGTTAAGT TTCTTATTAG 3660 

CCTGGTTATG ATTCTGATAA TGCTGGGTTT GATGCTTTTT 3720 

GCCTATCAAG ACTTTTTAAC AGGCAAATTG ACCTTGCCAA 3780 

GATAGTTTAG CTCCTAAAAT TGTTTTATCG AATGTTTTCG 3840 

GAGGTCACGA CAGAAGCGGG TCAAGCCCCT AAACTACCAG 3900 

TCGGATGGTT CACAACGAGA AGTGACTGTC AATTGGTCAG 3960 

CAAAAAGAAG GAATATTCAC TGTTGTAGGA CGAGCAGCAG 4020 
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GTGTCTCAAT SCaATTATT GCCAATGTAA A--^7CACCGC TAAACATATT GTTGGATTCA 
AGGAATTGCC 7GCGCT3GAT CGACTAACAG GCACCTCGCG AGGCGAACTG AATTTGCCAA 
CTGAAGTAAT 7GCAAAATTA GATGATGGTA GCGAAAC3AA ATTGAAGGTT ATTTCTTGGG 
ATGACGA7GT TAGTAATTAT TCGCCA^^A GCCCGCCAGG TACTTACCAG TTCCCTGCTG 
CAGTAGAA3A AAAAATAGGA ATTGC6AATC C7GATGAACG AAAAATCTTC CAAGTCGTGC 
AAACTCA?3C GATTCCiGAA AGAATTCAAT T7GCTACAGA AACAGCCACT ATAAAATCAG 
GCGAAAATTA 7CAAATTCAA TCTAAAGTGA TTGGTCAAGC GCCGCACACA GAAACGGATG 
C7TGGA3CTC GCAAGTAACG TATGAGTTGG TCACACCAGA TGCTGGCAAT ACAGTAAGTG 
TASATG?jyVA CGGTCTSArT AGGACAGAAG CAACGACTGC TGCAGGAAAT TATCAAATCA 
AAGTAACAAG CAAAGTTTTG CCGMAGTGA CTGCACAATT TTCAATTAAA GTCACAAAAT 
AAACGRACTG TTTSAAGAGA CTGGAACAAG AATTGCTGTA CCTTGCAATC TTTCTAGTCT 
CTTCGAT7AT CTAGAATCAA TAAATA7TCT ATTGCCTACT ATTCATTCTT TTTAAAACAA 
GGTAAAATAA 7AAATGAAAG TAAAXGATAA AGGTGTGTAG ACAATGAATA CTATCTTAAC 
AGGTGA7C3T CCAACTSGTA AATT6CACCT AGGACATTAC GTAGGTTCXTT TAAAAAAACG 
AG7AGA.V.TG CAAGCAGACC CTACAAATCA ATTATTTGTC ATGATTGCAG ATTTACAAGC 
ATTGAC^T AATSCAAGAA ATCCCGRAAA AGTTTCAGCC ' AATGTTTTAG AAGTTGCTTT 
GGACTATTTA GCAGTTGGTT TAGACCCTAC GAAAACAACA ATTTTTATCC AATCGCAAAT 
TCCACAGtTA C-CTGAATTAA CAATGTACTA TCTAAATTTA GTGACCACGT CACGTGTTCG 
TCGGAATCCA ACGGTCAAAG CAGAAATTGA GCAAAAAAAA TTTGGGGAAG GTGTTCCGAC 
AGGATTC7TT ATAXATCCCG TTTCACAAGC AGCTGATATT ACTGCCTTTC AAGCGAATTT 
AGTCCC3GTT GGGGAA3ACC AAAAACCAAT G7TGGAAGAA GCGCAAGAAA TTGTGCATAG 
rrTTAA^CAG ACGTACGGTG AAGTTTtAGT TAGACCGGAA GCTGTGTTAC CACCAAAAGG 
CATGGGSCGC 7TGCCASGAA TTGATGGCAA CGGAAAAATG AGTAAGTCTT TAGGCAATGG 
TATTTATCTT 7CAGATCCAG CTGRAGTGGT ACAGAAARAA GTAATGAGTA TGTATACCGA 
TCCAAATCAT ATTCGTATAG AGGACCCAGG ACAAGTTGAA GGAAATATGG TCTTTACTTA 
CTTGGATGTT 7TTGGGAAAA ATAAAGRATA TATTGAASAA TTAAAAGAGC ACTATCGCCA 
TGGCGCTTTA GGAGATGTGA AAATTAAACG T7ACTTAATT GATGTCCTAG AAGAAGAACT 
CGCACCGATT CGTCGRCCrC GTGaAGBSCT GGCTAAfiAAT CCAGAAGCGA TTATGGAAAT 
GTTGCATAAA GGAAGTCTTG CTGCTGRAAA AGTGGCAGCA CAAACCTTAA CAGAAGTCAA 
AAAAGCAATG GGAATTCAAT ATTTCTAAGC AATAAAAGAA CTGGAAC6AG TCAAAAGAAG 
AATGACTTGT rCCAGTTCTT TTTAAG7GGT TTGAAATTGA CTATTATATA AGCTGGCGTA 
AAGAGCCGGA GC7TGTAATA GTTCCT6ATG CGTTCCrTTT TCAATAATAT CGCCATTTTT 
CATCACTAAA ATTAAATCAG CATTTTCAAT TGTGGATAAT CGATGAGCGA TAACGAAGCT 



4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 
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TGTTCGATTT TCTGTGACAG TTTCCATCGC TTTTTGAATA TGiSGCTTCTG TTCGTGTGTC 6060 

CACACTCGAA GTTGCTTCAT CGAGAATAAC AACGGGCGGA TTTGCTAAAA TGATTCGGGC 6120 

GATGGTTAAT AACTGTTGTT GCCCTTGTGA TAATGCACCA TTTTCGCTAG AAATAATTGT 6180 

GTCATATCCT TGAGGAAGGG TCCGAATAAA ATGATCACAT TGAGCAATTT TAGCTGCTTC 6240 

AATTATTTCT TCACGAGAAG CATCTTTTTT TCCATAGGCA ATATTATCTG CTACGGTTCC 6300 

TTCAAATAGC CAAGTGTTTT GTAAAACCAT GCCAAATAAA TTTCTTAGAT TTTGTCGAGA 6360 

AAGTTTCGTA ATATCAATCC CATCAAAAGT AATGGCACCT TGATTTATTT CATAGAAACG 6420 

CATTAATAAG TTGACTAATG TTGTTTTACC AGCGCCGGTG GGTCCCACAA TGGCCACTGT 6480 

TTTTTTCGGT TGAACAGAAA AATCAACATT CTTCATTAAA ATTTTTTCTG GTGTGTAGCC 6540 
AAATTGAACA TTTTTAAATT CAATGGCTCC TTTAGGAGAA GAAATAGTTT CTAAATGAGT _ 6600 

TGCTTCTGGC TGTTCATCAG CTTCATCTAA GATAACAAAA ATCCGATCAA TGGAAGCCAT 6660 

CGCTGCTTGA ATTGAGTTAA TGACATAAGA AGCGGTAGAA ATCGGCTCAG AAATTTGGTT 6720 

GATATATTGC AAATACGCTT GCAAGAAACC AATCGTAATA CCACCAGATA AAACGAGCAT 6780 

TGCGCCTAAG ATAGCACTAA TAATAAAA6C CAACTGATTA ATAAAACGAA TAGCTGGATA 6840 

AATCGCAAAA TTTAGAAACT GTGCTTTTTT AAAGGcACGA TAATGTTGTT GATTAACAGC 6900 

ATCAATTGTT TTTTCAGCAT TTTGCTGTTG ATTAAAGGTT TTTGTGACCA AATTTCCTGC 6960 

TAAATATTCT TCCATCTTAT TATTTAATTG ACCTAATTCA GCTTGACTTT GATCAGCAAA 7020 

CACCTTGTTT TTATTCGCGA TTTTTGTCGT CATGAAAGTA CTGCCGCCAA TTAGGAGCAA 7080 

CACTAAAATG GTTAATTTTG CATCAATATA GAACAACATG ATTCCTGCAA AAAGGATAGT 7140 

CACGACGGAA GTGAAAAATT GGTTGATGCC TGTTAAAAGA ACTTGTGACA ATTGGTTTAA 7200 

GCCAGTTGTT GAACGACTAA TGATATCGCC CACTTGGTGA TTGTCAAAGA AGGCCATTGG -7260 

TAAAGTTTTA AACTTTTTTG TCACTTCTTT TCTAATTCTT AAAGTAACCC GTTCGCTTAA 7320 

AGAAGCCATT GCTCGTTCCT GAATAAAAGA AGTGATACTA CTAATGATTG AAAAGAGAAT 7380 

TAAAAGTAAG ACGGGACCTA AGAGCGCTTC TTCAACTAAA GGAAGTGTCA TACCTTTGAG 7440 

CCCGACACGC TTGATAGCTT CTAAGAGGTT ATCAATCCCG ATTGCCATAA TATAGGTCAT 7500 

GGCAACAATT AATAAGTTTC CAATTAAACT GCAGATCATT AGACCATAAA AGATTG6ATG 7560 

CTCAGGTTTA ATCATTTTCC AGAAACGTTT AAAGCTTTGA AGATTCATTG TTTTTTTCAT 7620 

TGTATGGCTT GAAAAGAAAA CTTTCTTTCA AGCAGTCACA TCCTTTCTTT AATTACTTGG 7680 

GTAAGATACC TTGCGATTTA GCAAAGTCTT GGTAATAGCT ATTAGTGGTA AGTAAATCAG 7740 

CGTGGGTGCC TTGACCAACA ATCCTTCCTT CATCTAGGAC GATAATGTTG TCAGCGTTCA 7800 

TGATTGTACT TAACCGTTGA GCAACAATGA GTAAAGTTTT GTCCGACATT TGTGCATGTA 7860 
AAGCGGCACG TAGAGCGGCA TCAGTTTTGT AATCTAATGC GGAAAAGCTA TCATCGAAAA 7920 
TATAAACGTC TGCCGGTTTG ATTAAGGCTC GTGCAATACA CATTCTTTGT TTTTGACCAC 7980 
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CAGAATAGTT GGACCCGCCT TGTGCTACGA AACTTTCAAT CCCTTGTGGT 
TAAATTCAGA AGATT GTGAA ATTT CTAGTG CCGTTCTTAT TTCTTCTGTA 
CATTTCCCAT TAATAAGTTT GATAAGATTG TCCCACTGAA AAGAAAGGCT 
CATAACTGAT GACTTGGCGA ATCGTTTGCT GAGATAATGA GCGGATATCT 
AGCTAATCGT GCCGGCTGTG ACCTCATTTA TTCGTAAAAG TAACTTT^CT 
TACCAGCGCC AGTTGCACCA ACAATCGCCG TTGTTTTTCC TTTAGGAATG 
CATTTTCTAA AACAGGATCA GCTGTTTCTG TGTATTGAAA CGTGACGTGA 
TCAAGGCTTT CTCAGGATGA TAAGCAATGG TTGCCTCAGT ATGAGGATCG 
TTTCTTCTGA CAAAACTTCT TGCAAGCGAT TCAAGGAAGC AACGGACCGT 
CTAATACCAT AGCCGCGATC ATTAAGTAGG CCAAAGTTAG AAGCGTATAT 
CTGTAACGGT GCCAATTTGT AGGGTGCCTA CTGCAACTAA ATTACCGCCA 
TAGAGGCGTA CACAACGCCC ATTAATAAAA AGGCAACGGG AGTAATCCAg 
ATTCACTTTA CTCATGCGTT CAGCATAATT TTTAAATGTT TGATTGGTTC 
AAAATCTTGA TTATTGAACG CACGAATCAT ATTAATTCCA GTAAAGAACT 
CGTAGTAATT CGATCCATTT TTGGTTGAAT TTTTAAAGAT AAAGGGGTAC 
TAAAAAATAA ACAACGACTA AGTAAAATAC GATGGAAGCC AATGTTACCC 
AACTGAATAA GAAAATGTCA TCACTAAGGC AAATAAACTA ATAATAGGCG 
TAATTGGCAA AACAGCACAA TCATTGTTTG AACATTATCT ACATCGT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TTGTAATTTT ATTGGGTTAA CGTTATTTTT TGGTATCGGG ATCTTAGTTG 
CAAAATTGTA GCGAATAAAG GGAAAAAGAG TAAGGGgACC ATTCTATTAA 
ATTGCTCTTG tTAGGTATCT TTTTATTTGT GaCTTTGCCa AGTAAAAATC 
GTTATGGATT AATCTGATTG GCGCAGTGAT TGCTAGTGGG GTGTTATTAA 
AGCTAGTAAA GTCGAAGTTC CTGGTATCCA AAAGGGCCGC AAAGAAGTAG 
CAACAAAACG GCTAACAAAG ACTGGTCTGG CAAGATTTTG GGTGTTGCCG 
GATTTTGTTG.ACTATTGTCA GTAGTATTAC CAGAATCTCT TCTATTGATG 
AACGATTCCT TTAAAAACGG AAG7UUUU\GC TGAAGTTTTA ACTTCGGCGA 
CaTTGCGATT GCACCACAAA CGGCGAAACG AAAAATGTTA CAGAAATTTT 



AAGGAATCGA 
GTTGCTTTGG 
TTTTGTGGCA 
GTGCCAGAAT 
AAAGTACTTT 
ACAAAACTAA 
TCGAAGCAAA 
CTAATTTCAA 
GGAATGACAA 
TCAATCACAG 
AACCACAAAA 
CAAAGATTTG 
GCTGCTCTTC 
CTCGTAAAGT 
CTTTTTTCAT 
AAGCGAGTGA 
CTGGAAAGAT 



CGAATGGCGC 
TGGTGATGGA 
TTTCaACCGT 
GTACACGCTT 
GGAAGAATAC 
TTTTGGCAAT 
AAATTTATCA 
AAGAAACACC 
CTGtCATTCC 



8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9047 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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CAATTcCAAT ATGTTTACTT TGGaTGGtAT AACGGCACAA GTTGTCAATG 
TTATGTAGCm ACAGTCGAAT TTAATGGCTy CTTTAAATGG CTAAAACtGG 
TGGCTATTTC ATCATTAGTG CGACAGATAT CAACGCACAG CCAGAATTTG 
AATTGTTTAT ACTCCTTCAG CTTACTTTGG CAAAGATGCC GCACGAAAAA 
TTATCCAGGT TATGCAGCAA CGGGCACAAT CAATTTAGAG CTGGACGACC 
TTACTATATT CAAACGCTTT ACAAAGAGTA TGGGGTTTCT GGTCGCATGC 
ATTTAAAACG GCAGTATTGA ATGCTACAAC AGGAGAAGTT AACGTTTATG 
AGCCCCAAGT TTCGTTGATG CGCCGATTAC TAGCTCAGCG GCGAATAGCT 
CTTTGGTCGT TACAGTCAAG GCTGGTGGAA CCAGACaATG TTTGGTGCCA 
CAAAATTCCA ACTGAAAATG GTATTTATGC ATCAGGACAA ATTACGCCGA 
GGAAGGGAAC CAGTTGCTTT ATTTTACGGA CTTTACAAGT AGTGAAAAAG 
CGCTTTAGGT TATTCCTTAA TCAATGCCCG AACAGGCCAA GTTACGTATT 
TAAAGTCGGC ATTATGGATA GTGATGGCGC CATTTCTATC GCTTCAAAAA 
AAAGAAATGG TUyVGCTAGTA TGCCAGTCCT TTATAACATT GATGGCGTGC 
TGTTTCGTTA ATGGACAGTA AAGGGATTTT CAAAAAATAT GTCTATATAA 
TAATGATATT GTAATTGATG CAGACACGGC ACAAGGCGCA TTAGACGCTT 
GTTAGTAACA AAAGGAAGCA ACAATGAAAA TACAGACAAA GCCAACTTGA 
AGGTAAAGTA GCTCGTGTCT CAGTAGTAGC TGGGGAAGCG CAAACGGTAG 
ACTAGAAGAT GAAAAAaCCA TTTTTAGCGT AACGaCTAaT AACAGTcCcT 
TTAAAaGAGG GCGaCCAAGT GACaTTTAAA GCCGTAGtGA CTGAAAATGC 
AATGTTGAAG A 

(2) INFORMATION FOR SEQ ID NO: 14: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3928 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CGTTTATTTT ATCAAACCAA AATGGATTTG TAAAACATTT CACATGTTAT 
ACTATTACTA GAAAAAAGAC AAGCCAAGTT TCCTTTATTT AGGAAACTTG 
TTATTCTTTT TACTTCGTTA TTTTTTGATA GTTTGCGAAT AATTGATTCG 
TCCATCAATT TTCTTGAGAT GGCCGTTATC TATTTTAAAC ATAGATGGCA 
TTTATTTTCT CGGATAAATG TTGACCATTC CTTCATGGTC TCtTTGTTGG 
ATAGTAAACC TGTTTAGGTG CTTGTAATAG TTTCTCGAAA TCCTGACAAT 
TtTCTTTTTT AAAACAACAT AAAATGTTTC GCCAGAGTGT TTTTTTTCTT 



GCGAGTATGT 
GGGAAgTACC 
tAGAAAAACC 
TTTACTCCGC 
AAGGGAATCC 
ACTATAATGA 
ACAGTCAAAA 
TAAATGAATT 
AAAAAGATGT 
TGATGAACAA 
ACCAAGATTC 
ATCGTGATAC 
TTTATCCAGA 
CAACGTGGAT 
ACGCCGTAGA 
ATCG/^TTGA 
AAACGATGCA 
TTTCATTTTT 
GGCGCTATTC 
TAAAGCAGCG 



TTAATAAACG 
GCTTGTCTTT 
CATCTCCTTC 
CATGTAAAAC 
TTTUITTCTAC 
CTGAACAGTT 
GATACTCATT 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1751 



60 
120 
180 
240 
300 
360 
420 
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GAAACTTAAT TCCTTTTTAT GCGGAATAAA 


TGTAAAACCT 


ACCAACT^C 


TAAGTAACAA 


480 


AATTAACAGA CTGAACGAAA TTTTTTTATT 


CATGTTTTTT 


CTTCCTTTTT 


AGTTTGTCTA 


540 


ATTTTTTCTT TTTTTATTTT TTGTTATGCC 


AAAGCTAATA 


GCCACTATAA 


GGACAATTAA 


600 


ACCTGCGATC ATGTAAAGTG GATTTACTGT 


TGTATTAGTC 


ATTGGTAAAT 


GAATTCCTTT 


660 


ACTTGCTGaT TTTCCAAAAT TGaCTGGGTT 


CGTTGGTGCT 


TGATGATTTG 


TAGGCGTCAA 


720 


CGGTTTTTTC GGTTTTAGCG GTTGTTTAGG 


TTCAGGTTTT 


TTTGGGTCTT 


CTGGCGTTGG 


780 


TGCAGGCGGT GTTACGGTTG GCTTGTTGGA 


ATGCAACACG 


TCACCTTCGT 


TGCCAAAGTT 


840 


TAAGTCGGCT TGGTTGGGAA TCCCGCCTTG 


TTCAATATAA 


GGCGCTAATT 


CTTCATCCGT 


900 


TGCGTCAGTT TTAATTTTAG TGGTAATAGT 


CATTGTGTAT 


GTATGACCAG 


CTAAGTAAGA 


960 


GTAGCTGTCA-TCTTTTTTGT TCATAGTAAA 


AGTTACTTTG 


TTATTTTCTT 


GTGTTACTAT 


1020 


GCCATTATCT GTAACATCTT TGCCATTTTC 


mTCAGwAACT 


TTCACGTCTG 


TGATGTCTAA 


1080 


CACTTTATTA ATGTCATCCA CCATGCTGGC TtGGGTCCAT GTGCTTGTTT 


CGTTACCGAA 


1140 


AGCTGTTTtG ACGTTCCATT TAAATTCTTG 


GTCACGGTTG 


GTTAAATCTA 


AATGTTCTTG 


1200 


GCCTTCGATA TCTTTTGTAA TCGTTGGATC 


TTCTGGcGTT 


GGTGCAGGTG 


GTGTTACGGT 


1260 


TGGTTTGTTG GAATGCAACA CGTCACCTTC GTTGCCAAAG TTCAAGTCGG 


CTTGGTTGGG 


1320 


AATGCCACCT TGTTCAATAT AAGGTGCTAA TTCTTCGTCC GTTGCGCTAg CTTTGATTTT 


1380 


AGTAGTAATG GTCATTGTGT ACGTATGACC 


ACTTAAATAG 


TCATAGCTGT 


CkGCtTGkTT 


1440 


GtTCATTTCA AAAGTTACTT TGTTATTTTC 


TTGTGTTACT 


TTGCCGTTAG 


CTGTAACATC 


1500 


TTTACCATTT TCATCTGTGA CTTTTACATC 


AGTGATGTCT 


TU^CACTTTAT 


TAATGTCATC 


1560 


TACCATGCTG GCTTGGGTCC AAGTGCTTGT 


TTCGTTACCG 


AAAGCTGTTT 


TGACGTTCCA 


1620 


TTTAAATTCT TGATCGCGGT TGGTTAAATC 


TAAATGTTCT 


TGTCCTTCTA 


CGTCTTTAGC 


1680 


AATATTTGGA TCAACTGGCG GTGGTGTTAC 


GGTTGGTTTG 


TTGGAATGTA 


ACACGTCACC 


1740 


TTCATTGCCA AAGTTTAAGT CGGCTTGGTT 


GGGAATCCCG 


CCTTGTTCAA 


TGTAAGGCGC 


1800 


TAATTCTTCG TCCGTTGCGT CAGTTTTAAT 


TTTAGTGGTG ATAGTCATTG 


TATACGTATG 


1860 


ACCACTTAAA TAGTCATAGC TGTCTGCTTG 


TTTGTTCATT 


TCAAAAGTTA 


CTTTGTTATT 


1920 


TTCTTGTGTT ACTGTGCCGT TAGCTGTAAC 


ATCTTTACCA TTTTCGTCGG 


TGACTTTCAC 


1980 


ATCAATGATA TCTAGCACTT TATTAATGTC 


ATCCACCATG 


CTGGCTTGGG 


TCCAAGTACT 


2040 


GGTTTCGTTG CCAAAGGCTG TTTTGACATG 


CCAATCGAAA CTATCTTCAC 


GATTGGTTAA 


2100 


ATCTAAGTGT TCTTGATTTT CGATATCTTT 


TGTAATCGTT 


GGATCTTCTG 


GCGTCACGTC 


2160 


ACACGCATGC GCATTGTCAT TTTTAGGAAT 


CGTCGCTTCT 


GGAAATTCCT 


TAGCAATTGT 


2220 


TCCTTGATTC ATTGTCXTCAC TACTTACAAG 


GGTTGCTGCA 


TCAATCGCTG 


TATTTTCTTT 


2280 


GACTTCGTAG TGAATCGTGA TGTTTCCTAC 


GTAACCTTCT 


GTACTAGTTG 


CTGAAATGGT 


2340 


TTGTCCGTTA ATTTGAACAG GCACATCATT 


TCCAGCATCG 


: TCCGTAGCGG 


TCGCAGATTG 


2400 
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AATATCAAAT 


TGATTGGCAA 


TCGTTAATGA AGCTGTTGCT 


GGTGTCGATT 


GCGCCAGACG 


2460 


ATCTTTGACA ATTTGTTTTA 


ATTGGGTTGT 


AAAATCATCA 


ATAGATTGGC 


TTGTAATAAA 


2520 


ATCTTCTGGT 


GTGCTAGACC 


CTTGTTGCAA CTCTTGTTTT 


ACAAAAGGAC 


CCACTTCTGT 


2580 


TTTATATTTA 


TCAAAGTATG 


AATTCACTGA ACTTAAAGAT 


TCAACACTTT 


CCCAATACGC 


2640 


ATTAATCATT 


TCATAGCCTT 


GGTTAGTAAT 


TTCTTGGTTT 


AACGCTAAAA 


CTTCTGCTGC 


2700 


TGCACCTTGG 


TAGTCATTAC 


TATATTCCAC 


TGAGACTTGA 


AGAGGATGTC 


TTGGATCTGG 


2760 


ATATTCATTG 


ATTGAATCAT 


TGGTATTGGT 


CTTATGCAAG 


TAACCATCTA 


AACGTGTATT 


2820 


AGCGACCCCA 


TCTGTCACTA 


ATAGGAAATA 


CGTTTTTCGA 


TTCGTTAAAT 


CTCCGTGTGT 


2880 


TTGATTGTAC 


GTATCTAAAG 


CGAGTTTCAA 


TCCTGGGGCG 


GTTGGCGTAC 


CACCATACGT 


2940 


CCGAACGTCT 


CCAAAACCAG 


AGACAAATTG 


GCTTTTATCA 


TAAGTCAATT 


GCGTATTGAC 


3000 


GCGCACATTC 


ATATCATAAT 


CAGCTGAATT 


AATTTTTGTC 


TTTCCATCAG 


GAAACATAAA 


3060 


TTGTTTTCCG 


CCGCGATATG 


AAGCCAGCAT 


CACGCGGTCT 


TGATCAGATA 


AGCCTTGAAC 


3120 


CACTTCATCA ATCGCTTGTC 


TTACATGTGG 


AAAATTATCT 


GAAAAACTAC 


CACTAGCATC 


3180 


TTCAACAACT 


ACTAAATCGA 


CTGGTTCCGT 


TTCACCCGCT 


TGAACTGGGA AAGTAGCTTG 


3240 


TCCTTCCAAT 


ATCCGTTTAC 


AATCAGCCAG 


ACTGTCGTCT 


TTCACTTTCA CCGTATTGGT 


3300 


CATTCCTTCT 


GTATCTCCTG 


CTTTAACCGC 


TTCTTCAGCT 


AAAGCCAATG 


TGGGACTGAA 


3360 


ATTTCCCACA ACTGTTCCAG 


CGACAATCAC 


CATTGTCGAT 


AATAATTTCG 


TTGCTTTCTT 


3420 


AAACATTTTA 


ATCCTCCCTA 


TTTTCTTTCG 


CTAGTGTTTG 


GGACACTTGC 


TTCTCTTTAC 


3480 


TTCAACTACA 


GCTCCTTAAA 


TCTTTCATCA 


AGTCGATGTG 


TCTCCTTGTT 


TTTGTGACAC 


3540 


ATCTCTTACT 


CAATTTCTGA 


TTCTATCACC 


TTTAAAGGCT 


AAAACTTGGC 


AGTATTCGGC 


3600 


TCTTGAAAAC 


GTAAAATTTG 


TCTTTTAAAA 


ATTAATGAAA 


AGACTATCCA 


CTGACAAAGT 


3660 


TGGTTTTCAC 


TTACTTTTTG 


CTACTTGATT 


CCTTTTAAGA 


CTCTTTTTTT 


ACACAAAAAA 


3720 


AGTAAAAGAA CCTTTACAGT 


CCTTTTACTT 


AATTGTTAGA 


TGCTATTCAT 


AATTTTCTTA 


3780 


GTTTTGTTCT 


ATAAGCTGAA 


TAGTTTAGCC 


TGTTTATTTC AACAATATTT 


GACGTTAGTC 


3840 


AGCCGTTTCG 


TTATAGTTAA 


AGAAAAGGCT 


TTTTGGTAGG 


AGTGAATACG 


ATGAATGAGT 


3900 


GGCCCCATCA GTTAATGACG 


CTTTTAAC 








3928 


(2) INFORMATION FOR SEQ ID NO: 15: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5259 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGTTAGGTAC ATTGCTGGTT AATGGAACAA GTGAACAGTT ATATGAAACA AaCGAmaCCA 60 
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TTCACTTGGC 


TTTTCCAGCT 


GAGAAATTAG 


TTATTTTAGA AAAATAAAAA 


CCATTTTGGA 


120 


GGAAGATTTA 


AAAATGAAAA AACGCGTAAT 


TTTAGGGACA TTAGTCGCTG 


CAACGTTATT 


180 


AATGACTGCT TGTGGAAACA GCGAAgc/WVC TACGAAAAGC GAGAGCAAAG 


GTGGAAGTAA 


240 


TGCTTTAGTC 


GTTTCAACTT 


TCGGATTAAG 


TGAAGATATT GTCAAAAAAG 


ACATTATCGC 


300 


TCCATTTGAA AAAGAGAATG 


AAGCGAAAGT 


TACCTTAGAA GTAGGCAATA 


GCGCAGACCG 


360 


CTTTACGAAA TTAAAAAATA ATCCCAATGC GGGAATTGAT GTCATTGAAT TAGCACT^GC 


420 


AAATGCAGCA CAAGGTGGAA AAGATGGGTT ATTTGAAAAA ATTACAGAAA AAGAAGTACC 


480 


TAATTTAAGT 


CAGTTAACGC 


CGGGAGCAAA AGAGGTTTTT GAAAGTGGTG 


CTGGCGTACC 


540 


AATCGCTGTA AACAGTATCG 


GGATTGTTTA CAACAAAGAA AAATTAGGCA AAGAAATTAA 


600 


AAACTGGGAT 


GACTTATGGT 


CAGCTGATTT 


GAAAGGTAAA ATTTCTGTTC 


CAGACGTTGC 


660 


CACGACGGCA 


GGTCCTTTAA 


TGTTATACGT 


TGCTAGTGaA CATGCTGGTC 


AAGATATTAC 


720 


AAAAGATAAC 


GGGAAGGCCG 


CTTTTGAAGC 


GATGAAAGAA TTAAAACCAA 


ACGTTGTTAA 


780 


AACGTATTCA AAATCGTCAG ACTTAGCyAA TATGTTCCAA TCTGGTGAAA 


TTG7UVGCAGC 


840 


TGTGGTTGCT 


GATTTTGCGG 


TTGATATTAT 


TCAAGGCGCA CAGAAAACGT 


GACGTATATC 


900 


GTTCCAGAAA GCGGCACGTA TGCCAACTAT AATACAGTCA ATATTCCTAA AAAAGCACAA 


960 


AACAAAGAAA CCGCCTTTAA ATTTGTTAAC 


GCACGGATTA GTGAAGAATC 


ACAAAAGGCC 


1020 


AAAGCGATTT 


CGTTAAATGA AGGACCTACA AATCAACAAG TAACGTTAAG CGAAAAAGAA 


1080 


GCGAAAAATA 


AAACATATGG 


TGCCATTGCC 


GAACGTGCCA AAACAGTCGA 


TTTTAATTTT 


1140 


ATTAATAGTC 


AACTAGCGGA 


TTGGATTGAT 


CAATGGAATC GGACGATGAA 


TCAATAGTTT 


1200 


TAGAAAAGAG 


GGACAAAATG 


AACGTCGATG 


TATTACTAAA AAATGTGTGG 


CTGTATCAAA 


1260 


CAGTCACTCA AACATTCGTT 


CAACGAAATG 


TTGCGATTAA AAATGATAAA 


TTTTATTACA 


1320 


TTTATGAAGA 


AGAGAACGTC 


AATTTGCAGC 


CCCAAAAAAC AATTAATGCC 


GAGAATCAAT 


1380 


GGATGATTCC 


CGGCTTAATT 


GATGCACACA 


TGCATATTGA AAGTTCCATG 


ACGACACCTA 


1440 


CTATTTTTTC 


CAAAGCAGTA 


GTGCGTTAcG 


GTGTGACAAC GGTGATTGCC 


GATGCCCATG 


1500 


ATUVTGGCTAA 


CGTATTTGGC 


TTAGAAGGCT 


TAAAAGCTTT TATGGCCGCT 


GAAACAGAAC 


1560 


TAGATATCTT 


TCACGCAATT 


CCTTCCTCGG 


TTCCTTCGAC CACTCCCGAA 


TTAGAAACAA 


1620 


CTGGTGGGAT 


TATCGGCTTA 


GCAGAAGTAG 


CAGAATTACT CAAAGAGCCA 


AAAGTGATTT 


1680 


GTTTAGGGGA 


AGCCATGAAT 


TTTAAAGGGA 


TTTCCTATGA GCCAGATTCT 


TTAATTCGTC 


1740 


AAATTATCGA 


TTTGTGTCAA AAACAACGAC 


CAACCATGCC TTTAGAAGGC 


CACTGTCCTA 


1 Qnn 
loUU 


AAATCGAAGA 


TCAAGAGTTA 


GCAGATTTTC 


TATATAGTGG TATCACTTCA 


GATCATACCC 


1860 


ATCAATTTCC 


. TAAAACGTTA AAAGAAAAAA 


TTGAAGCGGG CGTCTTTATT 


CAATTTCAAA 


1920 


ATAAATCAAT 


CACTCCTGAA AATATTCAGG 


TGATAGTAGA CAATGACTTC 


TACAATTATG 


1980 


CAAGTATTAT 


CACGGATGAC 


GTGATGGCTG 


ATGATTTGCT AAAAGGTCAT 


CTAAATGAAA 


2040 
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ATGTCAAAAA AGCTGTTCAT GCAGGGTTGC 
ATACGCCAGC CAAACGAATG GGTCTCCATG 
CTGACTTTTT GTTATTAAAT GACTTAGAAA 
GAAAGGTCGT TTTTGAAAAA GGCGAGCCCT 
CAGCCACTTA TCAACAAACC ATCCAATGTA 
AGGTGGCTAC 7VACAAAAGAA ACAGTTCGTT 
CATTTACAGA GCGAATTACA AAAGAAATTC 
AAGCCAACTG CGCCTTATTA ATTGTGATGG 
TTTCTTTAAT GGATCAACCT CTTTCCGAGA 
ATCATCATAA TTTAATGGTG ATGGGTAATA 
AATTATTAGC 6ATGCAAGGC GGCTATTTGG 
GTCCTTTACC AATTGGTGGG ATTTTATCGC 
TACAAAAAGT CCGACAAGCT ATGCAAGCGT 
CCTTTTCTAC ACTATCATTA CCTGTCTCGC 
TGGATACGAA GTCACAACGT TTCTATCCAT 
ATGAAAACAC TCATTAAAAA TGTGCATATA 
AAAGCCGGCT ATTTAGTGAT TGAAGAAGAC 
CTTGATGAAA AGCGAATGGC TGCAAATCAA 
CCTGGGATGA TTAACACCCA TACCCATGTA 
GATGTGCCAG ATCGACTCCG GCGTTTTCTT 
TTAGTAGGAT GCAGTAGTGA TTATGCAATT 
TTTTGTGATA TGTATTATTT TGAAGATGAA 
CGTGCTTTGC TCGGAGAGAC GATCATTGAT 
GGCGGtCTTT TTTACGCGGA AACCTTTATT 
CCTATGCTTG CGCCACATGC ACCGAATACC 
GAACTTAGTC GGCAATACCA AGTTCCTGTG 
ATGGCTGAGT TTGAAAAAGC CTATCAAAAA 
TATTTGAGCG AGCCGTTTAT TTTAGCGCAT 
AGTTTAGCTG CTACTAATGG AAAAGCGCGT 
TCAGCCAAAG GCGTAgcGCC GATTAAGCAA 
GGCACGGATG. GACCTAGTA6 TGGGAATACA 
GCGAATTTTC ATAAGACAGC ACACCAAGAT 
TATCTGGCAA CGATGGGGGG CGCTAAAACG 
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CTATCGAAAA 


AGCCATTTAT 


ATGGCAACTT 


2100 


ATCGCGGCGA 


AATTGCACCT 


GGTAAAAAAG 


2160 


GTTTTGATAT 


TAATACGGTT 


TATAAATCTG 


2220 


TCCATTATCC 


AGAAAAAATC 


GAAGAGTTTC 


2280 


AAAAGTTAAC 


AGAAGAAGAT 


TTACTTTTAA 


2340 


GTAATGTGAT 


TCAAAAACAA 


GAAATTGGCA 


2400 


CTGTTGAAAA 


TGGATTGTTG 


CAATGGCAAA 


2460 


AACGTTACGG 


AAAAAATGGT 


AATATTTCTT 


2520 


AAGGAGCCAT 


TGCGACAACC 


TGGGCCCATG 


2580 


CGATTGAGGA 


TATTCTGTTA 


GCGCAGAATG 


2640 


TTGCTTCGGA 


TCAGCAAGTG 


ATGGCAACTT 


2700 


AAGCACCGAT 


TGAACAATTA 


GGCGCTTCTT 


2760 


TAGGCTATCA 


AAATATGAAT 


GAAATCATGT 


2820 


CAGCCATTAA 


AGTGACAGAT 


TTTGGGATGA 


2880 


TAGTTTTTCC 


AGAGGATGGA 


GTTCTTTTGC 


2940 


CTAACAATGG 


ATGAGCAATT 


TTCAGAAATC 


3000 


ACAATTGTGG 


AATTAGCACC 


7VATGACCACA 


3060 


GTAATCGATG 


GTCAAAATGG 


AATTTTAATG 


3120 


GGCATGATTC 


CGTTTCGTTC 


GTTAGGAGAC 


3180 


TTTCCATTAG 


AACAATTCAT 


GACAAAAGAA 


3240 


GCCGAAATGT 


TACTGAGTGG 


TATTACGAGC 


3300 


ATTGCTAAAA 


GTTGTGAAAA 


AATGAGTGTT 


3360 


ATGCCCACTT 


GTGATAGTCC 


GGAGCCTTCA 


3420 


CGCAAGTGGC 


AAGGCCATCC 


GTTGATTACG 


3480 


AATTCACCAG 


AAGTGTTGGC 


GAAAATTATT 


3540 


ACCATGCACG 


TTGCTGAAAT 


GACTTATGAA 


3600 


ACACCAATTG 


CTTTCTTAGA 


AGAACTGGGT 


3660 


TGTATTTTGG 


CAACAGATGA 


AGATCTTGCG 


3720 


GTCGCTCATT 


GTATCGGTGC 


GAATACTAAA 


3780 


ATGCTTGATC 


AAGGGATTAT 


TGTCGGTTTA 


3840 


TTAGATTTAT 


TCACCCAAAT 


GCGCATGGTT 


3900 


CGCTCCTTGT 


TTCCTGCTAA 


AGAAATTGTT 


3960 


TTAGGCTTGG 


CGGAGCAAGT 


CGGCTCATTG 


4020 
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TATAAPATTA 


ATTGAAACGC 


AATCAGTTAA 


TATGTTTCCG 


4080 


i\l i 1 X lianXla 






TCAGCAAATG 


CTAGCAATGT 


TGAAGCCGTT. 


4140 






ACTTflPPAAT 


AAAGAATTAC 


AAGAAGCTAA 


TCTCAAAGAA 


4200 


»pr» an r*/^ a H li 


nnl XAXnXktfiA 




ACGTTTGTGA AAGAAGCTAA AAAAAGAGCT 


4260 




wAu/Uvmt AAVa X 


aCfSAPfiTaTP 


GGATTGATTA AATAACGCTG 


GATTGACGTT 




TAGTGAT I 




r»a^^<^papap 


AAACTTTTAA 


TGAAAGAGTT 


GTCTGTGTTC 


4 JO 1/ 


TTTTTTGTTA 


TAAAuAxx 1 X 


xCi ill iCXA 


TATATATTTT 


TAAGGTAAGT 


TGTATTATAG 




taagtttgag 


AAGAxviAbAA 


X isAliAuVaAAA 


AGCAATGAAT 


ATACAACGGT 


TTATAGAAAA 




acgcaaagca 


CGTGGTTTGT 


ClvUAAACa X bA 


ATTAGCTAAA 


GGGATTTGTA 


CGCAAGTTAC 




GGTTAGTCGC 


TTTGAAAAGA 


ATGGTCAAGT 


CCCAACTTTA 


AAAATATTGA 


TCCaACTATG 




taatcgttta 


GAGCTACCTT 


TGGGGGAACT 


CTTCCCGCGC 


GTAGGGATTA 


AACAACCTGA 


*\ DOU 


AATTCTTGAA 


AAAAT GG AAG 


Ik Ik ff* Ik ^ 7k Ik nvr* 

AAGCAGAATT 


CTTCTTGATT 


ACCAGCGAAC 


ATGACCAATT 




gcaaacaatt 


TTAAAAAATA 


TTCCTTTTGA 


TGAAATCAAA 


GATTCCCAAC 


TGTTATTAGA 


4 □ UU 


atattattat 


TTACAAGGCT 




TTTTCAAAAT 


GCTTCGTTGA 


TGGATTGCTT 


4 0 OU 


ATTTACCTTT 


GAAAAACTTC 


TGT TTGAAGA 


GCAAAAATAC 


ACGAGCGATA 


TTTACCGTTT 


A Qon 


gttagctttt 


ACTGGAATTG 


GCATGGCCTA 


TGCCAAAGAA 


GGCGAAATTG 


AGAAAGCCGA 




ATTTTATTTC 


AATAAGGTCT 


TCAAGGAAAT 


TTATCTCTAT 


ACTATTCAAT 


CAATGGAAGA 


5040 


tacttggcga 


GTATTAAATG 


TTGTCTTTCA 


CTGTGGTGTT 


TTTTATGCAG 


AAAAAGGCGA 


5100 


tttagaaaca 


AGTGATGCTT 


TGTTAGAGTA 


TGCGATTTCC ATCTGTTCTG 


ATAATCACGT 


5160 


cacgtattac 


TTGGCAAGAG 


CAGCTTTCCA 


ACTTGCGAAA AATGCCTTAG 


CTGAAGAAAA 


5220 


gccgcaagag 


CAAATTTTAG 


AACTACTACA 


AGATGCCCG 






5259 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

AGGCCGTnTG AAGTGGaCCc TnGAgGAaTT wGtTCaAAAr GCmAAgGnTT TAATTGGaGA 60 

AGGCcAmCGT GGAACnTGCG CaACtnATTcA TTGrAGAACA TT^GaTGAA TTAGGCGAAC 120 

ACTATCATAA AATTACAGCG ATGTTAGCTG ATACAACgAC CgATGgTTTG TTTGATAAAA 180 

TTAAAGGATT GTTTCAATAG TTGAAAAAAA TCTGACATAA AAGTGTTTAG CTCCAAAAAC 240 

CAAGTAGGTA CGATGATTTC ATTTGTTCAT GCTTCACGGT GAAACATTGC AACTCTCGCC 300 

GTTTAACAGT TTTTGAGCGT GGAGCAAAAA TCCAAAGTGA TTTTTGTCCC ACGCTCTTCT 360 

TTTTTTATTT AACTTTTAAC CATTGTTTTG GCAACACATA CGCAATTGGA TTAAAAAGAT 420 
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AATCCGCTCC 


CCGTTTCACC 


ACAGGCATAT 


TCGTAATAGC 


TAAAATGAAG 


AGGGCTAAGA 


480 


AGAAAAGTGC 


CAGGATTCGG 


AAAGTAAACC 


ATTCTTTAAA 


GAGTGTGTCT 


AATTGCAAAC 


540 


CACTTTCCAA 


CAGAATAATT 


AAGGGCATAT 


GCAAGAGATA AGGGTAGAGG 


GTATCTTTTC 


600 


CGAGGGCGCT 


TAAAGGTAAC 


TTTCGTTTCG 


GCAATAATGC 


AAACCAAGAA 


AGAGACACCA 


660 


ATGCTGCAGC 


AACGTAAAAG 


AACAGCTGAA 


TGAAACTACT 


GTATTCAAAG 


GCCGGACTAA 


720 


TTyCAGCGTA GGAAGCCCAA 


CCAGTTAAGA 


aGCCCCAAAA 


TTTCGTTGCC 


AATGACCATT 


780 


GACTATTTAC 


AAATAAAAAG 


GAGCTTAAAA 


TTACGCCTAT 


CGCTGCCCAT 


TTGACTTTGC 


840 


GATGATCTAA AAAGGTTTGT 


AGCTTTTGAA 


TATCCATTAA GTAGCCACAA 


TAAAAGAAAG 


900 


GGAAGT^GCC 


CAGAGTCCGT 


CCTAAAGACA AGAAACCTTC CACAGGTAAA TAGCCTGAGG 


960 


CGCAgCcGAT AACAATGATG ATTGGCAACA TGATTTTCGG CTGAACGCGC 


TGAAGAAAGA 


1020 


TGGTAATTAA 


GTAGAAACTT 


GCTAAGGAAA GTAAGTACCA AGGAACGCCT 


GTTTCGTTCA 


1080 


GTAAATCAAA 


ATTCATTTTA 


TTTGCAAAGG 


CGAAATTAGC 


AAGCGTAATC 


AACAACTTCG 


1140 


CTAAGAGATA AAGCCAAACA 


AAGGAAA6TA 


AGCGTAGATA 


TTTTCGTTGT 


TTAATGGCAT 


1200 


GTTTACTAAA 


GAGTCCAGAG 


ACAAAAATAA AGGCTGGCAT 


ATGGAATGAA 


TAGATCCAAA 


1260 


AGTTGATCGT 


TTCCATGAGT 


CGTGAAGGAG 


AAGCTTCACG 


ATCTAAAATA 


TGCCCAAAAA 


1320 


TCACTAAGAA 


AATTAAAAAA 


AGTTTCACGT 


TGTCCCATTT 


ATAAATTCGC 


TTGTCGTTTT 


1380 


CCATAACGAT 


CACTCCTTTT 


AGTAGAAAAC 


AGTTCACTTT 


TGTTTAAGGC 


AGAAGAGGAC 


1440 


AATTTTTCCA 


TTCATTGAGT 


AGATAATTGT 


AGATACGAAC 


CGGCAGCCGT 


TAAATGAGCC 


1500 


CCCACATACT 


TGCGCTTAAA 


ACAAAAAATT 


AGGCAGGATG 


TTAGTCTTAA 


ATTATACTGA 


1560 


AAGAGTGACG 


ATCCTCCAAA 


GATTTTGTTT 


ATTAAGAAGT 


GAAAGATGCG 


GTGAAAATAA 


1620 


ATGATAAAAT 


TTCAAAATGT 


TTCAAAAGTA 


TATCAAGAAA 


ATGCCGTTTT 


GTCCGATGTG 


1680 


AATTTAACAA 


TCCAACAAGG 


CGAATTTTTT 


GTATTAGTAG 


GGCCGAGTGG 


TAGTGGAAAA 


1740 


ACGACGACTT 


TGAAAATGAT 


TAATCGGCTC ATTGAGCCAA CGACAGGTGA AGTGGCTCTC 


1800 


AATGAACAAG 


CGGTAACCAA 


TTATCCTTTA 


CGTGAATTAC 


6ACTAAAAAT 


TGGCTATGTT 


1860 


CTTCAGCAAA TTGCCTTATT 


TCCTAACTTA ACCGTGGCGG AAAATATTGA GTTGATTCCT 


1920 


GAAATGAAAA AATGGCCCAA AGAGCAACGC AGAGAGCGAA CCATTGAACT ATTGAAAAAA 


1980 


GTTCAGCTGG 


ATCCTGAGGA 


ATATCTGCAT 


CGTAAGCCAG 


CAGCGCTTTC 


AGGAGGCGAG 


2040 


CAACAACGTA 


TCGGGATTTT 


ACGCGCCATT 


GCAGCCGAAC 


CAGAAGTGAT 


TTTAATGGAT 


2100 


GAACCTTTTA 


GCGCGCTAGA 


TCCGATTTCA AGACATCATC 


TTCAACGTTT 


GGTGAAAGAA 


2160 


CTTCATCAGG 


AATTAGCGAG 


TACCATTGTT 


TTTGTGACCC ATGATATGAA TGAAGCTTTA 


2220 


CTGTTAGGAG 


AGCGCATCTG 


TATTATGGAC 


ACTGGCAAAA 


TTATTCAAGT 


CGATACGCCA 


2280 


GAAGCCATTC 


AAAAACATCC 


GAAAAATAAT 


TTTGTCGGTC 


AATTTTTCAA 


ACAATCTTCA 


2340 


CCAGAACTAG 


CGAACTATTG 


TGGCGCTGAC 


TTGACTGCAT 


TTTTAGAAGC 


CGCAGATGAG 


2400 
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ACGATTGAGC 


CAGTGCCGAC 


TGTTTTCATG 


GAAACCAATT 


TAAAAGAGAT 


TGTCCAACTC . 


2460 


ATTAATGCGC 


AAGAGAAAGC 


AAATTTAGTA 


GAAAAACAAC 


AAGTTTTAGG 


GTGTATCACC 


2520 


TCAAAAACGA 


TTACTCGTTT 


TATGGAACAG 


CTGTTGGAAA 


GCGAGGGACA 


GTAGATGCAA 


2580 


GCACTGATTG 


AAACTTTTTT 


CGCTCGAAAA 


GAAGAATATC 


TGCAAGCGTT 


AATCGAGCAT 


2640 


ATGCAATTGT 


CCTTTATCTC 


GTTATTATTC 


GCTGCGTTGA 


TTGCAATTCC 


TTTAGCCATT 


2700 


TATTTAACAA 


GTCATCGTAA GTTAGCGGAA 


AGTCTGTTAC 


AAGTGACTGG 


GATTTTCCAA 


2760 


ACCATCCCTT 


CTTTAGCCTT 


GCTAGGCTTA 


CTGATTCCGT 


TAATTGGTAT 


TGGTCGACCA 


2820 


CCGGCAATTG 


TTGCGCTAGT 


GATTTATGCC 


TTATTCCCAA 


TTTTACAAAA 


CACCTATACT 


2880 


GGTTTGACGG 


AAATTGATCC 


TTCGTTAGAA 


GAAGCTGCAG 


T^GCATTTGG 


GATGAGTAAA 


2940 


AAAGAAAAAC 


TGTTTAAGTT 


TGAGTTGCAA 


ATGGCGTTAC 


CATTTATTAT 


TTCTGGGATT 


3000 


CGAACGGCTA 


CCGTTTTAAT 


TATTGGTACG 


GCCACGTTGG 


CGGCGTTAAT 


TGGAGCAGGT 


3060 


GGTTTAGGGA 


CCTTCATTTT 


ATTAGGAATT 


GATCGAAATA 


ATTTATCGTT 


GATTTTTATT 


3120 


GGAGCCCTTT 


CATCGGCTGC 


ACTAGCAGTC 


TTATTTAATT 


ATGGAATTCA 


TTGGTTGGAA 


3180 


AAAGCGAATG 


GGCGGCGACT 


AATCATTGGC 


GGTACTATTT 


TAGGCCTTTT 


GTTAGGTGGT 


3240 


TCATTTTTCT 


GGAATCAACA AACTTCTTCA 7UVGGAAAAGC AGCTGACAAT 


TGCTGGCAAA 


3300 


TTAGGAGCGG 


AACCAGATAT 


TATTATTAAT 


ATGTATAAAG 


CCTTAATTGA 


AGAAAATAGC 


3360 


GACATTCAGG 


TGACGTTAAA 


GCCAAACTTT 


GGCAAGACTA 


CTTTTTTATA 


TAATGCCTTG 


3420 


AAATCAGATG 


AAATCGATCT 


TTATCCGGAA 


TTTACAGGAA 


CAGTGATCGA 


AACCTTTTTA 


3480 


AAAAATCCTC 


CTCAGCTAGA 


CAATCAACCT 


CAAGTAGTCT 


ATGAAGCGGC 


ACGCGCTGGC 


3540 


TTGT^AAAAAC 


AAGAAAACCT 


TACTTTGTTG 


AAACCGATGC 


GCTATGAAAA 


TACGTATGCC 


3600 


GTCGCAGTTA 


AGCGCTCATT 


TGCAAAAGCG 


CATCAATTAA 


AAACGATTAG 


TGATTTACAG 


3660 


AAAATTAGCA ATCAGTTGAA AGCAGGTTTT 


ACGTTAGAAT 


TTATTGATCG 


GCAAGATGGT 


3720 


TATAAGGGAT 


TACAAGAAAA 


ATATCATTTA AATCTGAATG 


TCCAATCGAT 


GGAACCGGCG 


3780 


TTACGCTATC 


AGGCGATTAA 


TAATGGTGAG 


GTAAATGTGA 


TTGATGCCTA 


TTCAACGGAT 


3840 


AGTGAATTGA 


GACAATACGA 


TTTAGTAACG 


TTAGAAGATG ATCAAGCGCT 


ATTTCCGCCT 


3900 


TATCAAGGAG 


CACCACTTAT 


TAAAACAGCA ACGTTGGAAA AATATCCAGA 


ATTAGCAGAA 


3960 


ATATTGAATA AATTGGCAGG 


AAAAATTAGC 


GAAGAAGAAA TGAGTGAAAT 


GAATTATCAA 


4020 


GTCAATGTGG AAGGGCAAGA TCCTAGCATA GTGGCGAAAG ACTATTTAAA 


AGAAAAGAAT 


4080 


CTTCTTAAAT 


AAGTATTTTT 


TACTGAAAAA GACTGGTATA TAAACATTTA 


TCCAGTCTTT 


4140 


TTCAGATAAA 


TAAAGAGTTT 


AATGTGAAAA 


GTACCTGTAT 


TTTTACTCAT 


TTTTGGATAT 


4200 


AATAAAAGAA 


ATGAATGGAG 


AGGAGTGGTC 


TATATGAAAA AAGCAGAACG 


TCAACGTTTG 


4260 


ATTAAACAAT 


TAATTATGCA ACAAGAAATT 


GAAACACAAG 


ATGAATTAAT 


CACTCGTTTG 


4320 


GAAGAAATTG GCGTCCGAGC CACTCAAGCG ACTGTTTCAA GAGATATCCG 


TGAAATGAGT 


4380 
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ATTGTTAAAA 


CACATGGTGC 


AGATGGTCGT 


GTCAAATATG 


CAATTTTTTC 


TCAAGCACAA 


4440 


GGCACAAGCA 


GTGAAGAGAA 


ATTACGAGAA 


TCAGTAAAAG 


ATTCAGTTGT 


TCGAATGGAG 


4500 


CGAGTACAAT 


TTATTGTCAT 


CTTGCATACA 


GAGATGGGAA 


ATGCCGATGT 


GGTGAGTAAC 


4560 


TTTTTAGATG 


AAGTAGCTTA 


TCCAGAAGTG 


GCTGGTACAG 


TGGCTGGTGC 


GGATACAATT 


4620 


ATTGTGATTA 


CACGTTC^GA 


AGAGGATGCG 


GAACACTTTA 


TCGAACGCAT 


CGAAAATATG 


4680 


ATTTTTTAAA 


ATTGAAAGCA 


TCAATAGAAG 


GGAGTTTTTT 


TCTTGGCAGA 


ATTGAAAGCG 


4740 


CTACAAAATG 


GTTCGGATAT 


TCGGGGAATT 


GCTTTGGATA 


CGGAAGAGCA 


AACTGCAACA 


4800 


TTAACCGCAA 


CAGCAGTAGC 


CGAGATTGCG 


GTGGGCGTGG 


TTCGTTGGTT 


ACAGGAT7UWV 


4860 


AAACAGCTGC 


CACGAAAGGC 


ACAACAGCGC 


CTGACAATTG 


CAATCGGACA 


TGATAGTCGT 


4 920 


CTAACGGCGG 


AGcatTAAGC 


AAGCATTAGT 


AGATACGTTT 


CTTTCATTAG 


GTATTCAGGT 


4980 


GATTGACGTT 


GGTTTAGCTA 


CGACGCCTGC 


TATGTTTATG 


GCTACTCAAT 


TTCCAACGCT 


5040 


TCAATGTGAT 


GCAGCGATTA 


TGATTACAGC 


TAGCCATTTG 


CCTTATTACT 


TTAATGGCTT 


5100 


GAAGTTCTTC 


ACAGCAGAAG 


GTGGAGCAGA 


GAAAGAAGAT 


ATTCGGTATA 


TTTTATCTCA 


5160 


TACTGATCCG 


CTGACGGCCA 


ATGAAAATGG 


CACACTGATG 


AAGCAAGAGT 


TACTGCCAAT 


5220 


TTATGCGGAG 


CATTTGGTGG 


AAAAAATCCG 


CCAAGGAATT 


CATTCGCCAG 


AAGAGAAACC 


5280 


TTTACAGGGC 


TTTCGAATTA 


TCGTTGATGC 


GGGTAATGGG 


GCTGGGGGCT 


TCTTTGCGGA 


5340 


GCAGGTTCTT 


CAAGTTTTGG 


GCGCTGATAC 


GACGGGATCA 


CAGTTTTTAG 


AACCAGATGG 


5400 


GCATTTTCCT 


AATCAT^^.i'AC 


CTAATCCAGA 


TAATTCTGAA 


GCAATGAAAA 


GTATTCAAAC 


5460 


GGCTGTTTTA 


GCTAATCAAG 


CAGATTTGGG 


CATTATCTTT 


GATACAGATG 


TAGATCGTTC 


5520 


GGCCGTGGTG 


GACCAATCAG 


GTGAGGTACT 


AAACCGA7VAT 


AATTTAATTG 


CTGTTTTAGC 


5580 


GGCAATTGTT 


TTAAAAGAAG 


CGCCTGGTAG 


TTATATTGTG 


ACAAATTCGC 


CAACCTCAAG 


5640 


TCATTTAAAA 


ACATTTATTG 


AAGAAAAAGG 


CGGCCAACAG 


ATtCGTTATA 


TTTCTGGTTA 


5700 


CCGAAaTGTC 


ATTAATAAaA 


TGATTGAATT 


GAaTCATGGG 


GGCTTTCAGA 


CACCGTTAGC 


5760 


CATTGAAACC 


AGTGGTCATG 


CAGCGTTTCA 


AGAAAACTAC 


AATCTGGATG 


ATGGTGCATA 


5820 


TGTCGTGGCG 


AAAATTTTAA 


TGTTGTTACC 


AGAATTGAAA 


CAAAACAATC 


7UVACGTTAGG 


5880 


AGATCTGATT 


GCTACATTGA 


AGCAACCGGC 


AGAAACCAAC 


GAATTTCGCT 


TCAAAATTAC 


5940 


TGCGGAGGAT 


GTTACTTGTT 


ATGGGCAACA 


AGTCTTGCGA 


GACTTTGAAT 


TGTTTGTAGA 


6000 


AAATCAAGCA 


GACTTTGCTG 


TGGACCGTGA 


AAACCAAGAA 


GGTGTTCGGG 


GAAACGTGTC 


6060 


TGGTCAATAT 


GGCAGTGGTT 


GGTTTtTGTA 


CGCTTAAGTC 


TGCATGAACC 


GTTGCTTGTT 


6120 


CrrCAGGTAG 


AGAATGACCA 


AAGCGATAAA 


AATGCGTGTG 


TCATCGAAAA 


AATTGCCACT 


6180 


TTTTTACAAA 


AATACGAGGA 


AATAGATAGT 


CAGCAAATAG 


AAAAATAATC 


TAGGAAATTA 


6240 


AAGGTGTTCT 


TATGGAAAAT 


GTTTTAAAAG 


GAAAACACGT 


TGGCTTTTCT 


CATTTTAGAG 


6300 


AAGAATATAT 


CGAACATATT 


GCAAAACAGC 


AATGGGATAA 


TGAATTGCTT 


CGGCATTTGA 


6360 
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GCTGGGATGC 


TTTACATCCG 


TGGGGGCTTG 


AAGAATGGAA 


AGATTTCACA ATTAATAAAG . 


6420 


GAGAA6ATGA 


TCGATTTCTA 


TTTGCTATTT 


TAGAAAATAC 


GACAGAGGAA 


TTTATTGGTT 


6480 


GGGTTTCGTT 


GTCTGATGTA 


CAGTTGAAAA 


ATAGAGGGGC 


CAATTTAGGG 


ATTGCTATTT 


6540 


TACAAAAAGA 


GCAACGAGGT 


CAAGGATGTG 


GCTTTGAGGC 


GGTGTCTTTA 


ATTTGTAAAT 


6600 


TTGCTTTTTA 


TGAATTGGGT 


TTGCACAAAA 


TCAGATTAGC 


TGTCAATAGT 


AACAATCAAA 


6660 


AAGCGATTCA 


TGTTTATG7UV 


AAAGTAGGCT 


TCAAAAAAGA AGGAATTGAT 


AGAGAAGCGT 


6720 


TATTTCAAGA 


TGGCCAATGG 


CTGGATATCT 


ACAATTACGG 


AATTTTACAA 


AAAGAGTGGC 


6780 


TACAAATGAT 


TAAGGCTGAG 


AGTTAGCTCA 


GGAACTGTTT 


AACACCAAGA 


CACTGAGCAA 


6840 


ACGCAGATAT 


GCCGTTCTCA 


GCAGTGAATG 


TGCGGTAAAA 


TAAAAAAGCA 


GAAGAAATCG 


6900 


GCGGTTACCG 


TCCGATTTCT 


TCTGCTTTTT 


TTTACCATAG 


GCCAGGTGTT 


GCAGTCACAA 


6960 


CGATTTGTTT 


TTTTGTAAAC 


GGATGGAGTA 


AATGAAGTTG 


TGAAGCATGG 


AGCATTAAAC 


7020 


GAGCTGCCTG 


CGATTTTGGG 


TTGTATAACG 


GATCACCAAT 


AATTGGATGC 


CCTTCGCTAG 


7080 


CTAAATGGAC 


CCGAATTTGG 


TGCGTCCGAC 


CTGTTTCTAA 


AACACAGGTA 


ACTGCGGTTT 


7140 


GATTATTAGA 


AAGAGCTTGG 


ACCACTTGGA 


CATGTGTAAT 


CGCTGAATCC 


CCTTTTTGAG 


7200 


GATCAATCCG 


TCGTTTCCGG 


CGATCATGTC 


GATCACGACC 


GATCGGCTGA 


TTGATTGTCT 


7260 


GCTCTTTTTG 


GCTAAAGTGA 


CCAGCTACAA 


TCGCTTGATA 


TTGGCGGTAA 


ATTTGTTTTG 


7320 


TTTCTAATAA 


ACGACCTAAA 


ATAGGTAAGA 


CAAAAGGATT 


TTTAGCAAAG 


ACAATCGCAC 


7380 


CACTGGTTTC 


TTTATCTAAT 


CGATGAACGA 


CATAAGGGAC 


CTGATTTTTC 


TCGGCTAAAT 


7440 


AGGCAGCAAC 


ATGATTTAAC 


AAGGTGTCTG 


TTTCCTGTGG 


CTGATTTGGA 


TGGGTTTTAA 


7500 


TGCCATAAGG 


TTTATTTACA 


ATCATTAAAT 


GTTCGTCTTC 


ATATAAAACC 


TTCACATGCG 


7560 


TTGCTTCCCC 


CAAATGAATA 


GTGGGACGTA 


CATAATCGGT 


TTCTTCAAAC 


GTTAAGGTGA 


7620 


CACAATCGCC 


AGCTTGGACT 


GGAAAATGAA 


AAGCAGCTGT 


TTCTCCGTTG 


ATTTTGACAT 


7680 


TTTT 












7684 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGGAACACCT GnCGCCTATA AAAAGGTTGG CCAACAAACC CTGATGGCTG GGGCGATTGA 60 

TAAACCGAAC ACTTACAC6G TCAAAwTTga TGTAGAAAcC GAGGGTTCTT TAAAAATCAA 120 

AAAAGTTGAT AAAGAATCAG GTGCTATTGT ACCAGGAACG GTTTTCCATT TAGACTTTGG 180 

AAAAACTTTA CCTGCAAAAG AC6TGACTAC TGACAAAGAA GGCATTGCTA CATTGGATGG 240 
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GATTCCTCAT 


GGTACAAAGG 


TAACCATTAC 


CGAAAAATCG 


GTGCCGGCGC 


CTTATACGAT 


300 


TGATACCATA 


CCCATGACTG 


CTACGATTAA 


AGCAGGCGAA 


ACCATTTTTA 


TGACTTCAAA 


360 


AAATACGCGG 


GAAAAAGGTC 


AAA.TCATTCT 


GGATAAAACC 


GGGATTGAAA 


CAGGAAGTGA 


420 


TCTTTGGAAC 


GACAATTACT 




AAACACGTTT 

n<^nv«nwU x x x 


GCTATTCGCA 


AGGACAGTCC 


480 


TACTGGTGAA ATTGTTCAAG 


AAATGACCAC 


GGATGAAAAA 


GGTCATGCAG 


AAACAGCAAA 

nnnv«n^^innn 


540 


AGAGATTGCT 


AATGCATTGG 


AATTGGGAAC 


TTACTACGTG 
X X nx^ X nw\3 X w 


ACCGAAACCA 


AAGCATCTAA 


fiOO 


TGGCTTCGTG 


AATACCTTTA 


AAGr!Af?TAAA 


AGTCGAATTC 
x\j>\3nt\x x\3 


AAGTATGPCA 


ATPAAAPPGT 




GGCCCTTGTG ACCAGCAATA 


TnH iLH.fZflfZT'li 
x^M^n^\iVa^ana\.*^rx 


rlnriV^V.,<nrlV3nn 


r\X X nv^ X OV3VJIV7 


AAAPaaPTPT 


f £\J 


GATAAAAGAA 


GACAAGGATA 


V« X vtVs Alan 1 An 


aapapaar:/ip 


aCGGPaGTPT 


TTGaGGf^nap 


7fln 


TGAATATACC 


TTGTTTACTG 


r"T*2v nfifinTfin 


a a a a^zpf^GTT 

AAnAVsVrfUoX X 




nnOVo X X X X nn 




ACCAGAATTG 


GTAAAGGGAA 


LfLinnnVjV^ 1 i 


TpaTPaaapa 


I^TnaPTPT^^G 


PTTTaGaTGa 

^ X X X nljnX on 




AAAGAaTCAA 


GCTGCCGTTa 




UA 1 UaACUACj 


T TV r"p TV r^^rr^r* 
X AU X AL> X oo^ 


a ar* a a a r^r'a a 

AAvaAAAUUAA 




AGCACCTGAA 


GGATATACCT 


rpcT^ -% Tf^ -» 7\ 7\ 

X ItV^S i LyaAAU 


r* TV TV r'T TV TTT'T 


^'P».i'P^« » TV TO TV 

Vj X WX UyAXCA 


a a a a Y-r'Tur'a 

AAAAxoX WVaA 


1 n9n 

XUZU 


TgATAACGAA AAAAaTGCCG 




AwlX o X X AV^V3 


poaaaacaap 

OV^AAAAuAn^^ 


ooo X XAX X^u 




TTTTGGGTTT 


GATTTCTTCA 




A 1 L> jciaC yoin 1 


(aoL»A^ i \9V,»Co 


a a apTi^/^PTT 

AAAV« X Vtou 1 X 




TAATGACTTA ACCTTTAAAG 




roAAVTisiiA^U 


aa'pr'ar'aTr^a 
AAi uAV^AX \^A 


p app*ppp*ppa 

wAuoXuLpIoA 


1 Of\f\ 


AGATrAAGCA 


ATCACAGCTT 




Al X AuvaO XXX 


paTP/iPTapc 

uA X uVsVr X Av«u 


GaaaGTTTGa 

UMnnuX X Xun 


1 9 fin 


AAATCTTCCT 


TATGGGGATT 


Ai 1 I Ad liaA 


apaaPTa^aa 

AoAAu X AuAA 


pp^ppanaaG 


GaTTTpaaaa 

UnX X x\^nnnt\ 




AATTACGCCA 


TTAGAAATCC 


S^x iClAUl i i 


Taappaaaap 

X AAlauAAAA^ 


aaa^aacaPT 

AAAla AAVanV.* X 


X X \3 X Isnnlasnu 




TGAGTATGTC 


TTTACCATTA 


k.*\.*oAAV-Anou 


ppaaaa/^paa 


ppaaTTaaAA 


PGGTGAPPGT 


1440 


TCCTTACGAA AAGCTGACGA 


ar*aaacf*aTT 


X XV#XUlX XnoX 


TTGaaPPGTT 


TGATGPTPTA 

X ^3t\X X V* X n 


1500 


TGATTTGCCT 


GAGGAAGAAG 


aT a r"rT*p^2 ar* 


X XVi^X V^X X tn.«n 


aPTT^tnaaGG 


APGGAAATAA 
nV» Vsvinnrl X nn 




AAAATTGAAT 


ACCCTTGATT 


'p*p a f^f^f2. a^^**p 

X X nV^V^unoVr X 


afZTTnaTa&a 

nuX XlsnXnnn 


TTGftGATATA 
X XutnVjinXAXn 


APTTCPATGA 


1620 


AATCAAAGAA 


GACTGGTATG 


1 VvU 1 nl9v^ X \->/l 


AGPPATTRAT 


GTGGAAGPPA 


PAAAAGPTGP 
KxXinnno^ X Ov^ 


1680 


CCAAGAAAAA 


GACGAAAAAG 


Pr* A AAPCAf^T 


GGTAATTGCP 
o\3xnnx xov^^ 


GAAAPAAPGA 


CCACTTTGGC 


1740 


GAACAAAGAG 


AAAACAGGTA 


PTTRGAAAAT 


TPGGCATAAA 


T T AACGACCG 


AACAAGTTTT 


1800 


GGATAAAACT 


ATCGTCTTGT 


TTAAPTATHT 

X X M\rt\^ X 4» X \3 X 


GTATGAAAAC 


AAGGAAGCTT 


TTGAAGCAGG 


1860 


CGATGAGCCA 


GTAGCGAAGG 


ATGCTAGCTT 


GAATAATCAA 


GCCCAAACTG 


TTAATTGTAC 


1920 


GGTGGAACGT 


CATGTTTCCA 


TCCAAACAAA 


AGCCCACCTA 


GAAGATGGTT 


CTCAGACCTT 


1980 


TACTCATGGT 


GATGTGGTGG 


ATATGTTTGA 


TGATGTATCG 


ATTACCCATG 


ATGTGCTGGA 


2040 


TGGATCAAAA. GAGGCTTTCG 


AAACAATTCT 


GTATGCACTA 


CTACCGGATG 


GTACAAACAA 


2100 


AGAAATTTGG AAATCAGGCA 


AGATTGATTA 


TGAAGTAAAT 


GATAAAGAAT 


TTACCAAAAC 


2160 


TATGCTTGCG 


GAAAAAGTAG 


ATACCGGAAA 


GTTTCCAGAA 


AGTACTAAGT 


TTACTTTTGC 


2220 
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AGAAATCAAT 


TATGATAAAG 


ATGAGAATAT 


CAATGGGAAA 


CACAATGAGG 


ATCTAAAAGA 


2280 


AAAAAATCAA 


ACCTTAACGC 


CCAAAAAAGT 


TCCAACCACA 


CCACGTACGC 


CGAAACAACC 


2340 


GGAT^CACCA 


ACTGTTCCAA 


GTGGCTATCA 


AGAATCTAGT 


CCCACAGTGA 


AGACATTCCC 


2400 


TCAAACTGGG 


GAGAAAAATT 


CCAATGTTTT 


ACTGTTCATT 


GGTTTTACTT 


TGATTTTTGC 

A VJ*A. A X X X X 


2460 




TATTATTTCT 


GGAATCGCCG 


TAACTAAGGT 


GATGAGATGA 


TACAAGTAAA 

A «AS« J^rli^S A 


2520 






xjnwv^u^w X ^Vtf 


TAACAAAGTG 


GTCGCGCTAT 


T TTTGT ATPP 

XAX XUX/\X^^ 






ATGAAAAv-AG 


GAAAAGAAGA 


TGGAATTGAA 


ATTTGTTGTG 


CCAGATATGG 




rr'r: a ft ft priTT 


TCGAAAGTTA 

XUwfVV^U X Xn 


AGT T ATGPPG 

AOX XnXVTN^VirVSI 


GTGAAGGAGA 


AATTTTAACA 


GAAGGATATG 


2700 


ft /^r*^* ft ft Tft P 


TAPGfJT^SATT 
X n\AiJ\3 X UX\X X 


nf^TPI^ Pft TT 

VTVI X v«\3^ AU X X 


APPATPTATA 

AWrS.«A X ^ X A X A 


PTPPAGPAAA 

V« X l.«^«AV3^«AAA 


PAAPGGGPPG 


2760 


a li T ft T*P/I ft 


fSPTC^TftCTft 
KJKJ I ova X no X A 


Vail^ X vsV^VtoAAo 


ncfzrzTfz a a a a 

^l^uru X OAAAA 


GGATTTTGAT 

OVIAX X X XUtAX 


PAAGAPPAAP 

v^AAV9AV«V«AA^ 




rrr^T ft ft & t^rr* 


p/?Tr^a aTPPP 


PftTTTf^l^TTr: 
L>Ai 1 Xool io 


ppaaaccpTft 


1 V> A\3A 1 I V3 AA 


AA 1 X V3\7V3 X 


^ O O \/ 




CtlTflTTfZTUT 
^oiuX lolAl 


ox ioAXoAX X 


Tftf^Tfiaaafia 

X AVj X V3AAAU A 


aTftr:r:ftr:r:ftft 

A X Av3\3AoV3AA 


A^aGftAnaTGA 

Aa V3AA\3A X UA 


9Q40 


pftTTftftpai^a 




v3 X Ao A X X Lr AV3 


TATTAftPftTT 
XAX XAA\_>AX X 


Tnr; ft ft a nT Tn 

I V3\3 AAAVJ X X O 


PrSTTTTTPPr^ 

X X X X X 






apaaPTaprir: 


aa^tpaaaaTP 

AAuv« AAAA 1 Vj 


a nft: a Tnr: ft a p 

Aoo A 1 AAV^ 


nnTTanTftftT 

\j\D X X X AA X 


GAGGTAAAAG 

O AO\3 X AAAA\3 




aar*^ftftr*ftTft 


TaaTTTnaaa 

XAAX X XuAAA 


TPP T PTflP a P 
X X X l3^«A^ 


a ai^r^TSPCft a T 

AAV9V3\3V«viAA X 


CftTTPaftnTT 
oA X X V«AAy X X 


AGTwTPPPGG 

A\3 X W X 




PT ft ft TPT^ZPr* 


PTT ft P^SPI^ ft 
^X XA^oV^uAls 


HMpfp /2I "1**1* a H!*^ 
X X XUV^X XAXu 


ft TCPft 1? AC/^T 
A X \3V«A\3 A\3w X 


ftflftCTTGGTP 
AUA13 X X OvI X w 


AATPPAATPG 

AA X wwAAX 




T^^PftTftPft^ST 


P^SPPft ft TV ft T 
i«UV«l.«<AAX XAX 


'PT'PTTP^aP 
\3 X X X X X kitUAI.? 


ftft^nftftPTftP 
AAuuAAV^ X Aw 


XVjX^AAX X0\3 


TTTATTAAGG 

X X XAX XAA0^3 




r* ft /2 ft P/Sft TTT 


ft r;*p T/: ft ft a 

Al3 X C X X VI AAA 


p^zapaappaa 

^l3Av«AA\^L>AA 


ftTPftftfZfZftftft 
AX ^^AAOOAAA 


TTPTftPnft ft P 
X XOX AV^UAAV^ 


PAAATTGAAG 

^AAAX XV3AAw 


3300 


^ftftftf^&AATA 


fSfZTCAAflATT 


AT/^AAATTA 
AX\9\aAAAX XA 


ATGTTGAAPA 

A X 19 X X V7 AAV^ A 


TATTTTAGAT 

XAX X X X AUAl 


TGTPTPGATP 


3360 


AATATf^AAA 
nnX aX ss^snnn 


ft fl/ZTf; Af^'TT A 
AUV9 X Onl9 X X A 


APTf^AGGAAP 


AAPTPGTTAA 

AAV^XVm>OX xaa 


AGPATTAAPP 

AU^^AX XAA^^ 


TATGATGAAA 


3420 


ft ft a'I*^*P'I"PfP 
Annx vix X X w X 


HftTTATCPftT 
VaAX XAX\9V«iAX 


PAAHGTTTAP 

V*AA\3\9X X XAViv 


TTGHAGAAAA 

X X VUnWAnAA 


TAGPGAPAGT 


GAAGAPGAPT 


3480 


X xunxuixnv^n 


ft ft PT^ZflTT ft 

AUAV* X V30 X X A 


GAAGAAGAAG 

VIAAOAAOAAO 


ftf^TPPTTPVT 

AV> X X X w X X 


TATGGTTGTP 

XAXOV3X X\3X^ 


GAAGTAAATG 


3540 


AAAACTTATR 


TPCCPACGPA 


CAATPPCTPT 


TGGAAGAAAT 


PGGTGTGGAG 


ATGCCAGATG 


3600 


CCATTGAA6G 


PTTTTTAAAV 
^x XXX xxvmx 


PAATTGGTTG 

WalasX A w\7 A A ^7 


AAAAGAAACA 


ATTPPPCGTA 

rlA A wwWW^'AJ* 


GTAAT7VAATA 


3660 


GCTAGCATAA 


CTATATGAAA 


ATGCCCAATG 


AAATCCTATC 


CGGCGTTT T T 

W\3\7w\7 A A A A A 


TGATCGGAGG 


3720 


TGGAAACCAT 


CAAAATGTAC 

V^«**l*ft*ft *lk \J A 


AAAGAACATC 


GCATACGTGC 


AAGAGATCAG 


CATT TGGTTT 


3780 


ATCACTTTAT 


TCTTGGCTGG 


CTAATTGCTT 


TGTTAATCAG 


TTGGATGGGT 


GTCTTTTATT 


3840 


TTCAAGAGTT 


TAGACAATTT 


GATATTTCTA 


GGGTATCACT 


TTCTACTATT 


GAAACTGTTT 


3900 


GGTCCATGAA 


GGAGTTAATC 


TGCTTACTTG 


GAAGTCTCGG 


TTTTTCAGGA 


GCTATGCTGT 


3960 


TGCTTTATAT 


TCACTTTTTT 


CCTGATCATT 


GGCGAAGCTT 


GTGGCATCGG 


CAAAAACTAG 


4020 


CTCGGATGAT. 


TCTAGAAAAC 


CATTGGTATG 


AAGTGAAACA 


AACTCAAAGT 


GAGGGCTTTT 


4080 


TCAAAGATTT 


AAATAGTAGT 


CGAACCAGAG 


AGACTATCAG 


TTACTTCCCC 


AAAATCTATT 


4140 


ATCGAATGAA 


GGAGGGTTTA 


CTTTCTATTC 


GCGTTCAAAT 


TTCACTAGGG 


AAATATCAGG 


4200 
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AGCAGCTATT 


GAAGTTGGAA 


AAGAAGCTAG 


AAAGTGG AT T 


GTATTGTGAG 


TTAGTAGAAA 


4260 


AGGAACTCAA AGACTCCTAT 


GTGGAGTATA 


CTTTACTGTA 


TGATATGATC 


GCCAATCGAA 


4320 


TTGGGATAGA 


CGAAGTAGTG 


Gr!AGAAAATG 


GGAPTPTGfG 


ATTGATGAAA 

n X X Kin * U/vVl 


AATCAAGTAT 


4380 


GGGCTTATGA 


TTCTTTACCC 


PATATGTTAA 


TPGPTGGTGG 


PAPAGGTGCG 


GGGAAGAPPT 


4440 


ATTTCCTCCT 


CACCATTATC 




TGAAGTPAGA 


TGPAGAATTG 
ll3\^Al9AAX lO 


TTTATPPTPG 
X X X A X K^\^ X Kr\3 




ATCCCAAAAA 


TGCAGATTTA 


It PTIt A TT T A/^ 


CTAPGGTGAT 


rinnTC A PGT A 


TATTPTPAAA 
X A X X v« X v* Ann 




AGGAAGAAAT 


TTCTGCTTGT 




TTT A TP A A PP 


PATPATAPPT 
bAl bA 1 Abb 1 


PPTAOTA Af2P 
bbX Aul AAb^ 


A tiOf\ 


CAATGAAAGA AATGCCCAAC 




nil ATT a 
(sAbAAAAl xA 


IbbblAlbl 1 


PP A PTTPP A P 
bb Ab X 1 bbAb 


A CQn 

4 OoU 


A 21 f^TTT VP 

\^nnn\ii ± ± X X ± 


AATCTTTGAT 


A A *r n pr* Tpr* 


PTTATATPPP 


TPPPA ATPPT 
1 bbbAA i V^V3 1 


TTTPPTAPAA 
1 X XV«L>XAV«AA 






GATGTGAAAT 




1 Cj Ab A 1 I A 1 b 


AA 1 Ab i bAb A 


aPT A ATPTTT 
AbiAAibX 1 1 


40UU 


TG AG AG AT GG 


AATGAAAGCC 


1 1 L.AAb r rtiA 


bb i bAAAb A 1 


b 1 UObt^ 1 AA 1 


BPTPPTAPAT 


hOOKJ 


GCTCTAATCA 


ATGAAGTAGA 


TTGGTTTTGG 


T ATbAAbb 1 A 


bbl bAAb 1 bb 


IbibAAAiUi 




GCCTAAGTGT 


TTGAGCATAA 


bOCTGACTAT 


Abbbbbbbbb 


/^/^*pr^TVTv TV TV a 
b bTbAAAAAb 


A i AAb A A 1 bb 


y o u 


TAAGAATGTC 


TGAAAATGAC 


TGACGAATCT 


GCGAATGTAC 


bbbTbxAAAA 


bbb xbATb X X 




GTCGAAAGAT 


AGGAACATAC 


GGACGTATGG 


TGAAATACCG 


TV^^TV^4> •« ^0m^ 

ACGAGuaagG 


TTbCATbA rA 


DXUU 


TGAAAATCGG 


GATAGCTAAC 


GAAGAAGCTA 


mtk 1kf|kfpfp^7ki^ik 

TAATTTCACA 


bbCTCATAb r 


Ar*ATA#^^TA A 

AbAl AbbX AA 


DXOU 


GGTTATACAT 


GATAGCTAAA 


Ik Ik ^«ik pnnv^^ik Ik 

AACATTGGAA 


/^^iv ti*^n> Yv TV 
CGATGTAAGC 


TV^TV TV TV TV Ol* A 'P 

AbAAAAbTAT 


^A A AT A A AT^ 

bAAAlAAAXb 


3ZZU 


ACTACTTCTA 


GATAGGTGCA 


m 3k nt Ik n /'*f n ik ik fi* 

TATAAGTAAT 


TV TV /^/^ TV TV TV *r*Wf* 

AACGAAATTb 


CTTTGTTTCT 


^Tr' A A Af TA A 

b X bAAAb X AA 




GGGCACAGTA 


CCGATGAAGC 


GTCTAACAAA 


bb X bbAbbuA 


TAPPPPTA AP 
i Abbbb 1 AAb 


TPn* A TTT A AT 
IbX AX X X AAX 




AACAATGAAT 


GTTGAAACAG 


ATGAAGTTlJA 


AbbAx b 1 bb i 


AAbAAbb 1 bb 


/ZPA A ATPPPT 
bbAAAXbbuX 




TATGGAGGTG 


CCACGCTTGA 


Ai«Aw\AAAAl 


APPTT APTPP 
Abbi xAblob 


PA AT ATTAPn 
bAAIAi 1A\.>V9 


ATPTAPAAGA 
AX \tt X AV.pAAoA 




AACATTCGAT 


CAGCTATTTA 


LrlUAAAljiAA 


AA APPPPA A A 
AAAbbbbAAA 


A APTTTTATP 
AAbi 1 1 lAib 


AATTATAPGA 
AAX XAXAbUA 




GTTAATCATT 


TCTGAAAATA 


Al Al 1^1 X 1 i 


APPATAPPGP 


APTATPAAGG 
AL> 1 AX l.«AA\9i7 


PAAAPAAGGG 
V«AAAV«AA\3VJ\3 




GTCTTCGACA 


CCAGGAACGG 


AL^lv^Wl Iwi^ 


AATTGATAAP 
AAX XlaAX AAV« 


TATAAGGAAA 
X A X AAV3U AAA 


TGA;i PP A AGP 


5640 


AGAGTTTATA 


CATCTTATTC 


1 1 \.«AV7 i 1 


AGAAAATTAP 


AAAPPAAAAT 

AAA^^AAA/\X 


PPATAAAGPG 


5700 


AGTAATGATA 


CCTAAACCCA 


Ar*ry!:pfZAGAA 


APGPPC AT T A 


GG AATTPPaT 


GCATGATTGA 


5760 


TCGAaTTATC 


CAACAGATGT 


TPA AGPa GGT 


APTGGAGPPA 


ATATGTGAAG 


CAAAGTTTTA 

X X X .X 


5820 


TGAACATAGT 


TACGGGTTCA 


HAPPATTGAG 


AAGTGPAAAA 


PATGPTPTAG 


GTCGCATTAT 


5880 


GTATCTCATA AACATCAGTA 


AGATGCACTA 


TGCAGTAGAC 


ATTGATATCA 


AGGGTTTCTT 


5940 


TGATAATGTA AATCACCGTT 


TACTGATAAA 


ACAATTATGG 


AATATAGGAA 


TTTGTGACAA 


6000 


ACGAGTCTTA. GCTATTCTTT 


CAAAATCTCT 


GAAATCACCC 


ATTCAGGGAG 


AAGGTATTTC 


6060 


GAGCAAAGGA ACGATCCAAG 


GAGGTATCAT 


TTCTCCACTA 


TTATCTAATG 


TTGTTCTAAA 


6120 


CGACTTA6AC 


CATTGGGTCT 


CAAAGCAGTG 


GCATACCTTT 


GAAACAAAGT 


ATCCCTATAC 


6180 
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flTHPTTTAAG 


AGATAC7VAAT 


CTAAAGC!AAC 


GCTATATTGT 
wv^xnxnx xux 


6240 




0A1I9AI 1 X lA 


nnnX Un X 


TAATHATTRT 
xnnxvxnx xnx 


WVi^vJ X Vi«^OW«/\ X 


TAAAATGGTT 
X Anrm X vy VI x 1 






AAuO X Li 1 AVio Vr 


Taaaanar'ac 

X AAAAoAwnva 


V3^X Xnnr\x XA 


GATATPTPPA 
unxnx Vi^ x\k*\.rri 


ATGAGAAr^TP 




ft* Ik Jk Ik l^fp^/^*P^ 

TAAAAi Ubl 


AA^l X bub X A 


a a 0/^^ a a a Tr* 
AAvrbi«AAAXU 


^^a a*i**p'Pi**i*a 

\3UAAX X X X X A 


nCATTPaPPA 
uuAX Xk«AL>L*A 


TTTPPPTPa a 

XXX VaUu X bAA 


04ZU 


aCAAAaGGGG 


AAAAATGGbT 


TTGxAAX ibb 


UAX AX X XUXla 


apaaaaapaa 

Aw\AAAAl3AA 


a aaToa a a 0*1* 
AAAxbAAAx T 


b4DU 


AAAAAAGAGA 


•nntik Ik Ik ^ Ik Ik jk ^* 

TTAAACAAAG 


7\ 7k «n«nik Ik TV A 'P 

AAT r AAAbA X 


|k rp«prv7k 7k A 

AX XUAAAAVsA 


bUub X AUAva^^ 


aoaaa a»pooT 
Ab AAAA 1 bb X 


D34U 


CTCCTATTTA 


ATGCTCTAGT 


GCTTGGTGTT 


iJAXAAl XAX i 


X w\bb X Ax bb 


bAblbAXAX 1 


bbUU 


AACTTAGATT 


Ik Ik m Ik ^^^^ m 

TGAATAGGAT 


GAACTACGTT 


CTTTCATTCA 


^'pa/^^'pa<ika a 
bTACbTATAA 


oooaTi>afP/^a 
bbGATTATuA 


bboU 


AATTGTAGTA 


CAAGATGTTA 


fll*^^<^7k 7k ^7k 7k 7k 

TCCCAACAAA 


GCl rcccuTA 


^^rpfpTv ^Tk a a a a 
CTTACAAAAA 


GTTTTACTCG 


b / 


ACAACGATGA 


AAACGTATAA 


AGTAGCCGGA 


GTCTATTTGT 


ACCCTCTTTb 


r^r^ TV /TP TV ^POfP 

CGACGTATCT 


0 /oU 


ACaAAAACAA 


TTTACGGATT 


m "Vk 'ih m Ik ^^^^^^ Ik 7k 

TAATACGGAA 


GATACACCGT 


TTACCCCGTT 


TGGtCGTCaA 


CO A f\ 

oo4U 


aGGTTGGAGC 


ACAAAAGCTT 


Ik ^ Ik ^m^m^mm 

ACAGTCTCTT 


^ m 7\ m Ik y^/^Tk ik ^ 

GTATACCAAG 


Ik Ik mm Ik 7k r^ 7V Tk 7k 

AATTAAGAAA 


TV rprprv TV tP/^^TV TV 

ATTGATGGAA 


byuu 


TCGAAAATAC 


m^^^ ^ Mi Ik Ik ^ m 

TGAATAGAAT 


GGTTGAGTAT 


mntik ik mtk 7k mik 

T TAG AT AAT A 


i^Tk Ik mik mr^my/^ 

GAATATCTCG 


rnm Ik /^m/^ik ik m/*^ 

TTACTCAATG 




AAATCAGGCA 


AGTGTGAGAT 


TACAAAACAG 


TTTTTGCCAG 


CAAAAGCCGT 


TCACTGTCAT 


T 0 

7020 




CAAAAAGTCT 


TGGTGGAGAC 


-vk m m mmm^*% 

GACAAATTTG 


ATAACTTGAG 


AATTATTCAT 


7080 


AAGGATATCC 


ATCTTTTGAT 


CCATACCACA 


AATAAAATGA 


TAATCGATCA 


CTACGTCAAT 


7140 


GAACTGAAAC 


TGCTGCCAGA 


GCAAATAGCA 


^ Vk ^ ^ flV^^ Ik Ik ^^^^ 

AAAATCAACC 


TTTATCGAAA 


Tk 7k tn^m^*fik7k 7k m 

AATGTGTAAT 


•1 1 Art 

7200 


CTGCAAAACA 


mm^^ Ik m n ^ ik vk 

TTCAATAGAA 


1h m fiiTk 7k Ik fRTk ^7k 

AATAAATAGA 


TGGAACGCGG 


mTV rT»/^ TV /^/^m/^Tk 

TATGAGGTGA 


a aooorpoaoo 
AAGbGTCAGG 


/ZbO 


TACGGTGTGG 


AGCGGGGGAA 


7k 7k ^7k fn^^Tk^Tk 

AAGATGGAGA 


mniTV T^TV tp^Tk TV TV 

TTATATCAAA 


rpfp^tniipTk ^^fpa 

T TGTl Abb xA 


rpr*/^rprp a *p a a a 
ibblxAlAAA 


/ .3^0 


TGTTAACGAC 


^■jk Ik Ik 0 3k Ik )k f^n* 

GtAAAGAAAGT 


GCGGTGATTT 


rpTk TV TV T^TV 7k f^f^fX* 

TAAATAAbCT 


0 a a a f^a a a *l*f^ 
bAAAbAAAXb 


b X bATb X 1 bb 




/•fp^^ffir^jk t^fnf^ 


TGGTTTCTTT 


CTAATTCTGG 


/^rprp^rp/-»7k a a/^ 

UT rb 1 uAAAb 


a i^^/^r^ a TOO a 
AbbbbA 1 bb A 


aaa'pa'P'p'Pao 
AAA! Ax 1 1 Ab 


/ 9 u 


^ jk 7» r^r^f^r^ t* t* 
GAGACGGGAT 


TCGCGATCAA 


TTTAACTTCC 


GAGTGGCTT T 


GbbTbbXAXb 


nSa X bAAb xbb 


n cnn 
/ 3UU 




GATbTTTboA 


bAAbTTGATA 


Ik TV TV 7k ipfpip/^fprp 

AAAAXl J-bi X 


•paToaa aooo 
T A x b AAAbbb 


Ai bAAAbb X b 


/oou 


bAbGCTATGT 


GGATACTGGA 


GGGAGTGTGA 


TTAbTbAbT T 


rprp TV rp TV r^f^f^r* a 

TTAl AbbbbA 


b X X b X bbbAA 


/ bzu 


AAouisl Ai oA 


X 1 I 1 1 AbAA 


X UCAX LAAAw 


AAb X AbbAUA 


aToaaaaoaa 

A X L. AAAAo AA 


aaaToaoar^a 

AAA X oAUAu A 




i /iWaioAAAAA 


X X XAAXV^XAU 


Ta Tf* a Tr^^ 
XAbX XUAXUb 


i^/**p*p a a a a r* T 

bbX XAAAAoX 


aToaTaaaPT 

AXbAXAAAUX 


anpTTTpaar: 

ALtV^ XIX oArilj 


7740 




00 X oL^AA X AA 


aaaTfar'ar^a 

AAAXbACAbA 


LAjAX V:>AX Xv# i 


TTaoaaTaT*^ 

X X Al-» AA X A i ^ 


ppannrrariaA 


7ft00 

f 0\J\J 




rlX XAX X X\.rX X 


wXlaX XAX XI3A 


Tr*fzaaaafzaa 

XuV3AAAA\9AA 


aaa^aacad^ 

AAA^AAVaAV^V? 


AAAAAAPTGP 


7860 


CATTTTTATG 


GCTGGAAGTC 


CTGGGGCTGG 


CAAGACTGAG 


GCTGCCCAAA 


CTTTAACGGT 


7920 


ATTAAATAGT 


AATCTTTGTG 


TGATTGATGC 


TGATAAGTTT 


CGTGTATTAT 


TTCCA6GTTA 


7980 


TGTTGGGAAT. 


AATTCAGATG 


AATTTCAACG 


AGGATCATCA 


CTCTTGGTAG 


ATGCAGCTCT 


8040 


CGATTTAGTT 


CTTAAAAAAG 


GCTATAGTTT 


TATTCTTGAT 


GCTACTTTTG 


CGACCTCTAA 


8100 


GGTAAAACAA 


AATATTGAAC 


6GGCACTGAA 


AAAGAATTAC 


AATGTGCTTG 


TCTATTATGT 


8160 
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TTATCAAGAT 


CCATTTATTG 


CTTGGGATTT 


TACCAAAAAA 


TGAGAAGTGG 


TTGAAGGGCG 


8220 


TTTTGTTTCT 

X A X A %a AAA W A 


AAAGAGCGTT 


TTATCAATGC 

X X r\ X WrlTa X 


TTTTTTCCAA 


TCACGAAAAA 


ATTTGATGCG 


8280 


TCTGAAAGAG 


CAAATTTCAA 


GATCAACTGA 


CAATCAATAT 


CTTAGTGAAG 


GACTTTCAAA 


8340 


ATAPGATTTC 


TGATTTTCTG 

XwsaX X X X^Xw 


ATGGATATTG 


ATAATGTTGA 
nx«v&Xwx X 


GTTGACATTG 


CCAATCAGTT 


8400 


AT AP AAAAG A 


AATATTGGAG 


GAAGAGPTAP 


ATGACTGAGA 


AAGAAAAAGT 


TGAGGAAATT 


8460 


ATGGAAAAGT 


ATAACCGTAA 


CTTTTCCACA 

X X X X 


TTACAAAAGA 


ATGCTTCTGC 


GAAAGAGCTG 


8520 




TTAAATTTGT 
X xnnnx x xwx 


AGPAGATGAA 


TCCAACCGAA 


AACAACGTGA 


ATTGATTGGT 


8580 


•p "Pi^r* a T a a a n 


A AA AATA iAT 


ATGAAATGAG 


AATAGTTGTA 

An X £\\3 X X \9 X n 


GAAGCPAAGA 


GPAAAAATGT 

WvAAAAAX w X 


' 8640 


i Jl i V3V»i ill 


TTTPT a 'prjpa 

J. 1 lVirXnXw\i«n 


ATGAaAGTaG 

Ax V3AAAV7 X AVJ 


ATATGGAAGG 

A X A X U\3UrtAV7\7 


AAAGGAAATP 

AAAVIWIAAX 


APAATPAGPA 


8700 


1 WviL* 1 o A 


TaPi^pAPTf^a 


pffzpnpacfia 


TTT^zr'f^ a a n r» 

1 1 X uVii>VJAA^ 


AGAAATPGTP 

AOAAA X X 


PGTGGTPPTG 


8760 


i oCL. loll IJA 


X (jVaA^^ViiTV^oU 


X AoCy ov^v.-A X 


uA 1 V9 1 Ul i.«AL^ 




TA APAGGGGG 

X AAV.>AO\9\3U\3 






ACAAVs/viA X A 


Av3 C AAuUAAk^ 


f^aTTaanaaT 

oAl XAA\3AA1 


X VTVf X X O X A 


nPAAflGaTAT 

V3^AA0VJAX AX 


8880 


i/\/\oxAAl 1 1 


A X AVjV? X b X 


Taar*Tr'aTTT 

XAAV^X\^^X X X 


r*aaaaa/^TTa 

oAAAAA\9 1 1 A 


apappTTf^aa 

A^AV^V.>X XvtAA 


TTTnnaf^TG 

XXX OVS AVJV3 X V3 


8940 


a r* » a <^ T 


a a a r*/^/^ a a T 
AV' AAU bvsAA X 


TT a r* a TT a T 
X 1 aijA X 1 ax u 


r^r'TTaTTaaa 

V^Vji lAi. lAAA 


a p a Tr*!^ T p rir* 

AoA i V^va X 


aaT<^aaTaTfi 

AA X AA X A X O 


Qnno 


GTAl X XCUCA 


a a ai*a a aT'pa 
AAAlAAAx XA 


X Av^k^l^U X X 


\jx\jnj\3\^x XAO 


1 CoACCC X A 1 


TTaaaTr'a^ia 

1 lAAAldAoA 






1 i3l9L>l9 X AAL* X 


X C X ACAA 


a a ar^r'a'Pf ar* 

AAALt v« A X V9 AV3 


aaaaaTTTTT 

AAAAAX X X X X 


aappaapTf:/: 

AA\^l.*AAC 1 OV3 


7X,^\J 


a a a/'*'i'"i'M"i*a a 
AAno 1 1 1 X nn 


TrT'r*^ a*p TT a 

X\m>OL»oAX X xa 


WVi»X X XAXV^uX 


X AC 1 1 1 1 V7A 


PTATriTAAGG 
k« 1 A 1 V3 1 AAV3 V 


ATTPGTTTPP 

AX X XXX \*\^ 






Tfzr* a^<z/z a a a 

1 U\«AV.*lar\xAAA 


aTPATTf^aan 

AXV«AX 1I.«AA\3 


anaTTPTPPA 

AOAl 1 d VrVrfA 


PPTGAAATTT 

1 urAAA XXX 


GATTATATGP 

\3AX XAXAXVSV^ 


9240 


Tppap/iaafza 


X 1 A\^uV«C ill 


TaPTPPTATP 
i Av^ 1 1 Al 


AACaAPA&TA 
AAVaAAV^ AA X A 


TGTPATGGGP 


GATATTGTGG 

\3AXAX XV9XUV3 


9300 


X VJiAXO X xv>x 


TPATGAAGAA 


GATAAAGGAG 


TTPTTTTGGA 

X X w X X X X OOA 


ATTA7VAGGGT 
A X X Annwns x 


CGTGGTTGTC 

V«^7 X X X W X %^ 


9360 




AAPTTTTTT A 
nrW^X X X X J X/\ 


PTPGPPPAAA 


AGCGTAacTG 

AOVtfU X A^ ^ X U 


GtACGACTTT 

W W A^wA.^ XXX 


TTCGAAGATT 

X X ^W«ai«XWlX X 


9420 


GTPTAAAAGP 


PGGTGGPGTA 


ATGAAAPGPT 


TAGATTTGGC 

XAUAX X XOVJV^ 


AATCAATGAT 

AAX ^«AAX \3AX 


CTAGTAGGGC 


9480 


TATTCGATAT 


TPPTGATTTA 
xvv^xunx X xn 


APGAAAAAAT 


GTPAGAAGGA 


AGAATGTATT 

TXUw^Xl X X f\ X X 


TCCTTGTT TC 

X X X W XXX W 


9540 


GT aPPTW A A 
oxxiwVifX X Xnn 




TPCGGGGAGP 


TGTTGAAAGC 

X U X X wAAAww 


TGATGAAAAG 

X UtAX wAArVlU 


GATGGTATGG 


9600 


GGAATACCTT 

w\3n<^ X ^^^^ X X 


ATAPATTGGP 

/\X AV^AX X\3\9^ 


AGTPTPAAAA 


GTGAAGTTTA 


TTTTTGTTTG 

X X X X X ^3 XXX ^3 


TACGAAAAAG 


9660 


ATTATGAACA 


ATATATPAAG 


CTAGGGATTC 


CATTAGACAA 


AACTGAAACG 


TU^AAATCGCT 


9720 


TTGAGATTCG 


ACTAT^AAAAT 


GACCGTGCCT 


ATCATGCAAT 


TCAAGATTTG 


TTAAAGGGTC 


9780 


GTAGCATTGA 


AAGTACCACG 


TTTTCCATTA 


TCAATCGTTA 


CTTGGGATTT 


GCGGATAAAG 


9840 


TAGAAGGCAA 


AAGGCGAACT 


AACTGGCCAT 


TAAATGAACA 


ATGGGGGCGT 


TTTATTGGAC 


9900 


GGAATCGGAA 


AGAGATACAA 


CTGACTTCTG 


AGCCGAAACC 


GTATACCATT 


GAACGAACCT 


9960 


TAAATTGGCT. 


AGGTAGACAA 


GTCGCCCCTA 


CGTGGAAGAT 


GGCCAAGGAG 


TTGGATCGGT 


10020 


TGAATCAGAC 


AACCTATATC 


CAGGATATGG 


TACAGAATGC 


GCGCTTATCA 


GACCGGCATA 


10080 


AAAAGATTTT 


GGAACAACAA 


AGTATGGCAA 


TCGAAAACTT 


GATTATATGA 


AAAGAGGAAT 


10140 
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TGTGGTAGGA 


AGTGTAAAAG 


ATATGTTATT 


GGCATTTGGT 


GGCAGCATGC 


TTAGTGTTAC 


10200 


CGTGATGTGC 


TTGATGCATG 


CAAGTGTAGC 


GGCGAACCAA 


GCAATGGAAA 


AGAAAGTCTA 


10260 


GTGGTAATAA 


GTCAAGATAA 


TTTTTCAAAA 


GATTTTTAAA 


GGTAGTTAAT 


TTTTGAAGCG 


10320 


CACTG7VACTA 


AACCCGACCA 


AAATTAAAAA 


TTTTTTCGGG 


TACTTCCAAG 


GATTAGCGTC 


10380 


AATAACATAC 


TGTTATTTAT 


CTATCACGCT 


CXTTTGGTGGC 


ATATAGAGTT 


GATGGTTACG 


10440 


TAGTAGCGTA 


TCCACCAGAC 


GCGTAAATTT 


TCTTGCGGTT 


AGTACGATGG 


CTCGTTTGTG 


10500 


TTTATGCTTG 


GGTGTTTCAG 


CGTATTTTTT 


CTTATAATAC 


GCATGGTACT 


CAGGCTCGTA 


10560 


TfTCPTTACG 


GAGTTGGTGG 


CTTCAACTAG 


ATAATAACGG 


AGATAACGAT 


TGCCTCGT T T 

A \^\^\^ A Xtf AAA 


10620 




GTATTTTCAT 
oxnx X X x^nx 


ACTCAGAGTT 


ACCTGATTGG 


TTCTGTCGCC 

X X W X W X ^^^7ww 


AATATAAACC 


10680 




GPPAATTTGP 


TTTrATCTCA 
X 1 Xxi>/Ax^x\3m 


AAAGCGATTG 

/W\V9w vW> X X V3 


ATAGGGCCAA 


TTTCAGCGAG 


10740 




GPATAAAPAG 


GAPPGATTPP 


TGGAATAPTG 


GTTAAAPATT 
\3 X X ev\t\\^-t\ X X 


GATATTCAGG 


10800 


Ta<^zipnaprr; 


apTaaaTpnT 


PAATGGPTTT 


GTPTAAGTPT 

u X ^ X rVnU X ^ X 


TTAATTAAPT 

X X rv\ X X X 


TTTPAAGAPT 

XXX V«/VlV3/W^ X 


10860 


apr:aaTTTPT 


TTGGPTAGAA 


PAPPPAAAAC 


2VATATPTAPA 


GAGTCCTGAT 

X wV«r X VJ-ii X 


GGACTTTTCC 
\D\3n\^ X X X X \^\^ 


10920 




GAAPTACGTA 


TCGPTTTAGA 

X OOVr XXX tWDrt 


AATTGCCTTT 
rxnx X V3^\^ x x x 


GCAATTCCtT 

wV^Xm» X X L> X 


PGGGaTTTTT 

V«V7w wCI X X X X X 


10980 


aaAAPKATAP 


TTPPPTAGTT 

X X^^^X/l\3X X 


T TTG AATT AG 

X X XwnnX Xnw 


GTTPGPAAAP 


TPTTPTAATG 

XWX X^X/VAXV 


ACAGTTCGGC 


11040 


rvlO X X Ow X V^Vrf 


AT TGTGT AAT 
AX X o X \j xnnx 


X X X \^t\ X 


TAGAGTTAPT 
X nvsmu X X X 


AATGTGGnAP 
nnx \3 X vownv* 


TTAAAACGGA 


11100 


X \0\^n>^\^ X X X 


CPT TTT AGT T 

wWX X X XX\wX X 


CTTTAGAAAG 


CGTGTTGCAT 


TTATAATAAA 

X X X rV» X alrlss 


TGTTTTCGAT 

X \7 X X A X V^^7X^ X 


11160 


AAAGTGTTGC 
xvmu X w X X 


TTTGTACGAA 
X X X o X tx^\3nn 


PP ACPTGTT T 
^wnw^ X w XXX 


AATTAGCTGA 

X X f^w w X W 


TAACGCGTTC 

X J^f%^«w w\9 X X \^ 


TAGTTAAGTG 

A a a A \J 


11220 


TTGGAGGGrA 


AGGTATTCTT 
nwxfix x^x X 


^^x^x X xnnx 


GATGGAGTTG 

Wirl X ^7 wssX^ X X w 


APTTGCCTTT 

X X www XXX 


GGATGCGAAA 


11280 


ATAGTCAGPG 


ATATAAAAAG 
t\xtxx ntw%jns3 


p ATCAATT TG 


ATPGTTTTTG 

X WW X X X X X %9 


TTTTCTTCAA 

X X X X \^ X X wfV% 


AAATATCACG 


11340 


ATACTTTTTA 


ATTTTATTTG 

»\X X X XM^X X AW 


GT TGTTCAAC 


TGAAACCATA 


AGGTTCAATT 


GATTTAACTC 


11400 


AAGATCTTCT 


TTAAAAAACA 


TAGCTGGATG 


AAAGCTATAT 


AAAGATGTGG 


CCTCCATTCC 


11460 


AATCACTAAG 


PTPTPTATPT 

V* X ^* X \i# X«*X w X 


CAAATTTTTG 

w^^fU^X XXX X ^7 


AGAAAATTCG 

J^^TsirW^X X w^S 


AGAATGTATT 

X w X X X 


CTTTGATCTG 

AAA A A %9 


11520 


GCTTGCGCCA 


AGTTGACTAT 


TTCCATAAGA 


TGCTTCTTTT 


AAAACAGAGC 


AAGTTGAATC 


11580 


GTCTGTCATA 


AAGCAAGCAT 


CTAATTTTTC 


AGAACTAACG 


TCTAAACCGA 


CAAATAGTTT 


11640 


CATGTGGAAG 


GACCTCCTTT 


CAT TGATT AT 


TCGGAAATTG 


CTTGGGTACT 


GTGGGATATC 


11700 


CCAGAATCAC 


GTTGTCTAAC 


GACAGCCTCG 


CATAATAGAG 


TTTCATTCAT 


TAAACCTCTA 


11760 


AGCCACCCAA 


TCTcTACTAG 


GaTTGGTAGA 


GGTCAATGGa 


CTCAGCTAGT 


GTGTTGGaGG 


11820 


TTATACCAAC 


GGTTCGGGTC 


ACAGACTTTC 


TGTCGCAGTA 


AAACTGCAGA 


AGGTGATAGC 


11880 


GTCTTCCCGT 


AAGTCCTTTC 


CAAGACTTAT 


TATTCAGGAA 


TATCCTACAG 


TATCCAA6TA 


11940 


GTAATCGTTA 


TTTTATTTAA 


ACATATTAGG 


AATAGCCCCT 


AAGGGmGwTT 


GAAGGAATGA 


12000 


AAGTTTCTTC 


AAAATATATA 


TTACGAGGAA 


AGTGACAAGA 


TGAATTTCGG 


ACAAAATTTG 


12060 


TATCAATGGT 


TTCTAACCAA 


TGCCCAATCC 


TTAGTCTTAC 


TGGCAATTGT 


GGTCATTGGT 


12120 



wo 98/50555 PCT/US98/08985 

327 

TTGTTTTTAG GATTTAAGCG GGAGTTTTCT AAATTGaTTG GCTcCTGGTG ATTGCCTTGA 12180 

TTGCGGTAGG TCTGGTCTTT AATGCTTCtG GTGT7UVAAGA TGTCyTACTC aATTTATTCa 12240 

ATCgAATCGT CGGTGCGTAA GGAGGAGTCA CGATGGAACa GATGCGTGTT TATATTGCGa 12300 

ACTTAGGTCG ATATAACGAA GGCGaATTAG TTGGTGcTTG gTTTACACCA CCTGTTGATT 12360 

TTGATGmGaT AAAAGAACGA ATTGGATTAA ATAATGAATA TGAGGAATAT GCGATTCATG 12420 

ATTATGAACT TCCTTTTGAG ATTGATGAGT ATACACCAAT TGAAGAGATT AATCGGTTGT 12480 

GTGGTTTAGC GGAAGAGCTT GAAGGAACGC CGATTGGTGA GGTTGCTTCA GAGATTCAAC 12540 

ACGCTTTCTT CAATTCATTT GAAGAAATGG TGGAGCATCT GGATAATATT GTTTATTATC 12600 

CAGATTGTAA TGACATGAGT GATTTAGCGT ATTACTTGAT TGAGGAAGCA GGCGCTTTAG 12660 

GTGAAGTTCC TACTCATTTA CAAAATTATA TTGACTATGA GGCTTATGGT CGAGATTTAG 12720 

AAATTGAAGG GAATTATTTG GTTACGAGTC GTGGGGTCTT TGAGCTATGT GAGTGAGAGT 12780 

ATATTGAATA CTGCCCTTAA AGATACTACA ATAGAAAATA TCGAGTAGTA TTTCAAAGGA 12840 

GTGGAAGTGG TGAATATTGA TGAGGCAAAA CAGCTTGCAA ATTATTTTTC AGACGGATCT 12900 

CCTGGATATA TATTGGAATT CAATAATCTT GAATGGGACC TATATACTAA GAAAATAGTT 12960 

GGATTTGGAA TTCAAGAAAG ATTTGGTGGC TCTAAAGGAA ATTCCTTCAA AGAATTTCTT 13020 

GTCTCAGAGG ATATTAGTGA TCACCTAAAG TTCAAATTAA TATACTCATa AATGaAACTA 13080 

TGAAGCAAAT gCTTCTATTC GCCAATTTGa AAATCCAAGT TCaTATAAGT CaCCTAAAAk 13140 

GAATCCACTT GtGACTCyAw TGaTTgAAGA GGCaTATCaT AAGaAAGATA GTGCaAGATT 13200 

GCtTAATGGT ACTAAGTTTT CGAAAGAAGT GAATTTAGAG TATATTCAAC AACTTCCTAC 13260 

TCGTATAAGT AAGGATATTC AAAATAAAGA TTTCGATTCT GTTTTAACAA AATCAAACAC 13320 

TATGATTGAT GAAGTCTTGA AATATATCAT TGAGAACACA AAAGGAAAAA GCATTGATAA 13380 

TAATACTGTA AGGTCTAAAG ACTTAAGAAA TCAAGCCTTT GAAAGACTAA ATATAAAAAT 13440 

T6ACAAAGAT ATGGATAmAa GAATAAAaGC TTTAGTTGGT GCTCTAAACA CTTTGyCAGA 13500 

CAAAATTCTy GAGATGAGGA ATAGTCAAGG TGATGCCCAT GCTCAGGGCT CCAGGAGGAT 13560 

TGTAATTAAT GAAGAAGAAG CTATCCTTAC TGCGAACTCA GCAATGATTT TATGTGAGTA 13620 
TCTATTTCGA AAGTATGAAA AGAGAAAAAG GTTCAGTGAT GCAAAAGAGA GTTGATTtTT 13680 
ATCAGCTCTC ATTTTCTATC TTTyCAATTT TTGCGAATAG CTTCATAAGA ATACTAGTTt 13740 
GTTCTATAAC AAATTCTACT TCTAGGTCAG TGAATGTTTG ACTATGTTTA ACATTTTCAT 13800 
TTTGATAGCT TATAAGTCCt TTAATAATTA AACGGACCAA ATTGATTATT TCTTTTGGAA 13860 
TGCCCtTTTT TCTCAATAAG QGCAGGAGTT CTCCTTCGAT AGGCTGGTTT TCTAAAGACT 13920 
TTGAGTTTTT.TAAAATATCC TGCATAAAAA ACTCAAAGGC CAGCCTGATA CTGTCTAAAG 13980 
TATAACGGTT ATA6ATGCTG TTGTCGTAAG ACTGTATGGC TTTTTCAAAT TCTTCTTTTG 14040 
CTCTTGAGGA AGCAGTCAAC CAAGTGTTAG TTTCATTGAT GATATTTCTA TCTATTTTAT 14100 
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PATTAGAATA 


AAPT^^AAPrA 


TA ATTTTTTA 
x/v\x XXIX xn 


GTAAPACTTT 
u X /Anv^no x x x 


TAAAATPTTA 
XAAnAX^X XA 


GPAAPGTPTT 


1 41 fkO 

X M X 0\l 


TATTTGCTCG 


ATTTTGAGCA 
m:\ X X X X ux\w^n 


AGPAPAPTTT 


PAPAAAGPTP 


TTTTATAATT 
XXX xnxnnx x 


TTAAATTCPT 


14220 


\3± XWX X^X\9X 


GAAAGATTGT 
OrmnvAX x\3X 


AAATTATTTT 
/innX Xril XXX 


fSAATPCPnfST 


AflPPTTAf^AA 


CTAAACTTP^^ 


X 4 £OU 




Pf^AAATAAAC 
V«0/vvl X /innl3 


f^f^AAPA^AAn 


AI^TTAAATTP 


TfZPTPP A T AT 
X \J\» X ^V«A X M X 


TPACPAAATA 
X l^AOV^AAAX A 


X ^ J4 vJ 




PTPAPTTPPP 
^ 1 CAu 1 X CCl^ 


^APTAAAPTT 


PPTTAAPTPT 


mcc A r'T TPPP 
XvrCACX XoCC 


aTTPTTATTa 
AX XoX XAX X A 




AGT TTTWJA 1 


A 'P TV T A rr^vf^f^ 
AlAx ACCxCC 


a^paaaaaaTT 
AiAAAAAAi i 


a a a a T/^ 
AAUCAxlaAlla 


X 1 lAlAX XAX 


A mm A ^ A n A ^VP 

AX X AGAnAG r 




GAGCCGATTG 


7\ 1% rp 71 rit/** TV A A A 

AATATGAAAA 


AAA T* AAA A A 

AAATAAAAAG 


^fTiTv ^7V ff^ A 

CTACACCAGT 


ATCTGGGCAG 


fP*1>^A A A A^^n* 

TTGAAAAGGT 


14520 


CATTTATGCA 


ATCAATGACT 


TTCAGCTACC 


T TTTCCTGTG 


A ^^rnmn a a 

ACCTTTAATC 


Tk 7k m^^/'*fii«ii^ 

AGATGGCTTG 


14580 


GTTTGTTTTG 


TCCTTGCTGT 


TTGTCATCGT 


T TTTGCCCAT 


GTGCCTCCGC 


TATCTATGAT 


^ A C /t t\ 


AGAGGGGGCC 


TTTTTAAAAT 


ACTTAGGCAT 


ACCGGTGGTG 


GTCACTTGGT 


TCATGTCGCA 


14 /OU 


AAAAACCTTT 


GATGGTAAGA 


7k 7k /^/^7k f>n\^^ 

AACCACTGGG 


TV mmmmmTk ^/^7k 

AT TT TTACG A 


TCCTTTCTCA 


/^rTim7k m^7k 

GT T ATCAC TT 


14 / bU 


AC GG T T T AAG 


GTCACTTTTG 


7k /^/^7k 7k 7k /^7k Tk 

CAGGAAAGAA 


GGTCAAAGAA 


^Tk /^7k 7k 7k 7k 7k 

CAGAAGAAAC 


GATGGGATGA 


14 oZU 


AC CCA rCACl 


CTAGTAAGGA 


/~^n>f^rr\^ TV 7k^rT>7k 

GTGTGAACTA 


T G T ACCCG AT 


m 7k 7k rn 7k fnorn m 

TAAGTATGTT 


^7k TV 7k Tk mTk 7k rn/^ 

GAAAATAATC 


1 o o rt 


TTGTCTTTAA 


t CAAGAGGG a 


/**7k 7k ill/ •fniimwiK^ 

GAATGTTTTG 


CCTACTAwGA 


ACTTGTGCCT 


TAw AACT AT T 


14940 


CCTTCTTATC 


GCCTGAACAG 


AAGTATCAAG 


m^^Tk ^^7k mik 7k 

TGCACGATAA 


CTTTCGTCAA 


TTGATTGCTC 


15000 


AGAATCGTGA 


AGGCAAGATC 


CATGCCTTAC 


7k 7k 7h mm^^^^ik ^ 

AAATTGCCAC 


•m 7k Ik ^mH ^»|H' 

AGAAAGTAGT 


ATTCGTGCCA 


15060 


CGCAAGAACG 


Ik m^ik % vk 7k Ik M 

ATCAAAAAAG 


GAAATTACAG 


GCAGACTCAA 


AGAAGTCGCG 


7k 7k 7k ^7k 7k ^^7k 7k 

AAACAACGAA 


15120 


TTGACCTTCA 


TV A A O 7k rf*^/^A 

AACAGATGCA 


TTAGTATCTA 


TGATTGGCGA 


m TV /^ffk/^TV A f'P 

TAGTCAGATT 


GAT TATCGCT 


1 Sit on 


TTTTTATTGG 


CTTTAAACTA 


A mrn^^Tk a ^iv 

ATTGCAACAG 


7k rng"^ 7k TV Tk f^f^tn 

ATGAAGAGGT 


/^iv A mrrtTV tv tv tv 

CAATTTAAAG 


TV r^rr* r^rr* r^Ty tv tv 

AGTCTCAAGA 


1 CO ii n 
13^4U 


AATCTTTCCT 


TTCTGGGCTT 


^>7k 7k ^7k 7k flllllfll^ 

CAAGAATTTG 


TCTATGGGGT 


^Tv IV ^/**A n*^ An* 

GAACCATCAT 


TTGATGGGTG 


13JUU 


ATTTTGTTTC 


r^mm A rp^^A AT 

CTTATCCAAT 


^A TV /~* TV IV 7k ffrnf^ 

GAAGAAATTC 


^^/^/**A RVTV Tl TV 

GCCGATATAC 


7» TV •IH'I'^TV TV 

CAAGCTTGAA 


TV A Aonr* A A nyf^r" 
AAACTAATGG 


IDJdU 


AAAlalAAAi 1 


o/^r^T a r" a a ^ a 
GGC i AuAAGA 


»P»P/-» AAA r*»nr» a 
1 ICAAAGiCA 


GACGGG X CAC 


a r*r*r» a T/^ a "P 
ACCGAG 1 GAX 


o»p/^ a r*r**p a »p'P 
CXGACGXAX X 


134ZU 






/^A A A A A/^rr^A 

(aAAAAAGGCA 


CACCGx. T xGA 


AGAATATGAA 


X X ICAGCIAC 


134 oU 


r*aaaf*aaaaa 


pp*pa a a a*pr*a 


a a a r'fir**T*cir' 
uAAACoC X o\3 


•pr" a a a rr^TT a 
X laAAACo X X A 


Tr*aTTTaTT a 
XGAx X XAX XA 


paTPPTaPTP 
CAXCCXAoXC 






TpaapaaaaT 




*p a r*rY* a TPP a 

X ACIjCA X V9oA 


apaTPaaaaT 

ACAX uAAAA X 


PmTnAATPPT' 
LpIuX laAAX V«>l3 X 


X 


ATGTAf^f^CTA 


TTTn AP A ATP 


A a T a pn A TT^^ 


X AooVsVaAo A 1 


P/^aPTTTPPT 


TPTTPAGAAP 


X ■J o ovi 


X X X X X xi^x xn 


PPAGPAAPAA 


PA ATTP APTT 


TTPPPATTna 
X X^l^L^Al i\jiA 


TflPTTPAATfi 
X Av^ X i l-f AM X o 


AATGTfinAGA 


15720 


TTGTflAPPAA 


TAAAAAAPPT 


PTTPPPAPTr: 


TTPPPAATA A 
X X^oV^AAXAA 


^ a A A A A AC A A 
VsAAAAAAVTArV 


PTGAAAGATT 

\^ X OnnnOrA X X 


15780 


TAGACAACCA 


TGCCTACCAA 


TCCGATAATG 


AGACAAATTC 


CAATGTCTTA 


GATGCTTTGG 


15840 


ATTCAGTAAA 


TGAGTTGGAA 


ACTACCTTGG 


ATCAATCCAA 


GGAGTCGATG 


TACAAATTAA 


15900 


GTTATGTAGT, 


TCGAGTGAGT 


GCGGAATCC6 


TGGATGAATT 


AAAACGCCGT 


TGTGATGAAG 


15960 


TGTTCGATTT 


TTATGATGAT 


ACTAATGTAA 


AACTAGTCCG 


ACCCTTTGGC 


GATATGATGG 


16020 


GGCTACATGA 


AGAATTCTTG 


CCCTCAAGCA 


AACGCTATAT 


GAACGATTAT 


ATCCAATATG 


16080 
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T 1 iv^ 11^1 


^ i. 1 1 1 1 AV3L.U 


UoAC 1 1 lib 1 1 


1 IGGoGCTAL. 


•n/^, TV A A A »p A 

1 UAAAAo\^ 1 A 


lauuuAAl lAU 


1 £1 Af\ 
J.DX4U 




d 1 llsVaClA^ 


AA 1 GT AGAT A 


CGGGGCGGAA 


IGl X lAUdb 


AAA^UuGUAl 


1 COMA 


TAGCT rUAuA 


11 TO a a 
Al9vti9l3 i L.AA(j 


m if»^ m « rp tn ]k 

laiaTTCAGTTA 


CCAATGCTTT 


GGL> iGC 1 GUVa 


TTCCTTGGTT 


lozou 


CACTCGGTGC5 


CGuAAAATCu 


iiiimn jk « « ^ik 

TTTAGTAACA 


ATCTCTTGGT 


CTACTATGCT 


GTTCTTTTTG 


16320 


GTGGCCAAGC 


TGTGATTGTC 


GATCCAAAAG 


GAGAACGAGG 


TGGTTGGAAA 


^ Ik Ik Ik Ik ^mm^#^ 

GAAAACTTGC 


16380 


CAGAAATTGC 


GGATGATuVTC 


MTATTCTCA 


ACTTGACTAG 


tn 'Vk ^ Ik Ik ^ M Ik 

TAAACCAGAA 


7UVTCAAGGCT 


16440 


TACTTGATCC 


TTATGTGATT 


ATGAAGAAAA 


AGAAAGATTC 


AGAAAGTTTA 


GCAATTGATA 


16500 


TCCTGACTTT 


TTTAACAGGA 


ATTTCCAGTC 


GGGATGGGGA 


GAAGTTTCCA 


GTATTGCGCA 


16560 


AAGCCATTCG 


GACGGTTACC 


CAAAGTGATC 


AACGAGGATT 


GCTGCAAGTC 


ATTAATGAGC 


16620 


TTCGAAAGGA 


GAACTCTCCA 


GTCGCGGAAA 


ATATCGCCGA 


TCATATCGAG 


TCCTTTACCG 


16680 


ATTATGATTT 


TGCCCATTTG 


CTTTTCTCAG 


ATGGTTCGAT 


TACCCAATCA 


ATTAGTTTAG 


16740 


ATAAGCAGTT 


GAATATCATT 


CAAGTAGCGG 


ATTTGGTTTT 


ACCAGATGCA 


GAAACCACTT 


16800 


TTGAAGAATA 


TACCACGATG 


GAGTTGCTAA 


GTGTCTCTAT 


GCTAATCGTC 


ATTTCCACGT 


16860 


TTGCCTTGGA 


TTTTATCCAT 


AGTAATCGAG 


AAATCTTCAA 


GATTGTGGAC 


CTGGATGAGG 


16920 


CTTGGAGTTT 


TCTTCAAGTG 


GCTCAAGGAA 


AGACCCTTTC 


AAATAAGCTC 


GTTCGTGCTG 


16980 


GTC6GGCCAT 


GAATGCCGGT 


GTTTACTTTG 


TCACGCAAAA 


TAGTGACGAT 


TTACTGGATG 


17040 


AACGCTTGAA 


GAATAATATC 


GGATTGAAAT 


TTGCTTTTCG 


TAGTACCGAT 


ATTCATGAAA 


17100 


TCAAGAAGAC 


TTTGGAATTT 


TTTGGGTTGG 


ATCAAGAAGA 


TGAAAGT7UVT 


CAAAAACGCC 


17160 


TACGAGATTT 


GGAAAACGGC 


CAATGCTTGA 


TGCAGGATTT 


GTATGGTCGG 


GTAGGCGTGG 


17220 


TTCAGGTACA 


TCCAGTCTTT 


GAAGAACTCT 


TCCATGCCTT 


TGATACCCGT 


CCACCTGTTC 


17280 


AAGAGAGAAT 


GGAGTGAGGG 


^ Ik Ik Ik « M 4k 0* 

GATGAAAAAG 


AAACTTTTTC 


GGATTGTGGG 


AATCGTTTTT 


17340 


ATTAGCATAC 


TCACTCTGAT 


AGTTCTATTG 


AGTTTAGTTG 


GTACGGTAGC 


CGAGGCGACA 


17400 


GGGCTAGTTG 


ATGATACAGT 


Ik ^ Ik w 

TGAATCAGGA 


AATCTATATT 




ACTGAACAAC 


17460 


n%n fit^ik Ik ^ Mill* ^ 

TATCAACTAG 


ATTTCTTTGT 


GGATAGCTCT 


TGGGATTGGT 


TGCCTTGGAA 


TTGGGGTGAT 


1 /DZO 


vaCiCT T AGGGA 


AAAGCGTGAT 


GTATGGTCTT 


TATGCCATTA 


C T AATTTCAT 


TTGGACGGTG 


1 /3aU 


Av> 111 OTATT 


TGT CGAACGC 


CACGGGT TAT 


GTGGTTCAAG 


Tk Ik r'^f^n^T^ i*Tk Ik 

AAGCC T AT AA 


Ik rnn>7k ^ik fprnm/^ 

ATTAGAT TT C 


1 /o4 u 


Al 1 ICobAlA 


CGGCTGAAAG 


TATTGGAAAG 


A TV fP TV 1*/^^ AAA 

AATATCCAAA 


CATTGGCTGA 


'PA'P'PA/^'pr^A A 

TAT TAG T G AA 




A>\1 VsoAU i 


AGACCTCCGG 


ATTTTATTTT 


GGGTT 1 CTAT 


rpTV 1*1*/^ A A fP 

TATTGATGAT 


TTTAGCTCTG 


Iff ou 


GGGGTGTATG 


TGGCGTATAC 


CGGATTATTA 


ATVACGAGAAA 


CCACGAAAGC 


AGTTCGAGCC 


17820 


GTGATTAATT 


TTGTGATGAT 


TTTTCTACTA 


TCGGGTTCGT 


TCATTGCTTA 


TGCGCCTACC 


17880 


TATATCACAA. 


AAATCAATGA 


CTTTAGCTCA 


GATGTTAGTG 


AAGCGGCCCT 


TACCCTTGGA 


17940 


ACCGAGATCG 


TTGTACCCAA 


TTCAGAAA6C 


CAAGGAAAAG 


ATAGTGTGGA 


TTTGATTCGG 


18000 


GATAGTTTGT 


TTTCGATTCA 


AGTCCAACAA 


CCTTGGTTAC 


TACTGCAATT 


TGATGATTCC 


18060 
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AAATTGGTGA 


AGACCGGGTC 


AACAAGATTC 


TATCTGTGAG 


TCCAGATGAA 


18120 






AGAAGCGGTC 


AAGGCAGAGA TTGAAGACAA 


TGACAATGCG 


18180 


n iVTr'TPHrSTlV 


TTAPPAAf^AP 


CATGAACCGT 


TTAGGGATCG 


TGGTCTTTTT 


GGTATTGTTC 


18240 


AAT A 1 1 bou A 




TGTTTTTCTT 


CTAACAGGGA 


TTATGTTGTT 


CTCGCAAATT 

^x^w04>nnx X 


18300 


/^nto*P fp/^7\ 'PI* TV 

CTCTTwVl lA 


XCX X XuCwiX 


GTTTCTGCCA ATTAGTTTCT 


TATTGTCCAT 


CTTGCCGACT 


18360 


TATG AlaAu 1 1 




GGCGATTATT 


CGTTTATTCA ATACGATTAT 


van X o^oi^ov^vs 


18420 


GGTC? 1 v#AUu 1 


1*flflTtZ A T T AP 
XuviXlaAX Xn\^ 


GACTGCATTC AGTATTTCTA CGATGTTTTT 


CAATATTTCG 


18480 


GCTACuTA 1 C 


L.v«X XCX X X AX 


GGTAGCTTTT 


CTACAAATTG 


TGACCTTTGC 


PKCTATPTAT 


18540 


T T CAAATTGvj 


GXGACAX XAX 


GAGTATGTTC 


AACCTTCAGA 


GTAATGACAG 


TPAGTPAATG 


18600 


GGCAGACGGG 


rCATGCGAAb 


ACCGCAAATG 


TTGATGAACC 


GTAAATTGCG 


nPAAPTPAAT 


18660 


CGaaaTGTGG 


GCCGAACTCT 


GGCTTTTGGC GGTGCTGCAG CAGTygGaaT 


PPPA AA ATCP 


±0 / 


ATTACtCTGA 


TTTATGGTAC 


AATTTTCCCA 


TTAATCCGCC 


GTATGTACCA 


npppa A AAPP 




CAACAGG T AT 


TTACAAACGG 


ACATTTGCTA 


TAGATGAAAC 


ATTTCACGAT 


a A A AAA ATP A 


±OOH \J 


TTTTACGTTT 


TTGTGGCGTA 


GATTCAGCTT 


ATCATGTTTG 


GGTGAACGGC 


PATP AAPTTfl 


1 flQnn 

xo yyjvM 


GCTACAGTAA 


AGGTGCACGA 


AATGAAGCAG 


AATTTGACAT 


TACGCCTTAC 


PPTAAAATT/Z 
lal^X AAAAl lu 




GTGAAACAAA 


TGATTTAACG 


GTGCGAGTCT 


ATCAATGGTC 


AGATGGTACG 


TATTTAf2Af2^ 




ATCAAGATAT 


GTGGTGGTTA 


AGTGGTATTT 


TTCGTGATGT 


TGAATTACTC 


nf2Tf2TAPPAf2 


1 Q08Q 


AAAATGGGCT 


GGAAGATTTC 


TTTATTATTT 


CCGATTTAGA 


TGATTCGTAT 


CAAAATGGGC 




AC*1 IGGwlAi 


i ACl^GvT X AAA 


TTTTGGCAAG 


ACAAAGGACA 


ACAAGTTCAA 


TTGGAGTTGA 


19200 


1 GGAX UAGwA 


nnf*ik a a a ar'p 
ialsiaAAAAAOl? 


GTTCTTAAAG AGACTGTCGC 


AGGAAATCAA 


GGTACGGTTG 


19260 


AAi 1 1 iUAGU 


Ansa XXX bCUia 


TCTGTCACTG 


CTTGGTCAGC 


TGAAAAACCA 


TACTTGTATC 


19320 


AAv^lAX rrAi 


UAu^J. o X X X X X 


TCTGAAGGGG AAGTCGTCGA AGTTATTCCG CAAAAAGTTG 


19380 


(all X wUm\AA 


TATTPaTPTA 
1 AX XUAXI9XA 


TCTGGTGAGA 


CTTTCTTAGT 


CAATGGTGTG 


GCCATTAAAC 


19440 




P A ATPPTP AT 
uAAX KAa X UAX 


GATTATAATC 


CGAAAAATGG 


TCGAGTGGTT 


AGTCGTGAAG 


19500 


AAft.i iUAAAA 


APATaTTPPT 
AoAXAX iUoi 


TTAATGAAAC 


AATTTAATAT 


TAATGCCATT 


CGAACCAGTC 


19560 




ATPTflPTTAT 
A X V« 1 0^ X X rl X 


TTCTATGATT 


TATGCGATGA 


ATATGGAATG 


TATGTGATTG 


19620 




XXX X v3 X 


CACGGCTTTG 


AATTAACCGG 


AGAATATGAT 


TGGATTAGCA 


19680 


APGATCCAGA 


T\ X \j\D\3nr\r\\^\3 


GCTTATGTTT 


CTCGGATGGT 


ACGGATGATT 


CAACGGGATA 


19740 


AAAATCATCC 


ATCAATTCTC 


TTTTGGTCGT 


TAGGGAATGA ATCTGCGTTT 


GGTCATAACT 


19800 


TTATTGAAAT 


GGCCCGTATT 


GCAAAAGAAA TGGATCCAAC ACGTTTGGTT CATTATGAAG 


19860 


GTGATTTTGA 


. GGCAGAGGTC 


ACGGATGTCT 


ATTCTACAAT 


GTATACGTGG 


TTGGAACACC 


19920 


CAACTAGAGA 


ATTACTGATG 


AATACGATTA TTGAAAATTC 


AAAAAAACCG 


CATATTTTAT 


19980 


GTGAATATTG 


TCATGCGATG 


GGTAATGGCC 


CTGGGAATTT 


AAAAGAATAC 


CAAGAGCTAT 


20040 



wo 98/50555 PCr/US98/08985 

331 



TTTATGCTCA 


TGATAAATTA 


CAAGGTGGCT 


TTATT TlabGA 


Ax liia 1 1 X V3 AC 




zuiuu 


AATCTGTAAC 


AGATAACGGT 


GAAGTTTATT 


ATCGTTATGG 




l^bCGATGATC 


Oft! <cn 


CAAGTAACAA 


AGATTTTTGT 


ATTGATGGGA 


TGTTGATGCC 


TGACCGTACG 


CCATCTCCTA 




GCTTGTATGA 


ATACAAAAAA 


GTCATTG7\AC 


^^^^ ^ mm ^ % Ik 

CCATTACTiAC 


TAGCGCTATT 


GACGTTCTGT 


202o0 


CAGGTGAATT 


CTCATTATTG 


AGTCGGTTTG 


ACTTTGAAAA 


CTTGGCTATC 


TTTAAGCTGG 


20340 


TTTATACAAT 


TACTGAAGAT 


CAAACAGTGA 


TTCAATCAGG 


GACAGTAGCA 


GTGCCAGCAA 


20400 


TTGCTGCTAG 


AGCAGAAGGA 


AGACTGCACT 


TACCTTATCA 


TTTAGATTTT 


CCTAAAAAGG 


20460 


CAGGAGCCGC 


CTATTATTTA 


ACTCTTTCAT 


ATCAATTGTU^ 


AGAAACGACT 


GCCTATGCAT 


20520 


CTGCTGGTCA 


TGAATTAGCT 


ACCGCACAGT 


TTGAATTGCC 


AATTGCAACA 


CCAGGGATTG 


20580 


AGATCACACC 


AGTTGGCTCG 


CTAATGGCAA 


AGGAAATTGG 


CCCTCATTTA 


TACATTGAAG 


20640 


GACCAAACTT 


TTCAATTAAT 


TTTGATAAAG 


TGAAGGGCGC 


CTTAACGAAT 


GTGACACGAG 


20700 


ATGGTAAAAA 


ATTATTGCAT 


AAAGGGCCGA 


AATTTACATT 


TTGGCGGGCA 


CCAATCAGTA 


20760 


ATGATATGGA 


AATTATTGAT 


GAAATGAAGA 


AGAAATATTT 


CTTACATTTA 


GAGCATGAAA 


20820 


TTGTCCGTTC 


TTTTGAGTGG 


AAAAAAGTTG 


ATGATTTTAT 


CCAAGTCATC 


GTAAAAACAA 


20880 


TTAACGGAAC 


GACTAACAGT 


GCTTGGCATT 


ATCAATGTAC 


CTATCAGTAT 


TTAATTGCAC 


20940 


CAAATGGCGA 


AATTTTCTTC 


GACTTAAAAG 


GTAGTCCTGC 


TGGTAAAATT 


GAA7UVTGCGC 


21000 


CAGATATGTT 


ACCACGTTTG 


GGTGTGACTC 


TCCATTTAGA 


CAAGTCTCTA 


TCAGAAGTAA 


21060 


AATATTTCGG 


TAAAGGGCCG 


AGAGAAAACT 


ATGTAGATTC 


ACAAGAAGCT 


GGTTTACTTG 


21120 


GCGTTTATGA 


TGCTACAGTG 


GCGGAAATGT 


TCACTTUVTTA 


TGTTGTTCCA 


CAAGCCAATG 


21180 


GTAACCACAT 


GGCGACTAAA 


TGGTCCGCAT 


TCACGGATGA 


TCGCGGCCAA 


GGGGTAGTAG 


21240 


CAACAGCGGC 


TGATTCATAT 


AATTTTAGTG 


TTTCTTATTT 


TGAAGAACAA 


GCGCTAGATG 


21300 


TTGCTAAGCA 


TACCAATGAA 


TTACAAGAAA 


GTGAGTATGT 


TGTTTTAAAT 


ATCGATTACA 


21360 


AACAAAACGC 


CCTAGGTAGC 


TACTCTTGCG 


GCCAGTGGCA 


ATTAGAAAAA 


TACCG7VACTA 


21420 


CGTTTGAAGA 


ATTCCAATTG 


GCATTCCGCT 


TAACACCTTT 


TAATAATAAA 


GAGATTCAAG 


21480 


CAGCGGATGT 


CGCACATGAA 


CGAGTAAAAC 


GACCAACAAT 


TTCTTAAATA 


AAAGCT CAG A 




ACAGTGCACT 


TCGGTCTCCT 


GTTCTGAGTT 


TTTTTAATGA 


AATTATCAGA 


AATTATTGCl 




TTTTTTTCAT 


GGAAATAAGC 


AAACTATTTT 


GTTATCTAAT 


AAAAATAGTT 


CAT AA i Ao 1 A 




ACATAGCGAA 


TTATTCGCTA 




mm ft f^f^y^ /*»/** iv m 

TTAGGAGGAA 


TTATTTTATG 


CAlAUi 1 1 Al 


Z ± / ZU 


GGGTTTTAAT 


CGTTGGTGCT 


GTTATCGGTG 


CTAtCgcTGG 


CGCGCTTACA 


AGCAAAGGTC 


21780 


AATCAATGGG 


TTGGATTGCG 


AATATCGTTG 


CCGGTTTAGT 


TGGTTCATTT 


ATCGGAGAAA 


21840 


AAATCTTAGG 


. TAGCTGGGGT 


CCTCACGTAG 


CAGGTATGGC 


AATTGTGCCA 


TCAATTATTG 


21900 


GGGCAGTAAT 


TTTAGTTGCT 


6TGGTTTCAT 


TCTTCTTCGG 


ACGAAAAAAT 


AGCTAATTAA 


21960 


GT7UVATAATT 


GTTTAATTAG 


TTAAAGGAGG 


TCGGGACAGA 


ACCGTTTAGC 


TCCGAGAAAT. 


22020 
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AAGAAGAAAT 


TTCCGAAAAT 


TGCTTTTTAA 


TTTTTGGATA ATTTCGGCTT 


ATTTCCGAAG 




GAGTTGGTTC 


TGTTCCCGCC 


GTTTATCAGT 


TTTTGAGCGT 


GGAGCAAAAA 


TCCAAAGTGA 


22140 


TTTTTGTCCC 


ACGCTCCTTT 


TTTTATTTTC 


TCGTTAATTT 


TACGACAACT 


TCACAACGAG 


22200 


CGGTTTGTGG 


AAACATATCA 


ACAGATTGTA 


AATAGTCTAC 


TTGATATACT 


TTGGCTAATT 


22260 


GAACTAAATC 


ACGTGCTAAC 


GTGGATACAT 


TACAAGAAAT 


ATACACCATT 


TTCTTAGGTG 


22320 


GTTGTTTCAA 


AAGGGCCGTT 


AATAATTTAC 


GATCCAAACC 


AGTCCGTGGG 


GGATCAACTA 


22380 


CAATGGCATC 


AGGTTTAAAG 


CCTTCTTTGA 


ACCATTTAGG 


TAATAAGTCT 


TCAGCTGTTC 


22440 


CAACAGCATA 


ATGTGTGTTC 


GTAACACCAA 


\J X\^\3 X X X X\J\^ 


ATTGAATCTA 


GCATCGTCAA 


22500 


TTGCGGCGGG 


AATGGTATCC 


ATACCACGAA 


\^ X X Ur> X V-IVrVJV^ 


X X O X X X\^\^\^ X 


AAACTTAAGC 


22560 


CAATGGTCCC 


GACGCCACAA 


TAAGCATCAA 


v.nrir\V.>rlvj X X X Vi^ 


O X X X X X w X 


AAATCTAAGG 


22620 


CCTTGATTCC 


TTCATTATAT 


AAAACTTCCG 


X X XV3X XV>«X OV3 


ATTfAATTGG 
fx X X \^f\r\ X X o V9 


AAAAATGCTC 


22680 


GCGGCGATAA 


ATCAAAGACA 


ACGTCGTTAA 


X\3 XS3 X X ^ X X^ 


AATGCTTTCT 
nn, X X X x\^ X 


TTGCCCCATA 


22740 


AGTGAAGGGT 


ATCATCACCC 


ATTACTAGAG 


AGGTTTTTTT 


ATTTTGGACA 


TTTTGCATAA 


22800 


TAGAAACAAC 


TTCTGGTAAT 


TGTTCATTAA 


TTGCGCGAAC 


CATTTTTTCT 


TTTTGTGGAA 


22860 


ATTTCTGCGA 


TTGTGWATG 


AACACTACTT 


GGACTTCGCC 


AGTTTGGATG 


CCCACTCGAA 


22920 


CCATTAATGT 


TCGAAAGATA 


CCACTATTTT 


TACGTTCATC 


ATAAATTGGT 


AATTGAAAGT 


22980 


CATTCAATAA 


ATCAACTAAT 


GTATTCATCA 


CCTTGGTTGT 


CGCAGGCTGT 


TGGACCAAAC 


23040 


AGTTATCAAT 


CGGTACTAAT 


TGATGTGAAT 


CAGCTTGATA GAGACCAGCT 


TCTACTTGGC 


23100 


CATCGATTTG 


GCGTAATTGG 


AATTGGGCTT 


TATTTCGATA ATTCCAAGGA 


TTATCCATCC 


23160 


CAATTGTTTT 


TCGTAGTTCA 


TAGTTTTGAT 


AATTGGCTGG 


TTTAAATTTG 


TTAAGCGCCT 


23220 


GCTTTAATAA 


ATCTTTTTTA 


AAGTCTAATT 


GTGCTTTGTA GGCTAAATGT 


TGTAGTTGGC 


23280 


AGCCGCCACA 


TGTTTCATAG 


ACAGGACAAG 


GAGGGACCAC 


GCGATCTTTA 


GCAGCTTTTT 


23340 


TGACAGATTG 


AAGGGTTCCT 


TCAGCAAATT 


TTGGTGTAAC 


ATTTGTGATC 


GTGGCAGTCA 


23400 


CTTCTTCTTT 


CGGTAGCGCC 


CCAGGAACGA 


AAATAATTAA 


TTTTTTATAA 


TAGCCGATAC 


23460 


CTTCGCCATT 


AATTCCTAAT 


CGTTTAATTT 


TTAAAGAAAT 


TTGTTGGCCT 


TTTTTTACTA 


23520 


GTTGTTGGGT 


CATATTCTTT 


CATCCTTTAG 


TCATCTCACA 


TTTTTCACTT 


CCTCCCATTT 


O O C O A 

23boU 


TAGCACATTT 


TTATTTATTT 


GGCTTCCAAG 


GATTCTATGG 


TATAGTAGAA 


TATATGTTCT 


/ Jb4 U 


GATTTAGAGG 


GGCAGGAGAA 


TAAGATGACA ACAATCACAC 


GAATTAGTAA 


AGATAAAGGT 


O A A 
2 J /OO 


GAATTTTATC 


TACTTTGGCT 


ATCTTCAGGC 


GAAAAACTGC 


GTGTCTCGGA 


AGATATTTTA 


23760 


GTACGGCAGC 


GTCTGCTAAA 


AGGCCAAGAG 


TTATCAGATA 


CTTTAATAGA 


AGAAATAAAA 


23820 


AAAGCCAGTT 


• CTTATGATGT 


CGGGTTACAA ATGGCGATGA ATTATTTAAG 


CTATCAATTA 


23880 


CGTTCAAAGA 


AAGAGATTTT 


TACTTATTTA AAAGAGAAAG AAATTGTACC 


TGAAGATCGA 


23940 


GTAAAAATCG 


TTCAACGTTT 


AGAAGAGCTG 


CGGTTGTTGG 


ATGATGCAAT 


TTTTAGTGAA 


24000 
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AGTTACGTAC 

TTAAAGCAAA 

GATGAACAAC 

AAAAGTTTTa 

AATGAAATCA 

CAACAAGCGT 

TCTAAAAAAG 

ATTCAGCAAT 

GTGGAGTCCC 

AAAACGCAAT 

TATGCTACAA 

GTTTCCGACG 

AGGGTTAGGT 

AGAATTTGGT 

CCCCTATACG 

CGGTAACGTC 

GAGTAGTCGT 

GGATTTTAAT 

GTGTGAAAGT 

TTATCCAGTC 

TGAAAATAAA 

TATGTGGCTG 

GGCGCAACCA 

TTTGGTCGCT 

ACGTACCTTA 

TTATGGCCGA 

ACAATTTTCA 

AGAACACGAA 

GTTTCCTCTA 

TAAAATAGAC 

GAGAGAGGAA. 

CATTTTGGTT 

GAAAACAATT 



GGACAGCTAT 
AAGGTATTAG 
TTAATGTTGC 
AAGATGCACT 
TTGATCTAGC 
TAGACAAGGA 
TCCAAA/JVGT 
TTATTAGCGA 
GCAAAAGTTA 
TTACCTTGGC 
CAAACTCGCG 
ATTCAAGATT 
TaCTATTCAC 
GGGAAAATGC 
GCTGGTGCGA 
ATGCGGGTAG 
AAAGTATTT6 
CAAGCTTTAA 
TGTCCGCTTC 
AAATCAAAAA 
AAGCAAGAAT 
TTTCCGATTG 
GCAGAAACGG 
GAAGAACCTG 
GGCGAAGTCG 
AATACAGGCG 
GACTATGTTT 
AAAAAATAAC 
GGTTATACCC 
TTGTCTCTTG 
GTACAGCCAA 
TTTGTCACAG 
GATAAAGGCT 



GCGAACCAGT 
TGAAGAAGAT 
CACAGCAaCT 
TCaGAAAATG 
TTTGGAATCT 
AGGCGAACGT 
CAAACAAAGC 
AAAGGAAGTA 
GTTCATTCCA 
GAGCGAATAC 
TAGATACAGT 
TAGCGGAAGC 
GAGCGCGTAA 
CTGACACAAT 
TTGGCAGTAT 
TGAGCCGTTT 
AAGCGGC6AT 
TGGATCTAGG 
AGCAATATTG 
AAGTTAAGCC 
TTTTATTAGA 
AAGAAATTAG 
AAAAACAATT 
TCAACTTTTT 
TTCATATTTT 
AATTAGCCAC 
TTCCAT^CC 
AAAAAAGAGA 
CTATATAAAT 
TCTGCTTAGA 
TGAGTAGGAA 
TCGGCTCAGC 
ATGTTCCCAT 
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GATAAAGGAC 
ATTCAGCATG 
GCAGAAAAAG 
CGTCTGCATC 
TTAGCTTTTG 
TTATGGCGTG 
TTATTCCAAA 
GAACATGACG 
GGAAGATTTT 
AGATGCATAT 
CATTGATTAT 
GCCAGATGAT 
TTTAAAAGTG 
CG7UVGATATT 
TGCCTTTAAT 
ATTTGAAATT 
GCTTAA7ATC 
CTCAGCCGTC 
TGCGGCCTAT 
6AAAGATGTG 
ACAACGCCCT 
CAAAAAACAA 
AACTTTAGAG 
TACGGACTAT 
CAGTCATTTA 
ACTAGAGTCG 
ACAGCAAAAA 
GGCTTTTTAA 
TCTGCTATGT 
GGAGTAGTAA 
GCGAAAAATC 
ATACTTTGCT 
AAAAAATGAT 



CACGAAATGT 
GTTTGACCTT 
CCATGAAACG 
TGATGCAAAA 
AAAAGGACGA 
CCAATCAGCG 
AAGGGTTTGA 
AATAAAGAGC 
TTAGCTTGGT 
CGTATTTGGA 
TTTTATCGAT 
AAGTTGTTGA 
GCAGCGCAAC 
CGGAGTTTAA 
CTCCGCGAAC 
GATGCAGATA 
ATCGATCGTG 
TGCACCCCAA 
CAGGCAGATA 
TATTATGTCG 
GAAACCGGTT 
TTTCAACAAT 
TTAGAGCCTG 
GAAACAGTTG 
AAATGGCATA 
CAGCGATGGG 
ATGGTGGAAT 
CAAACAAGCC 
TTAGTTTTGA 
TGATGGTGGA 
AGCTTAATTA 
GTAGCGGGTA 
TATAATGAAG 



GGCACAACAA 
TTATACGTTA 
CTATCGAACT 
GGGCTTTACA 
AGAACAAGAA 
ATTTGATTTT 
CTATGATTTG 
CTTGGGAGGC 
ATGAACGAGA 
TTTCTGAAAT 
TTATGGAATG 
AAGCTTGGGA 
AGATTGTTTC 
AAGGAATCGG 
CAGCAATT6A 
TTGCAAAAGC 
AGCGTCCAGG 
CTTCCCCTAA 
AAATGACCGC 
GCACTATTAT 
TGTTAGCCAA 
TGCAAAAACT 
TAACAGAGCC 
TCTGGCAAAA 
TCTTAGTCTT 
TGGCAGCACA 
TGTTCAAAAA 
CCTCTTTTTT 
AAGGGACTGG 
GACTTTTTAA 
GTTTAGTCAT 
GCTATTTAAA 
CGCAAAATAA 



24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
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1 i i iwi 






ATTHAAPnnA 


AATTAnnPAP 


«. u 


AA( - i Ai3\3AU i 


uA risC I A i I3A 








AAATAAPPTA 


«^ ox w 


TTTAAGTTI G 


CCACGGGA 1 A 


V9I I 1 luX iUA 


AAi iUAiVaUV? 


aaaaaTTa^^f 

AAAAAl lAwt.^ 


a arr'fa'prT'a 
AAbbbA 1 bUA 


OCT cn 


GCGAATTGAA 


GCCGCCTATA 


C;CTACGAT6i3 


AL^CAAGAVaU T 


1 U IGT i, AAvJA 


A fPfPO AO A A 

GAbTTGAGAA 


0 coon 


ATTATTGAAT 


ATTCCAATCA 


ATCATTACb 1 


TGTGTTTAAC 


TTTTTATCTT 


TTATTAAGT T 




AATTGATGCG 


GTTGGCGGCA 


TAGATGTCAA 


TGTCAAGCAb 


GObT r IbATb 


GTGTCACCAA 




AGACGGGCCA 


GGATCCATTC 


ATTTTGATGC 


AGGGAAACAG 


CATTTAGATb 


OdtAOOA A AOO 

GTACGAAAGC 


0 £ il A A 


TTTATCTTAT 


GCCCGTGAAA 


GACATAGCGA 


fn Ik Ik Ik Ik mffi 


ATGCGTGGAT 


TIOOOnOA AO A 

TCCGACAACA 


CA 


AGAAATTATT 


CAAGCAGTTG 


AAGACAAGTT 


GAAATCTGGT 


CAATCAATCA 


n*o A A A An* A A ft 

TGAAAATAAT 


OCCOA 

zbOZU 


GGACATTATT 


GATTCGTTAA 


ATGGAAACAT 


TCAAACTGAT 


GTGGATTCCA 


ATGAATTGAC 


26580 


TCATTTAGTC 


AAAGAAGGTT 


TGACTTGGAC 


CAATTATGAT 


AAACAACAGC 


TTTCTTTTGA 


26640 


CTGGCGCACT 


TTTAGTAATG 


AAGGGCGCAG 


TATGGTTGAA 


CTATACCCAG 


ATAGTATTGA 


26700 


AAATGTCCGT 


CATCAATTAC 


GTGTGTCTTT 


AAATTTAGAA 


7VAGCCAGATG 


AACGAGATCA 


26760 


AGACGGCTAT 


GTCTTCCATA 


CGAACGGTGA 


ATTTTTATAT 


CAAAGTGATT 


ATACCGTTCA 


26820 


AGATGAAGCA 


GCTGAGGAAA 


ACGAAATGAC 


TTCCATCAAC 


GGCAATACGT 


ATATTGGTGT 


26880 


TCCTGGTAAT 


ACACAGACCG 


GCCCGTTGCC 


ATCAGTTAAA 


ACGG7VAAATG 


GCTTTATAAA 


26940 


ATAATTTATT 


TTCCAAGTGT 


AGAAAATCCG 


AACATAACAG 


ATTAAATAAT 


CCGCTTACTT 


27000 


ATTATTCTTC 


TAAGGAAAAA 


ACGAAGGATT 


TTAGCAAGGA 


AACAGCGTAT 


CATTTGATAT 


0"1 A ^A 

27060 


AAACCCAACA 


TTCCCTTTAT 


AATGAAAGTA 


ACAAGCAAAT 


TTTTGTTACT 


m 7k 0 7k Ik 0000 7k 

TAGAAGGGGA 


OT1 OA 


AGGGAGCAAG 


GAGTTATGTT 


TAGTTATACA 


GATACCATTT 


TGTCAATTAT 


OOAA AOAA T**!* 

GCAAAGAATT 


001 Qn 
Z /loU 


GAAGTATACA 


ACGAAATTTT 


Ik «t ^^^^ 

CAAAGCCATC 


m^mTk 7k 7k ^7k 7k f> 

TCTAAAGAAG 


mm/^7k 7k ^ 7k 71 m 7k 

TTCAAGAATA 


00 OfP^POfP 7k 7k rp 

CGGTTGTAAT 


Z /Z4U 


CAGGCTGCAC 


ATCGGAGAGG 


CCGTGACACT 


TATATGTTTT 


GCCGAAATAA 


TbTbAATUbl 


Z / JUU 


TTTTTTACAG 


AAGAAATGAT 


GTTTCGTAAA 


AATTTAGAGC 


r^r^rn 7k mfp 7k m/^ |k 


Ta aaoaao/^a 
jlAAAbAAbCA 


Z / JDU 


ALCaAAAATCT 


TGCTAGAAGG 




TATAAAUAAG 


bAbi Ai X 1 1 i 


r'Trr'TT ar* ar* 
b 1 bb 1 1 AbAb 




GCATTAAGTG 


AATCTTGCCA 


AGTGACAGAT 


GAATT b AAAT 


AlCAAi TAbb 


aaTrTTTT^aa 
AAi bbbbbAA 


Z 


AAA^3AA 1 1 


L-l lAO lAl A 


AATTAATCAA 


f fTTfT a a a r* 
(jL. lib i AAAb 


AAbV^V^ i b 1 bA 


fz/mp a T a a 

bbbbAi Ikrfiin 




AGCGCTCATA 


GTGTTCATAA 


7« rp/^m Tk 7k Ik 7k n 

GATGTAAAAA 


^m#p 7k /^/^ *^ a fp 7k 

GTTAGbLiAlA 


bblblbAAlb 


ibbbAbi iAi 




CCCTAACTTT 


TTTATTTCAA 


TATCATCTGA 


TAAGUA i AAL. 


bl IbbbbAIb 


b 1 i AAbAi b 1 


97fifi0 


AAAATCACGT 


CACCTTGATA 


AACAAA6CCA 


GCTTTTTTAA 


TCAGGTGCTG 


CATAGCGAGA 


27720 


TTTTGTGGAT 


6TGTATCGAT 


TCGAATATCA 


GTAGCACCTT 


GCAAACGAGC 


GGCTGTAACC 


27780 


AGTGAGCGCA. AAAGCACCAA 


AGCCAGTCCT 


TGACCTTGAA 


AGGAACTTTT 


TAGAGCAACA 


27840 


CAATGAATTG 


CCGTATACGT 


AGTTTGAGAA 


TGAGGTTGCC 


AAGAACCATT 


AAAAATCGCT 


27900 


TCATAAGCAG 


GATCAATCTC 


CGAACATAAA 


ACACCAACGC 


CTGCCAAGGC 


ATCCTCAACC 


27960 
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ATTAAAATAT 


AGCAAGTTTG 


CTGGTCGATA 


TCTTTTGCTA AAATTTCTTG 


ATTCGGGCUA 




TCGCCGTTTT 


GCCATTGGGG 


rATGCCACTT 


TTTTGT7\AGG 


TGCGGCGTCC 


ATCTTCAATG 


28080 


ATAGCAAGGA 


TGTCAGGCAA 


ATCAGAAGGT 


TCTGCTTTTC 


TTAAGTAAAC 


AGTCAAAAAA 


28140 


TATTCCCTCC 


TCTTAGAAAC 


CTGTTGTTGT 


TTTCAATTGG 


TAAATTTTTG 


AAGGGCGGCC 


28200 


AATGCCAACG 


GGACGTTCAC 


CTAACTCTTC 


AAAATATTGT 


GTCATGGCTT 


TTTTAAAATT 


28260 


TGAATGGTCA 


ATGCTGCGAT 


AATCAACGCC 


GAGGAACTTC 


GCAAAAACTT 


TACGGGCTTC 


28320 


CGTAATGGTG 


AAATCTTTTC 


CTAAGACTTG 


AAGCACTTGC 


GGTTCATGTT 


CCATTTTATC 


28380 


AACAACACGA 


TTAAACGCCT 


TAATTATAAT 


TTCACTATGG 


TCGAAGGCTA 


ATGTATCTTT 


28440 


ACCAAGAGAG 


GCCGCTGTTT 


TCAAGTCTAA 


AGTAATTTCC 


ACATCTTCAT 


GAGATAACGT 


28500 


GATGTGTTGA 


CCATGCCGTT 


CAAGATTGAA 


CCAATGGACT 


TCTTTGGCAT 


CGTCGCCAGC 


28560 


AATT/IAAGGT 


TCTTCACCAA 


TAAAGGCTAA 


ATAGCTGACT 


GTCACGACCC 


AGCCACGAGG 


28620 


ATCTCGATCT 


GGGCGACTAA 


AGCTGTGTAA 


TTGTTCAATA 


TTTTCTTGAG 


AAATCACCAC 


28680 


ACCAGTTTCC 


TCTTTTGTTT 


CTCTTAAAAC 


ACTATCTTCA 


GTAGATTCGT 


TGCGATTAAC 


28740 


AAAACCGCCA 


GGAAGCGCCC 


ATGAGTTTCT 


A7UVGGGGTGA 


CCTTTACGTT 


GAATCAGTAA 


28800 


GACTTTTAAT 


TGATCAGCTT 


CCTTGTTGTA 


ACAAAGAAGA ACCATATCAA 


CGGTTAAAGA 


28860 


GGGtTTTTCA 


TATTCAGGTA 


ATTCTTGTTG 


ATGATACCAA 


GTTAAAAACT 


CTGCTAAGCT 


28920 


AGCTTGGCGT 


TCATAATAAT 


TTTTTTCTTC 


GGCTTTAGAA 


GCAAATTGTG 


GCACAATTGC 


28980 


TCCTCCTTTC 


AAAAAGAAAC 


ATGGTTTTCT 


TTTATTGTAT 


CATTAAGTGG 


CTGAAATACA 


29040 


ATATTCAAGC 


GTAAAAAGAA 


AAGCAAGCCT 


GGGACAAAAA 


TCACTTTGGA 


TTTTTGTCCC 


29100 


AGGCTTGTAC 


TATAAGAAGT 


TTGTTCTAGA 


CGTTATTTAT 


TGTGTTAATT 


CTAATAATTT 


29160 


ATTTTTCAAA 


TCTTCTTCCA 


TATGGCCAAG 


TTCGATTTCG 


GCAGCTTGGC 


GT TTTTCTTT 


29220 


TCCTTCTTGT 


TGAATACGTA 


ATGTTTCTTG 


GATAGTTTCA ATTAAATCAT 


TTTGTGTTTT 


29280 


TTGtAATGTT 


TCAATATCGA 


CTAGGCCACG 


TTCGTTTTCT 


TTTGCTGTTT 


CAATTGCAGA 


29340 


AATTTTTAGC 


ATTTCTGAAT 


TTTTCTTCAG 


TAAGTCATTG 


GTGGTTTCAG 


AGACTTGACG 


29400 


TTGAGCAGTC 


ACAGCATCTT 


TTTGGCGGAG 


TAAGGTTAAG 


GCGATGACTA 


CTTGGTTTTT 


bU 


CCATAACGGA 


ATCGCCGTAT 


TCACCGATGA 


TTGAATTTTC 


TCAGCCAAAG 


CTTGGTTAGT 


o c o n 
29520 


ATTTTGGATT 


AAACGAATTT 


GTGGGGCTTG 


TTGAATCGTA 


ATTTGACGAG 


CCAAGCGTAA 




ATCATGGGTT 


CTTTTTTCTA 


AACGGTCTAG 


GAATTGGTTA 


TAATCATTAA 


CAATTTGGAC 




ATCCATTTGA 


TCTCCAGATT 


CTTCCGCTTT 


TTTCATGGCT 


TCAGGGATAA 


TTTGCGTCCG 


29700 


TAGTTCTTCT 


GCTTTTAATT 


CCCCAGCGGC 


AATATAGATG 


TTTAAGGCAT 


CAAAATAATC 


29760 


TTTATTTTTT 


. TGATACAATT 


GATCTAACAT 


GGCATTATCT 


TTTAAAAGAC 


CATCTTTTTC 


29820 


ATGGGTTAAT 


TTGACAGCAA 


TTTTGTCAAT 


TTGCGCACCA ATTTTTTGAT 


ATTTGGCGGT 


29880 


AATTTCGTAA 


ACGGATTGTT 


TGACTTTACC 


TAACATTTTT 


TGAATAAAGT 


TGCCTTCGCC 


29940 
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AGCACGTAAT 


TCATTGGGAT 


TGGCTTCATT 


TAAACGAAAC 


ATTA6ATCGG 


TCAGTGAATC 


30000 


ACCAACAG6A 


CCAATGTCTT 


GTGCTTGAAC 


GTGGTTTAAC 


ATCGCTTGCG 


AAAATTCGCC 


30060 


GAGTTTGATT 


TGGGCATTTG 


AACCATAGGT 


AATTACTGCT 


TGTGAATCTT 


GGACATCAAT 


30120 


TTTTGAAGCT 


AATTCTTTTG 


CTTGGGCTTG 


ACGTTCTTGT 


GGTAATTTAT 


CAATTAAACG 


30180 


AGGAGCTGCT 


TGCTGTTCTT 


TTAACTCAGA 


TAGTTCATTT 


TGTTGC6TTG 


GTGTTAGTTC 


30240 


ATCCATTGAT 


GAAAATGGAT 


TACTTAATAA 


ATCGTCCAAT 


GTATTGGTTA 


CAGATGTTGT 


30300 


TTCTTTGGGT 


TGTTGCTCAG 


TCATTGTTTT 


TCCTCCCTAA 


TGATTATTGC 


TGGTCTTCGT 


30360 


TTACTTGAGG 


TGATTCCTCA 


GTTGCAGCTT 


TTTGCGACAA 


GCTGCTTTTA 


GCGATCGACA 


30420 


TTTCGACATC 


T7\AATCGTCT 


AAGTCATCGG 


AAACGATTTC 


CTCATAATCA 


TTTTTAACTA 


30480 


ATTTTGACAA 


TTGGTCAATG 


ATTTGTGCGC 


TTTCTTCTAA 


TTTTTCATAC 


GTTTGTTTGT 


30540 


TTTTTACTTC 


GTGTTGTTCG 


ATTTCTAAAT 


GTTTACTTGT 


TAAGTCAACG 


ATATTTGGTA 


30600 


AATGTGTATA 


GAGAAAGTGA 


TTGGCTAAGT 


GTAACTTTTT 


AGGTTCTTTC 


ACTAACTCTT 


30660 


TAAACAGAGC 


TTTAGAAACC 


TTCGTAGTAT 


CATTACGTAA 


GTCAATCGCC 


CGTAATTTAG 


30720 


TTGAACGATT 


CATGTTTTCT 


TGCAATTGTA 


TGATTTGTTT 


TTTGGCTGTG 


CTCATTGTAC 


30780 


TTCTGAAAAA 


ATCAATTTCT 


TGTGGAGATA 


ACCCCAACTC 


ACGATAATGG 


GCTTCTTTAT 


30840 


CTTTCGTTAA 


GTAAGGCATT 


TGATCGCTTT 


TGTCGGCGGC 


ATTTTTCTTT 


TTTCGATAGA 


30900 


GAACGAATAC 


TAAAAAGCCT 


ACCAGAATCA 


CAAAAGCGAA 


GAAGCCATAT 


TCTCTAGCGA 


30960 


CAGcAATGaT 


TAAGGCTCCT 


AAAAGAATAT 


AGCCAATCAG 


TAATTTTGTA 


TTTAATTTGT 


31020 


TCATTTATAT 


AGTTCCTCCT 


ATCTATCTAA 


TCCTCGAAAG 


AGTCAGCTCT 


CGTTACAATA 


31080 


GTTATTTTAG 


CATACTCTGT 


GAGAG7VAAAG 


ACATACAGCG 


GAAGATGTAT 


TTTATGATTT 


31140 


AGTGAAATCC 


TAAGAAAAAA 


AGTATAAAAC 


GCAAAAAAAG 


ACCCGTCCTA 


ATTGGGGGTA 


31200 


GGACGGGAAA 


GGAGTTAAAA 


ATGAAAAAGT 


GTTTTGTTAG 


GGTTGTTTGT 


TGGTATGCTT 


31260 


AAATAATACA 


AGGAAGTTGT 


GAATTTTTTA 


TGTCTATTTT 


TTTTCGAAAA 


TGTAATTTTT 


31320 


TTCTAAAGCA 


ATAAATTTTT 


TTCTTTGaAA 


AGGGTTTAAC 


TAGAGAAGAA 


GGGGTGCGAT 


31380 


TTTTATCATT 


TTAGGGTATG 


ATAAACAAGA 


AAGGTTGGTG 


AAAGtATGTT 


AAATTATGAG 


31440 


GACTTCGAAG 


rAAAAACACT 


TCmACgTCGA 


GrAATTTTTA 


AAGGrAAGrT 


TATTGATGTT 


31500 


TTTTTGGATG 


ACGTGGCTTT 


ACCGACAgCG 


GCACAGCTAA 


ACGCGAATTG 


GTTTTTCATT 


31560 


CAGGCGCAGT 


AGCTATGATT 


CCGTTAACCG 


CAGAAGGAAA 


AATCGTCTTA 


GTTAAACAAT 


31620 


TTAGAAAACC 


CTTGGAACAA 


GTAATTTTAG 


AAATTCCGGC 


GGGAAAAATT 


GATCCTGGAG 


31680 


AAGAAAATCA 


ACTGGAAACG 


ACGGCTATGC 


GGGAATTAGA 


AGAAGAAACT 


GGGTATCGTG 


31740 


CAGGTCAGCT. 


TACCTACATA 


AATTCAATGT 


ATCTGTCACC 


AGGATTTGCT 


AATGAAAAAT 


31800 


TGGCTCTCTA 


TTTGGCAACG 


GATTTGCAGA 


AAGTCGAAAA 


TCCGCGTCCT 


CAAGATGAAG 


31860 


ATGAAATCTT 


GGAACTTTAC 


GAGCTAACAA 


TAGCTGAAGC 


CAAAGCAGAG 


GTGGCAAAAG 


31920 
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GCACGATTTG 


TGATGCCAAA 


ACACTTTTTG 


CTATACAATA 


CTGGGAATTG 


TATTTACTGC 


31980 


AGAGACAGTT 


TAAGGAGGAC 


ACGCAATGAG 


CAAAGGTCCG 


TTAGTCACTC 


GGACAGAGCT 


32040 


TCGCAAACGC 


AGAGAAGCAG 


AAGAAAAAGA 


AGCGGAACGT 


CGTCAGCAAG 


AAGAGCAGAA 


32100 


GCTGGCGGAA 


AAAGCGTATA 


AGCGAAAAGA 


AAAAGAAATT 


TCGACGTTTT 


ATCGTAAAGA 


32160 


AAAG7KAAAAA 


CaAAAACCGA 


TCAACAAGTC 


ACGAGTAGGA 


GAATACTCGA AGCGTCGAGA 


32220 


ACGGAGTACT 


TGGTTAAACA 


AGGCAATTAT 


TATTGTAGCG 


ATTTTATTAG 


CCGTTGTGGC 


32280 


ATATATCGTT 


TTGAATTTAT 


AGAAAAGAGG 


AATCACTATG 


AAAATTGGAA 


TTATTGGAGC 


32340 


AATGGATCAA 


GAGGTCAAAA 


TTCTAAAAGA 


AAAATTGACA 


GACACGATGT 


CATGGGAACG 


32400 


AGCAGGCGCT 


TTATTTGTTT 


CTGGTTCGTT 


AGGAAGACAT 


GAGGTGATTG 


TGGTTCGTTC 


324 60 


AGGAATTGGT 


AAAGTGGCCT 


CAGCTGTGAC 


CACGACTTTA 


TTGATTCATC 


AATATGGCGT 


32520 


AAATATGGTG 


ATTAATACAG 


GTTCTGCTGG 


TGGTATCGGT 


GAAGGACTAA 


AAGTTGGCGA 


32580 


TTTAGTCATT 


GCAGACAAAT 


TAGCTTATTT 


TGATGTTGAT 


GTCACTGGTT 


TTGGTTATGC 


32640 


CTATGGACAA 


TTGCCTGGTG 


GTGTGCCTTT 


GTATTTTGAA ACAAGTGAAT 


ACTTGCGGAT 


32700 


GGAAATTGCG 


AAAGCCGCTG 


7UVAAAACTGG 


TTTAGCTATT 


CGTAAAGGAT 


TGATTGTTAC 


32760 



TGGAGATA 32768 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6930 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



AAGTAGAACG 


TAAATGGTAC 


GTAGTAGATG 


CAACAGATGT 


TCCTTTAGGA 


CGTCTATCAG 


60 


CAGTTGTTGC 


TTCTGTACTA 


CGTGGTAAAA 


ATAAACCAAC 


ATTCACACCA 


CATGTGGATA 


120 


CTGGAGATTT 


CGTAATCGTA 


ATTAACGCTG 


ATAAAGTAAA 


ATTAACAGGT 


AAAAAAGCTA 


180 


CAGACAAAAT 


TTACTACCGT 


CACTCAATGT 


ATCCTGGAGG 


ATTAAAATCA 


GTGACTGCTG 


240 


GTGAATTACG 


CGACAAAAAC 


TCTCGCCGTC 


TTATCGAAAC 


TTCTGTAAAA GGTATGCTAC 


300 


CTAAAAACAC 


TTTAGGACGT 


AAACAATTTA 


CTAAATTGAA 


CGTATACGGT 


GGAGCTGAGC 


360 


ATCCACATGC 


TGCACAACAA 


CCAGAAGTAT 


TAGATATCAC 


GAACTTAATT 


TAAGGAGGGA 


420 


AATTCATTGG 


CACAAGTACA 


ATATAGCGGC 


ACAGGCCGTC 


GTAAAAATGC 


AGTTGCCCGC 


480 


GTACGCTTAG 


TACCAGGTAC 


TGGTAAAATT 


ACTGTAAACA 


AAAAAGACGT 


AGAAGAATAT 


540 


ATTCCACATG 


CTGACTTGCG 


TGAAGTTATC 


AACCAACCAT 


TTGGCGTAAC 


TGAAACAAAA 


600 


GGCGCTTACG 


ATGTAATCGT 


AAACGTAAAC 


GGTGGTGGCT 


ATGCTGGACA 


ATCAGGAGCT 


660 


ATCCGTCACG 


GAATCGCTCG 


TGCATTGTTA 


CAAGTAGATC 


CTGACTTCCG 


TTCTGCTTTA 


720 


A/UVCGCGCTG 


GGTTACTTAC 


TCGTGACGCA 


CGTATGGTTG 


AACGTAAAAA 


ACCAGGTCTT 


780 
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AAGAAAGCCC GTAAAGCTTC ACAGTTCTCA AAACGTTAAT ATATCTTATA TATCAACATC 840 

TCAAGACACT TTCCAAATTT TGGAGGGTGT CTTTTTTTGT GTTTTTAGGT ACGACAATGG 900 

GTACGAGTTT TTTAAACCAG CTATTCATTT TATATTTTTT GCTTCTGCTA ATCGTTGTTC 960 

AAATTTTTCA TAAGTGGATA GATCAACGAT GTTTGGTGTA TTCACATAGA CTTCTGTTGT 1020 

TTTTGTATCA GAATGGGTGA GCGCTCTAGA AATTTGTTCC ATGGTTGCGC CACCTTCATA 1080 

TGCTAGCGTA CTAAAAGTAT GTCTTAACTT ATGAGAAGAA GTATTTATTA AGTGTTTGTG 1140 

TCGACGTTTA ACTGAGTTGA TGCGGTAGTT AAGATAGTCT ATGTGTACCG GGACATTCAC 1200 

ATTTCCCTTT CTATCAATAT AAGTGAAAAG AAATTGCTCT GCATTTTGnT tAATTCCAAG 1260 

TTGTAAAAGT TCTTGTGCTT GTTCTTTTTT CCATTTCATT AACAACTCTA TAAGGGAGGA 1320 

AGGCAAT-TGG aATTTAGTGT TTTTTCTTCC TTTGGTAGGA GTTAAATTTC CTAGCTTGwC 1380 

TTTGCTTTGG ACAAGTAGTA TTGTTCCATT CTGTAAATCG ACATGTTTCC ATTGCAGAGC 1440 

ATAACTTTCA CTTTTACGAT CGCCTAAATT TAATGTTAAG GTAAAAAGGA CAAAATCTTG 1500 

AMGGTTAGA TGTTTGTTAT TGTAGTCTTG TTGAATGGCT TGAATCCACT CAAGAAGTTG 1560 

TTCTGCGGTT AGTGCTTCTC CTTTTAGCGT TCACTCCATT TTTCTACGTT GTTTTAAAGG 1620 

ATCTCCTACG TAACGAGTAA TCTTAGCAAC ATGATTGTAT TCGATGTAAT CAAGAAGTTG 1680 

TGCAATATCA AACAATTGAT TTACATAGCT TTTAATCGTT TTGATATTTG CATATTTTTG 1740 

CGCTTTCGCA TTTAATTTTC GCAAAACTAA ATCTTTATTT GTATTCAATT CAAAGAGAGA 1800 

GTAACTACCG AACATAGGTA GAATATGAAG ACGAAACAGG TCTTTAGTGT TTAAAATCGT 1860 

TTGTTCTGAT GGAATCATTC GGTTTCTCCC ACTAGAACCA TTGCAATACA TGTCTAACCA 1920 

GACCTGTTCA TAAAATTCTT TGAAAGAAAT ATTTGCTTTT AATTCAAAAG CTCGTTCATG 1980 

TTTTTCAAGT TGTACTTTTT CAATTTTTTT AAGAATATCT TTTTCAGCTA CCAGAGCTTC 2040 

ATTTTTGGAA GAGAACGTTT TTCTGTATCG CTGT6AAGTA ACACCCAGAA TTTCTCTTAC 2100 

TTCTTTAGGA TAAAATACTT CGACTTTATA ACTTCCGCTT TTCATTTTTT TGATTGCCAT 2160 

TAATATTTCC TACTTTCTTG CGTGAATCAG AACGTAGAAA GACTATCGTT TGATTATAGA 2220 

ATAGATACTT GAATGTTTAT AGGTTGGTTT AAGCATATTT TTTTAACCAA TTATCAATCT 2280 

CCAACTTATC ATAAAGTACT ATTCCGTTAA TACTGATTCG TGGAAGTCCA ACTGTAATTA 2340 

ATTTATCAAT AGTATTGTTG GAGAGATTAA GATATTGGCA TAGCTGATGC TTTTTAAGAT 2400 

AACGATTATT TAATCCTAAA TCAATTTGAA ATTGTGTGAT TTCATTTTCT AGTAGTTTGT 2460 

GAATGGCTTC CTGTAATCTT TGTTTTTGAG TGTCAGACAA TA7VAATAGTG AGTTCGTTCA 2520 

TTTATGCGTT CCTCCATTTT TCTTGATTTT TGTGTAACGA ACATTTTGTT CTTCAAAACC 2580 

TATTTCTTTG TTTGGGATAT TTTGTTCATT CCAGTCTTGA AAGAAGACAG GTCTTTTACT 2640 

TACATAATTG TTGGTATVATA AGACTTTTTG ATCTTTTACT TCATAAGGTC GATCTGTCAG 2700 

TCTTTTTATT TCGAGCGGAT GCTTCAAGTT TTGAGATTTA AAGAAGGCTT TTTTCTTATG 2760 
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CTCTTTTAGT TCTAAATCTT TATCAAAATA CTTGCTGACA TACCGTCCTC TATTTTCAGC 2820 

GCTATCTATG TCAATTTTGT TGATTTTGAT TAGTCCATGT CCCCAAATAG CTGTTAGTTC 2880 

TGCATGAGGT AAATAGGGGA AGCTGAATAA AATAACGTGA TAATGAATAG CGCCATGTTT 2940 

TTGTTTCTCC CACGTAGCTA GGTACTTAAT TTGGGCTTTC TTTGTCTTGT ATAAATGGTA 3000 

GTTTAGTCGG TTCATAAATT TTTTGAACTC ACTATTTGTA TAGGCTATAT CTGTTATATT 3060 

CTTTTTAAAA GTTAAAGTTA AAAATTTTGT CTGGTTATCC AAATTCATAT CAATGAGTCG 3120 

AGCAACGACA AAGCGTTGAT TTTTGTAATG TTTTTGTTTT CTCTTTAAAC TGTCATATTG 3180 

TCCTTGTGCG GTTAAATCAT CAAATGTCTG TTTTGTAGTT TTTAGCCAAT CAGGTTTTTC 3240 

TTTCTTTTCA TCATTATCTG TTTTTCGTTT TGTTAAGATT AATTGTTCAT ATTCCCAAAT 3300 

TTCAATGTAC GTAGGGGTTT CAATGATTTT TTGATTGTAT CCATGTAGCA ATCGCAATCA 3360 

CTCCTACTAC AGAAATTGAG TGTTCTTACT GGTTATTCAG TTGTCAAAGA ACGAGTGTAT 3420 

AACTTTTTGA TTTTTCGCCA GATAATGTTC TATAAATCAA GTTAAGTAGG AAGAAAGCCT 3480 

ACGCCTTTCT TCCAGAGCAA TCATCCATCG TGTGGTTGAT TTTCGAGCGG AAAGCGGCTT 3540 

GAGCTTTGAT GGCTCTACCG CTTTCGTCGA AACCACCCGC TGGAAGTCTG CTCTTGGGTG 3600 

GATAGTGATT CTTCAAGTTG AGCCAAGCGT TGTTCTAATG TTTGATAGCT TCTTTCCAAT 3660 

TGTTGGTACT TTTTTGTTAA TGCTTGAAAC AACTCCTCTT TATTTGCCAA CTTAACTTCA 3720 

ATTTGATTGA TGATTTCAGT TAGTAATTCA CTATGATTCA TGGTATCTAA ATGATGTTTC 3780 

TTATCAAAAA GTTTTCGTAA AGAAGTATTT CTGCCCAATA TAGGGATCAG TTTTATTACT 3840 

TTTTGAAATT TTTGGATTTC TTCCAGTGAA AACATTCTGG TTTTGCGGTG ATTGATCGTT 3900 

CTTTTAGTTC TTGTGTGACT GGCAAAACGA ATATAATCAT AGTGAAATTG GTTiATCGCAC 3960 

TCTTTTTGAA TAAAATTTGC CCATTTGTTT AAAGTAGAGA GGGAATTCAG CCCCTCTACT 4020 

TTTTTTAGCA CGTCATGGGC GGTGAAGTAA TCAGTCATTT GTTTGGAACT AGTGTGGCTT 4080 

TATAGGCTGT AGCAGTTAGT CCATAGCTAA CAAATTGTCC ATTTACACGA CCCCAAGGAG 4140 

TAATCATAAG ATTATCAAAT CGAACGATTG CCATACCTTT TTCATCTAAT GTTTCTTTCG 4200 

TGATTACAGG AGCTTCACTT TTAACTTTGA CACTGATTTT TTCAAATCGG AGTTTTGGAA 4260 

GAACACATTC ATACGTCCAG CCCAACGTTC GCCGTCTTTT TGCCAAAGGG AGACTTGGAC 4320 

TACTTGGTAT TCATTTTCAA GCTTTTCTTT AGGTAAAGTT AAATCGTTAA TTTTTACTGC 4380 

CATTTCATAT CCTCACTTTC TTTTAGTACA TTTTTTAGTA CAAACATATT CTAATGGAGC 4440 

TAGAAAATTT TGTTAATATT TTTTAGCACA AAAATTAGTA CAAAGCTGTT TTACTCTTAG 4500 

TGCATTATTT GTTATGATTG TTTTGATAAA AATTGGCTTT TAACCATTGA GGAGATTGAG 4560 

CTTTATGAAA CCAAACAAAA AAAAGTAGGC GAAAGAATTC GAGAAATTAG GACAAATCTA 4620 

GGCTACAGTA TGGACGAATT TGGAAGTTTA TTAGGTGATT CTCCACGAAt TCTGTGAACA 4680 

ATTGGGAAAA AGGGGTCAGT ATTCCTAAAA GAGATAAACT AGAAAAAATT GCGATATTAG 4740 
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GGAATATGTT 


ACCAGACCAG 


ATACTATATG 


GTCGAGCAGA 


TGAATATTTA 


TATGATTTAA • 


4800 


TTGATAGCAG 


TTTCAATGTT 


AAATTCAGCG 


ATAATATTCT 


ttatgaaatt" 


TTTGAAAGTG 


4860 


TACCACCTGA 


AAAGCGTTCT 


TATGATGATT 


TGATGTGGTT 


AGCCGTTGCG 


AAATATTTTA 


4920 


TTGATAATGG 


TACATATGGA 


AAAGTTGTTG 


GAAAATTCAA 


TTATATTTCA 


ATGCTAGGTA 


4980 


TACCGAACCT 


TTATACAGGA 


AGCTATCAAA 


ATGAATTTAT 


TGAACAACGT 


GAAGATTATA 


5040 


AACAACAAGA 


AGTTAAATAC 


TATGTATATG 


CAGATCCGGA 


AAAAAACATA 


TTACACATTA 


5100 


TGCCATTTGT 


TCCTAATGAG 


AAGAATATAG 


AACTGCTTTA 


TGAATTTCCT 


GAATTATTAG 


5160 


AACAACAAGC 


AGATCACGAA 


GTATTTACTG 


CAAATTTTGA 


ACAAATTGGG 


CTTACCTTAA 


5220 


AAGGATCAAC 


CATTGTGTAT 


TATGGAATTG 


ATAAACAAAA 


ATTAGAAGCT 


AAAACACAGG 


5280 


CATATAAATA 


TGATGAAGCG 


GTTGATTTAT 


ATAGAATAGC 


TCAACTACCA 


GAAACTTTAT 


5340 


CAATGTTTGA 


TGATGAAATC 


CAAAAGCAAA 


TTTCGTTTTT 


AAAAAGTAAA 


TAATTGCATA 


5400 


TAAAAAGAAG 


TCAACAGTGT 


TTCATAACTG 


TTGGCTTCTT 


TGTTTTGCTA 


TTTAGTTTAT 


5460 


AAGTTAGGCC 


AGAATCAAGT 


ATATCGTAAG 


TCTAAATTCT 


AACCTTTTCT 


CTTCTTTTTA 


5520 


AATTTTCAAG 


CTGTTTTTCT 


AAACTCTCAG 


TACGTTGTAG 


AAAGTCATTG 


AGAAAGATTT 


5580 


TGAATATTTG 


GGCATTTGCA 


TGGTATTTTC 


GAATGATCTT 


TTTTTTATTA 


ATTGTTTCTT 


5640 


GTTCAAAATC 


AAAAAATTTG 


TTTAATATCT 


CTAATGTTGA 


TGTCCATGAT 


TGAAGTTGTT 


5700 


CAAGTGTTAT 


TTCTAGCGCA 


TTGATATCTT 


GTTGAGTAAT 


GATTTTAATC 


GGCTGATTCT 


5760 


TCTTTGTTAG 


TAATTCTTTA 


TTTTCTTTTT 


CCGATGTCAT 


AAGGTTTCCT 


CCTGCTATTT 


5820 


TACAAAATCC 


ATGTCTATAC 


GAACAACAAT 


CTAAAATTAT 


TATACTATAC 


CATGTTTATA 


5880 


TAGACAACAC 


GGGGAGTGAA 


TATGAATGAA 


CGAAAATGAA 


GGTGGTAGAA 


AAGTGGAAGT 


5940 


GCTGATCAAC 


GATTTATTAG 


CAGAACGTCA 


AATGTCTTTA 


AGAGAACTCG 


CAAGATTGTC 


6000 


GGGAATTGAG 


CCTTCAAATT 


TAAGCAATTT 


GGCAAATGGa 


AA/^GACAGA 


AAATTTATTT 


6060 


AGAACACATT 


GAAAGAATCG 


CAGATGCGTT 


GGAGATTGAT 


GATATATCGA 


GGATATTAAA 


6120 


GTTGAATAGG 


AAGTAGTATT 


GG6GATATAA 


AGAATAAATA 


TAAAAATAGT 


TAAAGATATT 


6180 


GTTAAATACT 


TTAGATTAAA 


TGAAAAAATT 


ACAGAAATAA 


ATTTGATAGA 


AGAAAAGTAT 


6240 


AAAGCGAACT 


ATTTGATTTT 


TAATATTTAC 


TGATGTTTAC 


GATGAAGGTG 


GAATATAAAT 


6300 


GAAGAATGAT 


AACACTCTTT 


CTGAAGAAGG 


AATTGATACA 


GATAAAGGAA 


TAGTTGGTTC 


6360 


AATTTTTGAA 


AGTATGCTAT 


CGGATACTTT 


ATCAGAAATT 


AAAAATTTAG 


ATAATGXTAA 


6420 


AATACAGAAG 


ATAGTAGAAA 


ATATAGGGAA 


ATATGATAAA 


AAAGAGTTTT 


TGCAAAGACT 


6480 


TGCTGCACTA 


AGAATTCCCT 


TTGAAAATAG 


AGATAAGGCT 


GTATTATTGG 


ATGCAACAAC 


6540 


TACAGCAACA 


CTGAATTGGT 


TATCGGAAAA 


CAACTGGAAT 


TTTAATGGGC 


TTTCAATGAG 


6600 


CTATGGTAAA 


TTCAAAAAAG 


TTATTCAACA 


GATTAACCAA 


TTAGACTCTA 


AAATGGCTAT 


6660 


TGATCCATTG 


GATAATCCCT 


ATATAGATAA 


TATCCaaTTT 


TATGGTAATC 


ATAAGGTAAT 


6720 
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GCCTGGAATA AATTTTGCAA GCTCGTATAA TTTGCAAATG ATGATCCAGT CTATATTTTT 6780 

ATC7VAATCGT ACAAAGTTAT CTGATGAGAA AAATAGATAT ATATCCATTT CATTAAATGA 6840 

TAACCTTATG GTTTCTACCG ATGTTTGTAA AAATGTCAGA TTCCAAACGA ATTnCCATCA 6900 

TTCACTCGCG AAATATnTAT nCCTAATAAA 6930 



(2) INFORMATION FOP. SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



TGTACTTTCG 


TCCAATATTT 


ATTTCTTCTT 


CAAGCTGATA 


AATAGACACA 


TCTCTATCTT 


60 


TTGCTAGTTT 


TTTTATACGT 


TCCAAAAGGT 


TCATATGAAA ACATTCTCCT TCTAGTTACg 


120 


xnGTTATTTmC 


AAAAAAAGTT 


AGTAAAAATG 


TTTGACATTT 


ACAATATTTG 


TTAGTAATGT 


180 


AATCACGTAA 


GCTAATTTAT 


TAGCTAATAA 


GTTCTCAAAT 


AAAACCTATA AACAAAACTT 


240 


GAAAATCGTT 


GGGGAACGGT 


AAAA6TATTG 


TTTTAGAAGG 


CTTTTAAAGT 


CTTATTTAGC 


300 


TATGGGTTCA 


TTTTACAATA 


TGTGTTAGAA 


AGTGTCAACG 


AATTTTATTA 


AATTAGCTAA 


360 


TTTTTTAGCT 


TACAAATTAA 


AAAGAAGGAA 


TGGAAGAAAT 


GAAAATTGAC 


GAaGCTGTTT 


420 


CAAAAGCAAT 


GAAAAAAGGa 


AAATATATTT 


ACAGGGAGTC 


TGAAAATGAC 


TGTTCAGCTC 


4oO 


ATGTAAATAT 


ACTTCCTACA 


AATACCTATG 


ATTGTTGTCT 


ATTACTACAA 


GAAAATAGTA 


540 


ATTCTGTTGG 


CAAACGGTGG 


AACCCTACAG 


CAAATGATTT 


AATGGCGAGT 


GACTGGGAAA 


600 


TACGTTAGTC 


AAGGCCCAGT 


GTTTTTGTAA 


TAACTTTAGC 


TGCGATTTCA 


CTCATTATGT 


660 


CAATTGAGAC 


ACTTGcTAGT 


TTGGATGATA 


TTnTTTTTGA TTCCTTCCAA 


tTTTCGGATC 


720 


TCGAATATTG 


TCCAAATATT 


TATGACCATC 


GAATGTAAGG 


GAAGAGATAC ATGTCATATA 


780 


AGGTTTGTTA 


GATGCAAACT 


TAACATATCC 


TTTTATAAGT 


TGTGCATCGT 


CATCACCTAA 


840 


TTTGGAGACG 


GCATAATTAA 


TTTCCTCTGA 


ACTATATTTT 


TTTAGTAGAT 


CTGATTTAAA 


900 


TATTTTATCA 


CCAAAAGCCG 


GTTCATTGTA 


CGGAAGGCTT 


TCGCTAAATA 


ATAAAATATC 


960 


ACGAATACAA 


TCATGAGACA 


GTTTCATATC 


ATTCACCACC 


TTATCAGTTA 


TTTCAGCAGA 


1020 


CCACTTGCTG 


ATAAGGAAAT 


TATATCAAAA 


AAAGAAAGTG 


AGGTAATTAA ATGTCACAAG 


1080 


ATTTAGCTAT 


TGAAGTAAGG 


GCAGCGCTAA 


TTCGTGCAAA 


GAAAACTCAA 


TCTTGGTTAG 


1140 


CGAAACAATT 


AGGGATTTCA 


AGTCCGTACT 


TATCTGATAT 


TCTTCATGGT 


CGTAGACGTT 


1200 


CAGAAGAGCA 


AGTTCGGAAT 


ATCAmAAAAA 


TATTAGATAT 


TAGATAGGAG 


GTGTAGGTAA 


1260 


AAGTGGAAGA 


AGCGATTATC 


AAAGTTGACT 


TACAAAACTT 


AAAGAAGTTG 


ATCAAACAAG 


1320 


CAAAGGAGCA 


AGCTGACCAA 


CTTCAAAAAA 


CTTTAGATGA 


GATAAATAAA ACTAAAATCC 


1380 
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TAATTTLT TA 1 AUCL-A 1 1 


CTGATCCAGC AGCAGACTTC 


ATATCTTCCC 






GTTTGTGTTT GAGCTAAl CaA 


AATTATCTAA 


TTTATTGTCA 


TCTATTGATT 






GAAATCAAAT CCGGATTTTT 


CAATAAACTC ATCAATATTA GAGAATTTAG 


lAi 1 i 1 W\Ai 


130U 


CATAAATTTT TTAGTGAATA 


GTTTATCGAA 


TGGTACAGAA 


TGTTCGTCAT 


CI AAAiaAC i u 


IDZU 


GCCATTTTTA GCAAATTGAT 


TTAGCTCATG 


TTGTAATTCG 


TCAAATCCAT 


i X AAi IwoAA 


1 con 


TTTCATAATA TTTCACCTCG 


CTTTCAAATT 


CATTTTACCA 


AGAGGTGAAT 


TGTAAAACAA 


1740 


TCAGTATAGG AGGTGATAGC 


AATGGAAGTG 


ATTTTAACTC 


CAGAAAATGA 


AGCTTCTCTA 


•« Q f\f\ 


AGAGATTTTG TACACGGAAT 


TATTGTTGAT 


HAAATAGAAA 


AAGCaCGaAG 


aGaTACCGcA 


1860 


GTTGaTAAgC gaGtCtTAAA tCaAACaGaG 


rTTGcaAAAT 


AtTTcaATGt 


TTCCACAACA 


1920 


ACAATAAGGG AATGGGAGAA 


GCTAGGTCTT 


CCACATGGAT 


CAGTAAGTAA 


ACAAGGGAAG 


1980 


TTCTACGACA AAGAAGGGTG 


TCGGAGATGG 


CTTCTATCAC 


AAAAAAGATA 


AATCTTGGGC 


2040 


AAGCGAAATC AAGGGAGGAA 


ATAAATATGA 


AAAAAATATA 


TCACTTAAGA 


CGTATGGCGG 


2100 


CACTGTTGGT TGTATTCGGA 


CTAGGTCTAT 


TGGTAGGTGG 


CAATATTGGC 


CCATTAATCC 


2160 


AAAACATATA TATAGCAGCT 


TTTATCATTT 


GGTTGCTCTA 


CTACGATTTA 


GCGTTGGAAG 


2220 


ATCGAGAAGA AAAAAAACAA 


AAATAAAGAC 


CCACTTCGAC 


GGCCATCAAA GTAGGTCAGT 


o o o n 

2280 


TACAAATATC AAATTCAAGG AGAGTGTACC ACATGAATAG AAAAATTGAA AGAATGATTA 


2340 


TTGAACTTGA AAAAGAATGT 


AAGGCACAGA ATGTTGAACT 


TCTTTTATGT 


GCTACAAATT 


2400 


TTGAAACAGG CCAAGGAAGT 


ACTGCGTTTT 


GTGGTTCAGT 


TATCGGTTTA 


GCTATACTCT 


Oil crt 


TGCAAAAACT TGTAGGTGAT 


CTAAAAGAGC 


AATTAAGTAT 


AAGCGAATCT 


TGTGATTGTC 


2520 


CAGAATGTGT AGCAGAAAAA 


GCCGAAGATG 


CTGCAAATGA AAAATCTATG 


GATGAACTAC 


o CO n 


TAACTGCATT TTTACGAGGT 


GAACTGCAAT 


GATTGAAGTA AGAGGTTTAA GTGATGATGT 




TTACGAATTA ATGTTAGCGA ATGCTCAAAA 


TAGGATTATT 


CAATCAATTC 


GAACTGCAGC 


AA 

2700 


AGCAAATGGT AATACAAGTT 


GCGTAGTGAA 


TAGTAAAGGT 


CTTACATCAA 


CGTTTTTATC 


z / bU 


TCAATTAGAA ACAGAAGGAT 


TTGATCACGT 


TGAACTTGAA 


GAAAACAAAA 


CGAAAATATT 




TTGGGAGTGG TGAAAATGCC 


TGAATTTGAT 


TCATTAGGGG 


CTAGACAAGA 


ACCTCCAGAA 




GAAAAAGAAG CATTAGAGCC 


AACATGGGAA 


TATGACGAAG 


AAGAGGAGAA 


TGACAATGAG 


^ 7*1 V 


TAACGATTTA ACACAAATAA 


CACAACGATC 


TTTAGATGAA 


CAAGTCATCG 


GAAATTTGAA 


^ V7 V V/ 


TAGATTGCAA GAGCAGGGAT TAGAAATGCC ACCAGGTTAT AGTCCACAGA ATGCATTGAA 




AAGTGCTTTC TTTGAACTAA CCAACAATTC AGGAGGAAAC 


CTTCTTCAGT 


TGGCAGCTAA 


3120 


CAATCCAGAA ACTAAAACAT 


CTATTTCTAA 


CGCCTTGCTT 


GATATGGTCA 


TCCAAGGATT 


3180 


ATCGCCAGCT- AAAAAACAAT 


GTTATTTCAT 


CAAATATGGA AATAAAGTTC 


AGCTTATGCG 


3240 


CTCATATTTC GGAACCATGG 


CTGTATTAGA TCGAGTAACT 


GGAGGGGCAG ATATCACGCC 


3300 


TGTTGTAGTA AGAGAAGGCG 


ATGTATTTGA AATTGCTATG 


GATGGTCCCG 


ACTTAGTTGT 


3360 
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TGCTAAACAT 


GAAACGGCCT 


TCGAAAACCT 


AGACAACGAC 


ATTAAGGCTG 


CTTATGTGGT 


3420 


TATTAAGCTA 


GCAAATGGTA 


AAGAAGTAAC 


AACCGTCATG 


ACAAAGAAAC 


AAATTGATAA 


3480 


GTCATGGAGC 


AAAGCAAAAA 


CAAAAAATGT 


TCAGAATGAT 


TTTCCAGAAG 


AAATGGCAAA 


3540 


AAGAACTGTC 


ATCAATCGAG 


CTGCTAAATA 


TTTAATCAAT 


ACTAGTAACG 


ATAATGATTT 


3600 


ATTTGTGCAA 


GCCGCTAAAG 


ACACACTCGA 


AAATGAATTC 


GAACGAAAAG 


ATGTAACACC 


3660 


AGAGCGAGAA 


GAGCAAGCTG 


CGGTACTTGA 


AGAAAAACTA 


TTTTCCAACA 


ATATAAAAGC 


3720 


TGTTGATCAA 


GAAAACGAGA 


ATGAACGAAT 


TACACGTGTA 


GCTGACGTAC 


CAGAGCAACC 


3780 


CGATATTGAA 


CAAGCCAAAC 


CAATTGAAAA 


AGATAATTTA 


ACGAAAGTAG 


CTGACCAAAT 


3840 


TTTAGAAGAA 


CCAGTTCAAG 


AAACTTTAGA 


TGTGATGGCT 


GGTTATGAAA 


CCAATCAGAA 


3900 


AGAGAGTGAA 


GCTGATGTCT 


CAACGATTGA 


AGAAGACGAT 


TATCCTTTCT 


GATGAAAATT 


3960 


ATTATTCACA 


AGAAGCGGAC 


CTAGCTTATA 


TGTCTGTCTC 


TCAATATAAA 


AAATTTCTTG 


4020 


AATGTGAAGC 


TGCAGCTCTT 


GCCAAGTTAA 


AAGGCGAATG 


GACACCAGAA 


AACGATCCTA 


4080 


AAGCATTGCT 


AGTTGGTAAT 


TATGTTCATT 


CTTACTTTGA 


ATCACCAGAA 


ATTCATGAAG 


4140 


CATTTAAAGA 


AGAAAATAAA 


AGCAAGATGT 


TTTCTTCAAG 


AAAACCGTTT 


GGCCTACTGA 


4200 


AAGATTTCCA 


AATTGCGGAG 


CAGATGATTG 


AAAGATTAAA 


ACAAGAAGAA 


GCCTTTTTAA 


4260 


ATATTTATCA 


AGGTGAAAAA 


GAAGTGATCG 


TCACAGGTGA 


AATTGGTGGT 


GCAATGTGGA 


4320 


AAGGGAAAAT 


TGATTGTTTG 


AATTTAGAAG 


AAAAGTATTT 


TGTAGACATC 


AAAACAACCA 


4380 


AAGATATGCA 


TGAGAAGAAA 


TGGGATGAAC 


GTTTAAACAG 


AAAAGCAAAC 


TTCATTGAAC 


4440 


GCTTCGGTTA 


CGTGTTACAA 


ATGGCTGTTT 


ATTGCGAACT 


GCTTCGGCAA 


CAATATGACA 


4500 


AAATTTTTCT 


TCCTCTCATT 


GCAGCCGTTT 


CGAAACAAAC 


ACCTAGTGAA 


GCAAAACTAA 


4560 


TCACTCTTAG 


CGAAGAAAAA 


AT6ATTTACG 


AATTAGAAGA 


ATTAAAAGAA 


AACATCGAGC 


4620 


ATGTTGTGCG 


AGTGAAAAAC 


GGCGAAGAAG 


AACCAGTTAG 


TTGTGGGATT 


TGTGAATATT 


4680 


GTAGAGGACA 


CAACAAAATT 


ACAAATTTTA 


CCAGTATGGA 


CGATTTATAA 


AAGGAGGCGA 


4740 


GCAGGTTGGC 


CATTGGAGGT 


TGGATAAAAC 


TTTATAGGAC 


CATTCAAGAT 


CACTGGATTT 


4800 


GGGAAAATCC 


TCAATACCTT 


AAATGGTGGC 


TTGATTTAAT 


ATTCATGGCC 


AtCACCAgAC 


4860 


AGAaAGATTC 


TTTTTGATGG 


AGAGTTAAAA 


cGGTGAAAGT 


CGGAGAAAGA 


wTTACATCCG 


4920 


AAAAAAAaCT 


TtCTGAAaGG 


TGGGAaGTAG 


CCGAAACACC 


GTCAGAAAAt 


TTTTAGATTT 


4980 


GCTTGTTAAA 


GATGACATGA 


TTGAATTAAG 


CAGAAGTAGA 


CAAAACGGGA 


CATGGTACAA 


5040 


AGTCAGTAAC 


TACGCAGAAT 


ATCAAAACTT 


TTCTGAAATA 


AAAAAACAAC 


GAACTGAACA 


5100 


ACGAAGGGAA 


CAACAAGCTG 


CACATCAAAC 


TGAACAACGA 


ACTGAACATA 


AACAAGAACC 


5160 


TAAAGAACCT- 


AAGAACCTAA 


GAAATAATAA 


TAATAACAAT 


AAAGGGGCGT 


TCATTCGTTC 


5220 


AATTTGGGAA 


AATAACGGAT 


TTGGATTGAT 


GTCGTCTAAA 


ACCATGACCG 


ATTTT6ATTA 


5280 


TTGGATTTCT 


GATTTTGAAA 


AAATCGGAGC 


TAGTCAAAAA 


GAAGCTGAAC 


AATTAATTGT 


5340 
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TAAAGCTATT 


GAAATTGCTA 


TTGATGCAAA 




TATAACTATA 


TCAATGCCAT 




ATTGMAGAT 


ww^^%^^ m ft ^^ft ft ft 

_TGGGAACAAA 


^^ft ^^^^^^fnfM^^ ft, 1^ 

GAGGGTTCAA 


ATCTGTTGAG 


GAACGAGAAG 


CGGCAAGGAA 


d4DU 


GCAAAAGAAA 


ACAACCAAAA 


w ft ^vft ft, ft ft fn^^ 

AACAGAAATC 


AAACACAGGT 


CATTCTGATT 


ACGATGATCT 


DD£\J 


TGGATTTTAG 


ft M%^^ ft ft ft 

GAAGTGAAAG 


AATGCAGTCA 


GCATCAGATG 


GATTTTCAAA AATGATTAAA 


C C O A 

5580 


ACGTTGCTTT 


ATATCACGCC 


TGATCCATGT 


CCAGAGTGCG 


GAGGAAATCT 


TTATGCTTGG 


5640 


CGTGCAAAAA 


ACAAAGATGG 


GTCCGATAGG 


TGTCCGCCAA 


CTTGCATGGA ATGTGGCTAT 


5700 


AAAGCACGCA 


AAAAAGCAGA 


AGACCTTGAA 


ACAGAGAAAA 


TGTTTAACGA 


TAGTTTGAAA 


5760 


GCCAGAGCGA 


TTAATTATCT 


GAAATATAGT 


TCGCTTTACA 


CCGACAAAAA 


TTTAATTAAT 


5820 


TGTCGTTTTA 


AAACTTACAA 


AACAGTAGAC 


ACAGAAACTA 


AGCTTGCTTT 


TGAAATTGCA 


5880 


AATCGAGCCA 


CAACTGAAAT 


TCTTTTGAAT 


AAACCAATTC 


ATATGATTCT 


TTCAGGCAAA 


5940 


AGTGGTGTTG 


GAAAAAGTCA 


TTTAGCTATG 


TCAACGGCTT 


GGGAAGTGTT 


GGAGAAATCA 


6000 


AACTATGATA 


AACGCTGCTT 


ATTTATTAGC 


TATGCGGAAC 


TCTTAGAACA 


ACTAAAATTT 


6060 


GCGATGAATG 


ATGA6CAAGC 


CAGAAAAGAG 


ATAACAGGTA 


GTTTGATGGC 


AGAAATCAAA 


6120 


AGCGCAGATT 


TAGTAGTTCT 


GGACGATTTA 


GGAGCCGAGT 


TGGGAGTTAA ACAAATTGAT 


6180 


GATAGGAATA 


AAAGTACTAA 


CTTCAATAAC 


GACACCTTGA 


ATCGCATTGT 


AGAAGCTCGG 


6240 


CAGAATAAAG 


CAACTATTTT 


TACTACGAAC 


TTAACTGGTA 


AAGAAATGAG 


TCAAGCTTAT 


6300 


GGGGAGAGAA 


TTCTTTCTCG 


TATCATGAGT 


AATTCACAAG 


GTTTTGTGAT 


GAAAATTGAA 


6360 


GGGACATCAG 


ACAAACGAGT 


AGCAGGCATC 


TAAAATGTTA 


TTTTTAGCGA ATATATTCAG 


6420 


CGTAGAGCGG 


TTTTACAATC 


AAGTGAATAT 


AAATAGATAC 


AAAGAAAGAA AAACGGATTA 


6480 


AAACGCATTT 


TAAAGCCTTA 


T^VAACAAATC 


GATAGAAAGG 


GGAATCATTC 


AATGCCGTAT 


6540 


GTAGTGAAAA 


TTTCAGCCTA 


TCTTGGCAAA 


GATGGTCGAC 


CTGTAGCCAA 


TTTAAAAGAT 


6600 


GCTGTGCTAT 


TTGAGCAAAA 


AGAGACAGCA 


GCTATTGCAA 


CAATCGTATC 


TGGCGGAACC 


6660 


GTTTCAGAAG 


TAAAGGAAGC 


CATTATAATA 


CCAGAAAAAC 


CAAAGAAACA 


TATAGGGAAA 


6720 


TCTATTAAAC 


GGATTGATAA 


GAAGGAACCG 


ACCGAAAAAG 


CTACCAAAAG 


TAATCAAGCC 


D /oO 


TGGATGAAAG 


GGGCTAAATA 


AGAATGAAGT 


GTGTTAGATG 


TCAAGATCAG 


CGCGTGATTT 


CQ A f\ 


GGGGCAAAGA 


CAGATTTAAT 


TATGCAACAC 


CTATTCCATG 


TCCTGIUVTGC 


AACAAAGATG 




GAAAAGCAGT 


TCGAGCGGAA 


ACTGCGACCA 


ft ft ft ^i^^^i^ft 

AGGAAAGGGA 


GTTAAAACAA 


TGCAATCACC 


O!^d0 


AACAGCCCTG 


ft ft Mtft m ^^^^.^^m 

AATAAGCGAG 


1ft ft, ^^ft ft ft ^^ffVl 

GAAACAAAGT 


#^ft ftffiifffi^^ftfi* 

CACAATTGAT 


GGTTACACAT 


TTGATAGCCA 


/02U 


GAAGGAAGCT 


AACTTTTATA 


CAAAGTTTGT 


CAAAAATTGT 


GGGTTACCTT 


TTGAAGTTCA 


7080 


TCCGCGTTTT 


AGACTAACCG 


AACTTACACC 


AACTGCGGAT 


GGTATAGGCA AAATTTCGGC 


7140 


GATAGCTTAT 


TCACCTGACT 


TCATCATAAA 


AAACTTAGAT 


GGGAGTTGGA GACATGTCAT 


7200 


TGATATTAAA 


AACTCTTTTG 


GCGTGTATGG 


TATTGACCAA 


TCCGTTAAGC 


TTCGTTTTCG 


7260 


TCTATTTGCC 


CTTCGATATG 


GTCATCCAGT 


TGAAGCGATT 


GTTGTTCGTG 


CTAGAGATTT 


7320 
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1 /v\rio 1 \Dr\ 1 r\\^ 1 i^rinuvy X v7 


T AAP T AAGCC 


TTTAAACGAA AAAAGACCAT 


TCATAACCGA 


7380 




AAGATGOU^C 


TAATTATTAA ACGAAAGTAG 


GAAAATAAAA 


7440 


TuACAAAAUA AviXAAAl 1 Iv^ 


AGACCAGAAG 


TGAAAAAAGT GACATCTAAA 


TCAAACGGAA 


7500 


ATATCbAAb 1 1 AI 1 Au X I? 


GTTAGCAACG 


CTTCATTAAA AGGGAAATAT 


GAAAGTTTAA 


7*160 


ACGAATTTTT AGGTAAAACA 


GTATCAACGA 


CCATTGAGCC AGAAACAGTA 


GAATACAaGG 




TACCAGTTAA LAAGL^rGACU 


AwTAAACCGn ATGTCGAATA TATTGyGxonT AACGACGGAA 


r uO \i 


CAGTTGAAGT tctaaaagaa 


GAACAAACTT 


CTTTAGAAAT GGGCGATGAT 


GTGCAAGAAG 


1 f H\J 


tTGaagaagt tgctgtgcaa 


GTATCGAAAG 


AAACCATTGA CGAATTCATC 


AAGAAGGCAA 


1 ouu 


CAACAATCGa ATGGCCaGAA 


TCAGTAACAA 


TCAACGTTCG TGGCGTGTTG 


CATCGAATCG 




ATGAAGGGGA AGCCTTaGAA 


GAAATTGCAG 


CTGATCATGA TGTTTCAGTT 


GAAAATCTAA 


1 7ZU 


TCAACCAAGT TGAACTTGCA 


CGCCAACATT 


TTGCACCGTT TGCAGATTCT 


TGGAGCAAAA 


T QQ n 


ATAAAGAGAA catcattttc 


CCTGAAAAGA 


CAGTTGAAGA TGATGAAGAA 


GAAATCGAAT 


ou^u 


aataatctcg tagaaagtga 


GTGTTCATTT 


TGCTTGAGAT TTATTATACG 


CCAACATCCG 


Q1 An 
olUU 


CAATTATTGC GGATGCACTG 


GCTAAAACAT 


ATGAAGTCGT TTCTTTAGAA ACAGCTAGAA 


0 J.DU 


ATATTGCCAA GAAATTTAAG 


GCTAGCTTGA AGCAGAAAAC GGATCTTTAT 


GTAATTGAGG 


ozzu 


GAATTTTGAT TGATGCTGGT 


TATAAAAAAG 


AGCCAGTGAA TTTATAGAAG 


GGAGTGGAGG 




TTTGCGGCCG CGATTAAAAA 


GCTTTTTACT 


CCGTTGAGAT TATGAATATT 


TACCAAAAGA 


0 J4U 


AAATGGGTAA AGTTATGAAA 


GTCCTTTGCG 


GTAAAAGAAG TGAAAACATT 


GGGTAAATAA 


Q 4 nn 


AAAGTGAAGA AGAGTAGCTT 


TTTACTACTC 


TTCTTCGCTA GAAAAAATTT 


TTGAACAGAA 




nt^t It It It 7t /^/*/*?t Tt /*^»p/^Jt /^^P*!* >t 

TAAAAAGCCA AGGTCAGTXA 


CATGTATGGA GCCACGAATA CTTATAATGC 


TAAACCCTGT 




It rft^^mrnrrtit 7t it it it It 7t It 1\ f^^TT^ T* 

ATCTTTAAAA AAAGAACTGT 


TTGCAAAAAA 


ACTATTAAGA TTATATTTCA 


AAGTTTTAAA 


o«j WW 


ATAAGTTGAA TTTTUAAAAl 


CATCATAAAT 


TTTTGTATGT TTTTTTTGTG 


TTAATTCAAT 


R640 

W W V w 


AGAAGGTCTT ATATUAATTA 


ATCCATAAGA GCGTAAAG^T TGTAAAGAGG GACTTAAATC 


R70n 


AGAAAAATT A T i xl LJlACAA 


TTCCTTTCTG 


AGTAAGGATT AAATTTTCTA 


TATAATCTAT 


0 ' WW 


7\ /^/->/^fp»n TV r*/^ A rr»rp TyCf^'n rT^f^T* 

Auv-G 1 1 AULpA 1, r AbGAGCU 1 


GTACTCTAAC 


CTTTATAGTT GGGGTAATAC 


TAAGTTTATA 


w 0 «• w 


Jt It n»i|i»iii|»t|ii|M|i It It vpTV 

L.AAlliiiXx AAlAlLsAlilG 


CATCTATAGT 


CGTTAGTTCA GATAAAATAA 


TTGAAAACCT 


8880 


1 oul^unnoiva X ^1 X loni XXX 


TTCTGTTATC 


AACTAATCCA GCAaGTAAAT 


TAGAAAAGTA 


8940 


X X LprU,«u\.tXiXA lV«XXL*AX^XX 


CTAACTGATA 


TCTAGCGTCT TCCATTGTTT 


TTAAAGCAAA 


9000 


CCCTAATTTC GAATCATCTC 


TGTTATCTAC AGGGATACTA TCGGTTTTAG AATTAATCTT 


9060 


TTCTACAAAA TCAATATAAG 


ATTTTTCACT 


AACAACATTT AGTTTTCTAA 


GAGGACCTAC 


9120 


AACATAATTT. ACAATCCCGC 


CAAGGCCTTC 


TCCAATTGAT TTTGTAGCAG 


GTCCAAAGAG 


9180 


ATTAGTTATC AATTCCTTTG 


6TAATAGTTC AATGTTAATT AAAGATTTTT 


TATCGTCAGA 


9240 


CATTTAAACA TCTCCTTTTT 


CAAAATATTA 


TACCAATAAA GAAAGGAATT 


TTCTATGCAA 


9300 
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CTTAGACCAT ATCAAGAAGA ATCTCGTTCC GCTGTTCAAA ATGAATGGGA GAATAAGAAA 


9360 


AAGAAAACAT 


CATTAGTTTT 


GCCTGCAGGA 


TGAATCTGAA 


TGAAAATATA 


TTGATATGAA 


9420 


AGTTCAAAAA ATACTGGGTG 


TTGGACAGAA 


GGCCTAGATC 


ACTCAAATAA AAAATTTGAA 


9480 


AAGGAAAAAG 


GCTACTTGAT 


GGTTGACGAT 


CCTGAGGAAG 


ATGAAACAGT 


AAAACTAGTC 


9540 


ACAATTGAGG 


TGATAAACGT 


TTTAAAATCC 


TTAAGAATGA 


CCGTAACTAT 


CAAAAAGGCG 


9600 


ATATCTTACG 


CTTAAACGAA 


TATCAAGACG 


GACAGTATAC 


AGGTGATGTT 


CATGTCTCAG 




AAAT7UVCATA 


CATTACAGAT 


TATACTCAAC 


AAGATGGTTA 


TCTAGTGTTA 


GGAATTAAGT 




GAGGAGGATT 


AATAAATGGA 


ACAATTCTTA 


TTAACAAAAA 


CTGGTAAAAA ACAAATTGAT 




ATAAAGGGTA 


CGGGAATGGA 


TCATAACGAA 


ATTGTCTTTA 


CGTTAGCTGC 


TACTTTAGTT 


9840 


GGATATAGCA AAGAATTAGG 


CCTAACAAAA 


GCAATACTAA 


ACGAAAGTAT 


GTACGTGCTG 




TGGAAAGATG 


GTGAATAAAT 


GAAACGTAAC 


TGTAAAAGTG 


TAATAAATAA 


AGTTAGTGGC 


QQAn 


ATTGC7UVTAA 


TGATTCTTGT 


AGCAAAAGCA 


ACCGTGAGCC 


ATTTCGTGTA 


T GG AAATG AC 




ATAACAAGCA 


GTGACCTTGT 


TTATTTCCTT 


TCATGCTCGT 


TTATTTTGGG 


AT T AGGGCT A 


XUUOU 


TATTTAGGAG 


GTTCCAGTGT 


ATGAGTTATC 


CAGAAGTTTA 


TATCATAGGA 


AGGCAAGTCG 




ATGGCGTTTA 


TGTTGAATAC 


TTACATGGAG 


CAGAGCAAGC 


CGATTTATTT 


TTCGATTATA 




CGATAGCTCG TGATGAAAGA AATCATATGA ATAAAACCAA TATGAAAGAT 


GGTACTTGGA 




AAATTTTGAA ATATGGTAGA 


CCGATAACG6 


TCGAATTAAA 


CAATAAAAAA 


GCCAACCGAC 




CAATGGTTGA 


CTAAGAAGAA 


TATTTTACCA GAAAAGTGGT 


AGCTTGTGAT 


ATGTGAGGTT 




ACTTTGCCCC 


AAACATTGGT 


CACAATAAAA ATATTTTATC ATGAGTAAAG 


AAAGCTGCCA 


10440 


ACAAAAAAAG 


CCGGATTTCT 


CCGACTGTGG 


ATAATGTGTG 


GATAACCTTG 


TGGAAAAGTC 


tosoo 

±,\j 


GrrATCATTA 


TACCACAAAT 


CACAAACGGG 


GGAATCAAAG 


GATGGTACTT 






AGAnATATGA 


AACACCGGAT 


GCAAAGGACG 


TAGATATGGA 


GCAAACTAAA 


CATAACGTCA 


10620 


GTGTGTTCCT 


GTCTGCCTAT 


CTTGCTGCTA GATGTCGTGT 


TGGCCAGCCG 


AGGGAACCAA 


106B0 


AAGTAACAGC 


TTCATTCTCT 


TTGGTTCCAC 


CATCCACGGC 


CAAAAACACT 


TTTGAAGCCG 


10740 


AGCAAATGTT 


AATCCAGAAA 


GAAGAAGCCC 


AAGAAGAGTT 


TGATTATTTG 


CATAAGCTTT 


10800 


TTGTTAGAGG 


TTATTCTGCG 


ATTCAGCATC 


CGCACAAACC 


AGATGTTACC 


GAGCGAAGAA 


10860 


AAAGAATCTT 


CTATGATCGA 


TATATCAACG 


GTAATCCAAT 


CTATCTAGCA 


GCGCAACGAA 


10920 


ACTGCATCAG 


CGAAGAATCA 


GTGAAACAAG 


AATCTAACAT 


GATTATTGTT 


CAATTTGCTT 


10980 


CAGCGCTGGA ACTGGTTGCT 


TTTAAGTAGC 


CATTTATTAC 


ACTTTTTATA 


CCTCTTTTAT 


11040 


ACACTTTATC 


TACACTTCAT 


ATACCTTCTA AACGAGTTAT 


TATGATAGTG 


TCAAAAAAAT 


11100 


AAGAAATGCG 


.ACACACTTAC 


ACAAAATTTA AACGGAACGA TTGCCTACTT 


ATTTTTTTGA 


11160 


TTTGAGATTA CAAGGAAGTA AAAAAATTCT 


ACTTTCTTCG 


TTTAGTCACT 


TGTGATCTCA 


11220 


TTTAAATTCT 


CTCGCAAACC 


ACCAATTATA AAACTAAAGA 


AGTGAGGTGA 


ATTTCCTCTC 


11280 
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rrTTTTTrCT ACAGGTTTGC GAGAGTTAAT GGAGCATAGC TTAATCGGCA GAGCAGCGGT 11340 

CrCCAAAACC G7TGGTATAG GTTCGAGTCC TATTGTTCCA GTAAGTGGCA TAAGCTGCTT 11400 

P.-J^.TAAAATA GATCGTCAAT AAATGTTCGG ACAAACAAAT TGGCGCTACT ACCTTTCACT 11460 

AS3G:7rGrAT T7ATATGCAG TCCTTTTTGT TTTAAGTGTA GTAGATTTTT CATTTTGAAA 11520 

G3-GGATA.--G7 AAGACAATGC GTGTATTAAT TAGAAGTTCA 6CATCTGGTT CAGAGTATTG 11580 

GGAr.-XCGAA GAAAAAAGAA ATGTGTTTGT ACCTAAAGGT CAAGAACCTG ATTTTGAAGT 11640 

TACTGAAAAT CCTGAATCAA TGCTAAGTAA AGAAGCT6AT TTACATGTTG GTGGATTACC 11700 

AATTACTGTA GGGAATGTAA CGATTGATAC TGATGGAATT AAAGGCGAAC GATTATTAAC 11760 

AACTGCAAGT GCTGCTGATG ATGAAGAACA AGATGAGCTT GTTCCGTCTG ATGATGAATC 11820 

7GT7GTAr?A GAAGAAATGA ATGTAAAAGA ATTGCGTGAA TATGCAAAAC GAAAAGGTAT 11880 

TGAGP.TTCCA AGTGCTGTAC GTGCAAAAGG TGAAATTCTC AATATTATTA AAGAATCTGA 11940 

ArAATGCGCT ATTGCCAGTT TGAAGGTTGC TCTAATACAA CAGAAAAAGG AGCTTATTGT 12000 

TCCGPACATG CTAGGAAGTC AAGAAAAAAG AAAAAGCCAA TCAATGTTTA TCATCATGAT 12060 

AACAAATCAT TTTATCGAAC AAAAGAATGG CAAGATGTCG CTGACTTTGT CTATGAAAGA 12120 

GAAG3TGGTT GCTGTCAAAG ATGTGGCCGT TTTGTATTTG GAAGGCAAGC GCATCGGCAT 12180 

CATG7GAr7C CAATCAAGAA GAACGAAATG CTCAAACTTG ATCCAAACAA TATTCGTTTG 12240 

T7GTGTCCGA AGTGTCATGT GATTGAAGAA AATGAAGCAG ATGAGAAAAA AGTTTTTCCA 12300 

TCTTATT?7A AJUU^ATGAAG CCCCCCTATC AAATCTGATT CAAATTTTTT GTCGGGGGAT 12360 

AGGG7AGGGG GCAGTCACGC GTGTCGTTAG GTCAAAAATT TTAAAAATAA AAGGGGGGTG 12420 

TATAAAAAAT GACCACAAAA GCGCAACGCA AAGCGATTAT TGATGAAAAA GTTAATCACG 12480 

AJSAAAACGCG AATTTTAGAA ATTATGCGCA AGTCTGATTT ATACACTATT ACTCTTGATC 12540 

CAT73AT7GA ATCATACTTG GATATTTTTG AAGTTTACCA ATACAAATAC ATGCTGTGGA 12600 

AJ^.GAJ\AAJiGG ATTTCCCGAA ACCCAAAAAA CAACAAACAA GGyTGGAGmT AyTAACAATA 12660 

GCAA3CA7CC ACTAGCGCAA CAAGTCGAAG TTTGGGCCGA TArAAAAATG AAAGCATTGG 12720 

A7TTATTAGG ATTGACCAAT AAGTCAAAAA CAGGCAGACA AATTACTGGT GGTTCAACAG 12780 

CTAGAGCAGA TGnaGAAATG AAACGGCCAG AAGAAAAGCC TGTAAATGAA TTGGCAGAAC 12840 

ATCGGAJi-AAA ATGGCGTAAA AAGGCAGGGA ATGAAACATG ATTGAACCTG GTGTAAATTA 12900 

TGCT3ATTTA TTTGCGAAAG AAGTTCGAAA ACATCCTAAG AAATATCCGA AAACGGTTCG 12960 

T7tAGC:AATA GATCGTTGGT ATCGATGGAA GAAACGAAAA GATATTTG6T TTGATGTTGA 13020 

TCGTGCAAAT GAAATGATGG ACTGGGTAGA ATCTTTTATT GTTCaTACAA AAGGCGATAT 13080 

GGTAGG7AAA.CCATTTCTTT TAGAGCCATG GGAAAAATTC ATTTATTCTT GGATTTATGG 13140 

C7GGGTAAAA GAAAATGAAA AAGGGCAAGT AGTCCGTGTT ACTCGTGAGG CATACGTACA 13200 

AATACCAAAG AAAAATGGGA AAACATTAAT AGCCGTAGGG GCGTTGGGAT ATGCTATGTA 13260 
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TGGCGAAGGT GCCTTATCTG TCGATTGCTA TGCATGTGCT TCTGATTTTG CGCAAGCCCA 
GTATGCTGCT AAGCCTTTTG CCGCTACTAT CCTAAATAAT CCAGTGCTAT TAGATGGGAC 
TAAAATATTT AAAGGTCCAA AAGGCACCGT TTCAAGTATT ACGTATGACT ATTTACATGG 
AGATATGGCT TATACAAATA AGTTTATTGT TCAGACAAAA AACATTGATA ACATAGAAGG 
TTCCAATCCA TATTTTGTTT TAAATGATGA GCTGCATAAA CAAGAGAAAA TGGAACAGTA 
CGATAACTTT AAATCTGCAC AAATTTCATT GCCACAGCCG TTAATGTTTA ATATTTCTAC 
AGCTGGTAAA GGAAGTAGTT CGGTTGGTAT TCGTGTTTAT AAAGAAGCAA AAGAAGTCTT 
GAAGCGCGAT GATAATGATT CAAACTTTGT TTTGATTTAT GAACCAAATA AGGGATACGA 
TTGGACaGAT AGAAAAGTTT GGGAAATGTG CAATCCTAAC TGGGGAATAT CTGTTGATTT 
GTCTGCTTTA GAATCAGCCT TTAAAACTGC GCAACGTTCC GCTCACTCGA AAGCTGAATT 
TTTaACGAAG CATTTAGATG TGTTTGTGAA TGGTGCAGAT AATTTCTTTG AACAAGATCa 
AGTGGAACCG TGTTTGGTTC CCACAAATGA ATTAGGTAAC TTAAGTGGTG AGCCATGTTG 
GATTGGTTTG GACTTATCTA AAAGTCGAGA TTTAACTTGT GTATCATTAA ATTTTCCTAC 
ATGGGATGCC GAAGGAAAAG CGATACTCAA AGTAAAACAA TTATATTTTA TTCCTAGTGA 
AAATATTGAT TTTCGAGAAA AGGAAGATAA TGTGCCATAT TCTGAATTAG CAGAACAAGG 
ATTTGTTGAA TTTTGCGATG GTAAGTTAAT TGATCAAGAA CAAATATTTC ATTTTATTGA 
AGATTGCATG GATTTTTATG ATGTTCAACA AGTCAATTAT GATCCAGCGA TGAGTGACCG 
ATTAGTTGAA AAATTGGAAA ATTTAGGCTT GGAATGTGTG CAAGTTGATC AGTACGCAAG 
AGTATTGAAC TCGCCGCTTG AAGATGCCGA GCGATTATTT TATGAGCAAA GGATTATGTT 
TGATAATCCT TTATTTTTGT ATTGCGCTTT AAATGTGGTT GTCAAAATGG ATTTTCAAGG 
CCGTAAAGTA CCAAGTAAAA ACCAGTCAAA GAGAAAGATT GATGGATTTG TTGCTTTCCT 
TTGTGCGCAT AAGGAAACAA TGGATCAAAT GATTGATGTC AACGAAGATG ATATGGATGA 
ATATTTAGAT TCTATCTATC GATAATAGAG AGGCGGTGAG ATTTTGAAGC TAAGAGATAG 
ACTTTCAAAT GCAGTCTATG GATTTTTGGA AAAGCGTGGC TGGATTGAAG ATATTTATGG 
CAATGTAACA AGATATTCAC AACGTTTTGT TAACGATTCT TCTATTATGG AATCGTCTGA 
TGTTTATGAA TTGGTACAAG ATATTTCTAA TCAAGTTGCA CTAGCAGAAC CAGTAGTAAT 
TGGGCCTGAT GGCGAGGAAG TGAAAAATCA TTTCTTGTTA AACATATTGA AAAATCCTAA 
TGATTATTTA ACTGGTTTTG AATTTGCAAA GCTTGAAACA AATACATTGT TAATCAATGG 
AGAAGCTTTT CCTATTACAG ATAATGACCA GTTACATTTG GGATATGGTG TTCAAACGAA 
ATTAGATGAT CGTTTGATTG AAAGATTTTC AATGAATGGC CAACCAATAC CAGGGAGTAT 
GATTCGTCAT-ATAAAAAATA TTGGTGTGGA TTCCTTAAAA GGTGCTGGAA TTATTGATCT 
TGCAAAAAGC ACACTAGAAG GTGTTTTAAG TGCTGAAAAG GTTTTGACAG AAAAATATAA 
GAAAGGCGGC TTGCTCGCTT TCTTGTTAAA GCTGGATGCG CATATCAATC CAAATAACAG 



13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 
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CGCTCAACAA AAGATAGTAA AAGCTATTTr APATC^GTTG GA^^GAAACGC AAGATAATGA 
TAGl<»TJP^eTTri^ATGA TTCCTTTGGG CAAAG3ATAC TCAATTGATA CTTTAfiAAAG 
CCCAATTG?.? 3AT3C;iGCTA TTCTTAATTA TTtGGGTGTT cftCAAAAAAG ACCTAGGAAA 
tTTTTAGGAA 7AG.--TSTAfiA TACTTATCAg CATTAATGAG AACaGaTATT GnAAAAGCAa 
TGATGTATCr 3CAC;.J^.CAAA GCAaTTAAmC CAATATTAAA AAATAAGAGC 3AGCATTACT 
CGGCTCTTTr TTTTAtSCCT AATICTGGTI ATCGA3TGGA ATGGAAAATT AATATTTTGG 
ACTTTGTACC TTAT7CCACC AAAACAAATA TTGGGTACAA CArTGTTCGA ACTGGTATTA 
CCAGTCCT& 1AAT3T3GCA GAAArGCTTG GTTTTCCrAG ACAAAATACT GAAGCARCAC 
AAGCCGTCTA JATTTCAAAT SAT7TAACGG AAATC3GCAA AAAGAATGCT ACCGATAACT 
CATTGACAAC A6AG3ATGAC TTGA^-JSGGAG GXGGTAAGAA TGAAGAAACA GGAAATTCGG 
ACATTTGAa:^ TCACAAACCr 7AAAACAAGA AGCGA^GA-AG ATAGTCAAAC ACAGATTGTT 
ACTGGTTATG CGGC3GTGT7 TAAT.^^JGTCCA ACAGAATTAT GGGAAGGCCT AAATGAAGTG 
ATTAAGCCTG GAGC7TTCAG 7CGT3CTTTG TCAAArtCTG ATGTTCGTTG TTTATTCGAT 
CATGACTGG3 GCAAAGTATT A3GGCGCACA AGAAG7GGAA CTTTGAAACT TGAAGAAGAT 
GATAAGGGAC 7ACGA7TTGA AGTT3AGTTG cCCCA^TACA ACTGTTGCCA ATGACTkGAT 
TCAATCAATS TCACG75GGS zlCATTAvTCA GTGTA3CTTT GCrTTTTATC CaACGGAAGA 
AACTTGGGAT TATASTTCAG ACCCMTTTT AAGAACTATC CaTGAAGTCG AATTGTATGA 
AGTTTCTATr GTTTCTllAC CTGC7TATGA AGATACSGAA GCAGCACTAT CAAGAAACAA 
ACAAGAAATG AAGCAASATA TTAA.iACTAG AAAAAAATTA ATTGAAAAAA TCAAAACAGC 
GCTTGAAGC3 TAGGASGAAT TTAT7ATSAA OAAGSATTA TTGCGTCAAX JACAAGCTCG 
TCACGAGAAA CGAITAAGTG A7TTACAAGG CyVAArTGAA TCTGGAGAAG TGCGTGAAGC 
AGATTTAGAT 3CAGTTAATG AAGAAATTGA TGGTTTAATC GATGAATTAA WGCCATTAA 
AGCTGAATTA 3GGGATSATA ATTC-JGAATC TGGTGATGGT GAAGGCGATG fiCGGGACCGC 
TAAATCTGAT AATACTGAT5 ATGfcAAGTAA AGAAGATCGT GAGAAAGATA CGAACGGAAA 
CAACGATGAT AAA;l:«:GAAG-AAAA7CGTGG CGGCATGATT AGCCAAGAAC AGCGTGATGG 
TTTGTTACGC ACAATTCATG AAGG-ATGGA GGCTASAJiAT GTGATGTCTA ATGAACAACG 
TGAAAAACAA ATTCGTAAAG CATT7GCTGA TTTTGTTATT GGTAATATTT CAGAAA6TGA 
AGCACGTGCA TTAGGTATTG AAAC?:GGCAA TGGTTCAGTG ACAGTACCAG fJVGTGATTGC 
ATCXZGAAGTG ATTICTTATG CTCAAGAAGA AAACCTA7TG CGTAAATACG GAACTGTGAT 
TCGCACGGCT 3GCGATGTGA AGTA7CCaAT TCTTGTGAAA AAAGCAGAGG CTAATGTAAA 
CAAAAAAGAA- CGTACGACAG ATAT7ACTGA AACAGCGATT CAATTTGATG AAATTCTACT 
T6ATCCAGCA 6AATTTGATG CATTAGCAAC TGTAfiGGAAA AAACTATTAA fiAATGTCTGG . 
TGTGCCAGTA GAAGA7ATTG TTGTAGAAGA ATTGAAAAAA SCATATSTTC GCAAAGAAAT 
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TAATTATATG TTTAATGGCG ACGATGCAGG aAATGAAAAC CCAGGAGCTT TAGCTAAAAA 17280 

AGCTGTTGCA TTTGAAAAAC CTGTA6ATTT AACAGCTGCA GGTGCTGGTC AAAAATTATA 17340 

TGATGCATTG ATTGAATTTA AAAATACACC AGTAACAGAA GTAATGAAAA AAGGTCGTTT 17400 

TATTATTAAT CGTGCAGCTT TAACTGCTAT TGAAAAAATG AAAACAGATG ATGGATTCCC 17460 

ATTGTTACGA CCATTCACGC AAGCAGAAGG TGGTATTGGT TATCAATTAG TTGGTTATCC 17520 

AGTTGATTGG ACCGATGCAG CAGATAAAAA AGGTGAACCA GATACACCAG TATTATATTT 17580 

TGGTGATTTT TCTGCTTTCA AAATTCAAGA AGTTATTGGA GCGTTAGAAA TTCAAAAATT 17640 

GGTTGAAAAA TTCTCTGGAA AAAATCAAGT TGGTTTCCAA ATTTACAACT TGTTAGATGG 17700 

CCAATTAGTT TACTCACCAT TCGAGCCGGC TGTTTATCGT TATGAAATTA CAAAACCAGT 17760 

AGGTGGTTAA TGTGAATAAC GAaGCTGAAA CATTATCTTT AGAAGAAAAA TTCAAAGCAC 17820 

ATATTCATTT TGAAGAGGGG ATGGATGATT CCAtGCTCTC TTTTtATTTA AATATGGCAA 17880 

AAAATTATGT AAAAACTGCa ACTGGAGGGC AAGAAGAATA TTtAATTTTG ATGGTTGCTG 17940 

GTATgCTTAT GAaTATCGTG TTTCAGAAGA TGAATTAGAT AAGGCGTTGA ATGCGATCAC 18000 

GCCATTTATC ATCCAAGGAG TGATTCAACA TGCCGAAGAG GCAGACGAAT AGGTTTCGAT 18060 

GGAAAGCGGA CTTGCTAAAT GTAAAAGAAG AAACAGATTC GAACGATAAA GTAGTTACGA 18120 

CCTATAAACT TAATAGGCTT TTATGGTACG AAGATATTGG AGTAACTGCA CAAGAAAAAT 18180 

ATCTTTCACA GCAAGCCAAA ACAGACGTTG TCAGACGGAT TAAAGTGAGA TTGGATAAAT 18240 

CTATCACAGA AAAGTTTAGC GCTGTTAGAA TCGATTCTGT GACCTATAAA ATCACTCGTA 18300 

TTTACACAAA TATGGATAAA CGAGAAATGG AGTTGAGTTT GGCTTATGTC GATTAGTTTT 18360 

GAAAAATTGA GAATAACTCT GAAATCAGTA GGCGTACCTG TGACACGTGA TAAAGCGGAA 18420 

AAAGGAACGG ACTATCCATA TATTGTTTAT TCAAATGTTA GTCAAGGTAA AAAAATGGCA 18480 

TCGTCTAAAG TGCATAGACG AATGCCCTAC TATCAAATCT CTTTTTATAC AACAGGTACT 18540 

GAAAATOATT TAATTGCTTT AGAAAATGCA TTGGAGGAAG CTGGTATTCC TTACACTGAT 18600 

TTTGTAGGCA TTCAAGGCGA TGAAAATGAT GATACTGTGA CAAATTTTTA CACATATGTG 18660 

AGGTGTATTG AAGATGGAAA ATAATAATGG TTTTGCAGAT ATGGCAGACT ATTTAGGAAA 18720 

GCTTTCGCAA GTAGATGCGA CAAAATTATC AATAGAATCA TTAACCGCTG CAGCTAATTT 18780 

TTATATGGAA AAATTACTAC CCAATATACC TAAATCGCTT CTAAAAAAGA AGCACATGGT 18840 

TGATCAAGTG AAAGTAAATA TTAAAGATAA TGAAGTACAA GTAGCTTTTG AAGATACAGC 18900 

TTTTTATTGG CGATTCGCTG AAAATGGAAC AGTAAATCAA AAAGCGCAAC ATTTTGCCAG 18960 

TGGAACATTT GAACAAAATA AAGATCAAAT TGAAAAAATT ATGACTCAAC AAATATTAGA 19020 

TTTATGGAAA GGATGAGTAA ATTGGGAAAA CAAGATGTGT ATTATTTTGA AGGCTTAGAT 19080 

GACATCTTAA TTGCCATGAT GACCACAAAA GATGCAGTAG GTACAGAACC AGCATTTGGC 19140 

GAGGTTGTTC GTTTGCCAAT AGCCACAAAA TTGGGAATTA AAGGAAATGG AACAGCTTTA 19200 
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GAAAAATGGG CATCAAGTAA AATGTTCCGA CGCGTAAGTC GTGAAACGAA ACATGAAATT 
GCGCTAGATC ATGTGGGCAT TCCTATTGCG GTGATGGATG AAATAAAAGG ATTAATCGCT 
CAAAGTGGAG TGACTTTTGG TAAAAACACT GCGCGAGAAT TTCCTTATTT TGCCTTTGGG 
TTTATCGGAA ATATTGAAAA TGGTGGAAAA AAAGCTGTTT GGTATCCTAA AACGCAGTTA 
TCAAATGTTA TTGATGAAGA ATACACTACT GCAGAAGATG AAACCAAAAT TGATGATGTA 
ACTGCTAACT TTGTTTCAAC TGGTTTAACA TACAATAATG TTATGTATTC AAGTTTTGAT 
TCTAATCGGG ATAGTGCTTC AATAGAACTA TTTGAAAAAT TTATCGCACA ACCTGTTTAT 
GACGAAGAAC AATGGAAGAA ACTAGCAGGT CCTTCACGTG GAGGTGGCAG TGAATAATGG 
CAAAATTAGC GGATTATGGG ATTGTTATTT CAGATACACC CACAGTTACT ATTCAAGGGC 
ATCAGTTTCC CATTTTGTTA ACGATGGAAA CAATGGAGTA TATTGCTGAT GTTTATGATG 
ATGACTATTC AAAATTTGAA GCAGATATGA ATGAAATGAT TAACAAGAGT GGTGGTC6TA 
TTTCATCAAA AGATTTATCA GCTTCTGATT TAAAAATTAT GCGTGCATTG ATTTACGGCA 
TGCTAAGAAC TGGTGGATTG GAAGAAACGC CAGAAACCAT TTTTAAGTTT TTAGGTATGA 
GTGCCACAAT TGTTGAAATT TATGGTGCAT GTATGGAAAT ATTTGCAAAG CAGAATTTTC 
AAGTTGAAGA CTTAAAAAAA TCCAAGAAGC CACAAGATTA TCAAACTCCG AAAAAAAGGA 
AAAACAAAAA GAAAAAGCCT CAACGGAAAT AGGAACGCCG TGGGCTTTTT ATTTATATGT 
AGCTCTCACT CTTTTGGGAT GGAGTGAGGA CTTTTTTTTA AAAGCAACTC CCAACTTGTG 
GCTTAAGTCA TATATTCAGT GGTTAGTAAG TAATACGGAG TTTGAACCAC CCAGAAGTGT 
GACAATGGAT AAAAGTCCTT GGTGGTAGGA AAGGAGCGCT AACGTGTCAA AACAAGAATC 
CGATGTTGTC TTAAATTTTA AGATGAATGG AGAAATAAAC TATTCACGAA CAATTAAAGA 
CATTAATAAA GAAATGAACT TAGCGGCTAC CGAGTACAAA AACCAGGTAT CCGCAATGGA 
TAAAAATGCA ACTCAAACTG AAAAATTAAC AGCTACAAAG AAAAAATTAG AAAAACAATT 
ATCTTTAGCT GAACAAAGAA CAAAATTATT ACGTGAGGAA TACGAAAAAT CAGTAAAAGA 
AACTGGTGAA TATTCA6AGC AATCACAAAA GCTGTATAAG CGTTTATTGG AATCCGAAAC 
AGGTGAAAAT AAACTGCGTT CTGCATTGCA AAGTACCAAT GAAGCTTTGA AAGAGCAAGG 
TAATTTATCA ATAAAAACAG CTGAAAAACT AGCCAAAATT GAAAAAGCTG GAGACAAAAT 
TAAATCAGTT GGGCAAAAAC TGTCTGTTGG ATTAACAGCA CCAATTATGG GAATTGGTGC 
TGCTTCTATT GCCGCATTCA AAGAATTAGA TGAATGTTTG GATAATATAA CAACAGCAAC 
AGGGGCTACT GGTAGTCAGC TAGAATCTTT ACAAGCCAGT TTTAAAACAG TAGCAGGTCA 
AATACCCGCG GATATGCAAG ATATATCAAC TGGTATTGGT GAAGTAAATA CTCAATTTGG 
CTTAATGGAT.AAGCAATTGG AAGATACAAC AGGCCGAATG CTTAAATTTT CAGAAATTAA 
TGGCTCAGAT GTTTCTCAAT CAACTATTAA TGCAAAAAAA TCAATGGACC TTTTTAGGTT 
GTCTATTGAG GATTTGCCAA TGATTTTAGa TTCAGTATCT AAAACTAGCC AAGATACTGG 



19260 

19320 

19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

204 60 

20520 

20580 

20640 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 
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AGTAGGGGTA GATCaGTTAT TTGATGCCGT AAAtAGAGGT GCGCCCCAAC 


TCAAAGCTAT 


21240 


GGGACTTGGT 


TTTTCTGAAT 


CAACTACGTT 


AATAGGTCAA 


ATGGAAAAAG 


CTGGTATTGA 


21300 


TTCAGCAGGA ACTCTTGGCT ATTTGGCAAA AGCTAGTGTC 


GTATATGCGA 


AAGATAATAA 


21360 


AACCATGCAA 


GAAGGGCTTA 


GCGGAACTAT 


TGAATCTATT 


AAAGGGGCCA 


CAACTGAACA 


21420 


AGAAAAACTC 


ACTATTGCTA 


GTGAAGTTTT 


TGGAACTAAA 


GCTGCTTCAA AAATGGTAGA 


21480 


AGCTATTGAT AGCGGAgCGT TGTCAATGGA TGGTTTAGCA 


GATTCAGCAA AAAACGCAGC 


21540 


TGGCACTGTG GATCAGACGT 


TTAGTGATAT 


TCTTGATCCA ATTGACCAAG CAAAGTTGGC 


21600 


ACAAAATCAA 


TTTAAAATAG 


CAATGGGTGA ACTTGGAGAA CAAGTACAAA TAGCATTATT 


21660 


ACCAGCATTT 


CAAGCTGCAA 


CGGATGCAAT 


AAAAAAAGTT 


TCAGAATGGT 


TTGGAAGTTT 


21720 


AACAGATAGT 


CAAAAGCAAA 


CCATACTGAA AATAGCTGGT 


GTTGTGGCTG 


CTATCGGTCC 


21780 


AGTATTAGTA GTTTTAGGAA CACTTGCTAG 


TTCCATTAGT 


AGTTTGATTC 


CAGTTATTGC 


21840 


TTTTATTGCG 


TCGCCAATTG 


GTTTAGTAAT 


TGCGGCGGTT 


GCCGCTTGGG 


tAGCTGCAAT 


21900 


CGTAGTTGCA 


TATAAT7\AAA 


TCGGTTGGTT 


TAGGGATTTT 


ATCAATACCT 


CCTTTAAAGT 


21960 


AATTAAAGAT 


ATTGTGGTTG 


GTGTATTTAA 


TGTTTTG7WV 


GATACGACAA AATCTACTTT 


22020 


TGATTTCATC 


ACAGGATTTA 


TTGGTGGTGC 


CATGGATGGG 


GCTGCAAAAA 


TTATTGGCGA 


22080 


TTATGTAAAT 


GAAATTAAGC 


GTATTTTTGG 


CGGTATCGTT 


GATTTTGTAA 


CGGGAGTATT 


22140 


TACTGGAGAC 


TGGTCAA6AG 


CGTGGCAAGG 


TGTTGTTGAC 


ATTTTTGGTG 


GTATTTTTGA 


22200 


AGGCATCGCT 


GCAGTAGCTA 


AAGCTCCAAT 


CAATGCCATG 


ATTACGTTAA 


TCAATGGATT 


2226.0 


TATTGGTGGA 


TTAAACA'VrA 


TAAAAATACC 


TAAATGGGTG 


CCAGGAATTG 


GCGGTAAAGG 


22320 


ATTTCATATT 


GGAAAAATCC 


CTTATTTAGC 


AGAAGGTGGA 


ACTATTCTAA 


ATGGCCAAGC 


22380 


CATTGTTGGT 


GAAGCTGGTC 


CTGAACTATT 


AACCGCTAAA 


AACGGCAAGA 


CAACAGTAAC 


22440 


TCCATTGTCA 


CCAGAAGAAA 


AAGCTCGTGG 


AATTGGTGGT 


GCTTTGAAAG 


GTGGCAACAC 


22500 


TATTGAGCAA 


CATGTTCATA 


TTGGCCAAGT 


AGATGCAAAT 


AATCCGAGTG 


AGTTAGATCG 


22560 


AATGAATCGC 


AAGCTTTATA AAGCAAGTGC 


GCAAGCTTTC 


TATGACTTaG 


GAGGTGTTCC 


22620 


aACGTGATTT 


TTATGAATCC 


TGATGAACCA AATTTCATTT 


GGAAAGATTT 


GAATGCAGTT 


22680 


CGTGATATGG 


GGTGCATTAT 


CGAAAATGAG 


CTGTCAGAGG 


TTTTACCA/U^ 


TAAACGATAT 


22740 


GAAACGTATT 


CGATTATCGG 


AAGAAGTGGT 


GAATTTAATG 


AAACGTTCAA 


TGATTATGAA 


22800 


CCCTTTGATT 


ATGAAATTGA 


AGATGTAACT 


ATTCCATATG 


AAAATTTAGC 


GGCAGTCAAA 


22860 


AGATGGTTAA 


CTGGTAAAAG 


TAAACTTATT 


ACTCACAATG 


ATGAAGATAA 


ATATTTAGAT 


22920 


GCTATTTGTA 


CAATGAGTAA 


ACCAACTTCA 


TTCA7WVATG 


AATGGGGTGT 


TTTTTATACC 


22980 


TTTAACGTTG 


. AATTTAGATG 


TCAACCGTTC 


AAAAGAAAAG 


TAAACGAACA 


ACCAAAAGTG 


23040 


ATTAAAACAA 


AATCAATTGA 


AATTACTGAT 


CACGGTGATG 


AAATTGCTTT 


TCCTTATATC 


23100 


GAAATTAATT 


CAAAAGGTGG 


CGATATTACG 


TTAAACATTG 


GTAGTAACTC 


ACTAACGATT 


23160 
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TTGCGTACAC AATCAGGAAT CGTCACTATT GATACCGAAA AGGGAAAAGC AATACAAGAA 23220 

GGAAATCCAC TATTTACACG CGGCAGTTGG ATAAAAACGA ATCCCGGTCA ^WVTAA^ 23280 

AATATATCAG GAAATTTTAT AGAAGCTAAG TTTTGGAATA GGAGCGCGTA TTTATGACAC 23340 

AAAATTTTAT TTATGCCTAT ACTGCTATTC CTGAAAATTT AAACGATAAC GGAATGGCTT 23400 

TGCCAGATTG GCAAGATTTA CCAGAAATTA ACCGTGTGTT AAATGGTGTG TATCGATTCT 234 60 

ATGGTAACTA TGCAAGAGAT GGCCAGTATC GCTCATACTT AAAAAAAGGA AACTTTCTAA 23520 

AGGCACAAGT TGAAGATGGA TCATATCAAT ATTTTGAGAT TTACAATATT AAAAAAAATC 23580 

TGCAGTCAGT TTCAGTGACA GCGAGACACA TTGGTTTTAT GGCAAATAAG AATTTCATTA 23640 

TTGATTCGTT CACTGCTAAC GGAAATGGCA CGCAAATTAT GAATAATTTA AAGGCTGCAT 23700 

TAACGTTTAA GCAACGGTTT AACTATTTGT CGAATGTCGG TACTACACAT CAATTTACAG 23760 

CAAAACAAGT AGGTCCAATC GATGCAATTA TTGGTTCTAA CAATGGCAAT CAAAATTTAA 23820 

CAGGTGTTAC TGGTGGAGAA TTAGAGATGG ATAACTTT/^ TTTGAAATTA GTAAAACAAA 23880 

TTGGAGCAGA TAATGGCTTT AGAATTGATT TTGGAATTAA TTTGGAAGCT ATAGATGAGG 23940 

ACTATGACGA CGAATCAATT ATAAACAGTC TTTTTCTTAT CGGTGGCGTA CCAGACAATG 24000 

ATTATGACCA AGATAAAGAG CCAATCACGT ATGGCTTTTT AGAAATTGCT GGTGTAAATG 24060 

ATAGTAACAG AAGAATTGGA AAACGTGAAA ATTCGGAATG TAAAACAGTT GATGAGCTTA 24120 

AAAAATGGGG CCAGTCATTG TTTGATAAAG ACCGCATTCA TGAACCAAAA GTAACGCACA 24180 

CTATTAGCAT GGTAGCATTA GAACACACTT TGGAGTATGA AGACATGTAC GAAGAGCTTT 24240 

CTTCTTTGCA TTTTGGTGAT GTAGTACATG TTAGAGCAAA AGAAGTCGAT ATTGAAGTAA 24300 

CGGAGCGCAT GGTGGAATAT ACTTGGTTTC CGACTTTAGG CAAATTTAAA AATATTGTTT 24360 

TGGGGAATGA TTTATCACTT TACACCTCAA CAGTAAATAA TCAAACTCAA GAGCTAAAAC 24420 

AAAAAATTGA TAATCGAACA GAAACATTAG TACAAAATGT TTTAAATGCA ACGGCATGGA 24480 

TTACTGGAAA CAGTGGTGGA CATGTCGTTT TTCGTCCAGA AAAGGCCCCG TCTGAAATTC 24540 

TTATCATGGA CAAAAACAAA GTAGCTACCG CAAAACGTGT GTGGCGCTGG AACTTAAATG 24600 

GTTTGGGTTA TTCCTCCAAC GGCGTGAATG GTCCGTTCGA ACTTGCTATG ACTTCTAAGG 24660 

GAGAAATCGT TGCTGATTTT ATTAAAGTGG GCATTATTAA CGCGAATGTT TTACAAACAA 24720 

GCTTTAATAA AGCAACAGAC GATGTACTAA AATTAGTAGC TGGTGCTTTG CAAATTTGGA 24780 

ACAATAAGAA AAAAATCATG GAATTGACTA AAAAAGGGAT GGAATTTTGG AATGGCTCTA 24840 

GTCATATTGG GACAATTGGT ACGAAAGGAA ATCCCTTTCC AGGGGTAGTA GATAAAAATG 24900 

GAAATCCTGT AGTTTCTGAT GGGAATTCAT TGCTATTAGT CGCAGATAAT CCC/U^AAAAA 24960 

TTATTGGTTT- GTCTAACTAA TCAGGAACAG GACATTTAAT TACTGGTCCT ACACAGTTTT 25020 

TTGTTGGAAA TAATTTTAAC TTTTTTGGTC CGAATGGAAG TAAAGCAATT CTGACAGTTG 25080 

ATCGATTGAT TGTGGGCGGC AAAGAAGTTA TACCTGGTCA AAATGGTGGT GGCGGTTCTG 25140 
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GAGCTGGAAC AGGTGGTTAT CCATCAGAAG TTACAAGCGA TGCAGATAAA 
ACTTATGGAG TTACCTATTA GCTAACGGAT ACAGCAAAGC AGCTGCTGCA 
GAAATGTACA AGGAGAAGTT GGTCCAAGTA TGAACCCAGA TACCGAACAA 
CAGCTTACGG ATGGGTTCAA TGGGACGGTT CAGCATATCC ATTGGTAGGC 
GGAATGGCCG AGAATATGTA CAACGCTTAA TCGCAGCTGC AGGTATCAAA 
GGACGTCATT AGCCCAAGCT CAATTAATTA ATTGGTGTAT GTTCAATGGG 
GACAAGTAAG TCCATTAACa GTTGATGaAT TTaAAnnTCT ACGGTCTTTG 
TTACTTCTGT TTTTTCAGAA 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dotible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ACTTAAATAC AAATGTTGGT CGTGCACAAT GGGAACCTCA AAGTATCGAA 
ATACGTTTAT TACTGGTAAA TTAGCCAGTG CAGGTGTCTC AGGTTTTGAA 
AACAAACAGC TACTCGTTGG CACAAAACAA ACATTACAAC AGGTCCCCTT 
GGAACTTAAC TGCCCAACAT AGAACTGCTT CTTGGGATTA CTATATTACT 
GGAATCCCAA TCAACCATTA GACATTAAAA ACTTCGACAA AATTGCTTCA 
AACAAGAAGT TCCTAATAAA GTTGTTAAAC AAACAATTAA TATTCCGACA 
GTTATCATGT CATTTACGCT GTCTGGGGCA TTGGTGATAC GGTGAACGCC 
CGATTGATGT GAACATTCAG TAAAAATACT CACATCTTAA ACTAAATTAA 
AAAAAATGAA GCTAAAAAAA ATAATTCCTG CTTTTCCCCT TCTTTCAACC 
GCTTGTGGTT AACGCCTACT CAAGCTTCTG CAGATGCTGC GGATACGATG 
CTGGCAAAAA AGTGTTGGTT GGATATTGGC ATAACTGGGC CTCAAAAGGA 
ACAAACAAGG AACATCAGCA TCACTAAACC TTTCAGAAGT AAATCAAGCC 
TACCGGTTTC CTTCATGAAA AGCGATGGCA CGACACGGAT TCCTACGTTC 
ACCAAACGGA CACTGCCTTC CGACAAGAAG TCGCACAATT AAATAGTCAA 
TTTTATTGGC ACTTGGTGGA GCAGATGCAC ATATTCAATT AGTCTUW^GGC 
CCTTTGCGAA TGAAATCATT CGTCAAGTGG AAACATACGG CTTTGATGGT 
ACTTAGAGCA ATTGGCGATT ACTGCTGGCG ACAACCAAAC CGTCATCCCT 
AAATAGTC7VA AGACCATTAT CGAGCACAAG GAAAAAATTT CATCATTACG 
AATTCCCTTA TTTAAAACCT GGTGCCGCTT ATGAAACATA CATTACTTCC 
ATTATGATTA CATTGcCCCA CMTTATATA ACCAAGGCGG CGACGGtGTC 



TTTGCTTGGG 
GGTATCCTyG 
ATAGGCGGTC 
GCACCAACTT 
CAAGACTATA 
CAATGGTTAG 
GTTGGTAACG 



GCGCCCAAAA 

CCTTTGGATG 

GACATCACTT 

AAAAATGGCT 

ATCGACGGCA 

GACCGCAAAG 

TTTTACCAAG 

AGGAGTTCTC 

GTTGCAGTTG 

GTAGATATCT 

CGCGATGGTT 

TACAATGTCG 

AAGCCTTATA 

GGTCGCGCAG 

GATGAACAAG 

TTAGACATCG 

GCTACGTTGA 

ATGGCACCAG 

CTAAATGGTT 

TGGGttGATG 



25200 
25260 
25320 
25380 
25440 
25500 
25560 
25580 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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AAGTTATGAC TTGGGTTGCT CAAAGCAACG ATGCTCTAAA ATACGAGTTC CTCTATnATA 1260 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TCCAATGGCT TATCGATTAC AACAACTGGA AGTtCTTTTA AATCAAGCTT GCTTTTTCCA 
ATTAGAtACC TATGGAAACC ATCAwTAATA ATATATTCGT CTTTTTTATC ATCACAATAG 
CAAACAATCG GTTGTGTATA TCCGTCTTTT AAAATCGATA ATTCTAAAAG CTCCATTTCA 
GGTCCAGCGA CATGATTTGG GTTATAACTA TTACTTTTTA TTTTCTCTAC TGGAACCATT 
TTTACATTTA AAACGGGCAT ATCAATCTTG TTCATTGATT TCTCTCCTAA AAAATAAATA 
ATTCTTTGTC TCTCTATATT GTTTAAAACA ATAATGTTTC AGATTATTTA AGTACTGGTC 
^^Y^rp-j.^;^ GAAATTCGTA CACACTTTTT TTCACATTTC TTGTCTATGA AT^TAAAAC 
CTTCTCCAAT AAATACTTTC GAGTATCGTC ATTTTCATCT TCAACAAAAA CATCTCCAAT 
ATCAATAAAC TCATTGGAAA TTTGAATGGA AGAAAATCCA ATTATCCTTT CCTTATTCAT 
TAACAAAAAC CAAATAGTAC TAGATGTATT TATTAAATAA GGTAATTTTC TTCTATATAC 
TCTTTCAGCA AAATATTTCC CCATTAAACT GTAAAAATTT TGTyTATTGT ATGGGGGTAT 
ATAAACTTTA ATATCTATTT TGCTCATTTC ATTCCTCCTA TTTTTTGTAC AATATTTTCG 
TATTTCTTTC CTTTTAATTG ATTTATCCCC AAAAAGTCTT TATATTCTTT AAACATTTCA 
TATAGTTCAG CTTCATCTTT TTTCGTTAAA CTATAATTTC TAGTAGATAA AACAAAGTCA 
YXTTTTTCTA AAGCTTTAGC TATCCCTTTC CAGTCATTCC AAAGCCTTtC ATCTTTTATC 
CAATCTTTTC TTTTTGATTC ATCTGGCATA TCTTGGATAT CCATATtATA tGTATtCktG 
TAATAaGCAA AAaGAATACA TATCTTTCTG TAATAATGAT TCATTAAAAG AGGAGAATAT 
AATCCTAATG TTTCTAGTAA AAAAATAGAA TATCTTTGCC AAGTCATAAA TGAAGGTTTT 
TGAGAAGTAT TTAATCCTAA AAGACTAGTT TTACCATATA TATTTCCAAA ATTAACTCCA 
CTTACTCTTT TTACTAATTT ATCCCATGTT TCAgGTTCAA GTTCAGCAAA TTGATTTAAC 
GCTTTTCTTT GTTGTAATCC GAAAGGCTGG CATACTCTCA TGTCCCGGTA CTTATATCCA 
ATCTTATACA TTTTCTCATA GATATAATTA TATTCTAGTT CACATTTAAA AGTTGCAGCC 
CAAATGTCAC CAACTTTCCA ATCAAATAAA GGATAAAACA AAATATTTCC TTCTAAAGTT 
ATATTTATCC. AACATCGATT CTTATATTTA TTTTCACCTT TTGTAACAGC ACTATGTCTG 
TGTAAACTTT CTTGTGCCCT TATACCTACA CCACATGATA TAGCACTTTG GGGATATTGT 
TCTTTATACC AATTTTGAAA ATAGCGAAGA AATCTGTCGG GATTACCATT TGATAATGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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TAATATTTTT TTAAACCGTC ATCAATATTT TCCAAAGTTA TGGCATTTTT AGGAAGAGAA 1620 

TGAATCCATT TTTCTTTTTC GCTTGGATCC CACATTTTCC ATTGAGGTTG AAAAAAACTA 1680 

GTGTTGTTAT CCTCAAAAAA AGGTAAGCAA AAATGATAAA TATTTTTTAC TCTAGGTAAC 1740 

TGTTCAATTT TTTTTATAAA ATCAACAGTT GCTGTGTAAT TTGCTTCAAT ATTTAATATT 1800 

AATAGATCAA AATTTCTATC TAATTTTTCT GCAACTAGAT TAGCTAACTG AACCATTAGC 1860 

CCACTATCCT TACCGCCACT AAATGAAAAA TACACCTTAT CAAATTCTTC AAATATGTAT 1920 

TCTAATCTCT TTAAAGTAGC ATCAAAGACA TTCATTTTCC ATCCTCCCTA AAAAATAAGC 1980 

ATTTAATATC AATGGTAATA CATTAAAAAC TTTGTCAATA ATAAGGATAT ATTCCAAAAT 2040 

CCATTCATTG GAAATAGAGA CCTTCTTTAT TAATGTATTA TTACTTACAT AGCGAAgcAA 2100 

TaCAAACATT ATCCAATAGT TTCATCTACA AAAAATAAAT TACTTTTAAA ACATCTTGAT 2160 

TATGAATTCA TTTCCCAAAG GCCTTTTAAA ATTGATACAG ATGTTCGTTC ACTTTTTTTA 2220 

GACTTATTGT ACAAGTCTTT TTAATAGTCC CAGAACTUUUV TAGTTTGTTT ATCCAGCTTC 2280 

AAATCATTAA AGGCTAATGA ACGGTTTATT TTTTCGATTA CTTTAGATAA AGCTACAGAA 2340 

ATTTTTATAT TTTCACAATG CTCAGTTTGA ATTTCAATAA TTATATCATC AATTAGTTTT 2400 

TGAATTTCTT ACCTATCAAC ATATTCAGTC TTAGAAAGAG AATCTTTTAA AGCTACTAGT 24 60 

GATTCAACGA TTTCAGACAT TTTCTTCCTC CTTTCTAGTG GCCTGTAAAA CCGTGTAGAA 2520 

CAGAGTCCAT CGAACCGGTA ATTACACTAT CTACaGCGCC ATTCCAATTT ACTCTAcATT 2580 

TTCATTTTTC AGATTTGTCA GCCCATACAG ATAGCCTTTT TGACGTTCAT ATTCGGATTA 2640 

TACCATTCAG CCGCAGAAGC AGATTCGCTC GACACAAACA ATCCCATACC AACAAAACCA 2700 

ATACACATCA CACTATAGCC AACTAGTTTT TTCCTCTCGk ATCCATCTCC TAAAAAATAG 2760 

AATAGTtCTT GTAATTATTC ATAAGATATA CTTTTCCGTT TTTTATCCTT TCTGCTTTAT 2820 

AACAACTTTT TTCAAGTAAA GAATGACATA CTTCTTCTTA AAAAAACAGT CTGTTTTTTT 2880 

CTATTTTTTC GGTGTATAAT TAATACGAAT GTTCATTTTT TGTTTTAATA CGCACTCCGT 2940 

ACATAATAGA TCTTGATTGC ATCAATCTTT TGTACTTTTT GATTTATCTT AAGCTGAATG 3000 

AATAGAAACC TAGTTACCGC ATTACTTTTT TGTTCTCCAA ATAACAAGTA ATGATCCGTT 3060 

TGGGTTATAT CTTGTGAAAC ATTATGAATA C6ACCAATGT TTTTTTGTTG TACATGTGGA 3120 

TCGTTGTCAA TTCCATTTTC TTTTATACAA CAATCGATTA AATAGTTTTT TACGGATTGA 3180 
TTTCGTTCGT ATATGTCCGT GTAATTTACC CATTTTCTTC CAAAATCCTT TAACAAATGT 3240 
TGAATTACTT CAAATATGTT GGATTTTATT TTGTTTTCCT GATTTCTTTT ACTTTCTATA 3300 
CTTAAACTTT CTGCTTGCAT CATCGTTTCT CTTGATATTG AGGAAACTGC TGTTTTTTTC 3360 
TTCAGAAGAA-ACCCACCTAC AACTTCTAAG GACAAACAAA AGCAACCAAG AAAAAACAAT 3420 
ACTTTTTCAC AATTGTTTCA TCTCCCTTTT CTACTAAAAA TTTTCATTGA GAGAATTCTT 3480 
TTATTTGGCT TTACGAATCA AACTTCGAAT TTTCGTTTGG TTATAGGTGC GTTGATACAG 3540 
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TGCTATGATT CCCCTTGCTT GGCGTTCTGT TCCACAGTTG AAAAATAATT CTCACTATCT 3600 

GAAACATCCG TTATGATACC GGTGTGTCCA TAGATTCCAG GAGACAATTG GCTTTACCAG 3660 

TTGGCCATGA AAGGATTCTT TTAAACGACA CTGAAATAAA AGTIAAAGAAG CCAAATAATT 3720 

ATAAGGCTTC CTTCCTTGAA ATTGTTTTCG TGTCATTGAA AAAGCAAATA ACCACGGACA 3780 

AAAAATTTGT CGAGTAATCG TATGTTCCAT ATCAAATGGA AAAATTCGAA AAATCCCCCC 3840 

CAAAAAAGCC AATCCTAAAC CATACCACAA ATGTTTGTGT TACGGGACGA GGTAACGCGA 3900 

CCCCGTCCCA CAATTTATAA AACACTCGCT CTATATACCA AATCTTTGTC CAATTGCGTA 3960 

TATTGCTCAT TTTATCGCCC CCATGACCGC ATGCGATCCA GAATCCAATT GAATACTCCA 4020 

GACGTATCTC CAACTAACCA AACAGCTACT GCCAGCACAA GAATCAATGG AATGGCTTTG 4080 

CTTGCTTTTC GTTCAAAGGC AATATAGGCC GCACCACCCC AAGCATCGCC AATATAATCA 4140 

AATAATTAAA ATCCCCAACA AAATTGGTAA ACACATTTTG AAAAAAATCA TTTGCTTTTT 4200 

TCCTCCCTGT TAATACACAT TGAACTT^TG ACAATACACA TCACCACAGA GCACAACTGA 4260 

TAAAAATTTC ACTTGATATT CCTCATTATG CATTTGTTCT GTTCTTTGAT TGCTGTTTTT 4320 

ACGTAAGAAT TATAAATAAT TTCTCACACG GCGATACATT ATTTTCACAG GAACATGCGT 4380 

ACAATAAATC CGGTTCTGCG TACTTTTCTT TGTTTTAGGA TCAAGTTTCA TAGAATAGAA 4440 

ATCATAAAAA TATTTACCAC TTTTTTATCA TATGTTCCTT CGATCACTAG TGAAAAGCTA 4500 

GGGTGGTAAA TCCAACAAAT ATAATTGATA CTTCCCTCTA CTGGATAAAG GGACAAAACC 4560 

AAGTACCTTT AAATGGTCTG ATACAGTCAT AAACACGATC TATCTCCTCC CAAATAAAAC 4 620 

AAAAAAAGGC TGTAATCGCA TTTTGATTAC AGTCTTTTTT TGTAAATGTA TTAATTAAAG 4 680 

CTTTACTTCC GGTCTTTTTA TTTAATGACT TTAGTCAGTT GAAATCTCTA AAGAGATTTC 4740 

GACAAAAAAA CCACGCGTAA TGCGCGTGGA GGTCAGAGCT TTAGCGATAA AAGTTTAGCA 4800 

CATACTCGTT GGAATTGTAA GTATCATCTT GTGTTTACAC CAAAATATAG AAGAAAAGTA 4860 

ATCTATGGTC AATTGAGGCG AGATATAGGG AAAATACTAA GAAAATTGTG TGAGATGAAA 4920 

GAAGTCGAAA TCATTGAAGC GCATGCAATG CCCGACCATA TCCATATGTT GGTAAGAATT 4980 

CCACCGAAGT TGAGTGTATC AGGTTTTATG GGTTTCTTAA AAGGAAGAAG TGCAGTAATC 5040 

ATTCATGAAC GTCATGCGAA TTTAAAATAC AATTATGGAA ATCGATCCTT TTGGTCAAAA 5100 

GGTTATTATG TGAGCACTGT AGGACTGAAC CAAAAAACGA TAGCAAAGTA TATACGAGAA 5160 

CAAGAAGCAG AAGATCGAGT AAGGGATAGT ATAAACAAAA GAGAGTATAA AGATCCATTT 5220 

AGGAAGTAAC AGCTAGATAA GAATAGAAAG GTAGCGGTTG GCGGTCTCTA AAAAAGCTCC 5280 

CTTAAAGGGA GCAGCAGGTA GCGAGCCCTT CTAGGGCTAT TaAAAAGCCA CCCGCTATGC 5340 

GGGTiSGATAT.TTACTTAATA AATCAATTTT CCCAAATTAT TTTCTTTAGC TTTCTTAACG 5400 

ATTTCTGTTG CAACTACCAC ATCTAATACA GCCGAACCAA CTGTTTTGAA GATCGTGATT 54 60 

TGTTCTCCAC TTGTCCGACC GACCAATTCT TCATTGATCA GTTGGCCTAG TTCTCCATGA 5520 
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TAGCTGTCTC 


TTTGAATATA 


TCCATCTTGC 


AGCGGTGAAA 


TGaAGTCCCC 


AGCTTCTTTT 


5580 


AGAACACCGT 


CCATGGTATC 


AAAAATAACA 


ACATCTGCGG 


CTTTAATTAT 


TTCTCGTGGa 


5640 


ATTTCACACA 


TTTCCGGTGT 


GTAAGCACCG 


ACTCCATTGA 


TGTGCGCACC 


TTTTTTGACC 


5700 


CACTCAGCAG 


AGAATGTCGC 


ACGTTTTGAA 


GTGGTCACAC 


TTGTGATGAT 


GTCTGCTCCT 


5760 


TCAACGCATT 


CTTGACTGGT 


TTTTGTTGGG 


CGCATCGTTA 


CAGAAAATTG 


TTGCATCATT 


5820 


TCTTCAGCAA 


ATTGGTTGGC 


ACGCTCAAAA 


TCAATATCAA 


AAATCCGAAC 


TTCTTCTAAT 


5880 


TTACGAACGG 


TCAACATTGC 


TTCTAATTGT 


GACTGTGCTT 


GTCCACCGGT 


TCCAATCAGC 


5940 


GCGCCGATTT 


TAGCATCTTT 


TTTTGCAAGA 


AGTTCTGTTG 


CTGCTCCTTG 


AACAGCGCCT 


6000 


GTTCTTAATT 


GTGTCAAATA 


AGTACCATCT 


AAACAAGCGG 


ATACCATACC 


CGTTTCTGGA 


6060 


TCCAATACGA 


TCATCGTTGC 


TGGTACACTT 


GGTAAGTTTT 


TCTTGATGTT 


TTTAGGATAC 


6120 


ACGGAAACGA 


TTTTGACACC 


TAGTGCTCCT 


TCGCCACCTT 


CTACATAAGC 


TGGCATATAT 


6180 


AAGCTTTGTC 


CGTTACTTTT 


TGGCACATCG 


ATATTGGTGC 


GAAGAGGCAC 


AATTGCCTTT 


6240 


CCTGCTGAAT 


ACAATGACAA 


TGCTTTTTTG 


TCTGCATCGA 


TGGCTTCTCT 


CATTGAAAAA 


6300 


TTTTTGATAA 


TATCTTCTTT 


TTTAAGAAAT 


AGCATTTGAA 


TCGCTCCTTA 


ATTAGCTGCT 


63*60 


GTAGCTTTTT 


TTAATTGACT 


GCTCAACATC 


GTATATGCAC 


TGTATAGTAA 


GACCAAAATC 


6420 


ACGATCCATT 


TCAGAATATA 


GGTATCTAAA 


GATTTGACTA 


GGAATACAGC 


GACCAATACC 


6480 


CCTAATACAC 


CGAAGGTTGA 


AGTGAAAAGT 


GTGATTTTTC 


TGCTGTATTC 


ATCAAATTTA 


6540 


ATGAATTGAA 


CACTGCCGAT 


CGGTACTGAG 


AATGTACAAG 


CCCCCATCAT 


AATAGGAAAG 


6600 


GCTGCAACTG 


GATTCAAGCC 


TAGCAAGTAG 


ACAGTTACCA 


TTGTTAACGC 


ATAAGAACCA 


6660 


ATGCCGATAT TGTTCAATGC GCCATAGACA AAGAGTAAAA ATCCTGCCAA GATTAATTTT 


6720 


CCGCCATACA ATTCACTGGC TGTTCCGTTT GAGgGAAkCC AATTCATTTG TCCCATAAAA 


6780 


ATTAAGAATG 


CAGCAATGAC 


CAATCCTATG 


CCGACAAACA 


GTTTGATCGT 


TTTCTCTGGT 


6840 


AACTTCACGA 


CAAATCTTGG 


TCCTAAATAT 


GCACCGATAA 


TTTGGCAGAT 


AATACAAACA 


6900 


GCTAATGTCT 


TGATCCCTAC 


GTTGATAGCA 


GTGATATAAG 


ACAATGCCAT 


GACGGCTACC 


6960 


GGTACTACAC 


ATTGGGTATT 


CAACGTTCCG 


GGTAGTTTTT 


TCATGCTGAC 


CCATTTCAAC 


7020 


TTTGGATAAA 


GGACGGTTCC 


GATAGCAAAG 


TCGGAAACGC 


CAAATGTTGA 


AAGAAAGAAG 


7080 


GTGATAAATG 


ACGTGAAGGG 


CAACAGACGC 


ATATCCCCAG 


GTTCTTCTTT 


GAAATTTTGT 


7140 


TTGTTTTTGA 


AAAGGTCTCT 


TAGAAAGTAA 


ATAATAAAAT 


ATAGATTGAT 


TGCGACAATT 


7200 


AGTGCCAGCA 


AAATATTCCG 


AATCATAAAA 


ATTCCTCCTA 


CTAGTAAATC 


TTTCTGATTC 


7260 


CCTCAAAGTT 


CTCACCATAT 


TTTGCTTTAA 


GTTCTTCAGA 


AGATTCTGCA 


ATCACTTTTA 


7320 


GTGTATATGC 


. ATCATGTTTG 


ACATATTCTT 


TAGCTAGATT 


GACCATGGCG 


TGGTTTTTTG 


7380 


TCCAAGTGAC 


ATCGGTTGAT 


TGCAGTTCTT 


CATTAATGGT 


GCCAAACACG 


ACTTCTTGCT 


7440 


CATCGGCGAC 


GATGTTGATC 


CATCTTGCCC 


CAAAGTCATC 


AAGTTTGTCT 


GTTTCAAAAC 


7500 
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GGTAGAACTG AATAAAATCA 
CTTTAGCAGT GAATCCgTGA 
ATTTTtCACC AAATGCTCTG 
CATGTCTTCT TCATCTTTTT 
aTTTTTAACA CTTTTTTCTA 
TTTAGGTTCT GATTTATTGA 
ATATACTTTC GAGCGTGGCA 
TTTTCTAATA GGGTGGTATA 
ATTTGAATCA TCGTTTCCAT 
CAACTATAAT TTTTTTTTGA 
TTAGCTTAAA AACAAGCTTT 



CATGAATGTA ATACTTTTCT AAGGGAAGAT CATATTCATG 
GTACAAATTT ATCGACTTTT TTTTCGGCCT GCTGCAATGC 
GGTTCTCATG CCAAACTTGG ATGAGCAAAC TTTCTTTTGC 
CTTaTCCaGA ACATACTcGA TGCCATCAAC TTTnCCACAG 
CCTTAATCAT TcCTAGATTC TTTGTAAGAa AAGaAAGATC 
ATTTCTCTAA AAACTCaTTT GCCGAGATGG CATGATAAAG 
CGACAATCAA ATTCTTTTTT AATAGTTTTT CCAAAATGTT 
CcCTGAGATT TTACTTACTT CATATCCTGT AAGATTACCT 
CACTCTTGTT TCCATCTCTG AAAAATCATA TTTTTTCATG 
TGATTAATCC TCACTTCATT TTTAGTAGCT ACTACGGTAA 
TCATCTGTCA AGAGTTAGTT CAATAAAACA CATATAAGAA 
tTATTAAAaA TaAATaATAT AATAGGGA 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TGATGAGCAT AAGGAGGACA GGCATGAACA AAAAG7UVTAA TGGCATGAAA 
ACTACGTCTT AGTAGTACTT GCCATGGTCA GnGTTGTCTA TTTCATTTTT 
ACCAACAATC GCCAGACATC GATTACTCAA CCTTCCAACA ACAATTGGAA 
TCAAAGACAT GACCATTCAA CCAACTAATG GGGTTTACCG GATTGAAGGT 
AAAAACAAGA AGTGAAAGAC ACAGGTGGTC TGTCATTGTG GGGCTCAACG 
CAAAAGGCTT CACAACAACC GTTTTACCTA GTGATACAAC CTTAGCTGGC 
CGGCACAAAA TAACAAGGTG AAGCTAGTTG TCAAAGAACA ATCAACAAGT 
TGTCACTGTT GTTTAGTTTC TTACCATTAG TGATTATTTT CTTCTTCTTC 
TGAGCCAACA AGGCGGTGGC GGTGGCGGCG GTGGCCGTGT GATG7UVCTTC 
AAGCCAAAGA AGCCGATAAA AAAGCTAACC GTGTCCGCTT CTCAGATGTA 
AAGAAGAAAA ACAAGAATTA GTTGAAGTAG TGGAATTCTT AAAAGATCCA 
CTGAATTAGG CGCACGGATT CCTGCGGGCG TACTATTAGA GGGACCTCCA 
AAACGTTACT TGCTAAAGCC GTAGCAGGTG AAGCAGGTGT GCCATTTTAC 
GTTCAGATTT CGTTGAAATG TTTGTCGGGG TCGGTGCTAG CCGTGTCCGG 
AAACAGCGAA GAAAAACGCG CCTGCAATTA TCTTTATCGA TGAAATCGAT 
GTCAACGTGG TGCTGGTATG GGTGGCGGAC ACGACGAACG TGAACAAACC 



AATGGTTTAT 

GGAAATAACA 

GACGGAAAAG 

CAGTATAAAG 

CAAGCGTCGT 

ATTCAAGATG 

GGTGCTTGGT 

TACATGATGA 

GGTAAATCAA 

GCTGGTGCGG 

CGTCGTTTCG 

GGGACTGGTA 

TCAATCTCAG 

GATTTATTTG 

GCAGTTG6TC 

CTTAACCAAT 



7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8188 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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TATTAGTTGA GATGGATGGT 
ACCGTTCAGA CGTGTTAGAC 
TGGTAGGACG TCCAGATGTC 
AACCGTTAGC TGATGACGTT 
GTGCTGATTT AGAAAAT^TC 
AAAAAATTGA TGCTTCTGAT 
AAAAAgATCG CGTGaTTAAT 
ACACAATTGT TGGTTTAGTC 
CTCGTGGTCG CGCAGGCGGT 
CTAAAGAAGA TATGTTTGAA 
TTATCTTTGG CGTTCAATCA 
CTCGTAGCAT GGTAACTGAA 
GAAACCATCA AGTCTTTGTC 
TGGCGTTTGA AATCGATCAG 
ATGAAATTAT CGAAGCGCAC 
ATGAAACATT AGATGCAAAA 
GTGCTGACAG CGACTATCCT 
CTTTAGAAGA AAAAGAT-oCG 
AAAAAGAATT ACATGATGAA 
AAGTCCAATC TGAAGAGAaA 
ACGAAGATCG TTATAAATAA 
ACTGCACCTT CTATAATATT 
ACCTGATTCC TAGTATTATC 
GAATGGAAGA TTATTTAGTA 
TCAGTGCCAC TGAAACTGTT 
TGCAGCGTTA GGACGCACAT 
CGATAAATTA ACTGTAAAAG 
TAATGGCCGC GGAGAAACAA 
TGCAACAGGT AAAATTGATG 
TAAAGATTTA GGATTAAAAG 
CGGTGAAGAC TTTACATATT 
AGGTGTATTG GTGGACACGG 
AATGCCAGGT GCCGATGAAA 



TTTGATGGCA ATGAAGGAGT 
CCAGCGTTGT TACGTCCAGG 
AAAGGTCGTG AAGsgATTTT 
GATTTGTUUVG TTGTTGCACA 
TTAAACGAAG CAGCCTTAGT 
GTGGATGAAG CCGAAGACCG 
AAAAAAGAAC GCGAAATGGT 
TTAAGCCGTG CTCGTGTTGT 
TATATGATTG CTTTACCAAA 
CAAATCGTTG GCTTACTTGG 
ACAGGGGCTT CAAATGACTT 
TATGGAATGA GCGACAAATT 
GGTCGTGATT ATGGTCAAAC 
GAAGTTCGTC GTATCTTAAT 
CGTGAACAAC ATAAACTAAT 
GCAATCAAAT CATTATTTGA 
AGTGAAAAAG AAGCGCAAAC 
CAflAAACAAG TTGAAGAAAA 
GCAGAAGAAG TCAAAGTAGA 
AAAGATTCAG ATTCCAATTC 
CCAATTAGTA GAAGGTGCAG 
AAGATATTCT ATAGAAGGAA 
yGACTAATGG aCmAAAAAGC 
AAAGCCCTTT GTTATAAAGG 
AGCGAAcGCa aAGAAGACAT 
TGATTGGAGC CCTTTTACTA 
TACAAGGAAA TGGCCCAGCT 
AAGGCTACAT TAAAAATCCA 
TACGTGGTGC TGTCGGTAAC 
AAACCTTTTC TGGACAAACA 
TTATGGCGGT CTCTGAACAA 
ATGAATCTGT GAAAGCAGCC 
GTACCATTGA TTTTATTGAA 



TATCGTAATT 
TCGTTTTGAC 
aCGTGTTCAT 
ACAAACACCA 
TGCCGCTCGT 
CGTAATTGCT 
GGCTTACCAT 
CCaCAAAGTA 
AGAaGATCAA 
TGGACGTACA 
TGAACAAGCA 
AGGACCTGTT 
AAAAGCTTAC 
GGATGCTCAT 
CGCTGAAAAA 
AACAGGTAAA 
ATTCG7UVGAA 
ACAAGACTTT 
AAGCGAACAA 
TGAATATGAT 
TATTTGTCGA 
GATTATTTGG 
ACCAAGAGAG 
TTCAATACGA 
GATACATGGA 
GGAGCGACTT 
GGCGCTATTA 
CATGTCAGCT 
GAAGGAATTT 
CCTATTGTTT 
GTTCCTTCAG 
GGTGGTTTTA 
CAGCGTCTAG 



GCTGCAACGA 
CGCCAAATCT 
GCGAAAAACA 
GGTTTTGCTG 
CGTAACAAGA 
GGTCCAGCTA 
GaAGCAGGAC 
ACAATTATCC 
TTCTTAATGA 
GCAGAAGAAA 
ACAGGGATTG 
CAATATGAAG 
TCAGAACAAG 
ACGAAA6CGC 
CTATTAGAAT 
ATGCCTGAAG 
GCAAAACGTG 
GAAGAAGCGA 
ACAGAAAAAG 
CGCAATAACT 
GTTGACGAAT 
TTGAAAAAAC 
nGAGGAAAAA 
GCATATGCCA 
GTTCTTCTAC 
TAAAAGGTGA 
TTGTCGATAG 
TAAAGTTGAA 
TCACAGTCAT 
CTGGTGAAAT 
CTATCGGTTT 
TGATTCAAGT 
CGGAAGTGCC 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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ATCGCTTATT 
GTGACTGTTC 
ATGCGATGAT 
ATGACTATAG 
AAAAGTAGGA 
CGTAGCTCCG 
CGCaGGATTA 
AACCTTAGAA 



ACCAATTTCA CAATTACTAG AAAATGGAGA AACACCT6AA CAAGTCTTGT 
gggAgaagac GAAGTAGAAA TTTTAGAAAA AATGCCTGTA CAATTTAAAT 
CAAAGAAAAA TTTGCGACCG CTTTGATTGC TGTTGGGATC GATGAATTAA 
TGATGAAGAT CATGGAGCAG AAGCAGTTTG tcaattctgt aacaataaat 
tgaagaagaa ttaattgaac ttcgcgatga agcgattcgc aatacgaaac 
ggaaagacta tgtggaaaat tggagacgtt gaaatcccca accgtgttgt 
atggcgggta ttagtaacgc cgcttttcgt gtgactgtaa aagaatttgg 
gtcgtttgtg aaatgattag tgacaaagga attaaattaa gaaacaaaaa 

tGCTATATAT TGATGAGCGG GAATAnCCTT TAAGTGTACA ATTTTGG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CTTGAnACAA TTGGGGCAAA AAATGTTTTG GCAAATGAAA AAGGCTGGTT ACCAGTGACA 
GAAGAAGCGG CCATTGCTGC AAAACCAGAG GTCATTTTAA CCAATGTTAA CTATATGAAA 
GATCCCGCCA AAGAAATTTT AGCTCGTAAA AACTGGGAAA GCGTTCCAGC AGTTCAAAAC 
AAAGAAGTTT TCGAAATCGA TAACATGTCT AGCTCTTTAC CAAATAACCA CATTACCAAA 
GCATTAAAAC AAATGGCAAA AGCTGTTTAT CCAGAGGAAT ACAAGGATTT AAAAGATGAG 
TAAAAAAACG TGTCTCATAC TGATGCTCTT GATTAGTATG AGCATGCTCT TTTTGGGTAT 
TAAAATTGGA AGTGTTCAGA TTTCGTTAAC GGATTTGCTT CAAATTTTAG TGAAAAAGAC 
GGTTTCAGAA GATGGGATTT TAGAAGGAAT CATCTGGAAT GTTCGGTTAC CGCGTGTCGT 
GATGGCCTAT CTCGTGGGCG CAGGCTTAGC AGTAAGTGGC ACGGTGATGC AATCGCTCTT 
AGGTAATCCT -TTAGCTTCTT CGTATACTTT GGGCGTATCA GCGGGAGCAT CCTTAGGTGC 
TGCGTTAATC ATGGTAACTG GAGTAACTGC TTCGATCTTA GGGGCATTTT TATtGCCTTT 
AACTGGTTTT ATCTTTGGCT TAGCAACGGT CTTTTTAGTT TTGTTATTTA CGCAAACAAT 
GGATAGTCAA ATGTCCAATC AAACAGTTGT TTTAGTAGGA ATGATTATGA CCTTGTTTGT 
CGGTGCTATT TTGACGTTGA TTACAGCACT TTTCCAAGAT TATTTGAAAC AATTAGTCTT 
TTGGCAAATG GGTAGTTTTT CTGGCAGTAA TTGGCAAAAA ATTGCTATTT ATTGTCCAAT 
CTTGCTTGTC. AGCAGTCTGT TTCTTTGGTT TGATGCCAAC GCATTGGATG TTCTCGGTTT 
AGGAGAAGAA CATGCGATGT TAGCGGGAGT AGAAGTCAAA ACAGCAAAAC TACGCATTAT 
TCTTTTGGCT AGTTTATTGG CAGGAAGTGC GGTTTCATTT GTCGGTGTGA TTGGTTTTGT 



3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3467 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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TGATTTAATA 


GCACCACATA TTGTCCGTCG CTATTTAGGG GCGACTCATC 


GCTGGCTGAT 


1140 


TCCCGGTTCA 


GCAATCTTAG GTGGGACCAT CATGGTACTA GGGGACACGA 


TTGCCCGCAC 


1200 


GATTTTATCA CCAAGAGAAA TTCCCATCGG GGCCGTAACA GCCTTGATTG GAGCGCCATT 


1260 


CTTCTTGTAT 


ATTTACTTTA AAAAAAGGGG CGTTGCTGCA TGATTGAATT 


AAAAAAAACA 


1320 


GCTGTAACGC 


TCCAACAAAC CCCCATTTTA AAAGATATTT CCCTCGTTTT 


TCCGACAGGT 


1380 


AGTAAAACTT 


GTATTTTAGG TCCAAATGGT TGCGGTAAAA CGACACTTTT 


AAAAACGATT 


1440 


GCCGGACTTG 


TCCCTTATAG TGGTTCTGTT TTAATTGATG GTCAAGAAGT 


TCATGGACAA 


1500 


AAGCGGAAGG 


AATTGGCAAC GAAAGTAGCG ATGATGAGTC AATTCACGAC 


AGTTGCTTTT 


1560 


GATTATACGG 


TTTACGAAAC AGTTTTGATG GGGGCGTATC GACAACAAGC 


ACAACGTTTT 


1620 


TTACCTATTG TTTCtAAmCA rGAAAArGAA CGAGTCCTTw ATwATTtAGA ACGCACGGGT 


1680 


TTATTGCCTT 


TAAAAGATAA roGTTGTCAAT CAGCTTTCAG GAGGACAACA ACAACGAGTA 


1740 


TTTTTGGCTA 


AATTATTTGT ACAAGATCCA GAAATTATTT TATTGGATGA ACCCmATAAT 


1800 


CATTTGGaTA 


TTCGGTATCA ACAAGAATTG ATTCAGCAGT TAAATGAGTG 


GAGTGCGCAA 


1860 


GAAGGGAAAA 


CGTTAATCGG GGTGTTTCAT GATATCCGTT TAGCGTTAAC 


GCTAAGTGAA 


1920 


AAAATTGTGT 


TTATGAAGCA AGGGAAAGTG GCCGCGCAAG GTGATTTTCA 


AACGTTGGCT 


1980 


TCCAAAGAAT 


TTTTACAAAC GATTTTTGAA ACAGATATTG TCTCCTATTT 


TCAGAAACAA 


2040 


CATAAAGTTT 


GGGAAACTAT TCAATAATTT TGACAAGTTT TTTGGAAACA 


GGTAAGATAA 


2100 


AGGCAAAGAA 


TCAGAAACAC AATGGCGTGT TTTAACAAAT CGGCAAAGGA 


GCCCAAGACT 


2160 


AAGTACGTGA 


TGGTACTTGC TTTGGGCTCC TTTTTTATGG AGGTAACGAT 


GAAACGAATC 


2220 


ATTTTAATGG 


GAGCTATCGG TTGTGGCAAA ACCACGTTAT GTCAAGCTCT 


GCAAGGCAAA 


2280 


GAATTAATTT 


ATGATAAAAC ACAGGCGGTG GAATTTCATA CAGAAATGAT 


TGATACACCA 


2340 


GGCGAGTTTA 


TTTTGCATCG ACAATATTAT AATGCATTAA ACGTCACAGC 


GGCAGAAGCA 


2400 


GATGTGATTG 


GtTTGGTTCA AAGTGCTGTA GAAACACAAC AAGTTTTTTC 


ACCAGGATTT 


24 60 


GGTAGTATTT 


TCCCGAAAGA AATCATTGGT ATTTTAACCA AGATTGATTT AGCACAAGAT 


2520 


TCTCAGCAGT 


TAGAGATTGT TCGGCAACAA TTAAAAAGTG CGGGTGCAaC 


AAGAATTTTT 


2580 


GAAATTTCTT 


CTGTTGAAAA AATTGGCTTG CAAGAATTAG TAGATTATTT 


AGAGGAGGAT 


2640 


GAAGCAGAAT 


GAAAATTTAT ACAAAAACTG GCGATAAGGG AATGACGAAA 


TTAGTCGGCA 


2700 


GCTCAACCGT 


CGCTAAGGAT TCTGATCGAG TTGAAAGTTA TGGAACGATT 


GaTGAATTGA 


2760 


ATTCTTGGGT 


GGGCTATATT ATCAGCCAAC TTCCACAAGA AAATCAAGAA ATTAAAGAAG 


2820 


AACTAGAAGC 


TTTACAACAT TTACTTTTTG ATGCGGGAAC AGATTTGTCA 


ACACCGATTG 


2880 


AAGCACAACG 


ACCATTTAAA TTACAAAAGG CAAGTGTGCA TTGGCTAGAA 


CAGCGGATTG 


2940 


ATTTCTACAC 


TGCACAATCA CCAGATATTG ATCGTTTTAT TTTACCTGGC 


GGAACCCCAG 


3000 


CA6CCAGTAT 


GGTTCATGTG GCTCGGACGA TTGCACGGCG CGCAGAACGC 


ATCATTGTTC 


3060 
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GTTTAAATTG GACAGCCAAA ATTAACGAAG AGGTATTAAT TTTTACCAAT CGTTTATCAG 3120 

ATTATTTTTA TGCGTTGGCT CGCTGTTTAA ATGTTCAAGC ACAACGACCT GATGTTTTTT 3180 

ATGAACGAAG CGAAATGGTT TTCCATAAAA TTAAAGAAGA TGGATTGTAA ACAGTTGCTT 3240 

TTCTAAAAAA CTTATGGTAA ATTTCAGATA GaTTAAaCGA ATATTTTATG AaAAATATGA 3300 

ATGGAAGCCA GTGAGAATCT GGCACGGTCC CGCCACTGTG AAGAAGCAAG GTTGCTTTTA 3360 

AGTCAGGTCT TTTCATTTTT TCATTATTGG GCATGCTGTT TCGAGGCAAA ACAGGATGTT 3420 

TCTTAACAAC GCTTGTGrTG AAAATCCAGC CCAAGATATT TGTATTAATC CAATTAATGG 3480 

CACGAGGAAG TGCTTTTGGG AATTTTGGGC TGGTCTTTCA ATGTTGAAGG AGATTTCAGG 3540 

TACAATGACG TACTGCTATA TTCCCTTTTT TTATTGACAA TTAATTAAAG GCGTTGTACT 3600 

CTATGGATAA GGTTTCGTGT ACAATGGCGT ATACATAAGG AAGCAAAGAC GCTTCAGACA 3660 

GATGATTTTT CGTTTGTTTG AAGTGTCTTT TTGTATTTTA TTAAGGAGGA ATTGAGATGA 3720 

AAACAATTCA TTTCCCTACC GAACTATGGG TAGGAGAAGG CGCATTAGCC AATTTGGAAA 3780 

CACTTCATGA TAGACGTGTT TTCATTGTCA CAGATCCATT CATGGTTGAC TCTGGATTTG 3840 

TGAATGAAGT GACAAAGCAT TTAACAAAAA GTGAATGGCA AATTTTTAGT GATATCATCC 3900 

CCGATCCACC AATTGATAAG ATTGCTGCGG GAATTAAGCA TTTAGCGACA TTTCAAGGAG 3960 

ATACGATATT AGCTTTAGGT GGGGGATCCG CTATTGATGC GGCCAAAGCA ATGAAATTTT 4020 

TTGGCAAACG TACTTTGCAA ACTCAGATTG CTGAGTTTAT TGCGGTTCCT ACAACTAGTG 4080 

GAACTGGTTC AGAAGTTACG AATTTTTCAG TGATTACGGT TGCTGAAACT GGCACCAAAA 4140 

TCCCGCTTGT AACGGATGAA ATTCAACCAG AAATTGCGAT TTTAGATACA AATTTAGTGA 4200 

TGAGTGTCCC ACCAAAAArC ACAGCGGATA CAGGAATGGA CGTCTTAACG CATGTTATTG 4260 

AAGCGTATGT TTCAACAGAA GCAAATCCGA TTTCAGATGC GTTGTGCGAA AAAGTCGTTC 4320 

GCTTAGTATT CGACAATTTA GAAATAGCCT TTAATGAGGG TAGCAACCAA CAAGCCCGTG 4380 

AAAATATGCA TTTAGCTTCA TGTATGGCAG GGATGGCGTT TAATGTTACT tCGTTAGGCT 4440 

TGAATCACGG TATTGCTCAT GCGGCCGGTG CACGTTTACA TGTGCCACAT GGACGAATGA 4500 

ATGCAATGCT CTTACCTGAA GTCATTGCAT ATAACAGTGG CTTAGCGAAT GGCAAAGTCA 4560 

CCAACGAACC AACAGCTAAA CGTTATGCGC AGTTAGCAAA TTGTCTAAAT GATACACAGA 4620 

CAACCAATGC ACGGATTGGT GTTCAACAAT TTATTCGACA AATCAAGCAA CTCCGTCAAA 4680 

AATTAAACAT GCCTGCCACT TTTAGTGACT ATGGTTTACC AAAAGAAGAA GTCCAGGCTG 4740 

CAATCCCAAA AATAGCTGAA GGCGCATTGA TGGATGGCTG CACCAAAACA AATCCTGTCC 4800 

AACCAACAGC AGCAGAAGTG ACAAAAATTT TAAATTCTAT CTTATAAGCG ATTGACAGGA 4860 

TGAAATCGAT ArCATATAaT GTGGGTAGCA CAACGGCGTG CTTCmAAATT TAAGAGCAAA 4 920 

GAAGCTCCTT AGTAGAAAGC GAAAGTGGAA CCTTTACGTT TTTTATTGAG GATTTTTTAT 4 980 

TTGCTAAATT AAAGGAGCGA GTCTGGTTGG AAGAAAAACA ACGAATGATT CAAGAATATG 5040 
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TTCCTGGGAA ACAAGTAACG TTAGCGCACA TCATTGCTAG TCCTAATAAA GAAATTTATA 5100 

CAAAATTAGG ATTGCCAGAA GGCACCAGCA ATGCACTAGG TATTTTGACA ATCACTCCGA 5160 

GCGAagnCCA ATTATCGCTG TTGATATTGC GACTAAAAGC GGAGATATCC AAATCGGTTT 5220 

TATTGATCGT TTTTCTGGAT CGGTAGTCAT TTCTGGTGAC GTAAGCTCAG TGGAAGCAGC 5280 

CTTACAAGCG GTGATTGaAG GTTTACAACA GATTTTAAAC TTTTCTGTAA CTTGTAAAAT 5340 

TACCAGAACA TAGCGTTGAA AGGACAAGGG CGTGAAGAAA ATGGATGGAC GAATTGTAAT 5400 

AGTCGATGAT GAACCTATTA CGAGACTCGA TATTCGTGAT ATTGTGATAG AAGCAGGGTA 5460 

CGAAGTAGTC GGTGAAGCAG CTGATGGTTT TGAAGCAATT GAAGTATGTA AAAAAACACA 5520 

ACCTGACCTA GTATTAATGG ACATTCAAAT GCCAATTTTA GATGGTCTAA AAGCAGGTAA 5580 

AAAAATTGTT CAAGACCAAC TAGCCAGTAG TATTGTCTTT TTATCTGCAT ACAGTGACGT 5640 

ACAAAATACG GACAAGGCTA AAAAATTAGG CGCACTGGGT TATTTAGTTA AACCTCTAGA 5700 

CGAAAAATCA TTAATACCTA CAATTGAAAT GAGCATTGAA CGAGGCAAAC AAACGCAGCT 5760 

GTTACTAAGT CAAATCGATA AATTAAGTTT AAAATTAGAA GAACGTAAAA TTATCGAAAA 5820 

AGCCAAAGGT ATTCTTGTAA AAGAAAATCA TATATCGGAA GAAGAAGCCT ACCAAATGTT 5880 

GCGTACGTTA AGTATGAACA AACGCGCACG TATGAGTGAA ATTGCAGAaT TGATTGTAAT 594 0 

GGATGATGAA TAAATGAAAC GATTAGAGCA ATTATGTCAC CAATATACCA ATCTTTCAGA 6000 

ATCAGACATT AAAGAATTAC AACGTACAGC ACGTTATCTT TCTTCGACGA CACTCTATCa 6060 

AAGTGCAGAT GTTTTTATTG ATGTGTATAA GGAAATGTCA CAGCAAGCAC TAGTCGTTTA 6120 

TCATAAGCCG CCTGCGAAGA CAACTTCTTT ATATAGTGGA GATGTCGTCG GGATGGAAgC 6180 

TTTATTAAAA AATGAACCTG GTGTTCTGCG TACCATGCAA ACTAGCtTAA ATAGCATTGG 624 0 

TTtGTTAGCA GTGACTCAGG AAAATCGTTT GATTAAGCM AATATCTATC CCATTCGCAA 6300 

TGAACATCGA ACGATTGGTG TCATTATTGT CGAGATTGCG GCAGATGAAG AGATTCAAGC 6360 

GGACTTACAA AAAGAGGAGC TAAATAACTG CCAGTTAGCA AAAGTAGCGA AATCAACTAG 6420 

TCAAGTCGAC GCATTATTTA TTGATCAATT GGCAGAAGCC GTTTTGATTT TTGATGCGGC 6480 

GGGTCACTTA TTAATTACCA ATCACAATGC ACAAGAGCTT TATCGCAAAC TTGGTTATCG 6540 

GGATAATATC ATTGGAATGA GTTATGACAA TTTGTCCATC GATTATACAA CTTTTGAATA 6600 

TGTGTTGTAT CAAATGAAAT ATAAAATGAG TAATCAACCA ATTGAAAGCA AAACGACCTA 6660 

CCTTAATTAT TATTTTAAGG TTCGGAAAGT GTGGTTGGCA TCAGAAGAAC AACTGATTAT 6720 

GATTATTCAA GACAATACAG AATTTAAAGA AAAAGAAGCG GAAATTATTT CCAAATCTGT 6780 

TGcGATTCGT GAAATTCATC ATCGTGTTAA AAATAACTTG CAATCGGTGG kTTCCTTATT 684 0 

GCGTATTCAA.GAACnGGCGA ACGCAAAGCC CCGAAGCAAA GAAAGTCCTT CATGAAAGTG 6900 

TCAATCGAAT TATGGCGATT GCAGCCACCC ATGAATTGCT GTCAAAACAA GTCAAAGATG 6960 

ACGTTGCTTT ACGGCAAACA TTGGAAGCAG TGATGTATAA TTTTAGACAT CTTTTTCAAG 7020 
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GCGCACAACC CATTGAGATG ATGATGGATG TTGATCCAGC CATTATGGTT TCTAGCGAAC 7080 

AAATGGTTAC CATTTCCCTT GTTGTCAACG AACTATTACA AAACATTTTT GATCATGCAT 7140 

TTGAGCCACA AACAAGTGGC GTTGTTAAAC TTAGTGGAAC ATTAGATAAT AAAATGATTA 7200 

CCATTACTGT TACAGATAAT GGTAAAGGCT ATGATGTTCA TCAAAGTAAC GAGACCAGTT 7260 

TAGGTTTGAT GATTGTCAAA AGCTACGTAA AAGATAAACT AAAAGGAAAA ATTACAATTG 7320 

AATCCAATAA ACAGGGAACT AAAACCTGTT TCTATTTCGA ACAGAACACC AGTGATGTTG 7380 

TTCATTGATT AAGCAAAGGC GCTTAAAGAA AAGTAGAAAG AAATTTTCTA ACGATTCTTT 7440 

AAGTGCCTTT TTGTATGCAA AAGGAGGCGA GTCAGGCGAT GTCGAAGG7A ACACTTTTAA 7500 

CCGTAGGAAT TGATTTAGGG ACGTCTACAA CCCAATTAGT CTTGTCAGAA TTAACTGTTG 7560 

AAAATTTTGC CTCTGCGTTT ACAGTACCAC GTATCAGCAT TTCTGATAAA AAAGTCATTT 7620 

ATCGAAGTGA CATTATCTTT ACACCCTTAC TTAACCAATC AGAAATTGAT GCAGAACCTA 7680 

TCAAAGCGTT TGTAGCGGAG CAATATCGCC AAGCAGGCAT TCATAAACAA GATATTCAAA 7740 

TGGGTGCGGT AATTATCACA GGTGAAACTG CTCGAAAAAG CAATGCCAAC AATGTGCTAC 7800 

GAGCACTTAG TGGCTATGCA GGTGATTTTG TCGTTGCTAC GGCTGGTCCT GATTTAGAAA 7860 

GTATCATTGC CGGGAAAGGC GCTGGTGCTC AAACTTATTC TGAAACGAAA CGCT^CCAG 7920 

TGGTTAATTT AGACATTGGT GGTGGAACGA CAAATTTAGC TGTCTTTAAA GATGGCGAAg 7980 

TGATTGACAC GGCTTGTTTT GACATTGGCG GACGCTTAAT TAAATTAGAT CAACAACAAA 8040 

AAATCACCTA TATAGCTCCG AAGATTCAAG AAATTATAAA TAAAAAAGGG CTAACCTTGC 8100 

ACCTTGGTGA CCAAGCCACC GAACAAAACT TACTGCCAAT TATTTCgGAA TTAGTTGCCG 8160 

TGTTAGAAAA TAGCATTGGC TTGGGGACAC AAAGTCCGTT CTACCAGTTA TTAGTAACCA 8220 

ACCATCCACT ACGAAAAGGT GAGGAATTAC CAATTGTAAC ATTTTCTGGT GGTGTGGCTG 8280 

ATTGCTTAAA TACGACAAGT ACGAATCTCT TTAAATATGG GGACATCGGC TTACTTTTAG 8340 

GGAAATACTT GAGAAAGTCA CTGATTTTTT CGGAAAAAGA AGTCTTAGAA AGCGCAGAAA 8400 

CAATCCGAGC GACGGTCGTC GGAGCAGGTT CACACACTGC TGAAATTAGC GGAAGCACGA 84 60 

TCGCTTATCG TGAGCAAATT TTGCCGGTAA J\AAATATCCC AATTCTAAAA TTAGCGCAAG 8520 

AGGATGAAAC CCTTACTGTG ACAGAATTAG GCCAGCGAAT CCAAGAAAAA CTCAATTGGC 8580 

ATCGTATTGA AGAAACACCC CAGATAGCCC TAGCAATTAG AGGCATGAGC AATCCAACAT 8640 

TTGCTGATAT TCAACGCTAT GGTCAAGGGA TTGTGGAAGG ATTAGCTAGT TTAGTTGCTG 8700 

AACAAATTCC CATTATTGTT ATGGTGGATG AAGACATGGC GAAAGCTTTA GGGCATGCCT 8760 

TAAGTGCCCA TCTGCCAAAA GACTATCCGT TTATCTGTCT TGATTCGGTC AAAGTAGAAA 8820 

ACGGTGATTA TGTAGATATC GGTTTACCAG TGGCAGAAGG AGCCGTCCTT CCAGTCATTG 8880 

TTAAGACCTT AGTATTTAAT TAAAAAGGAG CGACACAAAG ATGATTTTAA AAACCAAACT 8940 

ATTTGGCAAA GTCTATCAAT TTACATCCGT TAAAGAAGTC TTAGCCAAAG CCAATGAAGA 9000 
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AAAGTCTGGC GACAAGTTAG CTGGCGTAgC TGCTAATTCA GCCGAAGAAC 


GCGTGGCAGC 


9060 


AAAAGTCGTT TTATCGGAAT TGTCCTTAAA TGACTTGTTT AACAATCCCG 


TCGTTGrTTA 


9120 


TGACGAAGAC GAAGTGACAC GCATTATTAT TGACCAAGTC aATATGCGAA 


TTTTTGAAAG 


9180 


TATTAAACAT TGGACAGTTG CAGAATTACG CGAATTTATC TTATCTTCGG 


AAACaACCGA 


9240 


TTTTGATATC AAACGAATTT CCCgTGGACT CACTTCAGAA ATGATTGCTG 


CTGTCTGTAA 


9300 


ATTAATGTCC AACATGGATC TAATTGTTGG CGCAAAAAAA ATCAATATTG 


AAAAAACCGC 


9360 


CAACACAACA ATTGGCCGTC CCGGAACTTT TTCAAATCGT TTACAGCCAA ACCATCCAAC 


9420 


CGATAATGTC GATGGCATTA TGGCATCAGT TATGGAAGGT CTTTCTTATG 


GCGCCGGTGA 


9480 


TGCCTTAATT GGTTTGAATC CTGTTGATGA TTCTACAGAA AGTGTTAAAC GGATCTTAAA 


9540 


CAAATTTGAA GAGTTCCGCA GTGAATGGGA AATTCCAACA CAAACTTGCG 


TGCTTGCTCA 


9600 


CGTAACGACA CAAATGGAAG CCATGCGTCA AGGCGCGCCA ACTGGCTTAG TCTTTCAATC 


9660 


CATCGCTGGT TCAGAAAAAG GAAACACTGC ATTTGGTTTA AATGCAGAAA 


TTTTAGCTGA 


9720 


AGCACAAGAT TTAGCGCTTC ACAGTGGTCA AGCAGCTGGT CCTAATGTGA 


TGTATTTTGA 


9780 


AACAGGTCAA GGCTCAGAAC TTTCTTCTGA AGCGAACTTT GGCGCAGACC 


AAGTCACAAT 


9840 


GGAAGCGCGT TGTTATGGCT TAGCCAAAAA ATTTGATCCG TATATTGTAA 


ATACCGTTGT 


9900 


CGGCTTTATT GGACCAGAAT ATCTTTACGA TTCAAAACAA GTCATCCGCG 


CTGGTTTAGA 


9960 


AGATCATTTC ATGGGCAAAT TAACTGGAAT TyCAATGGGC TGTGATGTCT 


GCTACACCAA 


10020 


CCATATGAAA GCTGATCAAA ATGATATGGA AAACTTAGCA ATGTTATTAG 


CAACAGCTGG 


10080 


TTGTACGTAT ATCATGGGGA TTCCTCATGG CGATGATGTC ATGTTAAATT 


ATCAAACAAC 


10140 


TGGGTTCCAT GAAACTGCGA CGATTCGTGA AACATTAGGC TTACGTCCAA 


TTAAAGAATT 


10200 


TGAAGAATGG ATGGAAAAAA TGGGCTTAAT GGAAAACGGA AAATTAACAA 


GTCGCGCTGG 


10260 


CGATGCATCT GTATTTATTA AATAAACGAG GTGAGAATAC AAATGAACGA 


AAAAGAATTA 


10320 


AAAGAAATGA TTGCCGGAAT TTTAACAGAA ATGGTCGCTG ACAATCAAGC 


GGTTTCGACT 


10380 


GCCACTGTGA CTGCTGAAGA AAAACCAGTC ACAACACACG TCACTGAAAC 


GACAGAAATT 


10440 


GAAGAAGGCC TAATTCCTGA CATTACTGAA GTAGATTTAC GTAAACAATT 


ACTATTGAAA 


10500 


AATGCCGTAG ATCCTGAAGC ACTATTAAAA ATGAAAGCTT TCTCGCCAGC 


TCGTTTAGGC 


10560 


GTTGGTCGTG CAGGGACGCG TTATATGACT AGTTCAACAT TACGTTTCCG 


AGCAGACCAT 


10620 


GCGGCAGCAC AAGATGCAGT CTTTTCAGAC GTAAGCGAAG ACTTGGTAAA 


AGAAATGAAT 


10680 


TTTATTTCTA CAAAAACAAT CTGTAACAGT AAAGATGAAT ATTTAACACG 


TCCAGATTAT 


10740 


GGTCGTCAAT TTGATGAAGA AAATAGTGAA ATCATTCGGA AAAATACGAC 


ACCAAAAGCC 


10800 


AAAATCCAAA- TGGTCGTAGG GGATGGCCTT AGTTCAGCTG CTATCGAAGC 


GAATATTAAA 


10860 


GAAGTCCTAC CaGCmATTAA ACmAGGATTA AACATGTACm ACTTaGATTT 


CGACmATGTA 


10920 


GkTTTTGtTA AAtATTGtCg CGTGcCAGCA ATGGaTaAAA TTGGTGAAAT 


CACTGACGCT 


10980 
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GATGTTGTTT 


GCTTGCTAGT 


TGGCGAACGT 


CCTGGTTTAG 


TAACAGCTGA 


ATCTATGAGT 


11040 


GCCTACATTG 


CTTACAAACC 


AACTGTCGGT 


ATGCCAGAAG 


CACGGCGGAC 


AGTTATTTCA 


11100 


AATATTCACA 


AAGGTGGGAC 


ACCGGCTGTT 


GAAGCAGGGG 


CCTACATTGC 


CGAAATTATT 


11160 


AAAAAAATGT 


TAGACAAGAA AAAATCTGGT 


ATTGATTTAA 


AAGAAGCTGA 


ATAAAGGAGG 


11220 


ATTTCTCATG 


AAAAATGATC 


GTCTAGGCGC 


AAATGTCCTA 


AGTATGAAAG 


TCATTCCCAA 


11280 


TGTCGATCCG GCATTAGCAA AAGAATTATC ATTGAAACCA 


GAACATCGCT 


CATTAGGAAT 


11340 


TGTTACTTCC 


GATTGTGATG 


ATGTCACTTA 


TGTTGCATTA 


GATGAAGCAA 


CCAAAGCCGC 


11400 


CGATGTTACT GTAGTTTACG GTAAAAGTAT GTATGCTGGC GCAgCAAATG CTTCAACCAA 


11460 


ATTAGCCGGT 


GAAGTGATTG 


GCATTATTGC 


TGGTCCGAGT 


CCTGCCGAAG 


TCAATAGTGG 


11520 


CTTGAGTGTC 


ATTACGCAAG 


TCATTGAAGA AGAAGCCAGC 


TTCTATAGTG 


CTAACGAAGA 


11580 


TGACAGCATT 


GTTTATTTTG 


CACATGTAGT ATCAAGAACG 


GGTAGCTTTT 


TATCTGAACA 


11640 


AGCGAATATT 


CCTGAAGGTG 


AAGCGATTGC 


TTATCTGATt 


GCACCACCGC 


TAGAAGCAAT 


11700 


GGTTGGCTTA 


GATGCAGCAA 


TGAAAGCTGC 


CGATGTTCAA 


ATGGGTGTCT 


TTTATGGCCC 


11760 


ACCTTCTGAA 


ACGAACTTTG 


GCGGCGCCTT 


ATTGACAGGT 


AGTCAATCTG 


CATGTAAAGC 


11820 


TGCTTGCAGT 


GCGTTTGAAC 


AAGTCATTCA 


AAATATCGCT 


GACAATCCGT 


TGAGTTATTA 


11880 


AGGATGTGAC 


CTGAATGAAA 


GCATTAGGAA 


TGGTCGAAGT 


CAAAGGATTT 


TTAGGTGCCA 


11940 


TTAGTGTGGC 


CGATGCGGCA 


TTGAAAGCCG 


CCGACGTTAC 


TTTGTTAAAA 


GCTGAAATTA 


12000 


TTAATGGCGG 


TTTAACAACT 


GTGGAACTGA 


TTGGTGACGT 


TGCCGCGGTT 


C/^GCTGCCg 


12060 


TTGAAGTAGG 


TACGGAAGTA 


GCTAAAGAAT 


TAAATTGTTT 


AATTGCTCAT 


CATGTGATCT 


12120 


CACGGGTTGA 


TGCTCAAACG 


GAAGTGATTT 


TATCAGATCC 


AGAACCCAAG 


TCAGCACCTG 


12180 


AACCGATGGA ACAA6TAGAA 


GTAATTGAAG 


AAGAGATTGA 


AACACCAGAT 


TTGGAAGAAC 


12240 


ACACAGGCAC 


ACGCCAAAAA 


TTAGAAACAC 


AACGCGTGGT 


TGATTTACGC 


AAACAAGCCT 


12300 


ACAAAATGAA CCTAAGTAGT TTAAAGAAAA GTGAGATTAA ATTTGCCAAT AAAGAAGCTT 


12360 


TAATTCAAGC 


AATCATGGCT 


GAAATAGAAA GAAGTGAAGA 


CGAGTGGAAT 


TAGTAGATAA 


12420 


AGACTTACGT TCCATCCAAG AAACAAGAAA TTTAATCAGA ^U\AGCCAAAG AGGCACAGCA 


12480 


AGTATTAGCA 


ACATTTTCGC 


AAAAACAAAT 


CGATGCGATT 


GTTCAAGCGG 


TTTCAGAGGC 


12540 


TACATTTAAC 


CAACGAGAAA AGTTAGCGAA AATGGCCAAT 


GAAGAAACAG 


GATTTGGTAT 


12600 


CTATGAAGAT 


AAAATCATCA 


AAAATGCTTT 


CGCTTCAAAA GTAGTATATG 


ACGAAATGAA 


12660 


GGACAAAGCT 


ACAGTTGGTG 


TGATCCATGA 


TGATGCTGCC 


AAAAAAGTCA 


CTGAAATTGC 


12720 


TGTCCCAGTG 


GGTGT7VATTG 


CAGGATTAAT 


TCCTTCTACC 


AACCCGACAT 


CAACAGTCAT 


12780 


TTACAAAGCA. TTAATTTCAT 


TAAAAGCTGC 


CAACAGCATT 


GTTTTTTCAC 


CACATCCCAA 


12840 


TGCGTTGAAA 


TCGATTATTG 


AAACAGTGGA 


AATTATcCAA 


AAAGCGGCTA 


TTGCTGCAGG 


12900 


GGCACCAGAA 


GGTTGTGTCT 


CGGTCATCAA 


AACGCCAACC 


ATGCAAGCGA 


CCAGCGAATT 


12960 



wo 98/50555 PCTAJS98/08985 

368 



AATGAAAAAT 


AAAGAAACAA 


ATTTAATTTT AGCTACTGGT 


GGAAACGCAA TGGTTAAAGC 


13020 


CGCGTACTCT 


TCTGGAACAC 


CAGCAATCGG TGTTGGGCCT 


GGTAATGGAC CAGCCTACAT 


13080 


CGAACGAAGC 


GCAAATGTTC 


CTCATGCAGT GAAACAAATA 


ATGGATTCAA AAACTTTCGA 


13140 


TAACGGTACA ATTTGTGCAT 


CGGAACAATC AATTATTGTT 


GAAACTGTCA ACCGTGAAGC 


13200 


GGTCaAAGAA GAGTTT^TTA AGCAGGGAGC GTATTTCTTA 


AGTCCaGCCG AAGCCGACAA 


13260 


ATTAGCTAAA 


TTTATTTTAC 


GACCAAACGG TACAATGAAC 


CCGCAAATCG TGGGACGTTC 


13320 


CGTTCAACAT ATTGCTTCAT TAGTTGGTTT ATCTATTCCA AAAGATCGCC GCTTAATTGT 


13380 


TGCCGAAGAA ACAcATGTAG GCTTGAAATA TCCATTTTCA AGnGrAAAAT TGGCACCGaT 


13440 


TATTGCTTTC 


TATACAGTTG 


AAAACTGGGA AGcAsTTGCG 


CGTTATCAAT TGAAATTTTG 


13500 


AAAGGTGAAG 


GTGCGGGTCA 


TACAATGGGC ATTCACACAG 


AAAACAAAGA AGTCATTCGT 


13560 


GAATTTGGCT 


TAAGAAAACC 


CGTCTCTCGT TTGCTTGTTA 


ACACCTCTGG TACACTTGGC 


13620 


GGCATTGGTG 


CTTCAACTAA 


CTTAGTGCCA GCATTAACAC 


TTGGCTGTGG CGCAGTTGGT 


13680 


GGCAGCTCAA 


CTTCTGATAA 


TATTGGGGTT GAAAATCTGT 


TCAATTTACG TCGTGTTGCA 


13740 


TATGGCGTGC 


GTGATTTAGA 


AGAGATTCGT CAAGAATTTG 


GTCAAACATC AACCACATCT 


13800 


GTGGCTACTT 


CTTGCGAAAC 


AACCAATCAA GAAGAACTAG 


TAAATGCCGT GGTAGCTCAA 


13860 


GTTTTAGCTC 


GCTTAAACTA 


ACAAGAAACG AGCGAGTAAG 


AGTAAAACAA ATAAATTATT 


13920 


GGATTTAGGG 


TTTGGGAOCT 


AAGTCAAATT GCTAATTGTC 


CCAAATTTTT AAAAATAAAA 


13980 


CATACATCTT 


AGGAGGAAAT 


TATAATGTCA AGTACAAATG 


CTTTAGGAAT GATCGAAACT 


14040 


CGTGGTTTAG 


TAGGTGCTGT 


TGAAGCAGCC GATGCAATGG 


TGAAAGCGGC AAACGTTACT 


14100 


TTAATTGGAA 


A7VGAACAAGT 


CGGTGGCGGT TTAGTTACAG 


TAATGGTTCG CGGTGACGTT 


14160 


GGTGCTGTAA AAGCAGCAAC 


AGATGCTGGT GCTGCTGCAG 


CAGAACGTGT TGGCGAATTA 


14220 


TTATCTGTCC 


ATGTAATTCC 


ACGTCCACAT ACAGAAGTAG 


ACGCTATTTT ACCACACGCA 


14280 


ACAACTGAAT 


TATAATTTTG 


ATTTTTTGGG GCAGAGGTAG 


GTTCATTACT TCTGTCCTAA 


14340 


ACTTTCATTG 


AGGAGGGGTG 


TTATGACAGT TATTACAGAA 


GATATGCTTC GCATCAAAAG 


14400 


GCTAAAAAAA 


GAACTAGAAG 


ATGGCACGGA TTTTGTGTTT 


CCTGTAGGCA GTTTTCTGAC 


14460 


GCCAGCCGCC 


AAAAGCTATC 


TACGCGAGCA TCGGATTAAT 


AGCAGCTTCT CTTCAACAAA 


14520 


CCCATTGCTT 


TCTGAGGTTC 


ATTCAGTGAA GGCCTCCTCA 


CAAAAACAAG AACATTTATC 


14580 


GAT^GAAaCA 


ATTTATGAGA 


TTCGGCACTT AAGCCATCTA 


TTGTATTTGC CTTTCTTAAC 


14640 


GAATGAAGCG 


TTATCAACAG 


AAGGTTGGTT ATATTTCGAA 


CAACAACAAC AGTGGTTAGA 


14700 


GAAATTTATT 


TTAGAACAGT 


CGCTTGTAAT GGAACCAGTC 


ATTTTGGAAT CAAAGGTAAC 


14760 


TATTCTAACT 


AACCAACAGA 


GACAATGGCG CTATTCCTCA AAAGAAATTC AATTACAACT 


14820 


AGATAAAATT 


ATTCTCCAGT 


TAAAAGGAGA ATCACAACTG 


TTCGGACTTT TTCAAGCTTG 


14880 


GGCTACAGAA TTAATTAGCA CCATGGCTGA AATATCACGA AACTAGGAGA TGTUVCAAATT 


14940 
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GGATTCAATG 
TACTTTTGAA 
ATTATTTGGA 
CGTTTGT/^ 
TTTAGGCCCT 
AGGAATTAAT 
GATGAACGGA 
TCACATGACA 
CAAAGTGGAA 
ATTTGCAACG 
AGCTCGTGGC 
AATTTGATTC 
GTGGCTTTAG 
GTAAATTGGG 
ATTTCCCAAG 
TTAGCATTCT 
7U\AAATTCAC 
CAACAGTTTG 
TGTCAAAAAC 
TGATAAGAAT 
CGAAAGATTA 
GAAAAGTGAC 
TTATGATGGT 
ACAACGCTGG 
TTTCGGCAGA 
CAGCGGAAGA 
TTCTTTTCAT 
AGCAATTTGA 
TCACTTTAGC 
TACTAGGGGC 
TTGCGTTAGC 
TAGGTGCAAT 
AAAAAGACGA 



AATGAAATTG 
GTTGAAGCTT 
CCAGGATATC 
GAACGAATTA 
GAACGTAGCG 
GCCCCAGTTA 
TCAGCAGTTG 
CCCGAAGACG 
GGCACACGTC 
TACATGCACA 
TACATTCTTA 
AACAGATTAC 
CAGAAGCAAA 
TAACCmAGGA 
TAAATGCGAT 
TATTTGAAGG 
GATTGAAGTA 
GGCTGATTTT 
AAtTCGTTCT 
GACAGAATCA 
TGCTAGGAAA 
TGGTAGCCTT 
TGATATTTTG 
AGCTGGTGTC 
AGATCCGAAT 
ATAGGGAGGA 
GATTATTGGA 
GGAAGGGATT 
GCCTGTCTTA 
TGATCCAGCA 
GCAACAATTA 
GATGGGACCA 
CCAACAATTC 



TTGAAGTU^GT 
CTGGCCGACA 
AACTAACAAA 
CGGTTGCTGG 
AATCACAAGT 
GAGAGAGTGG 
TGACATTAAG 
CCTTGAAAAA 
CGTTAATCTT 
TTGACTACGA 
AGTT^AAAGGA 
TGATCAAATT 
AGAAGCATTT 
GCAGAAGCTA 
TGCGCATTTA 
TAAGCCgGTA 
TCGTTTGAAA 
TTATCATGAT 
TTGGCTGAGA 
GGTGTTTCTT 
CATCAATTGC 
TGGTCAACAA 
AAXCAAGAGG 
GGGGACAAAG 
GTCCCAATTG 
CGAGACATGA 
GGAATCGATA 
ATGGCTATGG 
GCCAATCTTT 
ATGTTTGCCA 
GCCAATGATC 
ACGTTAGTCT 
TTAGCAACAG 



369 
TGTTAAACGA 
TGTTCATTTG 
AGTGAAAGAT 
ACCTAAAGGA 
CGAAGTTTCT 
CAAGACAGAA 
TCACGGCTTA 
TAAAGTAAGC 
TGATGATGTG 
TGAAGCCAAC 
GGCAAGACAA 
TGTCAACAAT 
CCTGAAGAAA 
GCACGAGGCT 
CAGGAAACGG 
TTGGTTTTAA 
CAAACGATTC 
TCAGAAAATT 
CCAAACGAAC 
TAAGTAAAAA 
TGACGTAAGG 
GAAAAGATGA 
ACGAAAAGCA 
TGTTAATTAG 
ATGCAATGAT 
GCATTAATGA 
AATGTATTGG 
GTTCTCTAGC 
TAAGTCCGAT 
CAACCTTATT 
CGCAAGCTGG 
TTACAATTCC 
GtGTTTTgAG 



ATTCAGCAAC 
TCTCGTCaAG 
TTATCACAAC 
CTCTTTCAAA 
ATGACGGATA 
GGGACACCTG 
ATTGTTGCCA 
AATAGCCAAA 
GTTGTGCGAA 
GCTTGCGGCT 
TGaACGTTTC 
TGCTGTCGTC 
TGTTTGTTCG 
TGGTTGTAAA 
ATGAATTAGT 
CCCCAATACC 
AGGAAAACGT 
ACGCTGTGTT 
CTACATTACT 
CGCCTATTTA 
AGGAAAAACG 
AAAATTAAAT 
AGGATTTTTA 
TCAAGGCAAT 
TGTTGGTGTC 
7\ATTATTATG 
CAACAAACTA 
GCTTTCTATG 
TGTCGTTCCT 
GGCCAATGAT 
TTTATTTGCG 
AGTAGCGCTT 
CGGGaTTATT 



AACAACAAAA 
AGATTGATGC 
CTGGACAGTT 
ATGTGGTTAT 
CGCGTATTTT 
GCGTCACGCT 
AACGTCATAT 
TTGTGCAAGT 
TCAGTCCGCG 
TAACCAAAGG 
TGrAATgGAA 
GTCTTATTCA 
TTTTCCTGAT 
GAGGCTAACG 
CAAAAACATC 
TTCAGTGAGT 
GGACATGTGC 
TCAGGCGGCC 
GAAAAACAAT 
ACCCCGTTAG 
ATGCTTTTAG . 
GGCTGGAAAT 
ATTGCTGCTG 
GCTGCCCGTA 
ATTGATTCTA 
TATATTATGG 
GGTTTAGGGG 
GTAGGGATTA 
GTCTATGAAT 
ATGGGCGGCT 
GGGGCAATCT 
GGCATyATCC 
ACaATTCCTT 



15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 
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TTGGTTTATT AGCAGGTGGT TTAACAGCTG GCATGCCATT GTCATTGATT ATTCCTAACT 16980 

wAATTCCAAT TATTATTGTT GCCGCTTTAA TTATTttAGG TTTATGGTTA GCGCCAAAGG 17040 

GCATGATTAA AGGCTTCCAA ATTTTTGGTC AAGGCGTAGT GATTGTCGCT ATCTTTGGTy 17100 

TAGTTGTGGG GGCGATTCAA CTATTATTAG GAATTACAGT GATTCCTGGC ATTGCACCAG 17160 

TTACTGAAGG AATTGAAGTC GTTGGCGGGA TTGCTTTAAC TTTAGCTGGC GCTTTCTGTT 17220 

TAGTAGCTGT GATTACGAAA GTCTTTAATA AACCATTGAT GAAACTAGGA AAAGTTTTAG 17280 

GAATGAATGA AGTAGCGGCT GCAGGAATGG TTGCTACATT AGCTAACAAT ATTCCCATGk 17340 

kCCAAATGTT AAAAGACATG GATTVATCGAG GGAAAATTAT TAATATCGCT TTTGCTGtTt 17400 

CGGCTTCTTT CGTTTTAGGT GACCACCTTG GTTTTACTGC AGGCGTACAA AAGAAATGAT 17460 

TTTTCCAATG ATTGTTGGCA AATTAGTTGG AGGGATTACC GCCATTTTTG TGGCTATCTT 17520 

TATGGCAAAT CGAATGCTAG GGAAACCCGA AACAGAAGTG AAATAATGAG GTGTAAATAT 17580 

GACAGAATTA AATCGTGAAA TGATTGAAAC GCTTGTTCGA CAAATTGTCA CAGAAAAATT 17640 

AATGCCGACT AAACAAGTGG ATCCAAGTGG AATTATGTCC ATCAAATTAC CCGAAATCGA 17700 

CGTAACAGAA GACGACCGCT TAGATACAGG CAACCCAGCG GACATTGTTT ACACAAAAGA 17760 

TCTGTTTTCT TTAGAAGAAA GTCCGCGCCT TGGTTGCGGT TTAATGGTGA TGAAAGATAC 17820 

CACCTTCGAC TGGACACTAG AATACGATGA AATTGATTAT ATTATTTCAG GACAATTAGA 17880 

TATACTGATT GACGGACGTA AAATTTCTGC AACAGCCGGC GAAATTATTT TAATTCCGAA 17940 

AAGTAGCAAA ATTCAATTTT CAGTGACGGG TGATGCCCGC TTTGTCTACG TGACTTATCC 18000 

AGCAGATTGG CAAAGCCAAT AATTATTTTA TCGGCAGTCA GAGAATAATT TTGCCAAAAC 18060 

CCTTTACTCA GCGTTATAAT AAATAAGAAC GAAAAAAGGG GTTGAGTGAA TGACATTTGA 18120 

GAATCCAAGT CAAAATCAAA TTTTTGATAT TTTAAAACAA GCAAAAAATA TTGCGGTGGT 18180 

TGGTTTGAGT AACAAGCCCG ACCGCACAAG TTACAAAATT GCGGAAATTT TACAGCAACA 18240 

TGGCTATCGT GTATTGCCTG TCAATCCAGT TTTAAAAGGT GAAACGGTGC TTGGCGAAAC 18300 

GGTTTATGGG ACATTAACAG AGATTCCTGA AAAGATTGAT ATTGTTGACG TCTTTCGGCG 18360 

GAGTGAATTT TTGCCAGAAG TGGCCGCAGA AACATTGAAA ACAGAAGCGC CTGTCTTTTG 18420 

GGCCCAACTA GGTTTGGAAA ACGAAGAGGC AGCGAACTTG TTATTAAATG AAGGTCGTCA 18480 

AGTTATTATG GACCGTTGTA TTAAAATTGA ATTGGCAAAA ATGGCAGATG CTGAATAAAC 18540 

ACGTGTAATT TTATAAAGAA AGAGCGTCCT GAACGGAAAG CTCTTTCTTT TTTGACCCAG 18600 

ATAGAGTATT ATTGTAATGA TGGTTTTAGA AGGAGCGAAC GGAAAACGCA AGAAACCGGC 18660 

TCACTCTTCT TCTAATTCTT TAAGrraTAG TTAATGTACG CTTATTAACT AAACGAGAAT 18720 

CTArGTATAA- GGAGTTTTTT CTTTGGCTAA AAAAATTAAC AATGAGTACA ATGATGCCTC 18780 

TATCCAAGTC CTAGAAGGAT TGGAAGCAGT GAGGAAAAGA CCTGGTATGT ATATCGGTTC 18840 

AACCGATAGC CGTGGACTCC ATCACTTAGT CTATGAAATT GTCGATAACG CAGTCGATGA 18900 
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AGCGTTATCT 
TGTTGCAr:Ar 
AGAAGTAATT 
TTCAGGCGGA 
AGTTCATATT 
AGTTGGTACC 
TTTACCTGAC 
TTTGAGAGAA 
GGAAGAACCC 
TCTAAATGAA 
TATTGAAGTT 
CGTTAACAAT 
GACG/^GGCG 
TTTAGAAGGT 
AAACTTACTA 
CGTTGTCGAT 
GAGTCAATCA 
AGCCCGTGAA 
AAATTAACAC 
GACTCGGCCG 
CTACGAGGAA 
GAAATCAATA 
GATTGTAATT 
CAAGTGTTGC 
GTCTACATTG 
ATTGAGTATG 
TATATGCTCC 
ACGATGGATC 
GAACGCCGTG 
CAACATGTTC 
ACTTCTGCAC 
ACCGATGACA 
CGCCAAGAAG 



GGCTACGGGA 
TCTGGCCGCG 
TTTACTGTCT 
CTCCATGGGG 
GTTCGTGATG 
TT7UUVAAAAA 
GACACCATTT 
TCAGCTTTCT 
AAAGAAGAAG 
GAAAAAGATA 
GAATTAGCTT 
GTCCGCACAA 
TACAATGAAT 
AGCGACTTCC 
CAGTTTGAGG 
AACGTCGTAG 
TTAATTCGGA 
GAAAGCCGGA 
CGGCTCAATC 
GTGGCTCAGC 
AAGTCATTAA 
CTATGATCTA 
ATGATAAAAT 
TATTAACGTT 
CCTTACCACC 
CTTGGACCGA 
AACGCTATAA 
CTGAAACTAG 
TGACCACATT 
AATTTACGTT 
CAACAGGAGA 
CTGAAATCAG 
TTCAAGAATT 



ATGAAATCAA 
GAATGCCAAC 
TACATGCGGG 
TAGGTGCCAG 
GCGTAGAATA 
TCGGCAAAAC 
TCTCTACGAC 
TATTAAAAGG 
TCTTCCACTA 
CATTAACACC 
ACCAATACAA 
AAGATGGCGG 
ATGCACGGAA 
GTGAAGGCTT 
GCCAAACCAA 
GCGAACAAAT 
AAGCGATTAA 
ACGtAAAAAA 
CAGAAATCCT 
AAAACAAGGG 
TACGGAAAAA 
TACGATTGGC 
CATCATCATG 
CTTCTATCGT 
ACTTTATAAA 
TGGTGAATTG 
AGGTCTGGGG 
AACCTTAATT 
AATGGGCGaC 
GGAAGAAGAT 
AAGTCTTTTA 
TTTGTTTGAC 
AACCCTTGAA 
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CGTAACTATC 
AGGGATGCAT 
TGGTAAGTTT 
TGTTGTTAAT 
CATGGAACGA 
AAAAAAACGC 
AAATTTTTCT 
CGTCAAAATT 
CG7UVGAAGGT 
CGTTGTTTAT 
CGATGGCTAT 
TACCCACGAA 
AGTGGGACTA 
AGCAGCCGTC 
AGGAAAATTA 
GGGCTTCTAT 
AGCCCGTGAA 
CGGAAAAAAG 
AAGAAAAATG 
CGCGACCGGA 
GCGAAAATGC 
GCAGGCGTTG 
ACCGATGCGG 
TATATGAAAC 
GTCTCTAAAG 
GCAGAAGTGA 
GAAATGAATG 
CGTGTTCGGA 
AAAGTCgAAC 
GGCAGTATTT 
GATGCGGAAA 
ATAGAGTAGG 
GAAGTAATGG 



GAAAAAGATA 
GCTTCTGGTA 
GGCCAAGGTG 
GCCTTATCTA 
TTTGAGGACG 
AACGGTACCT 
TACGAAATTT 
ACTTTAACAG 
ATTAAAGT^T 
TTTTCAGGAG 
TCTGAAAATG 
gTCGGTATGA 
TTAAAAGAAA 
CTTTCTATTC 
GGTACACCTT 
TTACAAGAAA 
GCCCGTGAAG 
GTGAATCGTT 
AACTATATCT 
AATTCCAAGC 
AaGACATCTT 
GTCCTGAATT 
ATACCGATGG 
CATTAATTGA 
GAACTGGAAA 
TTGATAAAGT 
CAGAACAACT 
TTGATGATGC 
CTCGTCGAAA 
TAGATCGTTC 
AAACAAAAGA 
GAGTGACAAT 
GCGATCGTTT 



ATAGTATTTG 
TCCCAACAGT 
GCTATAAAAC 
AGTGGTTAGA 
GCGGTAAACC 
CCGTTACTTT 
TAGCAGAACG 
ATGAACGTGG 
TTGTTGCCTA 
CCAAAGAAGG 
TTTTATCTTT 
AAACGTCAAT 
AAGATAAAAA 
GAGTACCAGA 
TAGCAAGAAC 
ACAGTGAAAT 
CGGCTCGTAA 
ACTTTCTGGA 
TGTCGAAGGA 
CATTTTACCG 
GAAAAACGAa 
TTCAATTGAA 
TGCCCATATT 
AGCCGGCAAA 
AAAATCAGTC 
TGGTAAAGGC 
TTGGGAAACA 
AGCTCT^GCA 
ATGGATTGAA 
AGAAGAAGAC 
AGCAGAACAA 
TTTGGAAAAA 
TGGAAGATAT 



18960 

19020 

19080 

19140 

19200 

19260 

19320 

19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20880 
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TCCAAATACA TCATTCAAGA ACGGGCCCTA CCAGATATTC GAGATGGGCT AAAACCAGTA 
CAACGCCGCA TTCTCTTTTC AATGAATAAA GACGGCAATA CCTTTGATAA AGGCTTTCGT 
AAATCAGCAA AATCTGTCGG AAACATTATG GGGAATTATC ATCCCCATGG CGACAGCAGT 
ATTTATGAAG CGATGGTGCG TCTAAGTCAA GACTGGAAAT TACGGGAAGT ACTAATTGAA 
ATGCAC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20940 

21000 
21060 
21120 
21126 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
nAGAGAAAnC CGGTAGCTTT TTACGGTTTT CTCTTTTTTG ACTTTCAAAG TCAAATCAGG 
TACGACTTCT TTTTAAACAG ATAAGaATGT GGTATCATTG TAAAGGACTA TATAATAAAG 
GAGATGGACG TTTTTTGACA GATAATCAGT CAAAATCTTT GGAATTAATA CTTACTGATT 
CTGATGATGC ACACATGCTT TTAGGAACAC ATGATAAACA TATCAAATTT TTAGAAGAGA 
ATACTCACGT AACGATTAAT AGTCGTGGCG AAGTTATTCA GTTAATTGGC GAATCATCAG 
AGGTAGAGTT AGTCGCTTCC GTGTTAAGAG CGCTACAAAC ATTAATTCAA CGAGGCATCA 
AAGTGCATAC GCCAGATGTC GTTTCTGCTT TAAAAATGGC CAAAGCTGGA AATTTAGAGG 
CGTTTATTGC CATGTATGAA GAAGAAATTA TGAAGGACCA TCATGGTCGG GCCATCCGAA 
TTAAGAATGT TGGTCAAAAA AAATATATCG ATGCAGTTAA AACTCATGAC GTCATTTTCG 
GGGTGGGACC TGCGGGGACA GGAAAAACCT TTTTAGCCGT GGTGATGGCA GTAGCAGCTT 
TAAAAAAAGG CGAAGTACAA AAAATTATTT TAACTCGTCC AGCGGTGGAA GCTGGCGAAA 
GCTTGGGCTT TTTACCAGGT GACTTAAAAG AAAAAGTCGA TCCTTATTTA CGTCCAGTTT 
ACGATGCCTT GTATCAAATT TTTGGTATGG ATCATACAAA TCGTTTAATG GAACGCGGCG 
TTATTGAAAT TGCCCCGCTT GCTTATATGC GTGGTCGTAC ATTGGATGAT GCTTTTGTCA 
TTCTAGATGA AGCACAAAAT ACCACGGTTG CCCAAATGAA AATGTTTTTA ACTCGTTTAG 
GGTTTAGTTC AAAAATGATT GTTAATGGAG ATACTAGTCA GATTGACTTG CCACGTGGTG 
TAACAAGTGG TTTAGTTAAC GCCGAGCGTA CGTTAAAAGA CATTGAAAAA ATTGCCTTTG 
TAAATTTTGA AGCCAGTGAC GTTGTGCGTC ATCCTGTTGT AGCACAAATT ATTCAAGCCT 
ATGAAAAAGA ACAACAAAAG CATTCCTAAG CAGTTAGGGA AGGAGTGAAA GTACCTTTTG 
AAAAAAGTAC TTACAAAGTT AGGGGATCGT TTAGGTAAAG CCTATTTACC TATTTTATTA 
GCCGTTTTTT CTCTTCTTTT ATTTATGATT ATGTTTGGCA GCGTCCATCA GAAAAGAGTA 
GAAATTAAAG AAGGTCAACT GGCCGAGAAA ACCAttCGTG CAAATAAAAA TATCGAGAAT 
ACCTACGAAA CAGAACAAAG AAAAAAATTG GCAGCAGAAG CTGTCACACC CGAATATATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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TATCAAGAAG ATACGGCATC TGTCCAACAT AATCGTATTG ATAAATTATT TAAATTGATT 1440 

GATTCTGCGA ATGAAAAAGT AGACAAAGAG TACAGCAACA AGCAAGCGT^ GGCCAAAAAG 1500 

GAAGAAACGA TTCCTGCGCC GACCGTGGAA GAACGAGTAG CTAGCTTAAA ATCaTTGTTT 1560 

GAATCTTTAC CGCAAGATGA AGTTACGTTT TATCaAAGTT TTCCTAATGT TTTTTATCaA 1620 

ACAATCTTTA CATTAACCTC TGAGCAATTG GATAAGGTTC GTTCAGAWVG TTTGATGTTG 1680 

GTGGATGACG CGATGCAAAA TCACGTTCGA GAATCTGATT TGGATAAAAT TCGCCAAGAA 1740 

GCTAACGGTA AAATTCAATA CTTAGATATT ACGAGTACCA TGCAACAAGT CATTCGTTAC 1800 

ATTGTTAACC AAGGGATTGT GGTAAACGAT ATTGCCAATG AAAAACGAAC AGAAGAgctA 18 60 

GGCCAAAAAG CGATGAATGC TGTTCAACCC GCAATGATTT ATCAAGGCGA AATTATTGTT 1920 

CGTGAAGGTA CTCAAATTGA TGCAAAAGCG GTCGAAAAAC TTGAGCTACT AGGAATGACT 1980 

AGTCAAAATA CCTCTATTTT TCCGATGGTG GCATTAGCAT TAGCTATTTT ACTTCAAGTT 2040 

GAAGTGTTAA TTTTCTTTAC AAAACAAGTG ACTGAGCCTT CGCGACAACG CTCGTTTATT 2100 

ATCTTTTACA CGGGTGCTAT GCTAATCAGT GTTATTTTAA TGAAATTCTT CCAGATTTTC 2160 

CAAACAGAGC AGTTGATGTA TATTCCCTTG TTCTATCCGG CAGCCTTTGC GCCGCTGATT 2220 

CTAAATCATT TTGTAAATCG TCGTTCAGGG ATAATCGCTG CTATTTTCCA AGTCGTTTTT 2280 

GCGTTATTTA TCTTTTATAA TTCTATCGGC ACAAACTCGT TGACGGTTAT TCTGATTATG 2340 

TACTTATTCT CAGGATTTCT AGCAACAGTT GTTAAACGGA AACGGATGAG TGAGCAAGTT 2400 

TTCCCAGCTT TAATGTGGGT AGTGGTCTTT CCTGTTTTCA TGGCGGTTGT CTTAATGATT 2460 

TATCAAGGGA TGAGTTTAAC AGATGGTAAA ACGTGGACAG CTTTAATTTG TGCAAGTGCA 2520 

GGAACGGTAC TTTCATTTTT AGCAACAATG GGCTTGCATC CATATATCGA ATTATTAGTG 2580 

ACCGATGATA GTATGATTGT CTTGAATGAA TTAAGTAATC CGAACCATCC GTTGTTAAAA 2640 

CAACTGTTAG AAGAAGCGCC AGGTACCTAT CATCATAGCA TGATGGTGGC TAGTTTGAgC 2700 

GCTAACGCTG TTGCTGAGAT TGGCGGGCGG TCGCTATTAA CACGAGTTGC TTGTTATTAC 2760 

CATGACATTG GTAAAATTAA ACACGCCAAT TTCTTTGTGG AAAACTTACC TGCTGGTGCA 2820 

GAAAATCCAC ATAACTTTTT ATTACCAGAA GATAGTAAAC AAATTATTTT TGGCCACGTA 2880 

ATTGATGGCG CCAAAATTTT AGAAGAGTAC AATATGCCGC AAATGGTGAT TGACATTTGT 2940 

CGTCAACATC ATGGCACAAC ATTGATGAAA TTTTTCTATG TGAAAGCAAA AGAACGCAAT 3000 

CCTGaAATTA AAGAATCCGA CTTCCGTTAT CCTGGaCCAC GTCCACAAAC aCGAGaAGCa 3060 

GGGATTGTGa GTATTGcTGA TACTTGTGAG GCTGCTGTGC GGGCGATGGA TCATCCCACC 3120 

AACGAAAAAA TTnCAAGCCT TTGTGCATAA TGTGATTCAA GACAGnATTT CAGATGGCCA 3180 

ATTAGATGAA TGTGGGTTAA CGATGAAAGA GATTCGCATC AT 3222 
(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 7812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GGACATGATT ACGTTAGACT TAAACTTACC TAGTATGGAT GGGATGGCTG TTTGTCGTGA 60 

AATTCGCAAA GTTTCAGCCA ATGTTCCCAT TATTATGTTG ACCGCTCGTG aTTCTGAAAG 120 

TGACCAAGTA ATTGGCCTTG AAATGGGTGC GGATGATTAT GTGACAAAAC CATTTAGTCC 180 

ATTAACTTTG ATTGCACGGA TGAAAGCTCT ACATCGTCGT GCCGAGGTGG CAGAGGCTGC 240 

GCATGATACG TCTGAGAATA CAGATGAAAC ATTTGACGTG ATTACGGATC ATTTTAAAAT 300 

GAATACGAAA ACACGAGAAA CATATTTAGA TAATCAATTA ATTGAAGGCT TAACACCTAA 360 

AGAATTTGAT TTGCTATACA CTTTAGcAAA AAAACCACGC CAAGTCTTTT CACGcGAACA 420 

ACTACTAGAA TTGGTTTGGG ACTATCAATA CTTTGGCGAT GAACGGACAG TTGATGCCCA 480 

TATCAAAAAA TTACGACAAA AAATTGAAAA AGTGGGTCCT CAAGTCATCC AAACGGTTTG 540 

GGGCGTAGGG TATAAATTCG ACGATTCAGG TGTTGCTTAA TGAAGTATTT GTATCAACAA 600 

TTACTTGCTT TTATAGGTGT GATCGCTCTG ATTATTCTCA TTGTCGGAAC GTCTTTCACA 660 

CAATTGACCA AGCGCACAAT GCAAGAAAAT AACTATGAAC AGCTGTATGG TTATGCAGAA 720 

TCGGCTTTAG AAACACGTGA CTTTTTTATT AATGTCGCAG GCGTTTCAGA TCGAGATGTT 780 

TTATCCTATT CATTtCAGTT GACTGAGCGC GTACTCCAGA AGCAAGATGT GCAATTTGTG 840 

TTTATTAACA AGGACCGCGA AGTGCAGTAC CCACCAGTTG ATAGCACAAA AAAATTAGAC 900 
TTTTCCTTAA TCGATATPAA TTGGGATCAA ATCATGAAAG GCAATCGTGT TaTGCAACAG 960 

AAAACATTGA TATTTACGGG GCCCGAAATA CTTCGTCCTA TGTAATGTTA CCGGTGTATG 1020 

CATCAAATCA GTCTTCTGAT AAAAAAGTGA TTATTGGTTC ACTAGTTATT ACACAACCTG 1080 

CCAAAAACGT TGACCGAAGT GTTCAGTCCG TGACACAAAA CTTGATTAAA GGCTTTATTT 1140 

TTTCTGGTGT GATTGCCTTG TTACTAAGCT ATCTATTTGC GACTTTCCAA GTGAAACGAA 1200 

TAAATCGGAT GCGTAAAGCC ACCAAAGAAA TTACTAGTGG AAATTTTGAT ATTCAATTAC 1260 

CGGTTCATGA CAAAGATGAA TTTGATGACC TAGCAGAAGA TTTTAATAAA ATGGCCGCGT 1320 

CTTTAAAAGA ATCACAAGAA GAAATCAATC GGCAAGAAGA GCGCCGTCGT CAATTTATGG 1380 

CGGATGCCTC TCATGAAATG CGGACGCCTT TAACAACTAT TAATGGCTTG TTAGAGGGAT 1440 

TGCAGTATAA TGCTATTCCT GAAAATCAAA AAGAGAATGC GATTAAACTG ATGCAAAATG 1500 

AGACCGCTCG CTTGATTCGC TTAGTCAATG AAAATCTTGA TTATGAAAAA ATTCGGACCA 1560 

ACCAGATTCA AATTGTTGTG AAGAAGTTCA ACGGAACAGA AGCTTTAGAA AATATTGTGA 1620 

CTCAACTAAC GGCCAAAGCT GAAGCCGCCG GTAATCAATT GTATTTAGAC ACCACCGAAC 1680 

CAATTGATGT GTATGCCGAT TATGATCGTT TCGTTCAAGT GGTTGTCAAT ATTGTTCAAA 1740 
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ATGCGATCCA ATTTACTGAA AATGGCGAAA TTCATATTGC TTTAGAAAAA GGCTATTTAG 1800 

AAACCATTGT CCGTATTTCT GATACAGGAA TCGGCATGAC GGAAGAGCAA ATTCTGAATA 1860 

TCTGGGATCG TTATTACAAA GTAGATCCAT CCCGTAAAAA TACGAAATAT GGAGAATCAG 1920 

GATTAGGCTT ACCAATCGTT CAACAATTGG TCCGTTTGCA TAAAGGAAAA ATTAATGTTG 1980 

AGAGTGAACT TGGTAAAGGC ACCACGTTTA TTATTTCATT TCCTGATGTT GAAATTACTG 2040 

AAAACTAAAG AAAACGTCTG GTCACAATCA GTGCCAGACG TTTTCTTTAG TTTAATAACG 2100 

CATGTAGAGA CAGGGGGTGC ACATAGGGAC TATCAATAAT TTGTGGATCT TTTGTCAGAA 2160 

ATAGCAAGCG CTTGTTTCTT TTTTGAACTG CTATTTTTAA TTGAGGTAAA ATTTCTTGAC 2220 

GTTGTCCAAT CGTTAATGAA GAAAAGATGT CATCAATCAC TAACGTACTT GTTTTTGAAA 2280 

GCAAAAGTTG TATTATGCGC ATTTTAATTT TTTCAAAAGT CGTAAAAGAA GTAAAGGAAC 2340 

GACCAAGTAA GGCAGGTTCT AATTGGAACA CCTGGAAAGC TTCGGCTAGA ACTACTTTTC 2400 

GATCCCGTTC TTTGATCGTA CTAGCGATAA ATAAATTTTC TTCTAATGGC AAGTAGGGCA 2460 

AAAAGGTTTC ACTAGAAGAA AGCCTAGTAA TGGTTAATTG GTTTGTTTGG AAAAAATCGG 2520 

CCAATGCGAG TTCTTGTTCT GGAGCAGAAT AAATGATTCC ATAAACTTCT CGCAAAGctA 2580 

ACGCGTCGAA TGAGGATTTT TTTTCGGTTA TTTTTTTGAA TGATGGTGTG ACATTCATAG 2640 

AAAATCCCTC CCTTATTTCT TATGTGTACG TAAATAAATC AACGTGAAAG GAGTCCCGCA 2700 

AACAAGAGGA ATGCCAAGCA AGATTAAGCA GTTAACAAAG CAATTACGGA TTGTATAAAG 2760 

TGACTGCATA CTTTTAAAAT CAAAAGAAAA CGCTTGGATC GTTTGAATCC CTAAATGCTG 2820 

GATATTTGTA ATCGTGACAT CCTGGTCGGT ACTTTTAGCT AAAAATGCTG GAGTTTCCAA 2880 

ATGAGACCAC TGTAAAATAG CATCACGAGC AGCCCAAAGA ACAGTATTAT ACGTTTCTTG 2940 

AAACATTAAA GTGAAAAGTA AAAAACTGAA AATTCCTAAT AGAGCCGGTA AACATAACTC 3000 

TAGTAAGAGC TGTTTGGTGA CATACATATT TGAAAAGTGC ATGTGATACC ATGTATGTAT 3060 

TTCTTCCTTT TTTATGCGTA ATGTCACATA CATCAACAGA AGAAAAGTAA CACCAAAGAA 3120 

AATGACACTA CCTAGATAGA GGTGCCAATA GTATTGGATC AGTTGGTAAT GGTAACTAGT 3180 

TTGTTTTTCA ATATCGACAA AGTCTGGCAT CGTTCGTAAA AACTCTTTTT CAAAATCAAT 3240 

TAAGTTAAAC AGCCCAGTCA ATAAGAAAGA GAACAGGAAG AAGAAGAGTG CGGTGATTAA 3300 

CGTGATTTTT TTATGATAAA AACTACTAGC TGTAGCGTAA CGGACACTTC TCATCAACTT 3360 

TCACCTCTCT AAACTATCTT TAGTATAGTG aAATAATATG AATTAwTTAk GAaAAAGtAG 3420 

CAAAAaGCTT GAGCAAATkG CTCAAGCTTT TTGAAGATTt ATTtACCCGT TTTTtGATAG 3480 

CCTtGATAAC TGTCAGtGTG ATATtCATCA ACGGTtGATT TATtATJcGTT TTTACTGTAA 3540 

ACACTCGTTGATTTTnCACC nTTTcTTTTT CAATTGCTTC CAGTTGTTGC AATTGGTTCT 3600 

TGTAATCATA GTCTTCTGGG TTAACTGGTT TTAAGCCGCT ATTGGTATAG AAACGAAGCA 3660 

AGTCACCGTT AGTAATTTGG TCAGACGTTT CTAATTGTTT CGTCGCTTTG GCTTTTAAAT 3720 
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CAGCGACTTC TTTCTTAACT TCTTCGGTTG GCTCTGTAAT TAAAGTGCCT GTTTTAGTAT 3780 

CGTAAATGCT_AGAACCTAAA ATAGTATATT TAGGTGTTAC AACATTGCCG TTGCGGAAAG 3840 

CAACGATTTG ATTGTGTTGT TTAGAGAATA AATCTTGGCC TAACTGAATG TAGTTTTTCG 3900 

TATCGACACC AAGCAAGTGT AGAAGGGTTG GTAAGGCATC AACTTGGCCG CCATACGTAT 3960 

GGTTCACACC GCCATTTTCT TGACCAGGAA CGTGAATCAT ATAaGGAACA CGTTGCATAT 4020 

TGGCGTTGTC AAAATCGTTC CAATCTGCTT TGGTTTTACC TACTAACTCT GCTAAGTTTT 4080 

GGTTTCTTGA ATTAGAAACC CCATAGTGAT CGCCATAAAG AACAATGACT GAGTTTTCAT 4140 

AGAGACCTGA AGATTTCAAG TAGTTAAAGA ATTCTTCGAC CGCTTTATCT AAATAATTCG 4200 

CTGTTGCAAA GTAACCATTA ATTGTTTCGT CAGAGGTTTT TGCAATTGGG AATCCTGCTT 4260 

CATCGTTTGT AAATTGAGAA TAAGGATAAT GATTGGACAC AGCAATAAAT TTAGAATAGA 4320 

ACGGTTGTTG TAAATGTTCT AAATATTGAA CGGATTGATT AAAGAATGGT TTATCGTGTA 4380 

ACCCATATTG GAATGAGTTA TCTGAATTAA CATCATAATA ACTAGCATCA AAGAAATAGT 4440 

CATAGCCTAA ACGTTTATAG GTTTCATTTC GATTCCAGAA GTTACCGGCA TTTCCGTGGA 4500 

AAGCAGCGCT AGTGTAACCT TGCGTTTGTT TTAAAATATC GGGTGCTGCT TCAAACGTAT 4560 

TTTTGCCACC AACTTGTGTA AAGAGTGAAC CTTGGTCTAA ACCAAATAGT GAATTTTCAA 4 620 

GTAACGTTTC GGCATCACTT GTTTTTCCTT GACCAACTTG GTGGAAGAAG TTATCAAAGC 4 680 

TAAAGGTACT TTTACTATTG TAAAGACTAT TGATAAATGG TGTGACTTCG TGTTCTACGC 4740 

CATTTTCATC TTTTAATTTG TAATTAACTA AAAATTGTTG GAAACTTTCT AAATGGATAT 4800 

T^TTACATT TTTGCCTTTT GCAATACCAA ACTTACTATC ATCGGGTGCC GCATATTGTT 4860 

GTTTCACGTA ATCTTCAACT TGTTTCATAT CGTTAGCACT GGCTTCGGCA CGGACTTGGT 4920 

TTGTTTGGTA AGTCGTAATG CCATCGTACA CGGTAAAGGC ATTTAAGCCT AAGAATTTCA 4980 

CGATATAGTC ACGAGAGAAT TGACGTCCTA ATAATTGGGG ACGTTCTGTT TCAGCCATAA 5040 

ATAGATTGAA TAAGAAAAAG ACAACAGAAA GGAGTGTGAC AGAGACTGCT ACACGTGCGC 5100 

GGACTGGACG CGCATCTGTT TTAATTTTTT TCGTTAATAA TAATACGCCA ATAATAATGA 5160 

AGTCTAAGAA ATAAAT6ACG TCATAAGGTC TAAAGAGACG TAAGGCACTT TCGCCAAGAC 5220 

CGCTAGCTAC TTTGCCAGCA CCAAGCATCG TATTAACTGT GATGAAGTCA GTAAATTCTC 5280 

GATAATACAC GACATTTGAG AAGAGCAGTA AACTCATCAA GAAGTAAATA ATCATCATTG 5340 
TAATATAAGA AGCTTTTTTA CGGCGAACAT ACAAAGCGAT GGCTAATAAA AAGACCGTTG 5400 
TGGCAATTGG ATTAATTAAG AGTATGAAGT ACTCGTTAAT ATTTTCCAAA CGTAAATGGA 5460 
AATCAACGGC GTACGCAAAG AGGTTTTTTA GCCACAGTAA GATGGCGATT AGCGAGAATA 5520 
ACCCCAGTCG.CGTGTTCAAA ATATTTGTTT TAAATAATTT TTTCAAAATA AACGTCCTTT 5580 
CTAAACGATG ATTCTATAAA AGTAAAATTA GGTAAATTTC GTATTTTACA ATATCGCTTT 5640 
AATTTTACTC TTTCACAAAT TTTATGTCAA TTCTTACCGA AGTGAGGAAT GGTGATGTTA 5700 
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AAGAAAACTT ACTATTTCCT TAAACTTTTT AGTTCGTTAA ATGACGTGAA GCTTGAATCC 5760 

TTTAAAGACA GATGGGCCTT TTTTTGCTAT ACTAGATAGG ACTTTGAAAA GTAGGTGTAC 5820 

AGAATGAAAC TTCAAGTAAC TAAAAAAGCA GAACATAAAT TTAAAAAAGG ATATCCTTTA 5880 

ATTCAGAAAG AAGATTTGCA ACAAGTGCCA GCGCCGTTGC CAACAGATTG GCTTACTTTG 5940 

ATTGATAGTA AAGGCCAGCG TTTAGCCGAA GGATATTTAG GCGAACAAAA TAAAGGAATT 6000 

GGTTGGTTGC TGTCATGGCA TGGACCAATT AATCAATCTT TTTTTCAACA ACTCTTTGAA 6060 

ATAAGTCGTG AGAAGCGGAC GTCTTTTGAA AAGGATTCGC TAACAACCGC GTATCGATTA 6120 

TTTAATGGAG AAGGGGATGG CATTGGGGGG CTAATCATCG ATCGCTATGC AGATTACGCC 6180 

gTTTTTTCAT GGTATAATGA AACCCTTTAT CAGAAAAAAG CCGAACTTTT GACGGCGTTT 6240 

CGCACAGTTT ACCCTGACAT TATTGGTGCC TACGAAAAGA TTCGCTTTTC CACAAAAGAT 6300 

TTACCAGAAT CACAGTTTTT GTATGGCGAA CAAGCACCCG AGCCATTACT TGTCACTGAA 6360 

AATGGCGTTC AATTTGCTAC CTATTTAAAC GAAGGATTAA TGACAGGCAT CTTTTTAGAT 6420 

CAAAAAGAAG TTCGTGGTCG ATTAGTAGAT GGTTTTGCAG TAGGTAAAAC AGTCTTAAAT 6480 

ATGTTTAGTT ATACAGGTGC TTTTTCAGTA GCTGCCGCTA TGGGGGGTGC CGTGGCCACC 6540 

ACTAGTGTGG ATTTAGCTAA ACGAAGCTTA CCGAAAACGA CTGAGCAATT TG7UVGTCAAT 6600 

CATTTAAATC TTGCCCCGCA AAAGrTTATT GTTATGGATG TTTTgACTAC TTTAAATATc 6660 

TTCGCGTAAA GGCTTQnGTT ATGACATGAT CATTTTGGAT CCGCCGAnTT TTGCTCGCAA 6720 

CAAAAAGAAA GTTTTTTCCG TTGCTAAAAA TTATGGGGAG CTAGTGAAAG ATTCTATTGA 6780 

TATTTTAACG GATAAGGGAA CGCTTATTGC CTCAACCAaT GCGGCAAATT TATCCTTAGC 6840 

CAAATATCAA AAAATGGTGA TTACTGCCTT GCAAGAGAAA AATGTCCGTT ACAAGATTAC 6900 

GGACACGTAT CAATTACCGG CAGACTTTCA AGTGAATCCT AATTTTCCAG AAGGTAATTA 6960 

CTTAAAAGTC CTGTTTATCG AAATTGAAAA ATAATTTTTT ACACC7VAGCA AAAGTCCCAG 7020 

CGCCGCTAAT CGGTATTGGG ACTTTTGCTT GGTnTTTATT TTATAGTAAC TTTTGGGCTT 7080 

TTTGAGTGTT TGCAAAATGc AGTTCGCGTA GTTTGCTAAT AAGTCTAGGC CGCCTTTTTT 7140 

CAAAATACGG GCGGCTACTT GGTCTGATTT GCTGCCAGCA CCAGAGGGCA CAGAGAAACC 7200 

CAATTCTGCA GATTCTTTGT CTAACTCTTC TAAAAAAGCA GCGCGGCTAA TAATAGAGGC 7260 

GGCAGCAACG GCAACATGAT ACTGCTCTCC TTTAGTGACG AAGAACAATT TTTCAGTGAC 7320 

TTGATTTTTT TCATTACGAA CATATTTCCG ATAATTATTT TCAGGGGTGA ACTGATCAAT 7380 

TAAAATCCCT TCTGGCTTTG TGGGAGCCAA TTCTTGTAGC AACAGATAAA TCGCTTGGTT 7440 

ATGCAAAGCG ACTTTCATGT GTACAGCATT GTAGTTTGGT TGGATCTCAT TGTATTTTTT 7500 

GGGTTCAACG.ATTAATAACT TGTAAGGAAT CAGTTCTTTA ATGACATGTG ACAATTGAAT 7560 

AATTTGTGGG TCGGTTAATT CTTTGGAATC ACGAACACCT AATGATTTTA GTTTGCTAAT 7620 

CATGnTTTTG TCAACATAGG CAGCACAGAC AGTTACTGGA CCAAAATaGC TACCATTaCC 7680 
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GACTTCATCG GAACCtATCA CAGgACCcAT TTGCCCAAAA TCCAGCAGGG CAGTTGGTAG 
TTAGTCTGGG GGAAGAAACT TTTTnTCTTC TTGGGTGTTG GTTACCTTCG GTTTCCCTTT 
CCCCCAAAAC CG 

(2) INFORMATION FOR SEQ ID NO: 26: 

|i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TTCAGCGACA GGGATGTTAT TGATGATTAA ACGTTATCAA TCGCAATTCG 
ACTAGGi^AAT ATTTTTTCAA TTTTGTTATG GTTCCGAGCA GGGACGCACG 
TTATGCGATA TTTGTTATGT ATTGTATGTA TACTTTTAAC TCAATTTTTG 
TTGGTTGAAG ATAAAAAATA AAATGGAAAA AAATAGGTAG GGAGTTCGAT 
GTTTTAGCAG GTACGATTGG CGCTGGTAAA TCTAGTTTAA CAGCTTTAAT 
TTAGGTTCAG AAGCCTTTTA CGAATCAGTT GATGATAATG AGGTATTACC 
GCGGAACCGG AAAAGTACGC ATTTTTATTA CAAATTTATT TTTTAAATAA 
AGTATTAAAC AAGCATTAaC GCATGAAAAT AATGTATTGG ATCGCTCGAT 
TCATTACTTT TTCATTTAAA TGCAGATTTA GGTCGTGCCA ACGAAACAGA 
TACGATGACT TATTACAAAA CATGCTCCAA GAATTGCCTT ATGCTGCGCA 
CCTGATTTGT TAGTTCACAT TCGTATTTCA TTTCCCAAAA TGTTGGAACG 
CGTGGCCGTC CTTATGAACA AATTGAGACA GATCCAACGT TGTATGATTA 
TTGAACGAGC GTTATGATCA ATGGTATGAA GACTATGATG AAAGTCCTAA 
GATGGCGATA AATATGATTT TGTCGAAGAT CCTGAAGCGT GTCAATATGT 
ATTGAGAAGA AAATTGAAGA ATTAGAAAGA TAAATAAAAA GAAGCAAGTC 
TCTTTTTGAC TTAAAATGTG GATAGATAGG CTAAAGTTGC TACTAAATAA 
TGCTAAATTA TAGAATCGGA TTTTAACCTT CCCCTAAGTT AAAATCGGAT 
TCGACAGTTT AAGTTTGTAT AAAATAGCGG TAGCTATTAT CAACGAAGTC 
TGACCTGACT TCTCTAGAAG TATACCATCG AAAATAACCG ACAGTTGTTC 
CTTAGTAAGT GTAAATTTTG ATTGAAAGTT TCTGTAAATC AATCATTTAT 
ATGCTCGAAA GAAGGAAGAT AAATGAGCCA AATTTGGTTT GAACGCATTC 
CAAAAAAGCG CAACGATTTT TACAAGGACG TTATGCCCGG TTCGATGAAT 
GCTGTTGCTA ACAAGTGTCG TGcTGGCGCT CATCAATATT TTTACTGGCT 
TCGGTTATTG ATTTTAATCA CAGTCGCCTA CGTGTATTAT CGCTTTTTTT 
TCACCCGCGA TTGAATGAAA ATCAATGGTT TATTCTTCGA AAACAACGCC 



TGTTTTGGCT 
CTGGTGGTGA 
GTATGTTTAA 
GGCAGTGaTT 
TGCAAACCGT 
TTTATTTTAT 
ACGTTTTGAT 
TTATGAAGAT 
GGTCAAAGTG 
TAAAAAAAGA 
CATCAAAAAA 
CTATCAAATG 
AATTCAAATT 
CTTGGCGTTA 
CGGACTTGCT 
AGGAGTGTAG 
AATCTATCTA 
ATTCATCGGT 
AAAATTTGGA 
TCAAAAAATA 
TACGTTGGCG 
TGAATCGAAC 
ATCTTTGGGT 
CAAAACATAT 
TAATGCAAAA 



7740 
7800 
7812 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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GGTTCATGCT TTTCGCPAGC SACATArGAC TCAGWaSAS TATCGTTATT TTAAATGTCC 1560 
AAAATGTCAG CAGTCATTAC STSCGCCGAA AGGfiCG?JSGA ACAATCAAAG TCACTTGTTC 1620 
AAACTGTCAA AATCAATTTA TCSAAAAAGT CTGA.TCTTCT GTGTrrTTTG GTCTAGGCAA 1680 
CTTTCGGACT TGTGGTATAC tTGAGRAAAG AAACCACTGG ATATTCTAJVA AGGGGAATAT 1740 
CTTGkTAAAA AAGGAGr.CrA .-AAACAATG 
(2) INFORMATION FCX SEC ID NO: 27: 

(i) SEQDEWCE CHAKiCirERISTICS: 

(A) LENGTH: 1903 base pairs 
(3) TYFE: nucleic acid 

(C) STRan3ED:3SS: double 

(D) TOPOLOGY: linear 



1769 



60 



(xi) SEQOENCi DESCRIPTION: SEQ ID OT: 27: 
CnGTCGTTGA CGCATCRACG CCTCnCTCGA GTATATGnGT TCACCTTCAC CGCGTTTAAT 
AGCAAACGCT GTTaCCGTTA J.ATTTTCATT GGCATTAGCT AATCGATTTG TTACTTCTGC 120 
AATCGTTGCT TCGACATTTT CCCTTTCTTT GGCAACSATA AGACTGACAG AAGACTTATA 180 
GGTAATCACC GCAAARATGG CAAAAACTAC AAATATAAAG AAAGAACTTG CGAAAGCCCA 240 
CTTTATAGTT AAAGATSGSC CTTCCASTTC TTTTTTAATC GTTCTTTTCA TTAATTAGTT 300 
CTCACGAACG CATAACGTAG CCAGTTCCAC GGACaGTTTG GATGTAGCTT TCTTCTCCAG 360 
GTACGTCAAT TTTATTrj/T AASTAGCGGA TATACACATC CACAACGTTT GTTTCTACTT 420 
CTGTTTCATA GCCCC»aACT TTATTTaGTA ACACATCACG TGCCAAGACA ACGTTCyVCGT 
TTTCCATTAG CGTTAArAST AATcGTAtTC GCGkTTkSTt AATTCAATCA TTTCAGAATT 
ACGACGAACG ACACGArTTT CTTTTTCAAT TGTTAAGTCA C6ATATGTAA TCGTTGTTTG 
TTTTGCAACG TTTTTKTCGC CCTCAA7ATC AATACGRCGA AgTAACGCAC GTAAACGAGC 
TAACAATTCT TCAATTSCAA ATGGTTrAAC AATATAATCA TCCGCTCCAT GGTCTAAGCC 
AGAAACACGG TCAATTACTG .-ATCACGCGC AGTCaiCATA ATAATTGGTG TATTTTTCAC 
TTGGCGAACA CGGCGflCATA CTTCTAATCC ATTTAATTCT GGIAACATCA AATCAAGAAG 
GATAGCATCC CATTCGTTGT TAAGAGCGGC TTCCAATCCT GTACGACCAT TGTAGTGTAC 
TTCTGTCGTA TACCCCTCAT GTTTTAATTC AAGCTCAACG AATCTCGCTA AGTTCTTTTC 
ATCTTCAATA ATTAAAATGT IGCTCAtTTG ATTAATTGTC CTTTCTCTTT AAAAGCTATA 1020 
CTCRAGCTCA ACCTTTTTAT TCCTTACTAC TTATTAGAGA ATTACTACTC AATCTATCTG 
GTTCTTTTTT AAGCCRCAGT .yL^aCCATGA GCATG6TGAG CCACACTCAT GGTCTCCAGC 
GTGTTTATTC TTCGTiGTAC CASCTGTAAT GGTAGA7TCC TTCTTTGTCT GTGCGTTCGT 
ACGTAT6AGC ACCGAfiGTAA 7CACGT7GT6 CTTGAATTAA AITCGCTGGT AAGCGATCTG 
AACGATAAGA ATC6IAATAA GCAATTGCAG A7GAGAATGT TGGTACAGGA ACACCTGCTT 



480 

540 

600 

660 

720 

780 

840 

900 

960 



1080 
1140 
1200 
1260 
1320 
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GAACAGCaAT CGCaACAACT TCACGAACGG CTTGTT6ATA TTTTyTCGTA ATTTCTAC3A 1350 

AGTATTCATC TAACAACAAG TTTTCAAGTG CTGGATTTTT TTCATATGCA TCGGTAATTT 14<C 

TTTGTAAGAA TTGTGCACGG ATAATACAAC CTGCTCGCCA GATTTTTGCA ATTTCf.CC»A 15C0 

ATGGTAAATC CCATCCGTAT TCTTCAGAAG CTGCACGGAG TTGTGCAAAA CCTTGCSCaT 15€0 

AACTCATTAA TTTACTAAAG TATAAaTgTT CGCGAATTTT TTCAATCAAT TCTTTTrTAT 1623 

CGCCTGCAAA GTTAAAGGCT GCAGGTTTTG ATAAAATGCC ACTTGCTTTT ACGCGTTCrrT 16E3 

CTTTGTAAGC CGAAATAAAA CGTGCAAATA CTGATTCTGT AATTAATGGT AATGGTACGC 1743 

CAAGATCTAA CQCACTTPGG CTTGTCCAIT TACCTGTTCC TTTGTTGCCT GCTGCATCAA IBOO 

GAATAACATC AACGATTGGT TGGTCGGTTC CTTCATCATC TTTACGTGTC AAAATGTCaS 18€0 

CCGTGATCTC GATAAGGTAA CTGTCTAATT CGCtTCGTTC CATTCTTT 19C3 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

TCTTCTATkG TTAAtTGCaw CTTGTCCAAC TCCTTGyCTT CTTCTACATT ATAACTAAST 60 

TAGGTGACTT TTTCACTAAT AACCAATT6A AACTTTTTCT TGAAAACTTT TTGCATTT7T 120 

CATAATTTTG CGTTATTAGC GGATACATTA ATGAACTTAT GGTAAACTAA CATTCT3GAT IBO 

GGTTCATTCC AGCCAGCCAA TTAAAAACTA TTTTTAGAGC GTTCAACACG CTTATTCA7G 240 

^jTTTTTTAT AAAGGTGGAC TTATTTTGAT GAAAAACAAA TCGAAGCTTT TATGGTTAGG 300 

j^TTTTTTC TTGCCGTTTT TACTTCTGTT GCTTGTATGG ATGACATTAC AACTTGCGCC 360 

TTTTGGTGAC AACAACTTAT TAGTTAGTGA TTTAGGCACA CAATACATGC CTTTCTTG^G 420 

TTTTTTAAAA CGCTCTTTTC ACGAGGGAAT AACTACATTC TACTCTTTTT CCAATGAGAT 460 

TGGTGAATCG ATTGTTCCTT TGGCAGCCTA TTACTTACTG AGCCCTTTTA ATGTGTTGGC 540 

TTTCTTTTTT CCTTATGAAC AAITACCTAT AGCAATTTTA TGGATTATTA CCTTAAAACT 6C0 

TGCCTTGATG GGCACTACGA TGTTTAGCTA TTTAAAATAC ACCTATCAAA AAGTTGATGG 660 

CACCACGTTA CTTTTTTCAA CGTCCTATAG TTTCTGTGGC TTTGTCACCG TCTATAGCCA 720 

AAACTTTATG TGGTTGGATG CACTGATTCT ATTCCCQCTT ATCTTATTGG GACTCCAACG 760 

TTTATGGGAT CAACGCAAAT GGGGCTTATA CAGTATCACC CTATTTTTAG CGATTGTGAC 840 

GAATTACTAT ATGGGTTATA TGATTTGTCT CTTTGCTGTT TTGTACAGTA TCTATTGGTT 900 

CTTTAAAAAG AATACCAAAG CCCATGCTAT TCGGCAATTT TTTAAACAAA GTCCGTTATT 960 

CATCCTCGTT TCTTTTTTAA CAGGAACCGC AACTAGCTTT TTATTATTAC CTGCAGCAGA 1020 

AGGGATGCTC TATACAAAAA AAGCAGACTT TGATGTCTCG ACGTTCTTTT TAACGCCTAA 10=0 
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GTTCAACACT TCATTCTTCT CGCAATTAGG CTTAGGTTCT ATTAATTATG AGTTGCGTCT 
AGACCATTTA CMACCGTTT TTGCTGGATT ATTTGTGACA CTCCTCTGTG TTGCCTATTT 
TCAAACGAAA CAAATTGCGT TAAAAGAAAG AATAGCTTCA GCAATTCTTT TATTCATTCT 
^^^,p.j.,p^QC TTTTGGTTAG AAGCCTTTAA TACTGTCTGG CACATGTTTC AAAGCCCAGC 
TGGTTTCCCT TACCGAAATG TCTTTATTTT TAGCTTTTTA TTGATTGTCT TTGCCTATGA 
AGTTTGGCTT AAAAAAGTGA CCATTCCTTG GACCGCACCT ATTATTTTTA GTCTGCTGTT 
AGTCATTGGC TACGGATCTT TGTATTATGG TCCACAAAAG AATCTCTTAA TTTCGATCAA 
TTATTTATGG CTAAGTTTAC TTTTTATTTG GTTGATTTTC TTTTGCTTGC GTTTAGCGCA 
CAAAAAGGCC TTAAGAAACT ATGTAGTCGT GGCTTTATTC TTACTGGTCA GCACAGAATT 
AACAACGAAT TTCTGGATTT CTTTTAAACA CATGCCCTTT GGTAGCCAAG CAACATTTGC 
CCAAGATTAC CGAAAACACA GCCAATTAAT CGATGAAAAA ATGGCCTCAG CACCAGAACT 
TTATCGCATG AAACAAGTCA TTCCTTCCAA AGAAACAGGG TTCCGTGAAA TCAATAACGG 
CTACAATAAT CCTTTACTTT ATGGCTATGC TGGCGTTTCT AGCTATACTT CGACTTTAAC 
TGCCACTACC CAAGACACAT TAAGTGCGCT AGGTTTATAT CGAAAAAATG ATCGCCGGAT 
TGCTTATGTG GATAATTCAC AACTAACTAA TTTACTTTTA AATGTGAAAT ATGACTTTTT 
ACCAATTGAA AAACCTACCA GTGAAAAATT GCTAAAAACa GTTGGTTCCA CGAAAATCAT 
GGaAAACGAT GAAGCGATCG GTATGGGCTT TTTGGCGCCA ACCGCATTAA CCAAGTTAAA 
ATTAGCAAAA AATAATCCTT TAGATGCCCA AGAAGaACTC CTGCAAACGC TCGTGCCGAC 
GGATAAACCT TACTTTAAAA CAGCTTCGTT GATAAATGAA CCTCATCATA CCAACGAAAC 
AATTGAAGCA ACCTTTAAAG TGAATAGCAC TGGTGACTTG CATCTTTATA TTCCGAATTT 
AAAATGGAAA AAAGTTACAC AATTGAAGGT AAACCAACAA GTTATCTCGA CGCCGATTTA 
TATTGCAACC AATCAACTGT TCAATTTAGG GCATTTTGAA AAAGGAACTA CCGTGACCCT 
GTCACTAACT GCTGAACAAG TGGTTGATTT AACCAATTGG CAACTTCAAA CTTTAGACCA 
AACAGCCTTT AATCGTGCGG TTGACAAATT ACGCCAACAA GCTCTTCACG TGAATGCAAC 
TAAAAAAGGC CATTTAAATG GCGCACTGAA TGTACCTGGA AATGACACTC AGTTACTGTA 
TACGTCCATT CCTTATGACC AAGATTGGCA AGTCAAATCG TCGCTACAAA AAGAACCCTT 
AAAAACACAA CGCATTCTAG GTGGCTTTTT AGCTGTTGAA GTCCCAGCTG GCAAACAGCA 
GTTAACCTTT GCCTATCATC CAAGAATGAT TTATCTCGGC ACTGCCGTCA GCGGAACGAT 
TTTACTGGGA ACGGCTGGTT ATCTTGGGTT TAAAAAATAC CGTCGAT^C GTCAGGAGGC 
CACTCATGAC TAATCAGAAA AAATTTTTAA ATGGACTCAT TCGGTGGTTA CCCCTAGTTG 
GACTGGTGCT TTTCTTCGGC TTAATTCTAT GGGGATATTC CAGAGGAATT TTTCATTCTG 
TTGCTTCCTT GCAAGCTTTT ATTAAACAGT TTGGTAACTA TGCGGTTCTG TGTTTCATTC 
TTTTACAAAT CGTCCAAGTG ATTATCCCTA TTCTACCTGG AGGAATCTCT TCTGTTGCAG 



1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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GAATGTTGAT GTTTGGCAAT TTGCAAGGAC TTTTGTATAG CTATCTTGGA TTGATTAkTG 3120 

GTGAGTTTAT TGGCTTTTTA CTGGTTCGTT ACTATGGACG TAGCTTTGTA AAAATAATCC 3180 

TTTCACCCAA TAAATACAAA AAATTTGAAG AGATTTTGGA CAAAAATGAA CACAATGTCA 3240 

AAAAGCTCTT AATCTTTACT ATGTTAGTCC CCTTCGCTCC AGATGATATT GTTTGTTTAG 3300 

TGGCGGGAAT TACAGATATT TCTTTAAAAG AATTCATGAA AATTGTTCTT TTATTGAAAT 3360 

TCTGGTCTGT GGCTACTTAC AGCTATTTAA TGTTGTATTT GTTTCAATTA TTTGGCAAAT 3420 

TATAACAAAA AGACAACAAA ACCAACGTTG GTTTTGTTGT CTTTTCTTTT TTAGATTGAA 3480 

CGTAATTGGG CAATCCGTTC TTTGACAGCC TCTTGTTTTT CAAGGTAGTC CTTTTCTTTT 3540 

GCACGTTCTG CTTCCACGAC CTCATCTGGT GCATTTGACA CAAAACGTTC ATTTGAAAGT 3600 

TTGCCTTGCA CACGTTTGAC TTCTTTTGTC CATTTGTCTA ATTCTTTTTC TAAGCGAGCA 3660 

ATTTCTTCCT CGATATTAAT CAACCCAGCT AATGGTAGGA ATAGTTCTGC GCCTGTTAAA 3720 

ACAGCGGACA TGGCTAATTC TGGCGCTTCT ATTTCACGAC TGATGACTAA TTCCTCTGGG 3780 

TTACAGAAGC GTTCCAAGTA GCTAGTATTA GCTGTTAAAA ATTCTTCTAC TTCTGTATCG 3840 

TTTGTTTTAA TTAACAAGGT AATTGGTTTA GAAAGTGGTG TATTCACTTC TGCTCGAATA 3900 

TTCCGAACCG AACGAATAAC TTCTTTCAGA ACTTCCATTC CTCGAGCTGC TGCTTCATCG 3960 

TTAAATTCTT CATGAACAAC TGGGTACTCT GCCACAACCA ATGACTCACC TTGATGCGGA 4020 

^,pY.p^YTCCC AAATTTCTTC TGTTACAAAC GGCATAATTG GATGCAATAA GCGTAAGATT 4080 

TGATCCAATG TGTACACCAA GATGCTCCGT GTTGTTTGTT TAGCTGCTTC ATTGTCTCCG 4140 

TAAAGAATTT CTTTACTH^T TTCAATATAC CAATCACAGA AATCATCCCA GATGAAGTTG 4200 

TATAATTGGC GACCTGCTTC ACCGAATTCA AAGCGATCAA ATAATTCTGT CACACGGGCG 4260 
ACTGTTTCGT TTAAGCGCGT TAAAATCCAA CGATCAGCGA CAGTCTTTTC ACCACTAAAG 4320 
TCAATATCTG CAGCAGTCAT ACCTTCTACA TTCATGATAA CGAAACGACT TGCATTCCAG 4380 
ATCTTATTGA TAAAGTTCCA AGAAGCATCC ATTTTTTCAT AACTAAAACG AACATCTTGA 4440 
CCAGGTGCTG AACCATTTGA TAAGAACCAA CGTAACGCAT CTGCGCCATA TTTTTCGATG 4 500 

ACATCCATTG GATCAATCCC GTTCCCCAAT GATTTACTCA TTTTGCGTCC TTGCTCATCA 4560 
CGAATTAAGC CGTGAATTAA GACATTTTGG AAGGGACGTT CACCTGTAAA CTCTAAGCTT 4620 
TGGAAAATCA TGCGACTTAC CCAGAAGAAG ATGATGTCAT AGCCAGTAAC TAATGTGCTT 4680 
GTTGGGAAGT AACGTTGGTA ATCTTCGCTT GCTTCATTCG GCCAACCCAT CGTTGAAAAT 4740 
GGCCATAAAG CAGAACTAAA CCATGTATCT AAGACGTCAC TATCTTGAAC CCAGTTTTCA 4800 
CTGTCGGCTG GCTCTTCCAT CCCAACATAC ATCTCGCCCG TTTCTTTGTG ATACCAAGCA 4860 
GGAATTTGGT GGCCCCACCA TAATTGTCGC GAAATAACCC AGTCATGTAC ATTTTCCATC 4920 
CAACGTAAAA ATGTTTGATT GAAGCGTGGT GGATAAAATT CAACAGCGTC TTCTGTTTCT 4980 
TGATTTTTCA TTGCTTTTTC AGCTAATGGT CCCATTTTTA CAAACCATTG GGTCGATAAA 5040 
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CGAGGTTCCA CGACAACGCC TGTTCGTTCT GAATGGCCCA CGCTGTGATT CATTGTTTCG 
ATCTTAATTA AGCGACCTAG TTCTTTCAAG TCAGAGACAA TCGCTTTACG TGCAGCAAAA 
CGATCCATTs CCTcGTATTT TCCAGCCAGT TCATTCATcG TGCCGTCTTC GTTCATAACA 
TTTACTCGAG GTAAGTCATG ACGATTACCG ACTTCAAAGT CATTTGGATC ATGGGCAGGT 
GTGATTTTTA CAACCCCTGT TCCAAATTCC ATATCTACAT AGTCATCTGC AATAATTGGA 
ATTTCTTTAT CGACTAATGG CAAGACAACT GTCTTACCAA TAAGTTCTTG ATACCGTTCA 
TCTTCTGGAT GGACCGCAAT CGCCGTATCT CCCAACATTG TTTCAGGACG CGTTGTCGCA 
ATTTCAACAA CGCCAGAACC ATCTGATAAT GGATAGCTCA TGTGGTAAAA AGCTCCTTCA 
ATATCTTTAT GGATTACTTC AATATCAGAC AATGCTGTTT TCGCTTTTGG ATCCCAGTTA 
- ATGATATATT CACCACGGTA AATCAAATCT TTCTCGTAAA GAGAGACAAA CACTTTACGA 
ACGGCCTCAG ATAGCCCTTC ATCTAAAGTA AAACGTTCAC GGCTGTAATC TAAAGAAAGA 
CCCATTTTTG CCCACTGTTC ACGAATGTGA GAAGCATATT CTTCTTTCCA TTCCCACACT 
TGATCAACAA ATTTTTCACG ACCTAAGTCG TAACGTGAAA TTCCTTGTTG TGCTAGTTTC 
TCTTCTACTT TTGCTTGCGT GGCAATTCCG GCATGGTCCA TTCCTGGCAG CCATAACGTA 
TCAAAGCCTT GCATTCTTTT TTGGCGGATA ATCATATCTT GTAATGTTGT ATCCCAAGCA 
TGCCCTAAGT GTAGTTTCCC TGTTACATTG GGTGGTGGAA TAACAATTGA ATAGGGctTC 
GCTTTTTTGT CGCCACTAGG TTTAAATAAA TCTTGATCTA GCCATTTTTG ATAACGGCCA 
GCTTCGATTT CTGTCGGTTG ATATTTCGTT GGTAAGTTTT TTTCTTCTGA CATTTTTTCC 
CCTTCTTTCT TAAAATACGC GATTTGGAGC GGTAATTATT TTACAAATGA GTTTTTTAGG 
CTAAACACTA AAATACGCCC TAAAATACTT TCGTATTTTA GGgCsTAGaT TCATCTTATT 
TACGCGGTAC CACCTAAATt GCATTtGAAT tGAATtCCTT CAAATGCCAC TCATtCGTGA 
GATAACGTTC ACGAAACGAA TCGTTCTACT AACGTACTGT GTTCAAACCA TTGACTCACA 
AGCTACCTTC TCACTGGTTG TGTAAAAGTC TTGCACCTGA ACGACTTTTT CTCTAAAACG 
ACGCCAATGA TACTCCTCTT GTTCGCTGTC TTTGCTTCAT TCTACAATGA AGCGCAAgcA 
AAGTCAATTT GTAATCTAGA GGGATTACAC TTTTTTTGCA GATTCTAACG CTTGTTGATA 
ATCGGGTTCC GACGAAACTT CTGAGACGAT TTCTTCATAA ACTAACGTTC CTTCTGGGTC 
AATCACAAAA ATAGCACGTG CTAAACGGCC CATTTCTGGA ATATACAAAC CATAAGCTGC 
ACCAAATGAG TCTTCAGTAT CATGAAGCAT TTCCATTTCG ACACCCTCAG CGGCACACCA 
ATTTGCTTGT TCTTCAACTG TGTTATTGGA AATCGTGATA ATCTGAACAC CGTCTAATTT 
CGCCGCTTCT TGATTGAAAC GTTTCGTTTG CAGTGAACAA ACACGTGTGT CGATATCAGG 
AACCACACTA ATTAAAACCG TTTTACCTTT ATAATCTGCC AAGTTGATTT CTTGGTTGTT 
TAAATTTTTC AAAGAAAAGA CGGGAGCTTT TGTGCCAACT TTAGGCTGCT CACCTGTCAA 
TTCTAATACG TGCCCTTTTC TTGTAACATT CATTCTGAAA TCCTCCTTTT CCACTTTCAA 



5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
. 7020 
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TGCTTTGGGC TATTTGCTAG 
GTTGTTAAAT CAATATATTG 
AAATTTAAAA TCGCAGTCAC 
TGACTTGGTA CTGTAGAAAT 
TCAGACATAT CATGAACTAA 
TCAAAAGCAC AGACAATATT 
TTGCCTAAAT TGCCACAACC 
TTACTGAATA CCTCAATTAA 



TAATAAGTAT AGTAGAGGAT TTTTATGAAG ACAATTTTAT 
TGAAAAAGTT TCATTAAAAT AAATCAGTGT TTGCAGTTCC 
CACACTAACG TTAGCTGGCA CACTGACACG GTCTGGCGCA 
ACCCGCTTGT ACAATTTTGT CGATGGCTTT TTGAGCATGG 
GGCTACCGTG ACCCCTTCTT GTCGAACGAA TGCTTCTAAC 
TAATCCATTA ATCGTGGTGC CAACTAAAGC CGAATCATTA 
CAAATTCTCG TTGCGTCTAA AATTATTTTT TAATAATGCC 
AATCAGAGCA ATTCTTTTTT CTTCTTGCGT ATTTAAAATA 
ATACgGCACA TCGTn 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29; 
ATATGGCCTT CACTTTAGTG AAGAGATTTA ATTGTCTTTA CAGATAAACG 
GTCGATAAAC AAGGAATTAC AGGTAAAAAA GTTGATTATA AATCGATTCC 
ATTTCACGCT TTTCTGTTGA AACTAGTGGA CATTTCGACC TAGATGCAGA 
TGGATTTCAA GTACTGAGTT ACCTTCAGTA AGTCTTCAAT TTAGAAAAGA 
GTTGCTATTC AACAAGCTTT AGCAGCTGCT GTATTATCTT GAATACACAA 
GAAAGCGACC AACTTTCCTG TCTTTTTAAT TIctAGATAAA ATAAAAAAGG 
TTGATACGTT CCTTATTCAA TCAGACACAT GGCGGCACTT GCTTAGTGCT 
GGATCTGACA CAATTCACTA GCCCGCCATT GTCCTAGCCT TTCGGCAAAA 
CGTATTTTTA TTCTTTTGAC TAGTCTTTCT TTTGATTTTT TAAAAGTTTC 
TCTTTGCCCG TAACTTTTTA AAGAACTGAA CTAGCAATAA GCGACACTCT 
CGCCTGCTTC GACATAGGCC ACATGATTGA AGCGTTCATC TTCTAATAAA 
TGCCAGCAGT GCCTGCTTTA GGATCAGTTG CGCCATAATA AACTTCTGGA 
AAAGCATCGC CCCACTACAC ATTGGACAAG GCTCTAAAGT TACAAATAAT 
CTAGACGCCA ATTTTCTATC CCGCGACACG CTTCTTGGAT TGCATACATT 
CCGTTGCTTC TTGCCGTGCT TCTCTCAGAT TATGCCCGCG GCCAATAATT 
GAACAATTAC AGCTCCAATA GGCACCTCAG CCAAACCTTC TGCTTTTTTC 
TGGCTTCTCG CATAAAAAAT TCTTTTTCTT CTTGTGTTAA CGTTGTTTCT 
ATTAATTTCC ACCTTTCGGC TTTTTCCCAC TAAACTTCTG TCTTGCTTCA 



CTTAATTTTA 
CTACAAATCT 
ACTAAAGATA 
TAAAGATATT 
AAAAAGACAG 
CTTCTCAGCC 
GCTTCCTTCC 
CCTAGTATAA 
TCTTCTTTTT 
TCTTCTAAAA 
TTCATCAATG 
ATTCGTGCCA 
TGGGTTTCTT 
TCTGCATGCG 
TCCCCTTGCC 
GCTTCTGCAA 
TTTTTTCCCA 
TGGTACAATA 



7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7515 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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AATCAGTGAG GTGTGTCTAT GATTATTGAA AATTTATCTC AGCTTTATCC CAAAGGCTAT 114 0 

TTATCAAAAA CAATTGGTTC CGAAGCCACC TATTCAATTC CTGTTGACGA CCAATTTTTC 1200 

ATTGTGAATA AAGCTTTCTT AAGTATTAAA GAACAGCATT TGTTGGAAGC GCTTTTCCCA 1260 

ATTTCTGAAA aCCCAACAAT TACTGGTAAC CATCCTTGGT TCAGTTACTT ATTTCAACAG 1320 

GCAAAATTAC CTGCCGAAGG GACCTTTCGT ATGTTGCAAA TTCAAACGAA TGTCACAAAA 1380 

GAATTACAAG CTGAATGGCA ATTTAATTTA ACAAAGATGT TTCCAGATAC AGTTGATTGT 1440 

TTCAGTCCAA GTAATAATAT GTATATTTTA GTTGAAGAAC AGTCAAAAAA TACCTTTCAA 1500 

CAAGAAGAAA TTCAAG;3AT CTTTTTAACA TTAGATACAG ACTTTGATTG CACTAGTGCT 1560 

GTATTTGTTG GCAATTTTTA TTCCTCTGAA GACATTCTCC GGCGTTGTTT CCATGTVAGAA 1620 

CAACGTATTT TTTCAGAAGA ACTTAATTCC TCTAGTCGGA CGACTGTTTT TAGTTTAACT 1680 

GATGTTGCCC TCCACTATTT TACCAAGGAA GCCATGAGCC AAAATGTACT TGTTGAGTAT 1740 

TATCGTCGTT TATTAAATAA AGACACCGAT ATTCAGCCAA TCATCAGCGC TTTATGGAAA 1800 

AATCAAGGCA ATATCAGCTC CACTGCAAAA GATTTATTTA TGCATCGAAA TACTTTACAC I860 

TATCGGTTGG AAAAGTTTTT TGAACAAACT GGCTTATCTT TAAAAAAAAT GGATGATTTA 1920 

ATCTTTTGTT ATTTGTTATT AAGAAAGTAG TACGTCCATT TTTTACTCGT ATGTTTTTTA 1980 

TTCCCAATTA GAAAGTAGGC GTTCCAATGA AAATTACCGC ATATGTCGCT AGTGCATTTA 2040 

GTAAAAACCA CGAAGGTGGC AACAAAGCAG GCGTTGTTTT TATGACACAG CCCTTAACGA 2100 

GAACCCAAAA AATGGCCATC GCCAAAGAAC TAGGATTTGC AGAAACTGCT TTTATTTCAG 2160 

ACTCAAAGAT TGCTGATTTC CGCCTTGAAT ATTTTACACC AAAAGAAGAA GTCGACTTAT 2220 

GTGGCCACGC GACAATTGGC GCTTTCACCA TTATGAAGCA CTTAAATAAA TTATTGCAAA 2280 

CAGCATATAC AATCGAmACG AACAGCGGTT TGCTCGCAAT CACGGCTAAC GGTGAACAAC 2340 

TATTTATGGA ACAAACTGTC CCAACTTTTT ATGATACGCT CACTTTAGAA GAATTGAGCG 2400 

GCTGCTTTGA TTTAAACGCA GTGgCAACTG CTTTTCCTAT CCAGATTGTG TCAACGGGCC 2460 

TAAAAGATAT TTTGGTTCCC ATTAAAAATG CGCAATTATT ACAGGATTTA CAACCAAACT 2520 

TTGAAAAAAT TAAGTTACTT AGTCAAAAGT ATGGAGTTAT TGGGATGCAT CTTTTTGCTT 2580 

TGTCTGAAGA AAATATCATT TGTAGGAATT TTGCACCCCT TTATGATATC GATGAAGAAG 2640 

CCGCTACTGG TACATCAAAT GGTGCGTTAg CTTGTTATCT GCACCATCAC CAAATTTTAC 2700 

AAAAAACCAA TTACACCTTT GCCCAAGGTT ATACCTTAAA CGCTCCTTCA GAAGTCTTAG 2760 

TGCGTTTGAA CACTACATCC CAAAAGACAA TTGATCAAGT TCTAGTTGGT GGCAGAGGCT 2820 

ATTATGTTGA AACAAAAGAA TTATTTTTAT AAAAAAAAGA TGGcCTAGGC CAcTTTTTTa 2880 

ACTTCaTTGC. ATCTTCAATA AATGTGAAAA AGCGTTCTTT TAAAGAATCT GTATTGTCTT 2940 

TAACTTTTTC GCCATATTCA GCTAATTTTG AACCGCCATC TTTCACACGA TCCACAACAT 3000 

TTAAAACAGA ATCTAATTCA TCATCGGATA AATCATCGAC AATCGATAAT AATTTTTGGT 3060 
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TTCCATCAAA TTTATCGTCT ACAAACTTTT TAACTTTATA ACGATTGGAA ACATGAGATA 3120 

CCTTCTTAAT AATTTTTTCA GAAGCGATGA CTGCAACAGA GACACTTGCa ACTGCAGcAA 3180 

TGCTTAAACC GATACCAATT TTTGTTGATG CTTTCaTTTC CTTCAGCTCC TCTGTTATTT 3240 

TCTACTTTAA TCATAACATA TTATTACTTT wAkGCGTAwT rGAAaGrCCT TTACTAGCAC 3300 

TTAGTCTCGT CGGTTATTGC CACCAAATAA AATAACCAAA CGTAAAAGTT GTAAAAAGGT 3360 

CGAAAGCGCC GCTGCTACGT AGGTAAGTGC AGCTGCAAAA AGAACTTTTC TGGCCATTGG 3420 

GACTTCTTCT TCTGTTAATA ACCCACCTTC AGAAAGAATA CTTAATGCAC GTCTAGACGC 3480 

ATTAAACTCC ACCGGTAAAG TCACCAATTG AAAAATCAAG GCTAATGAAA AAGCCAAAAT 3540 

TCCAATGTTG ATTAATGTTT GATTCCAACT AAATAAAACA CCAAGCAAAA TTAGTGGAAA 3600 

GGAAATGGTT GAACCGATGT TGGCAACTGG AACAATAGCT GCACGAAGAC GTAAAGGCAC 3660 

ATAGTTCGTA TGATCTTGTA CCGCATGTCC GCATTCATGA GCCGCCACGC CAATAGCCGC 3720 

GACCGATGTC GATTGAGCAG TGGCTTCGGA TAAGCTCAAC ATTTTGTTTC CAGAATTATA 3780 

ATTGTCGGTT AAATCACCCG CAATTTGCTG AACTCCAACA TCATTAATTT GTTCTTTTTG 3840 

TAAAATATAT TGTGCTGCCT GCGTGCCTGT AACATGTTTA CTGCTTCGTA CTTGATCATA 3900 

TTTACGAAAG GTACTGTTAA CATATGCGGA GGCCGCACCA GAAATCGCAA TTCCGATAAT 3960 

TACCAATATA TAGGTTGGAT CGATTCCAAA ATAAAAGGGC ATCATTGTCC TCTACCTCCA 4020 

TCTTTTTTTC TTTATTGTAG CATGATAGTG AAAAAAAGGG AGCGACAAAC GTTGTCTTTT 4080 

GTTTAAAAAA ATGATGAATT TTTCAAGAAA GTGCCGAAAC TAACAAAATT GTCACTCAAT 4140 

TAACGGAAAA AAATATGGCA AATTTCCTAA AAAAAAGGCA TACTATTCCT TGAGGTGAGG 4200 

AAAATGAAAA CATTACTAAA AATATTAATT CCCATAGTCA TTTTAGCCGG TGGTAGCTGG 4260 

GGCGTTTATA ATTACTATTA CGGTGGAGAA GCCTATTATA CACAAATCAC AACATCTGGT 4320 

GAAAAGAAAG ATGAAAAAAC AAATTCTGGT GAAGCAATGA CCATTTATTA CTATCAACAA 4380 

CCTGCGTTTA ATAAAAACGG TGAAGAAAAA ACCGTGGAAT TAAACGAATC CCGTGATCAA 4440 

CCTTTGCGTA TGAAAGCCTA TCTAAAATTG AAAGTTAATC CCCGTAAAGG CGTGATCAGC 4500 

TGGAATGAAG TGACAGAAAA AGAAGTCCCT GAAAAAGCTT TAGAAAAATT AAAATAAAGA 4560 

AAAAACCGTT GGCAAGCGAC CAACGGTTTT TTCTTTATAA TTGATTACTA ATTTTTTCGA 4620 

TATTTTTTAG CATAATGGAC ACTGTGCCAT CTGGATTATT AACAAATTCA ATTAAATCGC 4680 

CATTGCGGTA AACATCTAAC GGAACAATTA ATTCAATACC GTTATCCATT TTTAACTTCT 4740 

GTTTGCCAAA TTTCTTTTCA GAAATTTCGC GAGATTCTCT GACTGGCGGT GTTTTGTCGA 4800 

CAAACTTCGA TTCATTCACT TCTTCTTGGT AAGCTAATTT AGCTGATACA TTTTCTTTAA 4860 

ACACTTGCTCAGCAATCTGC TCATTGTCAA TCGTCCCCGT TTCCTCAATG CTATCATACA 4 920 

CCGCTTCTTT TACAGAAGCC ATTAGTTCAA ACTCATCTTC GTTAAATTTC TTTCCAATGC 4980 

GTTTGACTGC CTTTTTAATT TCTTTGATAT TTTCTTCCAT GGAAGGTGCC GCTTGCGTCT 5040 
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CAATCACTTT TTCAGAGAAA TACCAGATTT TTTCTCCTGA AAATTCATAT CTTTTCTCTA 5100 

GTAATTCAAA CGCTAATGTA TCTAAGTCAA CCGT/V^TCCC TTCATCTGGT TTTTGTGATT 5160 

TTGAAGACAA AATTGCTCGG TTGATAATCA GTTTGTTATA AACGGATTCT TCTTCATAGT 5220 

CGACGTAATG CGTGTAACTT TCTTTATAGT TTAATTTAAT CATTGCGACA TGTAACACCG 5280 

TATCTAATTC ATATAAAATG AACAAAATAT CAGCACTAGG CGCATCTTCG CTTCCTGAAT 5340 

AAACATCATA CCAATGTTGG GTAACTTCTT GGCTTTTCTT TATAAAATCA CTAGGAATCC 5400 

CACGTAACGT GTCAACAAAG GAACTATTTT CCGCCAATGT TCCAGTCTTT GTTTGTGCAG 5460 

TCGAAAGTTT TGAAATTTTT GCAGTTAAAT ACGTTCGAAT ATACTCGGTC GTTAAATCTA 5520 

GTTCCTTTTC AGAGAAAACC GGCGTCCCAG TTTCTC6ATC AATAATGTGT AAAATTGCTT 5580 

TTTTTAAATA AATATCCATT TTTACGCTCT CCTTTTCTCA ACCTTTTTAG TCTAACTGAA 5640 

AGAGAGAAAA AAGCCAATAG TTTCTATAAA AAGArGAAGA ATTCGTAGAC AAATGTCTAC 5700 

GAATTCtTCT TTACTTCAAA AAGCGATTAC CCTTCTTCGC TGTATAACAT CCCTGTTGCT 5760 

TCATCAACTG GATCGACTTT ATTAAAAATG CCTTTCTTTC TGAAAATATA AGTTAAGCCC 5820 

ATTGGTACGA TAATGGCAAT CACCATTGCG CCTAAAAACG GCAAGTAATA TTGTGGCACA 5880 

ATAGATAAAA TTCCTGGCAA GCCACCGACG CCGATACTGG TCGCACGAAC GCCCATTAAA 5940 

GTGGCAAACA TTCCTGCCAT TGCTGAACCA ATCATGCCCG CAACAAATGG ATAAACATAT 6000 

TTTAAGTTGA TCCCAAACAT CGCTGGTTCA GTCACACCTA GGTAACAAGA AATcATCGCT 6060 

GGAATCGAAA CTTGTtCTTC TTTTTkGTTT CCmCGATGCA TAAAGaTAAC CGCTAAAACA 6120 

GCTGAACCTT GTGCAATATT TGATAGAGCA ATCATTGGCC ATAAGTTGGT TGACTGAAAG 6180 

TCAGCAATCA ACTGTAAATC AATCGCATTA CTCATATGAT GCAACCCAGT GATGACTAAT 6240 

GGTGCGTACA TAAAGCCAAA GACGGCGCCA AATAACCAAT TAAATGTTGA CGTTAATCCT 6300 

GCATTTACGA TATCAGAAAC CCAACTACCA ATCGTCCAAC CGATTGGTCC TAAAATGACA 6360 

TGTGCAGCCA ATACTGTTGG TACTAAAGCA AACAATGGGA CAAAAATCAT TGAAATTGCT 6420 

TGAGGAATCA CTTTTCTAAA GAAAATTTCA AGATAGGCAA GCAAGAAGCC GGCTAACATG 6480 

GCTGGGATAA CTTGCGCTTG ATAACCAATC ATGTTAACTT GAGCAAACCC -AAAATCCCAG 6540 

AAAGGAATTT CTGCTGCTTT AGTCGTCGCA ACTGAATAAG CATTCAGAAG TTGTGGTGAG 6600 

ACTAATGTAA TCCCTAGGAC AATCCCCAAG ATTTGTGTGG TTCCCATTTT CTTCGTAATA 6660 

CTCCACGTAA TCCCAACTGG CAAGAAGTGG AAAATTGTTC GCCGATTAAC CATAAGAATG 6720 

AATTGACTCC ATTCCAAAAT TGAGAAACTT CCACGATTGT TTGACCATCA AGAAAGCCAA 6780 

ATGGAACCCC TTCCAAAATA TTGCGGAACC CTAAAATCAA TCCCCCGATA ACAAGTGCAG 684 0 

GAATCAATGG. TGTAAAAATC TCAGCTAAAA CAGAAACAAC TCGTTGAACA GGACTCAAAT 6900 

TACTTTTGGC AGCTGCTTTC CCTTCTTCTT TTGAAACCCC TTCTAAACCG GAAATTGCAG 6960 

AAAAATCATT ATAAAATTGT GGGACTTCAT TCCCAATAAT TACTTGAAAC TGTCCAGCAT 7020 
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TTGTAAAGGT TCCTTTTACA GAAGGAATAT CTTCAATCAC AGGTACATCC GCTTTTTTTG 
GATCTTTTAA TACAAAACGC ATTCGAGTTG CACAATGAGA AACAGCTGAA ATATTTTCTT 
TGCCTCCTAC TGCTTT 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



7080 
7140 
7156 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
ATTCGTTGTA ATTCCAACTG GAATTCCTAA TACGAACTTA GAAACGTCGT ATCCATTTCT 
TGCTTTAATG TCAGTTGTTT TTGGACCA6T AGCTGGAGGG CTAATTGGTT TGATCgGTCA 
CACgTTAAAA GATTTTACCA CGTATGGTAG CGCTTGGTGG AGCTGGATTA TTTGTTCAGG 
AATTATTGGT ATTATTTTTG GCTTCGCTGG TCGTAAAATG GATCTTCAAC ATGGTGAATT 
CACAACAAAT GACATGGTGC GTTTCAATAT TTTCCAAGCC TTTGGCAATA TCGTCGTTTG 
GGGCTTAATC GCGCCTAGCT TGGATATTTT GATTTATAGT GAACCCGCAA GTAAAGTGTT 
TACACAAGGC GTATTCGCGA CTGTTTCAAA TATTGTCGCT GTGGGGATTA TTGGAACATT 
GTTGAT6AAA GCTTATGCAT CAACAAGAAC AAAGAAAGGC AGTTTATCAA AAGATTAAGA 
AACTGTCGAA ATAATTTAAA TGGTTAAGGG TCTGCCCTAG TAATTACTTT GCTAATGCTC 
AGACCCTTTC TTTTATGGAG AAACTGAGGT GAGGCCGAAT GAAAAAACCA ATCATTACTT 
TTAATAATTT TTCCTTTCAA TATCATAGCC AATCTGAACC AACATTAAAG GGGATTCAAT 
TAACCATCTA TGAAGGAGAA AAGGTTTTAA TCGTTGGGCC TAGTGGTAGT GGTAAATCAA 
CATTGGCGCA ATGTATCAAC GGGTTAATTC CCAATATTTA TGAAGGTGAG ATTCAAGGAA 
CAGCTACGGT TGCTGGTAAA AATATCCAAG AAACAAGTCT ATTTGATTTG TCTTTTGATG 
TAGGAACTGT TTTACAAGAT ACCGATGGGC AGTTTATCGG TTTAACGGTT GCTGAGGATA 
TTGCTTTTGC TTTAGAAAAC GATGCAGTTG AACAAGCAGA AATGAAAAAA GCGGTTCAGA 
AATGGTCCGA GATTGTTGAG CTGAATCAGT TATTGCAACA CCGTCCACAA GATTTGTCAG 
GTGGTCAAAA ACAACGTGTA TCTATGGCGG GCGTTCTAAT TAACCAATCA AAAATTTTAC 
TTTTTGATGA ACCTTTGGCT AATCTTGACC CACGAGCGGG ACAAGAAACA ATGACTTTGA 
TTGACACCAT TCAACAAGAA ACAAAAGCGA CTGTTTTAAT AATAGrACaT CGCTTAGAGG 
ACGTtCTTTG CGAATCAGTT GATCGAATTA TTGTCATGAA TGAgGGCACa ATTATTcTGA 
TACAACGCCA GATGAATTGT TA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5840 base pairs 
(3) TYPE: nucleic acid 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1282 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

TAAGGATTGT TACTGCTTTG CAGGCARAAC CAGAGTCTTC CATTAACACA ATCAGTGGAG 60 

GGATTCTGGG CTTTTTTTAT AAAAAAGTGA GGTTGCATTC ATCATGAGGA TTGAAAAAGT 120 

TTTAAATAAT AATGTCGTGC TTTCAAGAAA TGAGCAAGGC GAAGAAATTG TTTATATGGG 180 

ACGCGGCTTA GyCTTTCAAA AGAAGATTGG AGACACGATC AATCCAGATT ATGTTGAAAA 240 

GGAATTCGTG TTAAAGGACT CCGCTATGGC AGGGCAATTC CAGCAGCTGT TTGAAGACGT 300 

CCCCACCAAA GAAGTAGAAG TCGTCAAACA AATCGTTGAT TTAGCAGAAA CAACGCTAGC 360 

GATTGAGCTT TCATCAAATA TTTATTTAAC TTTAACGGAT CATATTCACT ATGCGTTACT 420 

TCGAGCCAAA GAAGGAATTG ATATTCCGAA TCCTCTTGTA TTCGAAACAC GCAAATTTTA 480 

TCCTAAAGAA TTTGAAATCG CAAAGCAAGC ATTAGCGATT ATTGCTGAAA AGCTAGGGGT 540 

TCAATTTTCA GAAAATGAAG CGGGCTTCAT TGCTTTTCAT ATTGTGAATG CTGAACAAGG 600 

AAATGGCAAT ATGGAAGTTA CCATGGAAGC AACCAAGATG GTTCGTGATA TTTTAACGAT 660 

TATTAGTCGC TACTTTGGAC AGGTGTTTGA TGAAGATTCA TTGAACTATC AACGAATTGT 720 

TACACATCTT CaATATTTTG CGCAACGTTA TTTAAAACAA GAAGCGCATG aTGAGGAGGA 780 

CGAATTTCTT TTtGCATtGG TCCAAGGAAA GTATCCTAAG GCGTTTCAAG CGGTCCAACG 840 

TATTAATGAG TATTTATTAA AAAGCTATGA TCGACCAATT GATCAAGCGG AAATGATTTA 900 

TTTAACCATT CATATCCAAA GAGTGGTCAA TGATAAAAAA GCATAACAAC CAATAATCGC 960 

GGATTGTTAC TGATAATGCA GGCAAAACCT AGATTGAAAA AACCGATAAC GATTTTCGCA 1020 

6ACTGCGCAC TGTGAAGAGA AGGATCTG6C TCTTTTTCAA TGTAGGTTTT TTTGATATTA 1080 

AAAATTAATG GAAATGAGGt CTAGAGATGA GTAAAAATCA AGAAATTGCA GCACGTGTGT 1140 

TAAAAGCTGT TGGGGGCGAA GsCAmCGTTA ATAGkGTTGk TCACTGTGCy ACACGCTTAC 1200 

GtTTTAAATT AAAaGrTGAA AATAAAGCAG ATACAGCTGC ATTGAATGCT GACCCAGATG 1260 

TTATTCAGGT AGTTCAAAGT GCAGGACAAT ATCAAGTGGT TATTGGCAGT CACGTTAGTa 1320 

TGTCTATAAG GATTTGATGG CCAACAGTGG ACTAGGTAAT GATTCTGATA ATAGAGAAAA 1380 

AGAAAGTGGC GGAAATATCT TTAACCGTTT AATTGATATT ATTTCTTCTA TTTTTACACC 1440 

GTTTTTAGGT GCAATGGCCG GCGCTGGGGT TCTAAAAGGC TTCTTAACAT TGGCTGTAAC 1500 

TATGGGCTGG TTAGCT6ACA CTTCAGGCGT ATATCTTGTT CTCTTTTCGA TTGCAGATGG 1560 

TTTATTTACT TTCTTACCAA TTATGTTAGC GTTTACGGCT GCTAAAAAAT TCAATACTAA 1620 

TCCCTTTTTA. GCG6TTGCTT TGGCGATGGC CTTGGTTCAT CCCAGCATTA CTGCTTTAGC 1680 

AGGCAAAACA ATTAGCTTTG CTGGTCTTCC AGTGATTATT GGGCCAAGTG GCTATACTTC 1740 

TTCGGTTTTA CCAATTATTT nGGCTGtTTT TGCACAAAGT TATGTAGAAC GTTTCTTCAA 1800 
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AAAAGTAATT CCGAGTTTTC TACAAATTAT CTGCGTTCCG TTAGCTGTTT TCCTTATTAT 1860 

GGCACCTGTT ACCTTCTTAG CTATTGGTCC TATCGGGACG GTCATCGGCG ATTGGTTGGG 1920 

ACAAGGATAC AACGCAATTT ACGCTTTTAG TCCAATTATC GCAGGGTTAT TAATGGGTTC 1980 

GTTGTGGCAA GTCTTGGTAA TGTTTGGTAT GCATTGGGGC TTTGTACCAA TTATGATGTT 2040 

AAACTTAACA CAAGGTGGCG ATACGATGGT ACCGATGTTA TTACCAGCCG TTATTGCACA 2100 

AGGCGGGGCT GCTTTAGCTG TCTTTTTCCT AACAAAAAAT GTGAAACTAA AAGGTTTGGC 2160 

TTTGTCTTCA AGTATTACGA CTATTTTTGG AATTACTGAA CCAACTGTAT ATGGCGTGAC 2220 

TTTACCATTG AAAAAACCAT TTATTGCAGC TTGTATTGGT GGCGGTATCG GTGGTGCATT 2280 

TGTGGCTATG AATCACGTGA AAAACTTTAC GTTTGGCTTG GTTAGTATGT TGAGCTTGCC 2340 

TGGCTTTATT CXTGCAGAGA CAAAAGATAC TGCACCGATG ATTACTGGTG CAATTGGTGC 2400 

CGGAATTGCC TTTATCATTG CGTTTGTCTT AACGTTTGTC TTACGTTTTG AAGATCAACC 2460 

TAATCCAGAA ACAGCAACTG AAAAAACAGA AACTGATAAG ATGGTGGCAC CTGTAAAAAC 2520 

GAATCAAGAA GACAAAATTA TTTTAGCAAG TCCACTTCAA GGTGAAATTT TACCGCTAGA 2580 

AAAAGTACAA GACCCTGTTT TTGCTTCAGG TGCTTTAGGA AAAGGTGTTG CAATTGAGCC 2640 

GACTGAAGGC AAACTGTATG CACCCGCAGA TGGTGAAATC ACCACATTAT TTCCGACAGG 2700 

ACATGCTGTT GGCTTGACGA CAACAGAGGG CGTTGAATTA TTAATGCATA TTGGCATGGA 2760 

TACGGTCGAA TTAGATGGTA AAGGCTTTGA ATTATCAGTG AAACAAGGTG ATTCTGTTAA 2820 

AAAAGGAGAT TTGCTAGTTA CTTTTGATAT TGCTGCCATT AAAGAAGCTG GTTATCCGGT 2880 

AGTTACACC6 ATTGTGGTAA CGAATACGAA TGATTATTTA GATGTTTTGG ATATGAACCA 2940 

AACAGACGTG TTACATGGTG AAGACTTTTT AGCCATTATT AAATAAAAGA CAATTGGAGG 3000 

AGGAACAAGT ATGACAACAA CGACAAAATT TCXJ^AAGGA TTTTTATGGG GCGGTGCGAC 3060 

CGCAGCCAAT CAATTAGAAG GGGCTTACTT AGCCGATGGT AAAGGGTTAT CTGTGGCAGA 3120 

TGCGATGCCG GGAGGAAAAC AACGTTTCGC TATTTTAG6A GATGAAGCGT TTGATTGGAC 3180 

GATTGmCGAA ACGAAATATC GCTACCCTAA TCATACAGGA ATTGATCACT ATGATCGCTT 3240 

TAAAGAGGAC ATTGCGTTGT TTGCTAAAAT GGGCTTCAAA TGTTATCGCT TTTCAATTGC 3300 

GTGGTCGCGT ATTTTTCCAC AAGGAGATGA AACACAACCG AATGAAGCCG GCTTGAAATT 3360 

TTATGATGCG GTTATCGATG AATGTTTAAG CTATAACATT GAACCAGTAA TTACGATTTC 3420 

TCATTATGAA ATGCCACTGC ATTTAGCGAA AGAATACGGC GGGTGGAAAA ATCGTCAATT 3480 

AGTTGACTTT TATGAACGTT TTGCGGAAAC TGTTTTAACA CGTTACCACC AAAAAGTTAA 3540 

CTATTGGATG ACATTCAATG AAATTAATTC AGCTTTTCAT TTCCCAGCGC TTAGTCAAGG 3600 

ATTGGTAAAA.AGCAATGGAG CCAAT6ATTA TCAAAACATT TTCCAAGCAT GGCATAATCA 3660 

ATTTGTTGCA AGTAGTAAAG CTGTAAAAAT TGGCCATGAA TTAAATCCAG AATTACAAAT 3720 

TGGATGTATG ATTATTTATG CCACAACGTA TGGCATTGAT TCGAATCCAG TCAATCAAGT 3780 
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TGCTACAATG 
TGAATATCCA 
TGAAGAAGGA 
TTATATGTCG 
TTTAGGCGGC 
CGAAGGCTTA 
TGTGGAAAAC 
CTATCGGATT 
CGTCGATTTA 
TGAAATGAGT 
TAAAAACCGT 
ACAAAACTTA 
TTCTGTTTCC 
CCCAAGCTCT 
TGAGAGGAGC 
CGAGTGGTTC 
TCTCGCATAC 
AAACAAAAGC 
ACCATGGAAA 
CAGCCTATAA 
CCGTCATTCC 
GAGAATCTGA 
TCAAAAACCA 
AAGAGGTCGC 
GAGACAGAAT 
TGCATTCTGA 
GGAGGAACAC 
GCTTATTAGA 
CAAATATTGT 
AAGAAGATTT 
AGAAACAAAT 
AAATGATTGT 
ACATGGACTA 



ATTGAAAACC 
GCATACGCAG. 
GATTTAGAAT 
ACTGCTGTCG 
GTTAAAAATC 
CGAATCGCCC 
GGCTTAGGTG 
GATTATTTAC 
ATGGGCTATA 
AAACGGTATG 
TATGAAAAAA 
GACTAGTTAT 
GCTGTTTATT 
TTTTTGTCAA 
GTTTCTAATG 
TGGCAAGACC 
TACGAGGAAA 
AAGTTTTCAA 
CTATTACGGT 
TGCGTTAACT 
AGTTTTTTTT 
TCCTT^AAATT 
ATTGGAACAA 
TGGCTTGTrA 
CTAACATCAA 
ATAGAGAGTG 
AAATATGCCT 
TGCTGCGCAA 
TAAATTAGCG 
GTATTTTTAT 
TATTGCGGAA 
CGAAGTGCTA 
TCGTGCtTCA 



AAGAGTTTAA 
AaCGCATGTA 
TATTAAAAGA 
ATGTTACTGG 
CCTTCTTAGA 
TAAATGAATT 
CAATTGATAA 
GTCGTCATAT 
CGCCGTGGGG 
GGTTTATTTA 
AATCGTTTAA 
TGTAATTAAA 
AATTTTTAAG 
TTTTTCAAAA 
ATTAAGCAAT 
GCAATTACAA 
AAACGATTGG 
GCCTTGATTG 
GTCGGTGTAG 
TTTCCAGGCT 
GACGTTTCAA 
ATTGAAGAAC 
TATCCTAATT 
CAAGAATGCA 
TTTTGTCTTG 
CTATAATGAC 
ACGAATACGT 
ATTGAATTTT 
GGAATTCCGC 
TACTTTGCGA 
AATGGTGATT 
ACTGGaGAGA 
cGTCGGGTGA 



391 
TTATTATTGT 

TCAAAAATAT 
ATACCCAGTG 
TACTACCAAC 
AGCAAGTGAA 
ATATAATCGT 
AGTAGACGAA 
TGAAGCAATG 
CTGTATCGAT 
CGTTGATTTA 
TTGGTACAAA 
CAAAACAAGA 
CTTGGACCAA 
TAGTGCACAA 
TACAACATCC 
GTAAGGTTTT 
ATGAAGTGGA 
AAACCAATCA 
CTGCCATCGT 
TTCAAGCGGT 
AAGAAAATAT 
GTTTAAACCT 
ATCAACGAAT 
TCAAACACTA 
CTAGGTGCTT 
ATGGTGTCAT 
TTTTTCATTT 
CGCGCCATTC 
GCGGTAGCTT 
CATTAAGAAA 
TAATCGAAGC 
ATGCTTCTTT 
CGGGATnAAT 



ACAGATGTTC 
GCTGTGAAAG 
GATTACATTG 
GATACAGCAA 
TGGGGTTGGC 
TATCAAAAAC 
AATTTCTATG 
GCAGAAGCAG 
TTAGTCAGTG 
GACGATGAAG 
CAAGTTATTG 
TAAGCGTAGT 
AAATCTAAAG 
TACGTTATAA 
ATTTTTTGTT 
TCCGAAAAAC 
TGGCGTGGAT 
GTTAGTGGAG 
TGAAACAACG 
CTTTGAGCGT 
TTATCAACGT 
TTATGATCAA 
AGATGCCAAT 
TTATTAATCA 
GATTTTTTTT 
ATGACAATGT 
GCCAGAAGAA 
TTTACAAGAA 
TTATCAATAT 
AAATAGTGAA 
CATGGATGTC 
CTATCGGAAT 
TTTAGCCGAC 



AAGTTCGTGG 
ACTTAGTTAT 
GCTTTAGTTA 
ATGGGAATCT 
AAATTGATCC 
CTTTATTTAT 
TGGCAGACGA 
TTGCAGACGG 
CTTCCACAGG 
GAAATGGTAC 
AAACTAATGG 
AGGTTGTTGC 
TGATTTTTGG 
TAGAAACAAC 
ATTATGGGTC 
TATAAAGTTA 
TACTATTTTG 
TATGATTTTT 
AAAGAGCATC 
TTCGGCGAGT 
TTAAAACAAC 
GAAATTCTTA 
GGACCTATAA 
AGCAACTGGT 
GTAAAAAACT 
GTCATAAAAG 
AAGCAACAAC 
GCGTCAATTG 
TTTGAGT^CA 
CGAGATTTAG 
TATTTCTCCA 
TTATTTGTCA 
TGGGGGAAGA 



3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

54 60 

5520 

5580 

5640 

5700 

5760 
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GGGAAAAAAA ATAGGGAAAC CAGCCACTGT CCATnAAGCC TAGAnGGGAG GAAAAAGGnC 58 
CATGCCTGCC CCATGGCGGA 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



CGATCTTTTT 


GACGACGTTC AATTTTACGT TTTTTCTTAT 


TGCTGTCTTC 


AATTGCCCAA 


60 


TTAATTTTTT 


TCTTGTATCC TGGTTTAATT TTTTTCTTTT 


TCTTTTTCAC 


TAAACCAATT 


120 


AACGTTGGGT 


CTAATTCTTC TCGAGATTTT TCTCGTTTTG 


TTCGACGATT 


ACGATCATAC 


180 


GTCTCTACAA 


TCTCACCATT TTTAATTTCT TTTGGTTTGA AAGAAACACC 


TAACTGCTCA 


240 


ATCTGAGTAA tCGCTTCATC ATCTgCTGGT GAATATAATG 


TTATGGCAGT 


CCCATTcAAT 


300 


CCATTACGTC 


CTGTGCGGCC AACCCGATGG ATAAAGAAGT 


CCAATTCATG 


TGG7UVCTTCT 


360 


GCATTGATGA 


CATGTGAAAC CCCTTCAATA TCAATCCCAC 


GTGCGGCTAA ATCAGTAGCT 


420 


ACTACATATT 


GGTAGTCAAG ATTTTGAACT TGACGCATCA 


CACGTTTTCT 


TTCGCGTGGT 


480 


GTAATGTCTC 


CATGAATTTT GGCAACTTTT AGGCCCTGAT 


CTTTTAAGTA 


ATCGGTGATT 


540 


TCATCAACTC 


TTTGTTTAGT ATTTGCAAAA ACAATCGCTA 


AATAAGGATG 


CCCGATAGTC 


600 


AATAATTGAT 


AAATTATTTT ATTGCTGTTT TTTCCTTTGG 


TGGAAATCAA 


CCAATTGTCA 


660 


ATTGTTTCTG 


AAATAACTGC TTTTGGTTTG ATATGTTCAA 


TCACTGGATT 


TTCTAAATAT 


720 


TTTTTTAAAA 


ATGGTCGTAA TTTTTCTGGA ATGGTTGCAG 


AAAAAACGAG 


CATTTGTAAT 


780 


TTTTCTGGCA 


AACGACCCGC AATTTGATCG ACTTCTGCTA AAAAGCCCAT 


ATCTAATGTC 


840 


ATGTCTGCTT 


CATCGACAAC AAAGGCAAAA GCCGTATGCA 


CTTTCAAGGC 


TTGTTCATTC 


900 


ATCATATCTA AAATACGCCC TGGTGTACCA ATCACCACAT 


GAGGTTGCTG 


ATGCTTTAAT 


960 


TTATTTAATT 


GACGTTGTTT ATCTGTTCCG CCAACAAAAT 


TTGACACTCG 


AATTTCTGkT 


1020 


TGACTGAAAC 


GAGCCAGTTG tGCGCTTCTT GATAAATTTG 


ATTGGCTAAT 


TCACGACTAG 


1080 


GCGCAGTAAT 


CACAATTTGT ACTTCGTCGA TAGTTGGCTT AACTTTATCC 


ATTAATGGAA 


1140 


GTAAAAAAGT 


ATGCGTCTTT CCACTTCCTG TTTGCGATTG 


TCCAATAACA 


CTTTTCCCTT 


1200 


TCTTAATGAT 


TGGGATTAAT TTTtCTTGGA CTTCTGTTGG 


TTCTTCAAAC 


CCTTTTTCTG 


1260 


CGAGTGCTTC 


GTTGATAAAA GGTTGAAATT GAAATTGTTT 


AAATGAAGGC 


ACTTGCCTCA 


1320 


CCTCTTCTCG 


TTAAATTCTT TATGCACTTT GGCATTATAA CACAAAAACA 


ACTTTTACTC 


1380 


ACCTGCTGAT 


TCCTTTGTTT TCGCAACTTC TAATGTACTG 


ACTGTATAAG 


TGTGGCTGGC 


1440 


TTTTTCATAA 


AACAAACGCC GTTTTTTACG AATCAAGGTT 


AACAAAACGT 


TGATCAAAAA 


1500 


TAGTAAATAT 


AAAATTAAAC AGAACAAATA CAGAAGCAGT 


GACACCATTT 


GCAACATCCT 


1560 
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ATTAGAAGAA 


TTCAATAACT CAGTGGCATT GGCTAGATAA ATACTAATTA AACCATACAC 


1620 


AAAATACAAA TAACCGTAGC GAACAAACAA GGCTTTTAAA CGAATTTTTT CATGTCCTGT 


1680 


TTCAACCACC 


CGAATGCGAA CAACTTTTTT 


CCCTAACGTT 


TGACCATTTG TCAGTTTCAT 


1740 


CATGACGATA AAGACCGCAA ACACCATGAT AAAGAACCAC ATCCGATCTG AAAAAATCGG 


1800 


ATAATTGGTT 


GCCCAATTTT TATAAGCTGG 


AATGAAATTT 


AGTCCAAAAA CGAGCAGGCC 


1860 


TTGCAC7\AAG 


CCAATCACCA ACCAATCAAT 


TCCCCATGCA AAgAAaCGTC TTAACAGGCT 


1920 


AACTGTCTGC CCTTTTTCAT AAGAgGTGGC ATCAATTTCT 


TCTCTTGTTG GCAATAAAAA 


1980 


GGTAAAGAGC 


GGCGTCAACC AATAACCAAC 


TATGCCTCCG 


ACGGTATTAT TAATTAAATC 


2040 


ATTGACATCC 


GCCAAGCGAT ACGAACGTGG 


ATAAATAAAA 


TAAAGGCCAG ATAACTGTGT 


2100 


TAATTCAnAA AATAATGACA ATAAAAAACT 


CGCAAGAACT 


GTTTTTAAAA ATGAACATTT 


2160 


GAAATAATAC 


CGTAAATACA CACCAAAAGG 


TAAAACTAAT 


AAmACATTAA ACAGTGGTTC 


2220 


TAAAACGGCA TGCTGTTTTA ATGCTGGCAA ATACGTACTA GGATCTTTGA TCGAAAAAAC 


2280 


AGTTTCGTTT 


AAAATCCCTG AAATAAAATG 


AAAAGGCCGC 


AGTTCCATTG TTGGTCCTGT 


2340 


GTATTGAGCC ACTTCTGCTC 6AGATGGCAA CGGTAAAATT ACCAAGAAAT AGGCACATAA 


2400 


TAAATAAAAA 


ACAAATGAAT ACAAAATCAT 


TGCTCGTGTC 


CAAACAAATG AACCATATTT 


2460 


GCGGTATTCA 


TAGATTAATA ATGCCCCAGA AATAAAAAAT 


GCTAGAAACG GGAAAATAAT 


2520 


GATTGCCCAT 


TTAATTGGTT CGCTATATAC 


TGCCATAAAA ATCCCTCCAA GATAAACAAT 


2580 


TTCAAGTATG 


AATGCCTCAA TAACACTATA 


CTTGTTTCTA 


AAAAAATGTA CCAACAATAA 


2640 


ATTTTATTTT 


GCAAATCCTC TCCTTTGCAG 


TATCATTAAG 


AAAAAGGAGG ACGTTCTCAT 


2700 


GTTACAGCCA 


tattttv;::gt ttggagtgcc 


GCTATTTTTA 


CTTGTGCTCT ACTTACTCTT 


2760 


TGCACTGATT 


CACCGTCAAA CAACGATTCA 


TTATTTACGG 


TTTATTTTGT TACTTATTTC 


2820 


AACATTTCTC 


ATGGTTTTTA GTTTCCAAGT 


ATTGCAAGAA 


TCATGGACCA TCAACCCTGA 


2880 


AACGTTAAAA 


GATGCCGCTT ATTCTCCGCA 


ATGGTTATGG 


ATTCCTTTAG GTATCGGGTT 


2940 


AATCCTCACA 


CTTTACAACG CTTGGCACGG 


CCTTCGTACA 


ATGATAAAAT ACAAGACCGA 


3000 


TAAACATTAG 


TAAGAGTCTA GGACGATGGT 


TGTGTCCTAG 


ACTCTTACTG TCTCATCATT 


3060 


CAAATGAAAT 


TTAATTTTTC TTAACAAACT 


CTGATTTTAA 


CTTCATTGGA CCAAAGCCAT 


3120 


CTACCTTACA 


ATCGATGTTA TGATCACCTT 


CCACTAAACG 


AATGTTTTTC ACTTTTGTGC 


3180 


CTTGTTTAAT 


TGCGCCACTG GCACCTTTAA CTTTCAAATC 


TTTGATTACA GTAACACTGT 


3240 


CACCATCCGC 


T7U\AAGATTG CCATTTGAAT 


CCTTAACTAC 


TAAGCCTTCT TCTGC7VACTT 


3300 


CTTCTGTTGG TGACCATTCA TGACCGCATT CTGGACAAAT AAATAATCCG CGATCTTCAT 


3360 


AGGCATATGC 


CGACCCACAT TCAGGGCAGT 


TTGGTAATTG 


CTCTGTCATT AACGTTCTCC 


3420 


TTTGTAAAAA 


ATAATTCTTT TTCATTTTAC 


CTTATTTTCT 


AGTGAATGAA AAGTAGTCCT 


3480 


AATGATTTAT CAACGTCCGA TAACTACTTT CTAATTACTT ATTTATTTTT GAGCCCACTC 


3540 
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TTGGCAAGCT GCTTGGATTT CTTGGTAAAT CACTTCAATT TCAGCCGCAT CACGTGCATT 
AGCGCCACTT GCCAAAGGAT GGCCGCCGCC ATGATGTTTT TTAGCAATTG TATTAATGAT 
TGGTCCTTTG GAACGTAACC GAACGCGATA ATAACCTTCC GGCTGTTCAA CAAAGATTGC 
CCATGAGAGA ACATCTTCAA TTTTTCCTGG CATTGAAACA ATGGCTGCCG TTTCAGAATC 
GACAATATCA TAACTGTTTA AAATACTTTG TGGTAAAACG ACACGAGCAG CGCCATTTGC 
GTCGATTTCT AAATTTTGAT AAATATAACC CGCTAATTTA GCAACCTTTA AAGGCATTTG 
GTCTAACTCA CGATTTAAGT CTGCGGCATT AAAGTTAAAC GTACGCAATT GCGCACGACA 
GCGAGtGTAT GCGCCGAaGT GGaAGGATAA AGAAAACGTC CTGTATCCCC CACAATCCCA 
GCATACAATA AACGAGCTGC ATTATCTGTC ATCGTTAATT CTGCTAAATG CTGTgCCAAA 
T^TCAACAAT AATTTCACTA CAACTACTTG CATTTGTATT CACCCAAACT AAATCTCCAT 
AAGGTTCATC ATTCGGATGA TGATCGATTT TAATTAATTG ATCGCCTAAA GAAAAACGCG 
CATCACTAAT TCGTGGAGCA TTTGCAGTAT CTGTTACAAT GACTAATGCA CCTCGATAGA 
CATCGTCAGT AATAACGTCC ATTTCAGCTA AAAACTCTAA GCCTTCCACT GGTCCGCCTA 
CTTGATAaAT ATTTTTTyCA GGAAAGCTCG CTCTTAAGAG TTCTGCTAAC CCAACCTGTG 
AGCCAATTGC ATCTGGATCC GGCCTTTGAT GTCTATGAAT AATAATTGTT TCATATTGTT 
TAATTGCTGC CATAATTTCT TTTACTACGT CCATTCTTTG TAGCCTCCTA CGTTCTCTCC 
ATCACTTGGC AAACAACAAT TGCTTTTGCC ACAATGACAT TTTCTAAATA AACTTCAATA 
TCTT^TTtTG CAGAGCGACG GCCAATTTCT AAAATCCGTG GACGAATGTC CAATTCACTT 
TCCAACTGGA TTAGCCGCAA ATAATGTAGA TTCACTTGTT CAATTAGTAC ATTTCTTTTT 
TGATTCATCA ACATTGTTCG TTGTGTCACG TTAGAAATAA TTTCACTTAG GACACCGAAC 
GAGATGGTAC CAACACTGTT AACCATTTGG GGGGCAACCG AAAAACGAAA ACGGGTTGTT 
TCTTTGCCAT TTTCTTCAAC AGGCATCACT TCCCCGGAAA TTTGATCTGA AATTGTATCT 
GCAATTTGTG GTTGTCGTTG CACCAATTGC ATAGCTTTCA TCACATCTTG GCGAGAAACA 
ATCCCAACAA GCGCTAAATC ATCTTCCACC ACCGGCATGA CTTCTAGACC ATCCCAAATC 
ATTTGATGAC TGACACTCGC TACACTCATC ATCTTTTTCA CAACATTGGG ATCCTTTGTC 
ATGACTTTAT CCACGGTTAA ATGTTCACTT TTACCAATGA CATCTTTGGC TGTGACAATC 
CCCACTAAAC GTAATTTCTT ATTTACTACT GGGAAACGAG AATGATGGGT ACTTTCAGAT 
AATGCCTGAT AATCTGCAAT CGTCTGAGTT GTCAATAAAT AATTGGtTTT TTCAAGCGCT 
GTATAAATAT CACTGACGAG TAAAATGTCT TTTTTAATCA GCTGATCACT TAAAGCACGG 
TTAATCATAG TAGCCACAGT AAATGTATCA TATGTGGTTC TCAAGACAGG CATACCCAGA 
CGATCTGCTA AcTGCGCAAT CTCTTCAGAT GTTTCAAATC CACCTGTAAT TAAAACGGCT 
GCCCCATTTT CTAAAGCAAG CTTTTGCACA CcTTGGCGGT TACCAACAAT CATTAAGGAT 
CCGGGAGTAA TATATCGCGT CATCGCGGTT TCTGTCATCG CACCAATTAC AAATTTATTA 



3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 
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AGGACTTTGT CTAGACCAGA GcTACCGCCT AACACATCCC CTTCAATAAT 
TCACCAAATG TTAATTTTTC AATATGTTTT TTTAACTTCC GCTCAATTCG 
ACACGTTGAA TGGTGGAAAC AAGCCCAATA TTTTCTGCAT CTTTGATGGC 
GTTCCTTCAC TAACACTTAA ACTTTTGGCA ATGCTTCGAA CAGAGATTCG 
GGTAAGTCTT CAATATGTTT TAAAATTTGA TCATGTTTCG TCGCCATAAT 
ACTAGGATCG TCTCTCCTGG GTTTAATACT TTCCCGCCAA CTACTTGCTC 
GCTGGATTTT GTTGGATAAC AGGAAAGGTA TTGTAATGAA TTGGAATGGT 
TTTAAGTAAG CGATTGCTTC TAGCGCATCC TCTGGTCCCA TCGTATAATT 
GGTAAGAACG CTACATCAAT GGATTTGTCT TTTGCAAAAA GGCGCATATC 
GCCGTATCAC CAGCGTGATA AATTTTTTtA TCTTCTGCTT CTAAAATAAT 
TCGCCCATGT AGGTCATTAC GCCGTCTATT TCATAACCGG AACTATGCAA 
AATTTAACCG T^CCAAAAGG AAAGACATAA CGCCCACCTA GATTCATTCC 
ACACCTTGCG ACTGCGCATA CATAGCCACT TCAGCTATCG CAATAATCGT 
GCTTGAGCAA TTGCTAGCAT ATCGCCAATA TGATCGCTGT GTCCGTGGGT 
TAATCAGGGT GTAAATCTGC AAGTGAAACG TcCGCTAAAG GGTTGCCATT 
TCAAATAAAA GGTTCGTGCC GTCGTTTAGA CGAATTTCAA TACATGAATG 
ATCAGTTCCA TTTGTATTCC TCCTACGGTA TTTTCTACAC AATCTTATGA 

kCAAATTCAT yCTaAAAGTA GkaCaGTTaG TACAGTTACC TTCCTTAATG 

(2) INEX>RMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
ATAAAGAGGT AAATGAAGGA GAAATGCGAT GAAAAAAAGA AAGGCAAGAT 
GGTATGTATT GGAATTTTAG GGATTTATGC GGGCGCCAGT TACTACAATT 
AATCGCTCAA GAAGAGACCG AAACGAGGGA CGAAAGTCGA CAGCTATCGC 
AAGAGCGGAC CAAGCCAATC AAGACCAAGG AACCCTTACT TCTTCAACGC 
AATAATAAGT TCGTCTGAAC AACAAGAAAG TTCAACACAA AATGACAGCA 
GGAAAATAAT GTGGAACCAT TAGAATCTGA AAATAGTTTA CCGCCACAAG 
TGCTAGTACT.AAACCAACCG AATCGTTACA AAATGAGCAA CCA6TATCTG 
AGCGCTCACT GAGAAGTCAC CTCATAATCA GGAACCACAA AATTCTACAT 
GACCAAAGAG CCGTTGGTAC ATTCAGAGGT AGACAATAGT CAAGAAAAGG 



CCGGACCACT 
TATCGTACCA 
CCGATACGCT 
ATCGCCAATT 
TTCCTCCTAA 
TACGAATATT 
TATTTCGGGA 
ATCGCCAaTT 
TGAAAACAAC 
CCCACTTGCT 
AGCTGGCACA 
ATGCGCTTTT 
TGCCTTGTTG 
GATTAAGATG 
AATAAAAGGA 
TCCATGTCCT 
TAATTAATCG 



ATTTGAAGTT 
CGGAACCATT 
TGGAAGAAAA 
TGAACGAAAC 
GTGACCATCA 
AAAAGACAAT 
AAGAGCGTAA 
TAACTGAAAA 
AAAAGCAAGC 



5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6590 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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AGGAATTCAA CAACAAGCCG AGGCAAATTT TGTTGTCCAA GAGAATGCTT CTACTTCTGA 600 

GTTTATTAGA AAAATCGGTG AAGAAGCACG AGTTATTGGT CAGCGACATG ATCTATACGC 660 

TTCTGTAATG ATAGCTCAAG CTATTTTAGA GAGTGGCTCG GGC7\ATAGTG CGTTAGCGGC 720 

GGCGCCAAAT TATAATTTGT TTGGCATTAA AGGAGCTTAC CATGGTCAAA GTGTCAGCTT 780 

TCCCACACAA GAAGATGATG GTAAAGGGAA GATGACCACC ATTCATGCCG ATTTTCGTCA 840 

GTACCCATCC TATAAAGAGT CATTAACAGA CTATAGTAAA TTAATCATCA ATGGGCTGGC 900 

GGGTAATCCA ACGTTTTATC ACGGCGTTTT GAAAGCAAAC ACGACGAATT ATCAGCAGGC 960 

AACAAAATTT TTAACAGGTC GTTATGCAAC GGATACATAT TACGATAAAA AATTAAACGC 1020 

GTTGATTGAA ACCTATCAAT TAACTGAATA CGATCAAGAA AAGAAGAAAC CAGTAGTGAC 1080 

TAATTTGGCA GAAGAAAAGA AACCTTCGTT TGATAAGGAA GCAGTCAAAG AACAGCTAAA 1140 

AAAAGAAGCA GTCGTCTATG AAGTAAGCAA AGGAGATTCT CTCGCCACCA TTAGTCAGAC 1200 

GTTTGGTGTG TCAGCAACAG CCATCTTAAA ACAAAATTCA AAGACTCAAG AAATGTTTTA 1260 

TATTGGCCAA AAGATCACGA TTCCTCAACA TACAGCGGCG ACTTCTCTAG AACCGAAAGA 1320 

ACAAGCGTTA CTTCAATCAC TAATAATCAG TAAATCGGTG ACTAATGCTT TGAAGACAGC 1380 

AGAGCAATCA AATGGTAAAG CGAATGACTC AAAAACAGCG TCTACTGAGT ATTATGAAGT 1440 

AAAAAGAGGA GAGACACTCG CAAAGATTGC TAAAAAAACG GGCTATTCCT TAACCGCCCT 1500 

CAAGCAGGCA AATGACTTAA CGTATTCTGT ATTAACCGAA GaCAAACGAT AGCACTTCCT 1560 

AAGCTAAAGG ACTAATGCTG ATTTAACTTA TCGGTCGATG GAGTCAATAT ATTATTAGgG 1620 

TAAGAGAAGA GGAGAAGTTC ATCCATGGAT GAACTTCTCC TCTTCTCTTA CTATGTTTTT 1680 

CATAAATACC TTAATAAATA GTAAAATTAG CAAGAGAATT GTGTCTTTAC ACAAAATAGA 1740 

TAGGTTTTCT TGCCGAAGTT TGATAAAATG ATTAGTTGAA AGGATGCGTG AAAATTGAAA 1800 

GAATTATTTC GTAAATACCG AGAAGTACTG GCTTATTTAT TTTTTGGTGG TGCAACGACG 1860 

GTCGTCAACT TAGTTGTGTT TTTTGTTTGT CAAAACGTGT TAGGGCTGGA TTACAAGATA 1920 

AGTAACGCAA TTAGTTGGTT TTTATCTGTT CTATTTGCCT TTTTCACGAA TAAATATTGG 1980 

GTATTCGCAA GTAAGCATGA AAGCATA6CT GGTTTTTTTA AAGAAATGGG CTTATTTTAT 2040 

TGGTACCGAA TTTTATCTTT TGTGGCTGAT ATGGGTTTAA TGATTCTCCT AATTGATGGC 2100 

ATTCATTTTT CATCGTTTTG GGCGAAAATG ATTACACAAG TTGTAGTAGT GATTTTAAAT 2160 

TATTTCTTCA GTAAATTTTn CATTTTTaAA GAAAAAGAGG tCTAAGCAGA TTTtAGGCTA 2220 

^^^^^^^^^^YYT TTTTGTTATA ATATAAACAA GTTTTTAATA CAAAGAATAG AGGTAGAAAa 2280 

CGATGGACTA TACATATGCA GATGATAGTT TGACATTGCA TACAGATATG TATCAAATCA 2340 

ATATGATGCA. AACATACTGG GAATTAGGGA GAGCAGATTT ACATGCTGTT TTTGAATGTT 2400 

ACTTTAGAGA GATGCCCTTT AATCATGGct ACGCAATTTT TGCAGGTTTA GAACGTTTAG 24 60 

TCAATTATCT CGAAAATTTA ACGTTCACAG AATCTGATAT TGCATATCTC CGAGAAGTTG 2520 
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AAGAATATCC 
GCTCTGCTTT 
CTTTGGCACA 
TAATTGCGAC 
TTGGAACACG 
TTGGTGGGGC 
GTGGGACTCA 
CGTATGCCAA 
CGGGAGTGCC 
GTGTGCGGAT 
ACGAAGCAGG 
TTTTGAATCT 
TTACCGCCTA 
AAGATGGTCA 
CAGGAAAAAA 
ATGTGACACT 
TTCATACCTT 
TTTTCGTTGA 
CAAAAGAAAA 
ATCCaGTGGA 
GTAAAGACGT 
TCAAGAAAAA 
TCGTAAAAGT 
TGTCTTGGGA 
GATGACTGAA 
TCCTTACGGT 
AGATGTTTCT 
AAACGCTGGC 
GATTACACAA 
AGCCGAAAAT 
GTTATTTCGT 
AGCCtTGTAT 
TGAAgTTGCC 



AGAAGATTTC 
AGAAGGGGAT 
ATGTCAATTG 
AAAAGCGGCT 
CCGGGCACAA 
GGACGCGACA 
CGCCCATTCA 
AACACATCGG 
AAGCGCAATT 
TGATAGTGGC 
CTTTACTGAA 
AAAAATGCAA 
TGATCAACCG 
AATGAAAGAT 
ACAAGTCTGG 
TTGGAACGAA 
TATTAATAAG 
AGGAAAACGT 
TTTAGATTCA 
CCTTTCAACA 
TAAACATTTA 
ATCATTCAAG 
ATTGATTTTC 
ATTAGTGGCG 
ATGCGCGAAG 
GAGCAAGCAG 
TTACGAGTGG 
GTTCAAATTA 
TATGCAGTAG 
GTGACAGCTT 
TTAAATAAGC 
CTAAAAATAC 
TTAkGTGTCA 



TTGACGTATT 
TTGGTTTTTA 
GTAGAAACTG 
CGAATTAAAT 
GAATTAGATG 
AGTAATGTTC 
TTGGTTCAGT 
GATTGTGTTT 
CGAGTAGCTC 
GATATGGCTT 
GCAAAAATTT 
AAATCAAAAA 
GCTTTAGGAG 
ACCATTAAAC 
CGCATTACAC 
GATCCTCGCC 
TATGTCCGTG 
GTGTACGAAT 
CTTTGGGAAG 
GATTGTTGGA 
ACGGAAAC6G 
AGTTAGGTGT 
TGAAAGCTTA 
GACAAGATTC 
AAACAGGAGA 
ATGAAGCTGA 
ATATTAAACC 
GCGATTTTAA 
CTGGCGAAAA 
TCTTCACCAA 
GCCAAGGAAA 
CAACAGCTGA 
CGTATGATGC 



397 
TAGCAAATTT 
ATAACGAACC 
CCTTGCTAAA 
CCGTGATAGG 
CTGCCATTTG 
GTGCTGGTAA 
CGTATGGAAA 
TCCtTGTTGA 
GAGAAATGGG 
ATATTTCTAA 
ATGCATCGAA 
TTGATGTATG 
CGGTGTTTAA 
TATCTAGCAA 
GTAAATCTGA 
AAGAAGAAGA 
ATTTTGAAGC 
TACCAACTTT 
AATACAAGCG 
ACCACAAAAT 
TGAATAAGGA 
GTTACCGACA 
TTTAACTAAA 
AACATTAGCT 
TATGTCGTAT 
TGCACAAGCC 
AGCAGTGGAT 
TAAAGGGAAC 
TGCTGGCGCA 
ATATGGCGAT 
AGCTTTGCTG 
TCTAGAAGAT 
CATTGATGAC 



TGAATTTAAA 
GTTAATTCAA 
TATGGTGAAT 
CGATGATCCT 
GGGCACAAGA 
AATTTTTGGT 
TGATTATGAA 
TACGTATGAT 
GGATAAAATT 
ACGAGTTCGT 
TGATTTAGAT 
GGGCGTGGGT 
ATTAGTTTCA 
TGCTGAAAAA 
TAAAAAGTCA 
AATTTATATG 
ACGGCCAGTA 
AGATGAAATT 
GGACTTGAAT 
GAATTTATT.G 
GGCCTAGAAA 
ATTGATCCAA 
CATCCTTTTT 
GGGCGCTTAG 
CAATTTATTG 
GCGCTAGCCT 
GCAATGGTTG 
ATGAAAGCAC 
GTAATTGGCA 
GGCGGAgcGG 
AAAGAACTTG 
GACAAACCGT 
TATTTGGAAG 



TGTACTGTAC 
ATTGAAGGTC 
TTCCAAACAT 
TTATTGGAAT 
GCCGCCTATA 
ATTCCGGTTA 
GCATTCATGG 
ACTTTGAAAG 
AATTTCCTAG 
GAAC7VACTGG 
GAAAATACGA 
ACCAAGCTGA 
ATTGAAGGTG 
GTGACAACAC 
GAAGGGGACT 
TTCCATCCCG 
CTCCAAGATA 
AAACAATATG 
CCGCAGAAAT 
GAAAAAGTGC 
TGACTU^CACT 
AAGAAGAAGT 
TAAAAACCTT 
CACAATTAGC 
CGATTCGCCT 
TTATCCAGCC 
GTTCGCTAGA 
GTCAACGTAT 
CAGACCACGC 
ATATTTTACC 
GGGCACCAGA 
TAGTTGCAGA 
GCAAAAAAGT 



2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 
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CTCGGAAACA GATCAACAAA 
TTTACCCATT ACAATTTTTG 
ACGATGGTCT GTGTTTTTTT 
TATCCCATGA TAATGAAAGT 
AAAAGGTACG CTCAATAATG 
ACAAAATtGG GGATTACAAT 
CATCAGCCGA ATTGGGCGTA 
ATGTTCTGGG AAATGATTCA 
ATTACCGCTA TCGTTGCAAC 
ATTATGTTGA CTGGTGGTGA 
CTGAAGTCAT TATTGGATAA 
GAAGATGTTT CTGTTGAGGA 
CTAGTTCCAG TTGATGGCTT 
ACAGGAGAAT CAAAACCAAT 
AATGGTGACG GCTCTTTGAA 
ACAATTGTGA ACTTAGTGAA 
GATCGCTATG CGGTACCTTT 
GTTTCAAAAA GTCCGACACG 
ATTCTATCTG CCCCAATTGC 
GTTATTAAAT CGGGAACGAT 
AAAACAGGCA CGATTACGCA 
GGAATAACTG CTGCTGAATT 
ATTTTAGCTA GATCAATTGT 
ACAGATCTAG CGGAAGTTTC 
CGGGTAGGTA AAAAGAATTT 
ACGACTATTC ATATTTCACG 
GTACGCCCAG AAGCAAAAGA 
TTAATGCTGA CGGGGGATCA 
ACCGAAGTAC ATGGGGAATG 
AAAGAAAATC ATCCAGTCAT 
GCTGCAGACG. TAGGTATTGC 
GACGTTGTTA TTTTAAAAGA 
GATACCATGA AAATTGCCAA 



CAATTGAAAA 
ATGATTTTTG 
ACATAACTAC 
CGTTTTCATT 
TAAAGTGAAT 
tATAACAGGA 
TGGCATTATT 
AACCTTACGT 
CTTAGCTGTG 
TTCATTAGAA 
CTCGCCACAA 
AATCAATGTT 
GGTAAAAACC 
TGAAAAAAAT 
AATGGTTGCT 
AGAATCTGCG 
TACACTAGTT 
TTTTGCGGAA 
TTTAGTGGCA 
GGTCGAAAAA 
AGGACAACTT 
AGTGGGATTG 
TGCTTATGCC 
TGGTGCTGGC 
TGTGACACAA 
TAATGGCACA 
GACTATGGAA 
AGAATCCGTT 
TTTACCACAA 
CATGGTAGGA 
TATGGGTGCT 
TGACTTAAGT 
ACAATCTGTA 



398 
CTGGTATAAA 

GAAATAAAAA 
TTTTTAGAAA 
TAGGCGTTTT 
GATGGGAGAG 
GTTTTGGCAT 
TTAATAACAG 
GAAGGAAAAT 
GGAGAATACT 
GACTATGCCG 
AAAGCTCATC 
GGCGATGAAT 
GGGACATCAA 
CCTGGGGATG 
GAAAAAACTG 
GCGCGTCCAG 
GCCTACCTAA 
GTCTTAGTTG 
GGGATGGGTC 
TTAGCTTCTG 
TCTGTTGATC 
GCAGCAAGCG 
AGAAAGCAAG 
GTGAAGGCAT 
GAGTCTCAAG 
TATTTAGGCC 
AAATTACACC 
GCAGAAACGA 
GATAAATTAA 
GATGGTGTAA 
CATGGAGCTA 
AAAGTCAGCC 
TTAATCGGAA 



AAAGGACAAC 
CACAGACCAT 
GTAAACGTTG 
CACTTTCTTT 
AAAAAGAGAT 
TATTATTTGA 
GTTCTGTAAT 
ATGGTGTCGA 
GGGCCAGTTT 
CTGGAAAAGC 
GCTTGAATGG 
TAGTAGTAAA 
CAGTCGATGA 
AATTAATGTC 
TAGCAGACAG 
CTCATTTTGT 
TTGCAGGTGT 
TTGcTTCGCC 
GTTCAAGTCG 
CAAAAACGAT 
AAGTCCAACC 
TGGAACAAGA 
ATGTCCCATT 
TTGTGGATGG 
AAACTGAAAA 
GAATTACTTT 
AATTACATCT 
TTGCTGCAGA 
CTATTCTAAA 
ATGATGCACC 
CTGCGGCTAG 
AAGCGGTCGA 
TTTTTATCTG 



ACAAACGGCA 
CGGCGAATGA 
CTACAGCAAC 
TTCAAAACAG 
GAAGCATGTA 
ATTTATTTTA 
GGCGTTTU^TG 
TATTTTAGCG 
GATGATTTTA 
TAACCAAGAG 
CGAAAATTTA 
ACCAGGGGAA 
ATCTTCATTA 
GGGTTCCGTG 
TCAATATCAA 
ACGTTTAGCA 
TGCTTGGTTT 
GTGTCCTTTA 
TCATGGGGTC 
TGCGTTTGAT 
AATCAATGCT 
ATCAAGTCAT 
AAAAAATATT 
TGCTGAGATA 
AATTGATAAA 
TACAGACACT 
TCAACGAATT 
AGTAGGAATT 
AGAATTGCCT 
TTCGCTTGCT 
TGAAACTGCT 
AATTGCCCAA 
CGTTTTACTA 



4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 
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ATGTTAATTG CTAGTACCGG GATCATTCCG GCGCTAATCG GGGCTATGCT 
GTGGACACTG TGTCAATCTT ATCTGCTTTG CGTGCTCGTC GAATTGGCCA 
ACAATTAATT GCCTGAATGT CCAGCTTAGT GGTAAAwTAA AGGTACGTTT 
TAAGTTaCAG GGGAGAGATT GTGCAATGAA AGGAAACCAA ATAGATGGGC 
GGAAGAACAA CTATTAAAAG AAGGCTACCG TAAATATACT GGGGAAGGAA 
TTATAATAAA GATATTTJTG AACATATTGG TAATTGTGTT CGTGGAAATC 
TGAAGTTGGT CGGAAACCAT GGATTATTCC AGATAACGGA AGTGTTGAAA 
TGTGGTAGAT TCATGTCCAA GTGGTGCTCT AAAATATATT CGGAAAGCAG 
TGCAAATTAA AGAAGAAAAC AATCGTTTTG CTCTATATAA CGATGAAAAC 
GTGAAATGAC TTGGTCAGAT GCTGGCGAAA GCATGATGAT TATTGATCAT 
ATCCAACTTA TCGTGGACAA AAATTAGCTG AAAAACTTGT TTTAGCCGGT 
CGAGAAAAGA GCATAAAAAA ATTATTCCGC TATGTCCATT TGCCAAAAAA 
CTAAACCTGA ATACGGAGAC GTATTGAGAA AATAAAGAAG AAACTacAAC 
AGAAATTTGA TTCCTACTTG CGGCTGTTGC TATTAGAGAC AAATAAGGAA 
TTTCACTGGT GTTTACAGAT AAAATATGCT ACATTAAAGT AGATGAATTT 
TATAAAAAGG AGAATTAACG CATGTATAAT CTAATCTTAA CGtTGGTCAT 
GTTTTAATCG TCATTACTGT TATGAnGCAA CCAAGCAAAC AAAACAGTGC 
TTCACTGGCG GTGCAGATAA ATTATTTGGG AAGCAAAAAG CGCGCGGTTT 
ATGCAACGCT CAACAGCAGT TCTTGGCGCA ATTTGGATGA TTTTATTATT 
TTCTTATCTT CAAAATAAGT ACGTAATCTT TTAAAAT 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TGTCAATGAA ATTTTCTTGA CAGCCCTTTC TTATTTCCTG TTTTTCTTTC 
TAATTGTTTT TCTGTTGAAG ATAGCTTAGC GCTTATTTAA TTAAATAACA 
GTGAAATAAC ATAGATAATT TAAAACTAAG ACATAATTTT ACTATTTGCG 
TATTTACTGA ATACGTGTTT TCTGAACATT TCGTACTACT GTCTTCCTTG 
TGAGACAAAC TGACGAATCG GTTTGTCCCA GCTTTGCGTC CGAGTACATA 
TCAAATGATT GAATTAATTA GACTTAGATA GCTGTTGtTG CTCCGCGGtA 
CGACGAGGaT CaACAGTAAT CACTTCGTCA TTATTAATTA ATGAkGTkGC 
CCTACGATAA CTGGAATATT TkGCGCAATC GCtACaACAG CTGCATGAGA 



ACAAGAAGTC 
GTAAGGTGGA 
TTAATATTAA 
AACCAGTCAC 
TTGATATCTT 
CAGAAGTCTT 
ATGACATGAT 
GGAACGAATA 
CAAGAAATTG 
ACGTTTGTTG 
GTTGAAAAAG 
GAATTTGATA 
AGCCGCAAGT 
AGACATCAAA 
CGCATATTTG 
CATTTTATCT 
AGCAAGTGCC 
TGAAGCTGTT 
TGTTTTGGCG 



GTTTTTTTTA 
AAATTTTTGT 
CCCAACTCTT 
TTUVAAAAAGC 
TTGCTTAAAA 
AACAATACCA 
ATCTGCAGCA 
AGTTAAACCA 



6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7657 



60 
120 
180 
240 
300 
360 
420 
480 
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CCTTCTTCAA 
GTTITIGTAA 
GCCACAACGG 
GAACCGATCA 
GCAGTGATAA 
GCTAAATTAA 
CATGATAATG 
TnTGGACGGT 
GTTTGGATAC 
TCTGTTTTGC 
CGAACTGCAA 
TCGCCTGATA 
TCCGCACGTG 
ACTGGTTTAC 
TCTGTTGGAA 
AAGATTTCAT 
TGTGTTGCGT 
ACGAAGCTAG 
TCTTTTTCAG 
TTTTTCAAGA 
CAGTTACTTC 
ATTGATTAAT 
AGAAATACGT 
GCGGATTTCA 
AGCTTCACGA 
CAAACGGCAA 
AGCTGGTCCA 
AATTTGTTTA 
GATTTAACTC 
TATCAGCTGC 
TTAGAAGTTC 
GCGAGCCCCC 
CAGAAAGTTT 



CAACTAATGC 
CTAAAATAGC 
CTTTTGCAAT 
TTTGAATTTT 
TGATTAAATC 
ACATTtCATC 
ACAAGCTACG 
ATTTAGAGAT 
CCAAGTTACG 
TATATAATTT 
TACGAGCCAT 
ACATTACTGC 
TTGGACGTGG 
CTAAAGCATT 
TTTCAACACC 
CGATATTATC 
TTTCTTCTTC 
CTGCGATAAA 
TGATCCCTGG 
CGCCTCGTTT 
TAAATCGATT 
TCTGGATAAG 
ACTGAGTCAC 
GGACCTTTTG 
ATGTTTTTAA 
ACATTCATAC 
ATCGTACATA 
TACAAAGCTA 
ATAAAGAGAT 
AACTACTTGG 
AACAGCATAT 
ACGAACCACG 
ATCAGCAAAT 



TGATGCTTTT 
TCCTTCAGTC 
AATTGCTTCT 
CATTAAGTTT 
ACCTTCTGAA 
TGTGCTTGAT 
TGCTTTTTGT 
CATACGTGCT 
TGCTGTATGG 
CAATGCGAAT 
TGTTTGAACA 
ATCTGTTCCA 
GTTACGTTGC 
ACATTTTTTA 
CATGTCACCA 
GATACCTTCT 
TAAAATTTTA 
GTCAATTCCT 
TAAGTTTACA 
TTAACAACTG 
AAACCATCAT 
TGATTGAGAA 
CAATTTTCAT 
TATCTAGTAA 
TACGAGCACC 
CTGCATTAAT 
CGATTTTCGt 
ATAAGATTAG 
AAATCTGGTT 
TTATCTAACA 
GACCCAAATT 
TGTCCTAAAA 
TCATTTCCGC 



TCAATTGCTG 
GCTTTAGCAA 
TCGCCGACAC 
GTTGTCCCTT 
GCATAACCTT 
GGkTTGTCAG 
TCTGAGAACG 
GTATGGCCTG 
CCAACAGATT 
GAATCTTGGT 
GCTTCTAATG 
TCGTAGATTG 
ATTGAATCTA 
ATTAAAGCTT 
CGAGCAACCA 
TGGTTTTCGA 
GTGATTTCTA 
tGACCAATTC 
GAAACGCCTG 
TTACGATTTC 
CTAAAAGAAT 
TTTTTCATTT 
TGTGATTGCG 
GATCGCTACG 
ATGCTCTTCG 
TAAGTTTACT 
TTTTTTCATT 
CTAATATACA 
TATGTTTATT 
TACCGATACA 
TACTTGCTAA 
TTGATACACG 
CCATTACGCC 



GCATGTATTC 
CTGCTTCTTC 
CTTGACCTTG 
TTTCGCCAAC 
CTTCTTGTGA 
CAACTGTTGC 
TAATTGCAAC 
ATTCTGTTGC 
GTCCGATTGC 
TAACTAATGT 
GGTAATCCCC 
CGTTTGCTAC 
ACATTTGTGT 
TTTGGACAAC 
TTAAACCATC 
TTTTAGGAAT 
AAACGTCAGA 
CGAAACGGAT 
GTACGTTAAC 
GTTTGCATCA 
GTGTGAACtA 
GTTCCTAAAA 
CCATTTTCCA 
CGTTTCCCTG 
TAGTCACCAT 
AGCATATCAA 
ACTAAAACGC 
AGATTAAAAA 
GTTTTCTAAC 
TAAACCGCCT 
TACACGGTCA 
CGTATGGAAA 
TTCTGCTAAA 



TTTATCTGTT 
AGCAGTTGCT 
AACTAATTTA 
TGGAACACCA 
TACTTTTGAA 
GTAAACACCC 
GATATGTGCT 
CGCAACGATT 
TTCAGTCATA 
TTCTTCTGTA 
TGCTGCAGTT 
GTCATTTGCT 
TGCAGTGATA 
TGGAACATCT 
AGAAACTTTT 
GATTTGGATG 
AGCACGACGA 
ATCGTTAGCA 
GCCTTTTTTG 
CGGTCGATAT 
CGTTTACATC 
CTTCTGTCAT 
TGTCATTTGT 
TAATTTTTAC 
GTGAAAAGTT 
CGGATTCACT 
TCCTATTCCA 
GAAATTTCAT 
GTGTCAATAA 
TTGCCTTCTT 
CGAGCACTTG 
TCACCGTATT 
ATGATTAAGC 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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AATGTTTTTT 
AATCATGTTC 
TATCACCAGC 
TATCACGAAT 
TGAAATCTGT 
CACGTTTTGT 
CTTCAATACC 
ATTCAGGATA 
CAGCAACATC 
CGTAAACTTC 
TTCCAGGGGC 
CATATAATAT 
ATTACGTCAT 
GAAAAAAGTT 
TCGCCTAAAA 
TCATTTAATA 
GCGTGACGAC 
GTAATTCTGA 
ATTCGTTGAA 
ACATAATAAA 
AACGtTAGTG 
TTTTTGGtCC 
TCAACAaGTT 
GTCGGATGTC 
AGCGGATAAT 
CCACCGCTGA 
TTAGGCGCAA 
AACCACTTAC 
CGGCGTTCAT 
CCAAATAAAA 
GGCTGCAAAA 
GGATTATGAC 
AAGTAAGCCA 



CCCGCGGTCG 
AGGAATAATG 
GTTACGTCCC 
TCGGTCAATT 
TCCAGGAATA 
CAAAGCCATT 
GAATTTTTTC 
ACGAGCAGAA 
TTWUVCGACGG 
GATTCCATCA 
ATCTCCCCCG 
ACTTTCAACT 
TCATTTTACC 
GAAAACAACT 
CGTAAGCTAA 
CTATTTTCTT 
TGAGTACCTC 
TAAAACACGT 
CCAATATTTG 
CTTGCCCTTT 
AAATACTTCC 
GAATCGTCCG 
CATGAACAAG 
CTGATAAATA 
CGGCAATTTC 
6AAGAATACT 
TTTCGTCAAA 
GGTCAATTCG 
TTAAAATATT 
TTTGCTGTTG 
TCGTCTGCTT 
GAACAGAATG 
TTTGAAAACC 



CGGCCTTCTT 
ATTTCATCTG 
ATTACTTcGA 
GATTCCAAAA 
TCATTATCAA 
GCGCCATGAT 
AATTGTTCGA 
TACAAGAATG 
ATGTCGCCTG 
AAAATACTTT 
CTGGTTAAAA 
GCCAAAAGAA 
ATTAAAGAAC 
AGTATTTTTT 
TTGTTGCTGA 
ACCTGTTTTT 
TTTCATTTTC 
TTCGTCACTA 
CAATTCTTGA 
CTCGACACTC 
TGTTGtATCA 
AATGTTTTTG 
TTGCGTTTGC 
GACCCCTAAA 
TTCTTCCTTG 
TTGGATATTA 
TGCACCGATG 
CAAAAGAAAC 
TTGAATGAAG 
CTGATTTAGG 
GTTTTTTCTA 
CAACAATGCG 
AATAAAGGAA 



GAATACGTTT 
CTCCGCCTGC 
TAACGAATGT 
CAGTGTTAAT 
TCGTTCCAGG 
AAGAACCGTC 
TTCCTTTTAA 
TTCCACCACG 
CGACTAAACC 
TGCGTACGAC 
TACCAATGCG 
ACCTTTTCGC 
CAAAATGTCT 
ACCGTAAATC 
CTAGCGCTTG 
TCAAAATAGA 
TGAAAAACGT 
ATTGATGTTT 
TTGTACGTAC 
TGTCTTAACT 
TCCCCATCAA 
ACATAAAyCA 
TTGGCTAAGC 
TATTGTTCTT 
GGCTTCAACG 
CTCTCTAAAC 
GCAATCAATG 
TCATCGAACG 
TCTCGGCGGA 
TAAAAGCTGT 
GcTTCGCCAA 
ACATAAAAAG 
TAGGCAAATG 



PCrAJS98/08985 

GGCAACGTTT TTCATATCAA 2520 

CACGCCTGAC CACAATGCGA 2580 

ACGAACATGA GAAGTCGCTG 2640 

CGCTGTATCA AAACCGATTG 2700 

AATTCCAACA GCGGGGAAAC 2760 

TCCACCAATA ACAACTAAGC 2820 

TTGCCCTTCT TCAGTAGCAA 2880 

TTGAATCTTA TCGCCAACGT 2940 

GGCAAATCCA TAGTTAATAC 3000 

TGCACGAATT GCAGCGTTCA 3060 

TTTCATTCGC TATTTCACCT 3120 

ATUUyVTTGAT ATACAAAAAT 3180 

TGAGTTAAGA AGAAAAAAAT 3240 

ATTTGAAAAT GACGTTTTGC 3300 

TATGAGCGAC CCAATTTTCT 3360 

CAATCACTGG AATGTGTCCA 3420 

CCGTTTGTTC GACATCTTTT 3480 

CTACTTGTTG CGCTTTTTCT 3540 

TGCGTTCCAC TTTTCCTTCC 3600 

GGCGAAAAAC GGTTGGGaAC 3660 

CAAAGGCCAT TTGTTCTCCC 3720 

ATAATTTCgT GGGcTGATTT 3780 

GTGTTTTTTT GAACTCTTCT 3840 

CTTGTTCTAG TCGCTCTTCT 3900 

TTTCTAACAA ATCCATACTA 3960 

CTACCGCTAA TTGACGGCGA 4020 

GTTCGATATT TTCAACTTTC 4080 

TTTTGTACGG TCCATTTTCT 4140 

TTCCTTTTAA AGAACTAAAA 4200 

AGTGACTTTG GTTAATTGAC 4260 

TATATTCTTT AATTTTTGTT 4320 

CACTCGGATA ATGAACCTTC 4380 

CATGCGAGCG GTTAAAGCCA 4440 
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TAATTAGCAA 
TGTTGTAACG 
TTACTAATGG 
GCAACTTGCA 
GCGAGACTAG 
TCAATATTTT 
CTAGTTGGCC 
CCACTTGTTT 
TTTAAACGAA 
GATAAATTTC 
ATAGTAAATT 
AGATTTTCAT 
TCTAGCTGAA 
TGCATCCgTT 
GCTGCCGACC 
AAAACCATTT 
TATTTTTGAC 
ATATCGGGCA 
ATTGGATCTA 
CGGCCAGCGC 
AAAAAGTAAT 
TGGTAAACTT 
AAATAAGAAC 
AAAGGAATGT 
CCAGCTATTT 
TCTATTTCCG 
ACAGTTACGG 
GGTAATTGTT 
AAAGAATGTT 
TGAACCGCTT 
GTCACGGCAA 
GTCGTCATTT 
AATAAAAGCA 



AGCGCTCTAT 
CACCATTAAC 
CCCGACGTAA 
TAATCTGCTC 
GCTCTGGATA 
GCATCGGACC 
CTAATTTTCT 
CCCCTTTTCT 
TGGTTCGATT 
tTAAGCCCAa 
GGGTCAATAA 
CaCTAATTAC 
CAGCTGTTTT 
TTGACTCTTG 
AACGATTCGC 
TCGCAGCCAT 
TGACATAAGC 
TCGTGTAGCG 
CATCTGTAAT 
CTGTAACAAT 
CATCAAAGCC 
CATTTAGATG 
CAGCCGTTTG 
CAACTCGGCA 
GT^GCATTTG 
GTGTTTGGCT 
CAAAGCCTTC 
CTCTTTGGTA 
CAACAAATAA 
GTTGTTCATC 
ATAAATCACT 
TGCTACTAGA 
ACTCGGAATG 



ATAATCGTAC 
AAAATGATTT 
AATGTCAGCT 
TTGGTAAACA 
ACTAATTTGT 
AGGACGATAC 
TAATACATTC 
AAATAAAGCC 
ATAGACGCGT 
GaAATCCATT 
AATTTCATTC 
CACACCTGCT 
ATAGAGCAAT 
GTATGCTTCT 
TTCACTCTGG 
CGTACCAAAC 
TAAAACTTCC 
TTCCGGATTC 
TGATAGCACA 
TTGATTGTTA 
CATATTGTGA 
AGGCACTCTT 
ATTATCAGGC 
ATTTTCAGCT 
TTCCATCGTC 
TGTAATATTC 
TTCCCTTGTT 
AAAAGAAAAG 
CGACTGGGGA 
TTCAAAAAAT 
TAATAAGGAT 
TA6TTGCATT 
CCCTTCTTTT 



ACTTGGTTAG 
CGTTCTTCAT 
TGCCCTAAAG 
ATAATTCCGT 
TCCAAGCCTT 
AAGGCATTGA 
CGAATTCCAG 
AGCGTTGTTT 
TTAACAGCTG 
TTCAGAAGAC 
GATCCTGGTT 
GCGTGCGTAG 
TGATTATTTG 
TCTAACGTAA 
GACAACCCGA 
GTGGCaATTy 
TCCCGCCGAT 
AAAAAACGTT 
TAGGACACTA 
TGACAGAAGT 
ATGATGGTTA 
TCTGGTAATT 
ACAGGAAAAT 
AGCTGAATCG 
GTCTCTGCTT 
AATTGATTAT 
AAATGACGTA 
AGCGCTGGAT 
TCAAAATATG 
GTTTGGGCGA 
TGATGTTCCG 
AATTGCTGAT 
TCTGCTGGGG 



CTGTTTCTTG 
CTAGCACATC 
AGAAGCCTGC 
ACGTATTTTC 
TTTTCCGGCG 
CAGCAGCAAT 
CAGATTCAAA 
CGTCATCTAA 
TGAGGGTATC 
CAATTTTTTC 
GTAACGGAAC 
AAACATGGCG 
GTGAAAAATT 
TTTTCAGCTT 
AAACACGTGC 
SCGCCATGTG 
TATCAGGAAT 
CA7VAGAGCAA 
AAGAGCCTGC 
CCATCACATC 
ACTCTT-TTTC 
TTTCTAAACA 
GCGGTAATAA 
TATTCTGGAC 
CACGCAAGTA 
TTTCTTTGAT 
CCTCTTGCAA 
TACGTTGTTG 
AAGCAAGTTG 
CTTCTCCTTT 
CTAAGTAAAA 
ACCCTTTTAA 
TATATTCTAA 



GGGATAGCCT 
TTTT TTCT TT 
CATTTTCGCA 
CAAAATGGGG 
GATGaAAACA 
ATCTTCAATG 
CTGGAAAACA 
TGGAATCTGA 
ATCawkGATG 
AACATCATTC 
CAAATTCAAA 
AGGCAATCCT 
GACCAATTCT 
GTTGGGGACA 
AAcGTCCCGT 
ATAATGACCA 
ATCTAAATCA 
ATCATATTTG 
AGCTGAACCT 
CCAAACGATT 
TAATCGTTTT 
CAATGTGCGT 
TTTTTGATGT 
CACTTCTGCA 
ATTCAGACCT 
GCTCTCAAGA 
AGCCACGCCA 
ATTCATTGAA 
TTGGTVflAGCG 
AGCTGACAAA 
GGGACCCTCT 
ATTTTTAGCA 
TCTTAAACCG 



4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

57 60 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 
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ATAATTGGTT GAATTCCCGC TTCTTGGCAT GCTTCATAAA ATTCAACCGC 
ACATTGATAT CGGTAATCGC TAATGTTTGA TAACCTAACg CTTTCGCTTG 
GAACGAATCC TAACCGTACT AGATAAAAGC GAATAGGAAG TAACGGTATG 
AATGACATTG GACAGCTCCT TTTTGAATCC TTGCTATTTT TTCCTCTTCT 
TATCGCGGAT TCTTTACTTT TAAAGAAAAT ATGCTAAACT AAGAGCTAGA 
TCATATGCGT TATCTTGTTG TATTTTTCTG GACCTTCGCA TTAGGCCAAG 
TATCGGCGGT GCATTATCTG GTGGTTCATA CGACTTTAAA ATGACAACGA 
GGCAACAGCC GTTATTATTT TATTAATTGC aCACTTTGCA GTACCGAAAG 
TGTTAAATAA GAAAAAACCA GTTACTAAAG TTATTGTAAC TGGTTTTTTT 
CTGTTTAGAG CTTTTTATTT AAAACTTTTC GGAAATTAAA GAAGGACAGC 
CTAAAACGAA ACAAGCGGTT ACCAGAAAAA CAGAAGAATA GCCAAAACCA 
TCGACCCTAC CATCGGTCCT AAAACTTGAC CGAAATTACT ACACATCTGA 
AAACTCTACT TACGCCCTCT GTTGGCGTTA TTTTACTAAT TAACGTATTA 
TTAAAGCACC TGTTGAAAAA CCTAAAAGGA ATCGTAAAAC ACCTAACTGA 
GCACAAACGC CATGGGAATG TAACAAATAA AAGAAAGAAT TAATCCGCCT 
TTTGGTTCCC AATGCGGTCC CCAATTTTTC CCAAAGTTGG AGAAGAAATA 
CGCCGGCAAT CGAAACAATC AAACCACTAA CAAATAATAC ATTCTCTGTA 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TCGGAACAAG CTTTTAAGTG AAATTGAGAT TTTATATAGT GAAGAATTTG 
ATGGGCTGTA GAATATTTAA CACAAACCTT AGAGATTCCA TTTAGTTATG 
GTATATTGCG ATTCATATCC ATAGTGCTCG CAGCGGGCGT ACTGATAATA 
CCGTGAAGTT ACAATCGTTT CTGAAATTAT TCATTTAATC GAGCAGGAAT 
TATTCATGAT GATAAAAATA GTCTCAGTTA TTCACGTTTG GTGAATCATT 
TATTCATCGC TTCCAACAAA ATCAATACGC TGTTTTAGAT GAAGAAATTT 
AAAGAAAAAG TATGCTGAAA GCTATGAAAT CTCAAAAAAA GTACAAGTAT 
AAATTTTCAT TATCAAGTAC CAAATGAAGA ACTTGGTTAT TTATCGATTC 
ATTAAGAATG ACCAAATAAT AAAAATTAAA TAGTTTATGT GAGGAGTTAC 
AGCGTATATG CAAAGAATGG GACGTTCGTT AATGTTACCA GTTGCTGTAT 



CCCATGCAAG 
TTTTACATAG 
CAATTGTGGA 
CCTATTTTCC 
AAAGAGGTGC 
TCGTTGGATA 
TTCTATCGTT 
AAACCAAACC 
ATCTTTTCAA 
CCAATGTTTC 
TGGGCAACTG 
TTGTAACTAT 
ATTGATGGCA 
AAAGGTGTTT 
AATAAAACTT 
ATTnnTGAAA 
TCGC 



CTATTGCTCA 
ATGAAGCGGG 
GTAAAAGTAT 
TGGCTATTGA 
TACGTTTGTT 
TGGAAGTCGT 
TATTAATGAG 
ACATTGAACG 
GTATAATGAA 
TACCGGCGGC 



6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7434 



60 
120 
180 
240 
300 
360 
420 
480 
540 

600 



wo 98/50555 PCrA]S98/0898S 

404 

TTCTTTATTA GTCGGTATTG CTAACTGGAT CGTGGGAACA ATCGGTGCTA GTCCAGCCAC 660 

AACTTTTCTA ATGAATGGTG GTTTGGCTAT .CTTAAACAAC TTAGCATTAT TATTTGCGGT 720 

TGGTTTAGCG TTAGGAATGT CAAAA6ATAA AGATGGATCG GCAGCATTAG CTGGTTTAGT 780 

CGCTTATTTA GTGCCAAAAA CTGTTTTAGC GCCTGCTTCT ATCCAAGCCA TTAAAGGCTT 840 

CAAGGATATT GCTGAAGTAA ACCCAGCCTT CAACAGCATG GACAACAATG TCTTTGTCGG 900 

GATTGTCGCT GGTTTAGTTG CCGCGGCAAT GTATAATCGT TTCTCTGGTG TGAAATTGCC 960 

AATGGCTTTA TCATTCTTTA GTGGAAAACG TTTAGTTCCT ATTATGTCAG CTATTTCGAT 1020 

GTTAGCAATC TCAGCTGTCT TATTCTTCTT CTGGCCAGTT GTTTATAACG GATTA6TTGC 1080 

ATTTGGTAAA GGAATCTCTA GCTTAGGTTT CGTAGGAGCT GGCTTGTATG GTTTCTTTAA 1140 

CCGACTATTA ATTCCAACAG GATTACACCA TGCCTTGAAT TCTGTTTTCT GGTTTGATGT 1200 

TGCTGGTATC AATGATATTG GGAACTTTTT AGCTGGTCAA CAAGCGTTAG ATACTGGTAA 1260 

AGCAATTGTA GGACAAACAG GGATGTACCA AGCTGGTTTC TTCCCAGTAA TGATGTTCGG 1320 

TTTACCAGCA GGTGCTTTTG CTATTTATCA ATGTGCACGT CCTGAGAAGA AAAAAGTTAC 1380 

AGCTTCATTA ATGTTAGCTG CTGGTTTTGC GGCTTTCTTT ACAGGCGTTA CTGAACCTTT 1440 

AGAATTCTCA TTTATGTTCG TTGCTTGGCC TTTATATGTT TTACATGCGG TCTTTACAGG 1500 

TATTTCTTTA GCATTCGCTG CGTTTATGCA TTGGACAGCT GGCTTTGCCT TTAGTGCAGG 1560 

TTTTGTAGAC TTCTTCTTAA GTTTGAAAAA CCCAGTCGCG AATCATCCAA TGATGTTGGT 1620 

GGTTCAAGGA CTTGTCTTTG CAGCAATTTA CTACTTCGGT TTCCGTTTTG CAATCACTAA 1680 

GTTTAACTTA ATGACACXyVG GACGTGAAGA AGGCGATGGC GAAGAAACGC CAGATGTTGC 1740 

AGAAGGTGAC AACAAATTTG CTTCTCTAGC TCGTCGTATT TATGATGGTT TAGGCGCTGA 1800 

TGCTAATGTA ACTTCTATTG ATAACTGTAC AACTCGTTTA CGCTTAACAG TGAAAGATAC I860 

AGGCAAAGTG GATCAAGCAA AAATTAAAGC AACTGGGGTT CCTGGTGTGA AAGTCATTGA 1920 

TGACACCAAT ATCCAACTGA TCGTTGGTAC GGAAGTACAA TTTGTTGCTG ACGAAATGCA 1980 

AC6ATTATAT AACCACCAAG CGCCAGCAAC ACCAGTCAAA GAAACACCTG TTTCTCAATC 2040 

AGTTGTT6AA GAAAAAGCAC CAGTTTCAAC GAAAGAAACT GAATTATATT CTGTAGCGAA 2100 

TGGGAAAGTT ATTCCTATTT CTGAA6TACC AGATGATGTT TTCTCAGCAA AAATGATGGG 2160 

TGACGGTTTT GCTGTCGTAC CAACAGACGG C6AAGTTTCA ACACCCGTTG CTGGTAAAAT 2220 

TACTAGCATT TTCCCAACAA AACATGCGTT AGGTATCCAA ACAGATTCAG GCATTGAAGT 2280 

ATTATTACAC ATGGGTCTAG ATACTGTTGA GTTACAAGGT GGACCATTTA CATTACATGT 2340 

TGAAGAAGGC CAAGTTGTAA AACAAGGCGA TAAAATTGCA ACCATTGATT TAGCGGCATT 2400 

AGAACAAGCTGGCAAAAAAT CAGATTTAAT T6TTGTTTTC ACAAATCAAG ATATTGTTGC 2460 

TCAATACGAT TTACAAAAAG CAGGTCAAAC GACTTCAATG AATGATGTAA TCGGAAATGT 2520 

TACCGTAAAA TAAAAGTAAA AAAGGGCTAG CCGTAAGGCA GCCCTTTTTT TGATGAAATT 2580 
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CGATAAAATT ACTTGTTATT CTATTCTAGG TGAAAACGTT CA/VTCTTGCG AAGTTACGGT 2640 

TTGTTCACAA AAAAGTTACT TTTATTAAAT AAAACTAAAG AAGTAACAAA AACTTCGCnT 2700 

TTTTCAGTTA TTTCAAGTAA CATAGAACGT AT 2732 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CCAATGGCTG GATGGAACAG AACAAGACGT GGTATCACGA TTACATCCGT GCGACAACTG 
CACAGTGGAA GGCTACCGTG TAATATCATC GATACTCCAG GACACGTGAC TTCACAAnTG 
AGTACACGTT CTCTACGCGT ATTAGATGGT GCGGTAACAG TTCTTGACTC TCAATCAGGT 
GTAGAaCCTC AAACAGAAAC TGTTTGGcGT CAAGCGACTG AATATAAAGT TCCTCGTATT 
GTATTCTGTA ACAAAATGGA TAAAATCGGT GCGGACTTCT TCTATTCAGT AGAATCATTA 
CATGATCGTT TACAAGCAAA CGCTCATCCA ATTCAAATTC CAATCGGAGC TGAAGAAGAC 
TTCACTGGAA TCATTGACTT GATTAAAATG AAAGCTGAAA TCTACACAAA TGATTTGGGA 
ACAGATATTC AAGAAACTGA TATTCCTGAA GATTATTTAG AAAAAGCGCA AGAATGGCGC 
GAAAAATTAG TTGAAGCTGT TGCTGAAACT GACGAAGACT TAATGATGAA ATATCTTGAA 
GGTGAAGAAA TCACTGAAGA AGAATTAGTCGCTGGTATCC GTCAAGCAAC AATCAACGTT 
GAGTTCTTCC CTGTATTAGC TGGTTCAGCA TTCAAGAATA AAGGGGTTCA ATTAATGTTG 
GACGCTGTTC TTGATTACTT ACCATCTCCA TTAGATATCG ATGCTATCAA AGGGATCGAT 
ACTAAAACAG ACGAAGAAAC AACTCGTCCA GCTGATGACG AAGCACCTTT CGCTTCATTA 
GCATTTAAAG TTATGACTGA CCCATTCGTA GGTCGTTTAA CTTTCTTCCG TGTTTATTCT 
GGTGTCCTTG AAAGTGGTTC ATACGTATTG AACGCTTCAA AAGGCAAAAA AGAACGTATC 
GGTCGTATCC TACAAATGCA CGCCAACACT CGTCAAGAAA TTGACAAAGT GTACTCTGGT 
GATATCGCTG CTGCTGTTGG TCTAAAAGAC ACAACAACTG GTGATACATT ATGTGCATTA 
GATGCACCAG TTATTCTTGA ATCtATTGAA TTCCCTGACC CAGTTATCCA AGTCGCTGTT 
GAACCTAAAT CAAAAGCTGA CCAAGATAAA ATGGGCGTGG CTCTACAAAA ACTTGCTGAA 
GAAGATCCAT 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1150 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GAGCCTAATA AAGAAGTAAA AGCCTTCTTA GACTATATGG TCACCGATGA rGTTCA?.-JlA 60 

ACAATTGTCA AAGACTTAGG TTATTTAGCC ATCACAGACA TGCAAGTAGA ACGTGATGTA 120 

AATGGAAAAG TAACTGAAAA ATAAACAAAA AAACTCAGCC AATAAACAGG AT7GGCTGAG 180 

^.prj^TTTAGG CTTCTTTTTT TGTTAAAGGC AAGGTGrTTG T.ViAAGTGGT CCCTAAGCCT 240 

AAGTGACTAT CAATGGCGAT TTTACCGCCT AATAATTCTG TGTAATTTTG GACAATAGCT 300 

AGACCTAGAC CGGTGCCACC AGAATGACGG AnACGTGCTT TArCGACACG ATAGAACCGT 360 

TCAAAGATCC GTTCTTGGTC TTTTTGACTA ATGCCAATGC CAGTATCTTG AACS.GATAAA 420 

CGTAATTGCT GATTTTCTAT AGCATAAGAA ATAGTTU^TCT GACCATTTTC TTGGGAATAC 480 

TGAATGGCAT TTTCAATTAG ATTCTTCACG ATTGGATAAA ACAATTCGTA TTTGGTGAAA 540 

AACTGGCTCT CTGGCAGACC ATGAACGACT GTTTTCAGCT GT7TTGCTGC CAATTGTTGC 600 

TGATAAGAAC CGAGAATTTC TGTAATAAAT TTTTCTAAAG TAATTTCTTG TTCTGCA7AA 660 

GGAATTGTCG CGCTGCCCCG AGAAAGTTCT AAAATTTCTT GAATTAATTG TTGCAGACGC 720 

TGGGCATCTT TTTGCATGAT GTGCAAAAAT TGATTAAGGA CTTCCGGATC TTCTTTGGCA 780 
CCGTCTATGA GTGTCTCGGT AAAGCCCAAA AGAGAAGTGA C^.GGCGTTTT TAATTCATGa 840 
GACACGTTTG ATACAAAATC TTTTTGCATT TTTTCTAGTT GGCGAACGCG CGTTAAAtCA 900 
TAGGCAATTC CTAACACTTG ATAAtCATTG GcGCTTTGAT CATCCAAGTA ACGAAGAGAA 960 

AGATCTAATC GTAAAGGTGT TGGTCCTTGT GTCACAGTGA TTTCTTGATG TAAACTGTTT 1020 

TTTTCAGCGA CGACTTGATG GATTAAATGA ATCAAATCGG CGTCTTGAAT CACTTCGAGA 1080 

TAATTGCTTT GCATATTTTT AGCAGAGAGT CCTAACATTT CCTGCATTTT AGGGTTCAAC 1140 

AGACGAAGCC GTCCTTGCGT ATCAATGATA AAGACACCAA TCATCAACTC ATCTAAAAGT 1200 

TCGTGAAATT GCTCATCCGC AGACGTATAC GCTTTATACG 7TTGACTTAC TTGTTGaCTC 1260 

AGTAGGTTAA TAGTTTCGTA AAGTTCATTC CATTCAGAAG ATTGCTGGaT AATGGATCGA 1320 

TTTTCATCTG GATGTTTAAG CATTCGTTTC AGAATCGGTA AGACAGTCAC TAGCGGTTGA 1380 

,j.Y.j.^.j.Yri.Qcc GTAAC7\ATAA AAAGATAAAC AAGGTAATTA GTAGGAACAA GATGCCAAAT 1440 

GTAAAGATAA TATAGCGACG AAAAGATTGA ATATTTTTAG AAAATAGCGC GGTTTGTTCC 1500 

GAAATCCGAA TAATGCCAAC GATTTCTCCT GATTTTAAAA CAGGCGTGGC AACGTACAGA 1560 

AGTTCTTTAT TTAACGTTTT ACTCTTTCGT AAAGCTGAAC CATAGGTAGC TCCTGATAGA 1620 

ACCGCAGCGA TTTCAGGTCG TTTTTTGTGC GACTCTAGCG GTTCATTTTG GCTGCTGTCA 1680 

AAAAAGACGG TACCATTTTT TTGCAACAAA GTTACCCGTT CTTCTGCATC ATCTAACGCA 1740 

tGGGTGaCAA GGAGTTGTTG TTGGTTTGTC AGTTGCCCTT 7TTCGAJWVC GGAAGGGGGC 1800 

AACTGGTCTG TTAAAAGGGT ATTTTrPCTT CTTAAATAGG CTTCTTGCTG AGCAACCATT I860 

TCTTTTTGAA AGAAAAAATT GGTCGCTGCA ATACTCCCGA CAAATAATAA AAGCATGACT 1920 
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GCCGCAACTA 
GGTAGCCAAA 
TATCACGTAA 
GCCAAATTCG 
AAAC6AGTAA 
CATAATTTTG 
CTTGAAGCGT 
GGACTTCTCT 
TGATTTTATC 
CTCGACGAAG 
TTATAAAATC 
TAGTCACCTG 
CATCATCGAC 
AAGTGTATAT 
TAAGCGAGAA 
GGAGAAGCGA 
AAAATCTCAA 
TTGTCCCGAA 
GGATGGCTTT 
GACTGAGACA 
TACAACCTTG 
ACAATTTTCT 
AGGAATGACC 
AAAAACTTAG 
AaGATGGGAG 
TTTTATTCGA 
GACTTTTCTT 
AACATTAAAC 
AACTATGTTG 
CCCAAAGATA 
TATGGCTTTT 
GGGAAAAGTT 
AGAAGCCGAC 
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AAAAATATTC 
GCCACGGACA 
ATGACTAATG 
TTCCAACAAG 
TTCAAATTCT 
TTCATCCACA 
TTCGGTCGTG 
GGGACTAAAA 
TACTTGTTCA 
TGcTTTGGTA 
ATATTGATTA 
ATAGCCTTCT 
GACAAGAACT 
TTCACTATTG 
TGCTACTTTT 
ATGATTGGTA 
GAACAAGAAG 
GTTTTAGGGG 
GATGTTTGGG 
TTTAAAGCAG 
ATTTTGAAAG 
GGCATTAAAA 
GTTTATTCTG 
TAAAAAAAGA 
CACGGCTCAT 
CATTATTAAC 
TAGGAGAAAT 
GAACCGAAAA 
TACGGAAAGA 
GTGAAATCAA 
GAGAGTTTAA 
TCCGAAGCCG 
GTTAATTTAC 



AATCCGCAGT 
GTTACTAAGT 
TGGACATCGA 
GTTTCCCGAT 
TTCGGTGTTA 
CGAATCTCGC 
GTAGGTTTTA 
GGTTTTGTTA 
TCTTTGGCAG 
ATTTCTAAGC 
GACAAAGCTA 
TTTTCCAAAT 
TTTTTCATTT 
TTAGTTTAAC 
GAGAAAAGTC 
TTAGTAGTTG 
CGTTAAAACA 
AGTTGCCTAT 
CGAATACTGC 
GGGCACAACG 
AAAAAAGTCC 
AAGCTGGAGT 
AAGAAACATG 
TTGACTATTC 
GGAAaTGGmA 
GGAAAAGCAG 
TGCGGAAGAA 
AATTTTAGAA 
ATTTCTTGAA 
AAAATATATT 
CAAACCGCCT 
ACGTAAAAGA 
AAGTGGTCAA 



407 
CTCTTTTTCA 
AAACGGGCCG 
CGATGCGACT 
TAATGACCCG 
GCTCAATGGG 
CAATCACAAG 
AACGACGGAA 
AATAATCATC 
TTAAAATCAA 
CATCCATACC 
ATTCGAAGCC 
TAAAGGTTAA 
ACATAACTCC 
ATATTTTTAG 
AACTCATGGT 
CTTAGGTGGG 
ACTTGTAACT 
TCCACGTCCG 
TTATGTATAT 
CGCGTATCAA 
CagCTGTGGC 
TGGAGtCGCA 
GCAACTTGCG 
CCTATGAAAT 
AAAACAAACC 
ATGAATTATA 
TATGAGGTTA 
GAATACGAAA 
CAACTACAGC 
GAACAAATTC 
ACAGCAGGCA 
AATGATGCGA 
AGATTTCACA 



TC.GTTTTGGC 
TTTGGGATCG 
TTGTCCAGCG 
ATCTTTGCGT 
CTGATTGCGC 
CGGAgCTTTA 
AATAGCTTTC 
TGCCCCAATT 
AATAGGCGTG 
AGGAAGCATT 
GTTTTTACCA 
CAAAGTTAAA 
TTCAAGAAAT 
AGGAAGAACG 
ATAATAGAGA 
ATTTGTTGCC 
AATGGAAAAG 
CCAGCAGAAA 
ACGGATAACG 
AAATTAAAAG 
AGCCAGCTGA 
ACTGCTTATT 
GACATTTCTT 
CGGTAaCCTA 
GCATGAATGC 
TGGAGCTTTA 
GTCGCCAAGC 
AAAAGTTACA 
AGTATGTTCA 
AAGAAATAGA 
ATGAGTAAAA 
GAAATCCGTT 
AAACGTGTCA 



TCCTGAAAAC 
GGCTCGATTT 
AAGTCATATT 
TTCATGAAGT 
ACAAAGACTT 
GGTGTGTCCT 
ATGCGTGCTA 
TCTAAGCCGA 
TCAATTTTTT 
ACATCTAAAA 
TCTTCGGAAG 
ATCGATGGTT 
TTCTGAAAAT 
TACTTTAAAG 
AATGAAATGA 
GTTACGATGG 
CAATAATGAT 
TCaTTGGCGG 
GGGAAGATGT 
CCCTACAAAT 
TTTATGATGG 
tTATTCAACA 
TTTAAAAGGT 
TAAGGGATTG 
GTTATTTGAA 
CTATGCTGAT 
GGTTTATGAT 
TCTTTATTCA 
AGAAACTTAT 
GGAATGAAAC 
TCCGTCGTAA 
TGGCTTTATT 
GAGAACGGGC 



1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

24 60 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 
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AGTAGGAGTC GAAGTATTAG AAAGCTTATC ACCAGCCCAA Ci^TTGTAA AAATTGTTGA 3960 

. TGAAGAATTA-ACGAAAACGT TAGGTTCAGA AACGGTTGAA CTGAATAAAT CTCCAAAAAT 4020 

CCCGACAGTG ATTATGATGA CAGGGTTACA AGGGGCTGGT AAAACAACTT TTACTGGTAA 4080 

ATTAGCAAAA CACTTAATGA AAACTGAAAA CGCTCGTCCG CTTTTAATCG CTGGTGACGT 4140 

TTATCGTCCA GCAGCGATTG ATCAGTTGAA GGTTTTAGGT CAACAATTAG AAGTTCCCGT 4200 

TTTTGATATG GGaACAGATG CTAATCCAGT GGAAATTGTT CGTCAAGGGT TAGCATTAGC 4260 

AAAAGAAAAG ArAAATGATT ATGTCTTAAT TGATACGGCC GGCCGTTTAC ACATTGACGA 4320 

AGCTTTAATG GACGAATTGA AACAAATTAA AGAGTTGGCT GATCCCAATG AAATTCTGTT 4380 

AGTTGTTGAT GCGATGACGG GGCAAGATGC TGTCAACGTT GCAGATAGTT TTAATGAACA 4440 

GCTTGGAATT ACTGGGGTTG TTATTACCAA AATGGATGGC GATACTCGTG GGGGGGCTGC 4500 

GCTGTCAATT CGGGCAGTAA CGGGTGCTCC GATTAAATTT GTCGGTTCTG GTGAAAAATT 4560 

AACCGATTTA GAAATTTTCC ATCCCGATCG TATGTCGAGT CGTATCCTAG GTATGGGGGA 4620 

CATGTTGACG CTAATTGAAA AAGCGCAACA AGATTACGAT GAGAAAAAAG CAGAAGAACT 4680 

TGCTCAAAAA ATGAAAGAAA ACAGTTTTGA CTTTAACGAT TTCaTTGAGC AATTGGATCa 4740 

AGTTATGGGC ATGGGACCGA TTGAAGACTT ATTAAAAATG ATCCCTGGAA TGAGTAACAT 4800 

GCCTGGTATT GAAAATGTCA AAGTCGATCC AAAAGATGTG GCACGGAAAC GGGCGATGGT 4860 

CTTATCAATG ACCCCTGCAG AACGTGAAAA TCCTGATCTA TTAAATCCTA GTCGCCGCCG 4920 

CACAATTGCA GCTGGTTCAG GAAATAGTGT GGTTGAAGTC AATCGTATGA TTAAACAATT 4980 

TAAAGAATCC AAAAAAATGA TGCAACAAAT GTCCAAAGGG GATATGAACA TTCCTGGTAT 5040 

GGATCaAATG TTAGGTGGCG GCGTTAAAGG CAAGCTAGGT AAAATGGCCA TGAATCGTAT 5100 

GATGAAGAAA AACAAAAAGA AGaaAAAGAA AAAAAGATAA ACAGGCTTTA GTTAATTATT 5160 

GGAAGAAACa GAAAGTGCGT CATCAGTCTC TTCGACTGAT GGCGCACTTT CTGTTTAAGC 5220 

GTAGTTTTGG ATAATTGCGT TTAACGCTTC GGCAAAACCA GCGCCGTGTT GTTCATAATA 5280 

TGACGTAAAT CGTTCATCGC CAACATACAT AAGGCCTAAC CCTTTGTGTG CTTCTGGCGA 5340 

ATAGCTTGAC CAAGAGTACA TTAACCAATC TTTGTGTAAT TGGAAAATTT CTTGCGCAAG 5400 

TTGGCTAGTC AACTTGGGTG ATTTTTGATA CATGGCTAAC TTAGAAAATA AATCTTTTTC 5460 

CACGTTCTGC ATTGCTTGCA TGTCTTTTTC CGTCAAGTTT AAATATTTTT TGTTGGCTTG 5520 

TTCAAtCGTT TCGTTGCCAT ATTTTTCCCG AATCTCTTTA CCATATTTTT CTtCGTTTTC 5580 

TTGAACTTTT TGTTGTTTGA ATGCTTCGAA TTTTTGCTGA TCTGACATGT TTATCTCTCC 5640 

TTTGtAATGG TGTAAGGTTT GTTGAACAGA GGCTAAAAGA CGATCAATTT CATTGCGCTT 5700 

TTCCAATAGT TTTTGTTGAT GCTCGTATAA AGCCTGTTCA ACATTAAAGT CAGGTTGCTC 5760 

CAAAATTTCT TTAATTTCAT CTAGTTTCAG ATCCAATTCA CGGTAAAAAA GAATTTGTTG 5820 

CAAACGATTC ACCTCGGCTT GCCCGTAAAT TCGATAACCA GAAGAATTAA TTCTGGCTGG 5880 
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GCACTGACGC 

TGACAAAATC 

TGAGACAACC 

CGATTAAAGA 

ATAATTTTAC 

TGTAAAGTTG 

CCGCTAATkG 

TCgCCATACG 

GTTATATTCT 

TGTCCAAGCT 

ACCTATGTTA 

TACCAATTTG 

ATGT/UVATCC 

CAAACGTTTT 



TTGAAGAAGA CCAATTTCAT CATAATATCG TAACGTCCGT 
AGCCATTTTT TTAATTGTAT ATTCCATCAT TCATTCCTCC 
CTTTACGTAA CGAGAGGGTC AAGCAAAAAA ATAAAAAAAG 
TGCCTCACTT TTTCTCGATT GATTCAATTA GGCAGTTAAG 
AGCATCTACA ATTGATTGTA AGAAGCCAAC AGTGCCATCA 
AATTAAGTTC GTAATACCGC CAATATAGGC TTCAGGTTGT 
GAAGACTAAT GATTGTCTTA AATGATGGTT CGCACCaAAT 
AACACTCaCC aCAAGTCCtG GTTTTTayCC CCAAACGcTT 
GACATCCCAA TGCATTTTTT AAAACAGCTG GTACTGAACG 
ACATTACGCC ATCTACTTGT TTAACTTCTT CACGGAATGT 
TTCCTTCaTC ATCCmAATCT TGATTGTATA ATTCTAAATC 
GTTCGAAATC AGCTGGAAAT AAGTCCGCCA TTGTTTTTGC 
CTTTTCTTAA ACTCCCTACA AAAATACCAA TTTTTTTGTC 
CTTTTACTTA CTTTATGATA GTACTTTTTC AAAAAAAGCG 
ATTGTTGAA 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
AACtGTTCAC TTGtTCCATT AACCAGCAAT GTACCTAACG CTGTTTCCAC 
TAGCTTTTCC CTAAAAATGT TCGGACAGCG ACAGTACCTG CGACAGTATT 
TCTGAAACAA TCTCAATGTC CTCTGGAcGA AtCGTTGCCA CCGTCGTTGT 
TCAGGATTAG TCGTTTTCAC TATTTGATTT GTTTGTGTCC GATAATGACC 
TTTTCGACAG AGAAAAAGTT TTCAAAACCG ATAAATTCTG CAACAAACTT 
AAACGATAAA TCGTTTCTGG TGAATCATAT TGTTCAATCA CGCCATTATT 
ACTTTATCTG AAATAGAAAA ACATTCTTCT TGGTCATGTG TGACAAAAAC 
CCTAATTGTT GTTGAATCCG TTTGATTTCA ATCCGCATTG CCACTCGCAA 
AAATTACTCA ACGGTTCATC TAGTAGTAAT AGTTTAGGTT CGATAATCAG 
AGCGCAACCC GCTGTCTTTG ACCACCAGAT AATTGTTTTG GATAACGGTC 
TCTAAACCAC ATACTTTTAG CATTTCCGTG ACTTTTTGGT CCATGGTTTG 
TTGCGTAATT TCAGCCCAAA AGCGACATTT TCTGCAATCG TTAAATG 
(2) INFORMATION FOR SEQ ID NO: 39: 



CAGATAGTGA 
TATCATAAAC 
TGTAAAAAGT 
AATCCACGTA 
CATCTTCACC 
GAACATTTAA 
CaCCTGGtGa 
GACGTGAACC 
GGTGTTACAA 
TCGGTGGGGG 
ATGAAAACAG 
TTAAATGAAT 
CTCCATTTTT 
ATCCTTGTAC 



TTCATATTGA 
AGTAGCAGCT 
AACATTTGGT 
GGCTTCCATT 
AGTACGAGGC 
CATAATGGCC 
TGTCGTAATG 
TTTTGCATCT 
AGCACGTGcT 
GCCCAAGCTT 
TTTGTTTTCT 



5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6729 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 985 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

nCTCGGTACT TnCCCC^/^GG GTGGATGTGC CCGGTTCCAT TCnAGGTGAT TTAGCCAAAC 60 

GTTCCTTATG CGTAAGCCTC TGAAGTCAGC CACCTTTACG AACGAAAAAG AGAGCGCAGC 120 

AACGATTGCT TATTTAAGGG AATTGATTGA AAAAGTTGGT TTCAmCCCTA AATATTATAC 180 

TGCCATTAAT TCAAGCTATG ATTTACCTTA CGATTTTTAT CGTCCAAATA AAGACCGTCA 240 

TCGTACACAA ATTGAATTGA TGCAAAAAGA TGGCTCATTA GTGGAATTGG CAACGGTCAG 300 

TCCCCTAGTT GCAGCACTTG CTGGTCAGAG TCAAGGAGAT GAACGTTTCT ATTTCCCTAA 360 

AGAAATGCTT GACCAAGGAA ATAAAAAACA TTACGATTTA TTTGATGAGA CGTATCGTGA 420 

GTTTTCAAGT TACATTCATA ATGGTGCGTT AGTTTTAATVA AAATAGGTTT TCTAGGAGGA 480 

TTCTATGTCA ATTAAGTTAG TTGCTATTGA TATCGACGGA ACATTGCTAA ATTCACAACA 540 

CAAGATTtAC CCCACGGGTC AAAGAAGCGC TCCAAAAAGC AAATGAGCAA GGTGTTCGTA 600 

TTGTTTTATG TACAGGCCGT CCTTTGCCAr gCGTGAAAGA ACAATTGGAT GAATTAGCCT 660 

TATATGGTGA AAATGATTtC GTGATtACCT ACAACGGTTC GCTTGTCCAA GCAACCAAAG 720 

ATAATACGAT TATTTCACGC TATACCTTGA GTTATGAGGA TTTTTaGAAA TTGAAATGTA 780 

TTCTCGTAAA GTCGGCGCTC ACTTGCATAC AATTGATGAT TCCGCTATTT ACACTGCCAA 84 0 

TCGCAATATT GGTAAATATA CGATTCACGA AGCATCTTTA GTGAACATGC CTTTAAAATA 900 

TCGTACGGTG GATGAAATGA CACCAGAGAT GAACATTATT AAAATGATGA TGATTGATGA 960 

GCCGGAAGTT TTAGATCCTG CCATTGCAAA ATTACCATTA CATTTTACCG AAAAATATAC 1020 

GACTGTTAAA AGTACGCCTT TTTACTATGA AATCATGAAT AAAAATGCTA GCAAAGGCAA 1080 

TGCTCTAGCA AAATTGGCAG ACCATTTAGG CTTAAATAAA GACGAAGTGA TGGCCATTGG 1140 

TGACAATGAA AATGACTTAT CCATGATTGA TTACGCTGGG ATTGGTGTTG CGATGGGCAA 1200 

TGCGACAGAA AATGTTAAAA CAATTGCCGA TGTGCATACC ACTAGTAATG ACGAAGATGG 1260 

TGTCGCTCAA ATTATTGAAA AAATGGTTTT AATTTAAACA TTCTTTTCAA TAACAATAAC 1320 

CCTCAATCTA ATCAACATAG ATTGAGGGTT ATTCGTTATT TTATGATGCT GCCGCTAGCC 1380 

ATTTTTCAAC GGCCCAACGG TGGCTTCCTC TTTTTAATAA ACGGACTGCT TCTTCTGGTG 1440 

CAACCCAGTG TAAAGTATTG GTCCTTTCTA AGGGTTCGCT AAGTTGCCGC CAAGTATTCG 1500 

CTACATAAAA ATATCCAGGA TTATAATAGG CAGTTTGGCG ATGGTTCGAA TAAAAATATT 1560 

CATCAGCTTC CCCTAAATAA CAGCCAATTT CTACACTAAT TCCTAATTCT TCTAGCACTT 1620 

CTCGGTGGAT CGCTTCTTCT TTCGTTTCGG TGCCTTCAAT TTCGCCTCCT GGTAAAAAAT 1680 
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AAGCTCCATT TGGCGCTTGT ACCAGAACCA TTGTGTTATT TTCTGGCTTT GAAACAATAA 
TGTATGCAGC ATATCTTGTT TGATAGGTCA ATGTCTCTTC TCTTTTTCCA AACGTTGGCG 
TTTTCATTGG CGCACCTCTT CTCTTTTCAT CTATTTTACC TGAGAAATTC CTTTAGAACA 
AGTCAGAAAC CAAGCGTTTT cTCTACATTT CACGTTTATC TATGCTACTA TTAACGTTGG 
AAATATAATT AAGTTTGAAA GGGAGTTTTT AACTATGAAA ATGGCTCATA CATGTGTGCG 
TGTAAAAGAT TTAGAGGCAT CATTAGATTT CTATCAAAAA GCCTTCAATT TTGAAGAAAG 
TCGCCGTCGT GATTTTCCAG AAAATAAATT TACTTTGGTT TATTTAACTT TACCAGGAGA 
CGACTACGAA CTAGAACTAA CTTATAATTA TGATCATGAA GCATACGATT TAGGAAACGG 
TTATGGTCAT ATCGCCATCA GTACAGATGA TTTAGAAGGA TTACACGAAA AACATCAAGC 
GGCTGGCTTT ACTGTCACTG ACTTAAAAGG CTTACCTGGA ACTGCACCAT CTTATTACTT 
TGTTGTCGAC CCAGATGGTT ATAAAATCGA AGTCATTCGT GAACGTTAAA AAGTTTCACT 
TGTTCACTTA CTGAGCTTTT TGTGTTTGAG AGCTGTCCGA GACAAAACTT GTCTCGACTT 
CTTCTCAAAC ACTTTTTTCA AAAAAAGTGC TACAATAGAA CGTATGAATT TATTGAGGAT 
GTGATGTTAT TATGACAAAA AAAATTATTG GAATCGCTGG CAATCAACTT TTGCAGGCAG 
CTGAAGTGTT TCACGGTAAC CAAGTGACGT ACACCCCACA AGGTTTTGTC AGCGCTGTTC 
AAGCCGCAGt GGCGTTCCTC TCGTTTTGCC AATTGGCCCC AAAGAATTAG CCGCTACGTA 
TATACAACAA ATTGATAAAC TACTTTTAGC AGGCGGTCAA GATGTTGCCC CTGAATTTTA 
CGGACAAGAA CCCCATATAA AATTAGAAGA AACGAATCGT GATCGTGACG AGTTTGAGTT 
AGCCCTAATT GAAGAAGCCT TAAAACAAAA CAAACCTATT TTTGCGGTTT GCCGTGGAAT 
GCAATTAGTC AATGTTGCTT TAGGTGGCAC ATTATATCAA GATTTATCGA TGTATCCTCA 
ATGGTCAGTT AAGCACGGAC AACAACCCAC TCAACCAATC TTTGCGACGC ATCGTATTGA 
TGTTGAACCA GACAGTCAAT TATCAAACAT ATATGGTACA ACTGGTCAAG TCAACTCGTA 
CCATCACCAA GCACTACATA CTTTAGGAAA AGATTTACGC GTTACTGCTT GGTCCTCTGA 
TGGCCTTGCA GAAGCAGTGG AATCAACAAA TGAACAGCAG CCTTTATTAG CTGTTCAATG 
GCATCCTGAA CTAATGTACG CACGTGATGC CAAATCCCAA GCACTCTTTA ACTACTTTGT 
TCAAAAATTA TAGTAAAAAC GGmGGGACGT CTTTCGCCCC TCCGTTTTTA CTTGCtATCA 
TACTTAAAAA TCAAAATTAT CTGGAtCTGC ACCAAAACGA TGATCCTTAT TTAACTGATT 
AATTGCGGTT ATtTCTTCTG GTGTTAATTC AAAATCAAAG ACAGCAAAAT TTTCCTGAAT 
TCTTTTTTCG TGAACAGATT TGGGAATCAC GATAATATCA TTTTGTAAGT GCCAACGAAT 
AATCACTTGG GCCGCTGATT TACCATATTT TTCACCAATT TTCACTAGTA CTGGTTCAGA 
TAATAAATTA-CCTTGTCCTA GAGGAGACCA AGCCTCCACG GCAATCTTAT TTTCGGCTAA 
ATATTTTCGT AGCGGTTCTT 6TGTTAACGT AGGATGTAAT TCAATTTGAT CCACCATTGG 
CACAACATTT GCTTCTGTTA ATAAATCTTC CAAATGATGT TGATGAAAAT TGGAAACGCC 



1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 
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AATTGCTTTG GCACGACCAC TTGCGTAAAT TTCTTCTAAT 
TTTACCAGCA ACAGGCCAGT GAATTAAATA TAAATCAACA 
TAAGCTTTTA TCAAACGCTT TAATCGCATT TTCATAGCCT 
CGTCGTCACG AAAATTTCTT CGCGGGGAAT ACCCGATTGA 
TACTTCGTTT TTATAAACGG CTGCTGTATC AATCAAGCGA 
TTTTACTGAA TTTGTTGCTT CTGAGCCATC TTCTACACGC 
TGGCATTTCA TTGCCGTTTG CTAATTTTTT TGTTTTATCC 
TCCCTTCATT TTCATTACTT TTAGTATACT ACGCATTCTT 
CTTGTCAGGT CATTTTCTTT ATTTTTAATA AATAACGACG 
TTAGTGCCAC AACTACAATT GTTCGAaCCG TAACGCTTCC 

GTAACCAGAT ACAAAAGAGA TAAACACAGG ACCAATTAAA 
ATTCCGCAAT TTCGATTATC TGACTACTAA AAAAACGACT 

GATTATGG7VA GATGAATGTT CGTCGCTTGT CAGCCGTCAA 

CAATAAAAAT TGTTACAGGT AATTTAGAAA GTAAATCATT 

CTTTCACGCA GCACTCACAA CCTTTAACAA AATTATAAAA 

TTATAACATT CCTAAAAAAA CATGTTAAGT CTTAAAAAAA 

TTTCGCCGAG AGATTTAGTA TACTAAAAGA AGAAATTTTA 

TTGATACTTG GTTTAGCCAG TACTGAAACA GTCGATTCTT 

GCGACCGTAA AAAAAGTGAG CGCGCTACAA CGCTTTTGGA 

ATTGTCGCTA CCTTAATTGA AAAAAGTTTA TCAATTTTAT 

ATTATTCAGC GAATCGGGAA GTATTTAATT GATCGCACCT 

CAACATTTTA GTGAAAGCCG TTTAAAAACG TTACACACTT 

TATAC 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CGTCTGTCCT TATCTCGCAC AGCTATTTGG AAAGCGATTA ATGAATTGAA AAAAGATGGC 
TATCAAATTA CTAGCGTGCA AAACAAAGGG TATCGCCTTG AAAAATCAGA TGTCCTTTCT 
GCAGAAGGCA TTCAGTTGGC ATTATGGCCA CAAACACCGC CTTTAACAAT TACCGTCCTA 
GAAACATCTG AATCTACCAT GACCGATGCT AAGCGGGCGA TTTTAGACCA GACGCCCGAC 
AATACTTTGA TTGTTGCTGA CATGCAAGAA ATGCCTCGGG GCAGGTTCGG CCGGCCGTTT 



GCTTTCCATG 
TAGTCTAACC 
TGATCTTCAT 
CGAATGCCTT 
TAACCCACTT 
CAAACACCTA 
AAAGTCATGT 
CTTTCACGAA 
CCAATAAGTC 
AATCCACAGT 
GGAAACTTTC 
CCCTAATCGT 
GCGAGTAGCA 
TTTTGCCCAT 
TATATTATTT 
AACTTTCTAT 
GAGAAAGTTG 
CTGCTCCTGA 
ACAATATTAA 
TTTTAATTTT 
ATGCAAATTA 
TAATCATTAA 



ATTCTTTATA 
CTAATTTTTC 
TCCATAATTT 
CGCCCACCCC 
CTAAAGCCCA 
AACCTAGGCG 
GTTTCTTCAG 
ACAATCACGA 
AGCACTCCGA 
TTAACTGCAA 
AATGTACATG 
AAAGCGACTG 
CATCGCTCAT 
TCAATTGAAA 
ATTTACAAAA 
TTTCTTCCCC 
GATTTTTATG 
TTTAGCAGAA 
TTGGGATGCC 
TCTATTTTTT 
TAGTAAAAAG 
TGCATTTCAA 



3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

4985 



60 
120 
180 
240 
300 
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TTCGCTACGC CAGGTAAAGG GATTTACATG AGCATGGTGC TTCAACCGAA CCAAAATTTT . 360 

GAAGAAATCG CTCAATATAC TGTGATTATG GCCGTAGCGG TCGCTCGAGC AATGGATGCT 420 

CTGGCAGGCG TTCACACAGA AATTAAATGG GTCAACGATC TTTATTTAAA TGGCaAAAAA 480 

GTTTGCGGGa TTTTGTCTGw AGCCATGAGC AATGTGGAAA CAGGGCAGAT TTCTAATGTT 540 

ATCATCGGCA TGGGGATTAA TTTTTCAATT ACAGAAACCG AATTCCCTGA GGACATTCGG 600 

CAGAAAGTGA CTTCCCTTTT CCCAGATGGT GAACCGACGG CGACGCGAAA TGAATTAGTC 660 

GCCAACATTT GGAATCATTT TTACCAAATT TTAGACGAAC GTTCAACGAA AGACTTTTTA 720 

GAAGAATACC GGCAACGCTC CTTTGTGATT GGCAAAGACG TCGCTTTTAC CCAAAACGGG 780 

CAAGACTTCC GCGGCGTTGC GACAACCATT AATGGAAATG GGGAATTGAT TGTCCAGTTG 840 

CCTGATGGCA CGGCCAAAAC ATTGTCTTCT GGAGAAATTA GTTTGGACCA GATTGGTGAA 900 

TGGCGTCGAG GATAAGTGGG GAAGTTTAGA AAAAAGCAAG AAAGCACCTG TCTAAATTTA 960 

AACAGATGCT TTCTTGCTCT TTAAAAATCA ACTTCTTTTT GTGCTAACAT TTCGCACAAT 1020 

TCTTTGGGGA AATTCGTATA GGGATTTTCA TTGATTTTCA TTTCTTCTAA AGGAGACAAC 1080 

CAGTTTTTAT TCGTATTCTG AAAAGCAATA GCTACTTTTT CTTCAAGATA CTTACATACT 1140 

TCTGCGCCTT TAAAATTTCG ATAACTGTCT TTTAAATGCA GAACTTTCTC TAATCTCCCA 1200 

TATAAATtTT CTCTAGTCAT GTATCGATTG ACTTCCTCAA AGTGAAGTAG TAACCAGAGT 1260 

TCAAAGCATT CATTTGAAAA AATTGTCTTG ATTTCAGGAC GATGTCGTTG TGCACATCTn 1320 

ACCGCGTTTT CAATCGATTC nGTGGT ^^^^ 
(2) INFORMATION FOB SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1959 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CCAAAATATk GaTGCAATGG TTtTGkTTyC CTGaCAATAG nTGayCGATC GTCACAAATT 60 

CGTATCCTTC ATTTTTTAAC ATAGTGATCA ATCCTGATAC AGCTTTGACT GATTCATGAT 120 

GGATATCGTG GATTAAGATG ATTGCGCCAT TAAAGACATT CTGTTTGACA ACATTATTGA 180 

TTGCTTCAGG ATTTCTTGTT TTCCAATCTA GCGAATCAAT ATTCCATTGA ATAATTGGCA 240 

TGCCAATTGT TTCAGCACTT TTTTTATCAA TAGCGCCATA AGGAGGACGG ATATTGCGAG 300 

GCAAGACGCC CGTAGCCTTA AAAATTGCTT TATTAGCTTT ATTCATTTCA CTTTGTAATT 360 

CATCTGTCGA AAGTGTATTC AACTGTGGAT GAGAATACGA GTGACATGCT ACCtCATGAC 420 

CCTCGTCGTG GACTTGTTTT GCAACATCTG GGTTTTGATC CACCATCTGT CCTAACATGA 480 

AAAAAGTAGC CTTCACGTTG TTTGTTTTTA GTATATTCAA CAAATCAAGT GTTGAGCTAT 540 
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TATTGGGTCC 


ATCATCAAAA 


GTAAGAGCGA 


CATACTTTTT. ATTTTCATCT 


AATGCAGGTA 


600 


ACGCATCTTT 


TATACTTTCT 


TGCGAAAGCA GATGTGTATG 


AATAAAAGGC 


GCAATGTCTC 


660 


TGTATTTTAG 


AATAACTTTA 


GTAGTTCCAG 


TTGTGTTCTT 


TGGTAGGTCG 


ATGGTCAATT 


720 


CATCAGGTGT 


ATAAGTCATT 


TTACTCTCGT 


AAGAAATTCT 


ATCCATCGTT 


AATACAGCAT 


780 


CAATTATTTT 


TTTAGGCTCT 


TTGGCGTGAT 


CAAGAATGGA TTGCTGAATC 


ACCTGTTGAA 


840 


TACCTAAGAG 


ACTTCCTTCA 


TCCGGAATCA AATCCTTATT 


TGTGATTTCT 


TTACCAGTTT 


900 


TTTCGGAAAC 


ATAAATGGCT 


TTCTCAGAAA 


TACTGTCTGG 


CTTTTTAAAA 


TTTTCCTTAC 


960 


TGCGATCCCA 


GATAAAACTA 


TCTGCTTCAA 


TTCTATACTG 


ACCAATTTTA 


GAAGATATTG 


1020 


CTGTTGCTTT 


TATTCTGGCA ACTGTAAGCA CTTCTTTTTC ACGATGCTTC TTTTGTTGCT 


1080 


TTTGAGCAAG CTCGTCTAAG AGCTGGTTTG CATTTTTAAA AGGAACAGTA CTTTCATAAA 


1140 


CTGGACGGTA 


AATAAGGCAC 


TTATTTTCTC 


CGTCATTTAT 


AGTATCTATT 


TTCGTTGAAA 


1200 


TTCCACTTTG 


GTCTTGCTCT 


TCTTGGATTG 


AAGCCATGAG 


ATTTTTTACA 


GCAGTTTCAT 


1260 


ATTTGGCTTG 


AGCTTCTACT 


TCTTGCTGTT 


TTAATGTATA 


AACAATATAG 


CCGCCAGTAG 


1320 


CGACAATTAC 


TAGAATAAGT 


GCGGTAAACA ATGCGAATAT 


ATACTTGCTA 


TAATTAGATT 


1380 


TTTTCTTCGG 


TATTTTTGGT 


TGTTTGTTTT 


CGTTTCCGGT 


TTTGGGCTTT 


ACAGCTTCAT 


1440 


TCAATTGATT 


TGCATTTTCA 


CAGATATCAG 


GTTCTTGTTT 


GTGACGTTGA 


CTACGACGTG 


1500 


CATAGATATT 


TTTATCAGAC 


ATATGACTTT 


ATCCTTCCTT 


TTATCATCAT 


AAAGTAAAAT 


1560 


GAAACAGCGC 


AATTGTAATA 


AAAAATAATT 


TATAGATAAt 


TTTTACAAGA 


AAGTTTTCTC 


1620 


TATTATTCtT 


TTTCtTTTGG 


CGTAGTAATT 


ATCGCTAATT 


CCTCACCATT 


TTAGATATTG 


1680 


TGGCTACTTA 


ATGTTAAACT 


CAATTTTGAT 


ATCATTAGAA 


TCACTAGTTT 


TGATTTTAGT 


1740 


TTTTGGAATT 


GTACCCTTCG 


ATAGGTTAAT 


CACTATGGTT 


GCAGTCTTTG 


AATCACCAAT 


1800 


AATGGTCTAG 


GATTAAAAAT 


ATAAACTATT 


ATCCAATCTT 


GTTCTATTTG 


ATTAAATATA 


1860 


CTACTTTAAT 


ACCTTTAATT 


TGTGATCAAT 


TCCAACTTCA 


CCAAACTCAT 


TTACAATTAT 


1920 


AATAATTTCT 


TCCCTGGGAT 


GGGGTGAGAC 


TTGCAAGGC 






1959 


(2) INFORMATION FOR SEQ ID NO: 42: 









(i) SEQOENCB CHARACTERISTICS: 

(A) LENGTH: 3625 base pairs 

(B) TYPE: nucleic acid 

(C) STHANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

TTTTTATTTG 6AGGGATTTA TTGTGTTAAA ACCATTAGGC GATCGCGTCG TAATTAGAGT 60 

CGCGAAAGAA GAAGAAAAAA CTGTTGGAGG AATTGTTCTT GCATCCGTTG CAAAAGAAAA 120 

ACCACAAACA GGTGAAGTTA TCGCAGTAGG TGAAGGTCGT GTGCTTGAAA ATGGCACAAA 180 

AGTTCCGATG GAAGTAAAAA TTGGTGACAC AGTAATGTTT GAAAAATATT CAGGAACAGA 240 
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AGTGAAATAC GAgGCGTAGA ATACTTAATT GTATCAGCCA AAG;^TTAT 
GAATAATAAA CAACTTTTAC AGAaTAATAT AATAGAAkAA AAAAATTATG 
ATCATGGCAA AAGAGnTTAA ATTTGCAGAA GATGCACSTG CAGCAATGCT 
GATGTATTAG CAGATACAGT GAAAGTGACA TTAGGCC^TTA AAGGTCGTAA 
GAAAAATCAT TXGGTTCACC ATTGATTACT AACGATGSAG TAACAATTGC 
GAATTGGAAG ATCATTTTGA AAACATGGGC GCAAAAT7AG TTrCAGAAGT 
ACGAATGATA TTGCTGGTGA CGGWiCAACA ACAGCGACTG TTTTGACACA 
C6TGAAGGCT TAAAAAACGT AACTGCTGGA GCCAACCCAT TAGGTATTCG 
GAATTAGCAA CAAAAACAGC AGTAGAAGAA TTACACAATA TTTCATCTGT 
AAAGAAGCGA TTGCACAAGT CGCTGCTGTT TCATCAGGTT CTGAAAAAGT 
ATTGCCGATG CAATGGAAAA AGTTGGTAAC GACGGCGTAA TTACOVTTGA 
GGGATTGAAA CAGAATTAGA TGTGGTTGAA GGAATGCAAT TCGACCGCGG 
CAATACATGG TTACTGACAA CGATAAAATG GAAGCTGTTT TAGAAAATCC 
ATTACCGACA AAAAAATCTC AAATATTCAA GATATCTTAC CTTTATTAGA 
CAACAAAGCC GTCCACTATT GATTATTGCG GATGATG7TG ATGGGGAAGC 
TTAGTATTGA ACAAAATCCG TGGTACATTT AATGTTGTCG CAGTAAAAGC 
GGTGACCGCC GCAAAGCGAT GCTTGAAGAT ATTGCTATTT TAACAGGTGG 
ACTGACGACT TAGGGTTAGA GTTAAAAGAC ACAACTATTG AAAACTTAGG 
AAAGTAGTTG TCGACAAAGA TAACACAACA ATTGTCGAAG 3TGCTGGTTC 
ATTGATGCCC GCGTTCATTT AATTAAAAAC CAAATCGGCG AAACAACGTC 
CGTGAAAAAT TACAAGAACG TTTAGCTAAA TTAGCTGGTG GGGTTGCTGT 
GGTGCTGCAA CTGAAACAGA ATTAAAAGAA TTAAAATTAC GAATTGAAGA 
GCAACACGTG CCGCTGTAGA AGAAGGCATG GTTTCTGGTG GTGGTACCGC 
GTAATTGGTA AAGTCGCTGC GCTAGAAGCT GAAGGCGATG TGGCAACAGG 
GTCGTTCGTG CATTAGAAGA ACCAATCCGT CAAATCGCTG AAAATGCTGG 
TCAGTGATTG TTGACAAACT AAAAAATGTT GACTTAGGTA TCGGATTCAA 
GGTGAATGGG TAAACATGGT TGAAGCCGGT ATTGTTGACC CAACAAAAGT 
GCCTTACAAA ATGCAGCTTC TGTGTCAGCT TtATTATTAA CAACTGaAGC 
GACAAACCAG AACCAGCTGC ACCAGCTCCT ATGATGGATC CATCAATGGG 
ATGAT6TAAA AATAAAAAAG TATTCTAAAA GCTTACGATT GAAAAATCGT 
TTACATAAAg' nATAAATAAT CTAAATAATA TCTTTGCTTA TGGTGTTTTC 
ACTCAATTAA CAAAAAAGAT GATAATTTAA AGGCTTCGTT TTAGCTAGAC 
TTATTCGATA AAATTTAGTA ATAACAATCA TTTTTTGAAA ACTAGTAGTT 



?GCCAC?GT7 
AGGTGAAGAA 
ACGCGGAGTA 
CGTTGTTTTA 
TAAGGAAATT 
TGCTTCTAAA 
AGCCATTGTT 
TCGTGGGAT? 
AGTTGATTCA 
CX^GCCAATTA 
AGAATCAAAA 
TTATTTATCT 
ATATATCTTA 
ACAAATTCTA 
TCTACCAACA 
GCCAGGATTT 
TACAGTAATC 
AAATGCTAGC 
AAAAGAAGCC 
TGATTTTGAT 
CGTTAAAGTC 
TGCATTAAAC 
ACTGGTCAAT 
GATCAAGATT 
TTATGAAGGA 
7GCAGCTAAC 
AACTCGTTCT 
>JGTTGTTGCA 
CATGGGCGGT 
MGCTTTTTT 
7CAAATATTT 
aGCCTTGTTT 
GGTAAT7TTT 



300 
"360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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ATTTATTAAT AGAATATAAA TAAAAATTAT TATATTTTGA 
GAGGTGGAAA AATGGAAAAA TATACAATTG GTGTCGTTCC 
AATACTGTCA AAATTGGTTT GTTAGTAATC AAGTAGAATG 
ATATACTTCT TCATTTGCAC CATATTGATG GATTAGTGAT 
AAGTAAATAC TTGTTGTGAG TTATTAATGA CGATTCGCAA 
GGTTATTTTC TCGGACAGAA GTAATTAGTA AAGTCAATCG 
GTGCAGATGG CGTTTTTGAT TATTCTTATG ATCGTCAAGA 
ATTTATTGCA ACGTGTAAAA AGACGTTTTT ATCCTAAATT 
AAACAGTTAC TAAGAATCTT TCTGAGAGAT TGTATTTAAT 
GCTTGGGAAG TGGAGAAGAA ATTCTTTTAA CAAAATTGGA 
TATACAAACA CGCAGGACAA ACAATTACAT ATGAAGAACT 
ATACTGCGAA TGAACG7U\AA TATCGTGTCG CTTUVTGTGAT 
TTGAACAAGA TGTGAACAAA CCACAATATA TCAAAAC7AT 
TAACGGTGTA AAAGGTACGA AGGAGGAGAA AAAGATGTGG 
TGAAGATGAG GAGTATGAGA AGATGCCAGA TTTTGAAAAT 
ACCAACGGTA ACAGAAGAAG CAATCTTTGA TGACGAGGTC 
CGCAAAAGTA AAACAGGTCC AGCAACAAGT GCCGACTGAA 
TTTTGTTGAA AAACTAGAAG AAGAAAATCA ACTATATCAG 
GGAGCAATTA TTAATGCAAA CCAATGAAGT GGAAGCATTA 
AGATGTTCAA CTTAAACATG TTAAAGAAAC ATTAAGTGAT 
TTTACAGAAA CAATTGTCTG AAGAAAAGAT GCAACAGAGA 
AGACACAGCC GTTACGCTTT CTC7VAAAAGA AATTGGCGAG 
CAAGCAAAAG AAACATTAGT CAGCCAACCA CCAGTTGCAC 
TGCACTTTAC ACGCTGAGCA GTGGn 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3758 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CCnCCCCCCT TTTTTTTCCG GTCCCnAATn CCTCCGGCCn TTATTTTAGG TTTCCATTTG 
GTAAAAATAG TATAGGrGsw AAgCatCaGT TATGTATtAt TAGTGamAGA AGGTTGCcGT 
GAAGTAAAwA TAAAmAGGAG AGCcAAAGGA TGTTAAmAAA TGTAGAAGTG TTCTGGCAAA 



AGGGGAATAT TAGGAAGATT 
TTTGACAATT GAAACAAAAG 
TCAGATCATC CAAGGAACAA 
TGAAGTTACT GACCACCAGT 
GCAGTCAGAT TTACCGATTT 
AATTATTTAT TTACAATTAG 
ATGCATGTTA AGCATGAGTA 
AGCTATTGCA AACGAGGAAC 
TGCGCCAAAT TTGAGTGTTT 
ATTTTTCACA ATAGAGTATT 
CTACAAAAAT GTCTGGAAAG 
TTTCCATTTG CGCAAAAAAA 
TCGTTCAAAA GGCTATATGT 
CAATTTTTGA CTAAAAACAA 
GAACAATCGA TGAAAGCACA 
GTTTATGCAA AAGAGCAATC 
CTGTCAGAAG AGTCATTTTC 
CAATCCATTA AAAAGCTAAC 
CAAAAACAAG TAGTCGAAAA 
AAAGAAACAA CTATCACTTC 
CAGACCAGTG aAGAGrATTT 
TGTTATTAGA AGCCAAACGT 
ATTCATGAGA ATGGACACGT 



2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3625 



60 
120 
180 
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ACTTTTTAGA TAAACATGAA TTAGATATGT TGATGCCCGA TQTGTGGATG TTTGGTGATG 240 

GCTCTTCTGA GATGGGTAAT CGTTTGGGTC AATTGGTTGT TTCTGGTCGT AAGACAGCGA 300 

CATGCTCTTC TTTAGATATT TATAAGATGG AAGAAGAACA ACTCCCTAAA GCTGGTCAAT 360 

ACGACATAAT TTTAGATGGT CAGTCTCAAC CGCTGGCTAT TATTCGAACA ACCAAAGTTG 420 

AAATTATGCC CATGAACAAG GTCTCAGAAT CTTTTGCGCA AGCGGAAGGT GAAGGAGATT 480 

TGACGTTGGA TTATTGGTAT GAAGAGCATG CGCGGTTCTT TAAAGAAGAA CTGGCCCCCT 540 

ATCAGTTACA ATTTTATCCC GATATGCTGT TAGTTTGCCA AAGTTTTGAA GTAGTGGATC 600 

TTTATACAGA AAAAGAATAG ACAGAGAAGC GGTGACTGAC ACGATTGAAA CGAGCAGTCG 660 

CCGCTTTTTT TTTACTGATT TTTTGAAAAA GTTGCGTATG TTTATATAGA CAGGGTGATA 720 

GAAAAGAGAT GTGTATTTTA AGTGTCGAGG AAAAGATGGT ACAATAACCT AAGGAGAATA 780 

CTAACAAGGA GTGAATGTTA TGAAAGAAGA ATTTTCATAT GAAATTTTAG AAGAAGTCGC 840 

TGTGTTATCT GAGAATGCTC GTGGCTGGCG TAAAGAATTG AATTTAATTA GTTGGAATGG 900 

TCGTCCACCA AAATTTGATT TGCGTGAGTG GGCCCCTGAT CACGAAAAAA TGGGTAAAGG 960 

CATCACCTTA ACTAACGAAG AATTTGCTGA ATTAraGCAAA ACAATTAmaT CCATGTAAAA 1020 

GGAGCACCCT GTCAATGAAA AGTATGACTG GATTTGGAAA AAAGACCATT CAAAATGAAA 1080 

ATTATCAATT GGATATCGAA GTCAAAAGTG TCAACCAGCG GTTTTTAGAT ATTCAATTGC 1140 

GTATGCCCAA GGAACTGAAT GCCTATGAAC TGGTGATTCG GCAAGTAATC AAACGCACGT 1200 

TAAAACGTGG GCGTGTGGAA GTCTATGTAA ATCTCCAAAA AATCGGCAAC AATCAAAAAG 1260 

AAGTGCGCGT GCAGTGGGAC TTAATTGATC AATTATTAAC TTCGGTAGAT CAACATTTAA 1320 

AGGAAAAATA TCCGGAAGCA ACGTTTGATG CAGGGGACAC GGTCAACCAT TTATTAAAGC 1380 

AAAATGATTT TGTCGAAATA GTGGAAGCAG AAATTGTTGA TCAGACATTT GAACCATTTC 1440 

TTGTTCAAGC ATTTGAAGCA GCCATCGCTA GTCTTGACCA AAGCCGTGTG CAAGAAGGAA 1500 

CGCAAATTAA ACAAGTCCTG CTTGACTATG TAGCAGTGTT GACCCAATCA ATTCAAGAAC 1560 

TGCAAGCGTT TGTGGGCGTT TTTGAACAAG AATACCGTCA ACGTTTTGAA GCGAAATTAA 1620 

ACGAATGGTT AGGTAGCCAA GTGGATGAAA CACGCTTATT AACAGAAATG GCCATTTTAT 1680 

TGGAAAAAGG CGATATTCAT GAAGAATTGG ATCGTTTGGA TATCCATATT GATAAGTTGC 1740 

ATCAGTTGCT TGACGAAACA GAACCAGTGG GCCGAGAACT AGATTTCTTA ATTCAAGAAA 1800 

TGAACCGTGA AGTCAATACG ATTGGTTCGA AATCAAGTCC AATTGAAATA AAAAATAGTG 1860 

TCGTTCAAAT GAAAACAACC TTAGAAAAGA TTCGTGAACA AATTCAAAAT GTCGAATAGA 1920 

GAAAATGGTT TCGTGTTTTT CCAGAAAACG CTATAATATA GAAGTAACGA ATAAATGACA 1980 

GAGGTGAAAG.CAATGACTAA ATTTTATGAT GTAACCTTTC AAGAATTAAG TGGTCGTTCT 2040 

GTGGTAAAAA CAGAAGTCGC TTCAGACAGA GAACCATTTG ACGTCTGGCA AGATGCTTGT 2100 

GCAAGCTATT CTGAAACAGA ATTAAATATC CAAATTAACG AAGACACGTT TGTTACTTTG 2160 
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AATCGTCACT TTGTTGTTCG CATTGACGTG AAAGAAGTTG 
GTTCGTCGTC GCGATGAATT GATGAACGTG GTCAATACGC 
AAAATACCAC CAAGCATTTC ACGCTTGGTG GTATTTTTTT 
AGCCAACGGC GCATAGGAAA GAAAGAACTG CGGCACTTAA 
CAGTTTTACT GCTTTCTTTG CCGTGTTTTT TCTGACGTTC 
GGGTAATTAC TGAGAAAATC AGCCCGAAAA GGACTAAAAC 
GAATAAAATG ATTCGGTTCT GCAACGGTGG TATCCCCAGA 
CAGCTTCCGC TGTTGTTTGT TTTGGCTTAA CTTCTTGGTA 
CCGTACTATT TAACAATGTC AAGGGCGTGT CGACTTGCAC 
AAGTAGGCTG ACTGCCtTTG GGAACGACAC CGTATAAATC 
TGTGATCAAT GGTTTGTGGT CCTTTGTCTA AAAGTTTTTT 
CATAGTCCAG CAAAGCATTC CCAGCCACAT GCCGTTCGTA 
AATTACCAAC GCCTAAAATG ACTTCAATTA AACGGAAATC 
TATAGTTA7VA GCCGCCTGTT GGACTAGAGC CTGTTTTTAA 
CATACTTAGC TCCTGGTAAA GAATAATTGT ACGCTTCAAA 
CGACCATTGT TTTCACCACT GGTTTATTGG TGATTTCTAA 
GAAAATGATA GGTTAGTGAA GCTAAATCTC TAGCTGTAGA 
CAATTCCTTG CATTTGATAC AGGCCGTTAA AGGCACTTGC 
AATAGGTAGT ATTTGTCATC CCTAGTTCAG CCGCTTTTTG 
CAGTAGGCTG GTTCCCTGAA ATTAAGTTTG CCAACATGAG 
GGACAGCAAT CATTTTTAAC AGTTCACGGA CTGGATAAGC 
TATTACTAAT GGCATATATT TTAGAAATAT CGACATCTTT 
TATCCATTGT AAATTTTCCT TGCTCCATTG CTTCAAAGGC 
TGGCAATACT GGCAGGATTC CACGCTAAAT CTGGTTGCTG 
AATCAGCATC AACAACGATG GCTGCTTTGG GAATGCCATC 
TTTTTTTAGC AATCGCGGTA ATATCTTCTT CTGCGTGAGT 
AGGCCCCACa ACTGACAACT AAGCCTAGCA GAAACAAT 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1359 base pairs 

(B) TYPE: nucleic acid 

(C) STEIANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TCTTCTAATC CTGATGCATT aGCTGGCGTG GATGCACAAC GTGTAGCAGC TTTCCAAGCG 



ACGGTCCTGT 
TTTCTAATAT 
TAATTAGAAT 
AAAGATTGTC 
TAAACGTTTT 
ACTTGTGGCA 
ATGTTCGTTT 
AGCCACTTTG 
TTGCCCTGCT 
GTTAGCCAAA 
GCGTTCGTAT 
TTCACCATCT 
TCCTTGTTTG 
ACCATCCACG 
CGTTTCCTCG 
AATTTCTGGA 
GACGTTATCG 
TTGCGCTCCA 
ATTCATTAAA 
AGTCGCCACA 
AACACCTAAC 
TTGCGTAGCA 
CAAGTACATG 
CGACCAGAGA 
CGCTTTCAAA 
CACAGTAGGT 



CGACAAACAA 

GGGCTTATAA 
AAAAGAAACA 
CGCATGGATG 
TGAAGCTGAC 
ATCGATTTGG 
GATTCTTCCg 
GGTGCTTGAA 
ACTAATTTAA 
GTAACTTTTT 
GTCGTATAGG 
TGGTTTTGCC 
ACAGTCGCAA 
CCTTGGTAGC 
TAAGGCGTGC 
TAATTTTTCA 
CCATTTGGGT 
CTGCAGTTGT 
TGAACAAAGT 
TTAGAAGAGG 
GTAATTTTGT 
GTCACAGTCG 
GTCATCACTT 
ATTTCTCCAG 
GTCGTCCCCA 
TGGGCAAAAA 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3758 



60 
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GCTAATGGGA AAGCATTGGT CAATCTACGA AAAGCAACGC AAGCCAACAA AGTAAGTTGG 120 

ACTGTTGTAG CTGCTGCTAG TGAAGGCTGG GCAGCTAAAG TCTTTCCAGA ATTAGC7ACA 180 

AGTGAAGAAC AAGTTGATGC ACTTTGGAAT GAAATTTTCA AAACAACACG AATTTATGAA 240 

GAAAACCCAG TCATTGCTTG GGATATCCAT GATAAAAAAT TACAAGAAAA AGCGGCTGAA 300 

TTAAACGAAC AACAATTTAC TGCTTTACAC TACACAGCTC CTGGGACAGA TTTAACGATT 360 

GGTTTGCCTA AAAACCATTT ATGGGAAGGC GCTGGTAGTT ATAACGCTCG TGGGGAAGAA 420 

TTCATGGCTA ATATGCCAAC AGAAGAAGTT TTCACCGCCC CAGATAGTCG CCGCGTTGAT 480 

GGTTATGTTT CTAGTACAAA ACCACTAAGT TATGCTGGCA CAATCATTTC GGGAATGAAA 540 

TTTACATTTA AAGATGGAAA AGTTGTTGAC TTCTCAGCTG AACAAGGCGA AGAAGCATTG 600 

AAAAATCTCT TAGCTATTGA TGAAGGTGCC AAACACTTAG GAGAAGTTGC TTTAGTACCA 660 

GACCCTTCAC CAATTTCACA ATCTGGTTTG ATTTTTTATA ATACATTGTT TGATGAAAAT 720 

GCCTCTAATC ATTTAGCCTT TGGTTCGGCC TATGCCTTCA ACTTACAAGG TGGCACGGAG 780 

ATGAGCGAAG AAGAATTAGC TGAAGCTGGT TTAAATCGTA GCCAAACACA TGTGGACTTT 840 

ATGGTAGGAT CTGACAAAAT GAATATTGAC GGCATTAAGG AAGACGGGAC GATTGTTCCA 900 

GTTTTTAGAA ATGGCGATTG GGCATAGTCC TACCAAATGT TGAAACCAAA GTTTTTAGCT 960 

CCAAGAAATA AGAAGAAACT GCTTTTATTC AGTAGGTTCT TTTAAAATTT AGAAAGTGAG 1020 

GCCTAAGTTG CAGCGACTTA GGCCTCACTT TCTTATGCTA AAAAATCAAT TGCATTCGAA 1080 

GGACCTTTAA CGTTATAATC AAATTGAAAA CGGATTCTTT TGCACGATAT AAGTTTACTT 1140 

TTAAATAAAA GAATTTTTTT CAAAAAAGTA GAATTGGAGA ATTAACATGG GATTATTTGA 1200 

TGGATTATTA GGAAATGCAA CACAAAATAA TAATGAAACA GCTGAaAAAG AaTTACGGGA 1260 

TGTTTTAATC CCTAACGrAA AAGTAGATAT GGCCTTCACT TTAGTAAGAG ATTT7\ATTGT 1320 

CTTTACAGAT AAACGCTTAA tTTTrGtCGA TAACCAGGA 1359 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15614 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

ATTAAnATAn ACCAGGTTTT AGATAGTATT TATTCAAGTT TACCTCnAAG GTATTAATCT 60 

GAAACCCkGT TATtTtAtTa GAGAGTTCyT TTTCaAATAT ATGT/UUVTrG cTAACaACrG 120 

GGTTCTGTTT GTTGTTGGAC tATTTTTTTT GATCcCCAmC CGTTCTTGAA AAAAAAcAAA 180 

AAAATAAACA GTGyTCGCTT GATTAATCGA ACGTTTGTTC GTATAATGAT TTTGCaAGTG 240 

AGGTGATGTT TGTGAAATTT GATTATGAAA AAGAACCTAG TCGCGATATT CTTTGTATTG 300 
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ATTGCAAAAG TTTTTACGCT AGTGTAGAAT GCGCAGAACG TG^GTTAGAT CCTTTGAAAA 360 

CGATGTTGGT AGTGATGAGT AATGCCGAAA ATGCGGGGGG ATTAGTTTTA TCAGCATCGC 420 

CAATGGCAAA AAAAGTTTTA GGTGTTTCAA ATGTTACGCG TAAAAATCAA GTGCCCGCTC 480 

ATCCTGATTT ATATGTGGTT CCGCCACGAA TGAATTTGTA CATGAAAAAA AATCAAGAAA 540 
TCAATAATTT ATATAAAAAG TTTGTCGCAG ATGAAGATCA CTCTGTGTTT AGTGTAGATG 600 
AATCATTTCT TGATGTAACT GCTTCTTTAA GTTATTTTCA TTGTGAAACT GCCTATAAAA 660 
TGGCTCGGAT TATCCAAAGA GTCGTTTATA ATCATGTGGG TATTTATGTG ACCGTTGGGA 720 
TAGGGGACAA TCCGTTACTG GCTAAATTGG CGTTGGACAA TGGCGCCAAA CATTCGCCAG 780 
ATTTTATTGC AGAATGGCGG TATGATCGTG TACCAGATAC TGTCTGGCAG TTGCCTTCTT 840 
TGACAGATTT TTGTGGGATT GGTCGCAGAA TGGCTAAACG ACTGAATCGT TTAGGAATTG 900 
ATTCTGTATA TGAGTTAGCG CAACGAaTCC GCATCTTTTA CAAGAAACAT TTGGCATGAT 960 

GGGGCTTCTUV TTGTATGCTC ATTCATGGGG GATTGATCGT ACCTTTTTAG GGAAAAAAGC 1020 

ACCACATAAA GCTGAAAAAT CATTTGGGAA TAGTCAGATT TTACCGAGAG ATTATGCGCG 1080 

AAGAGATCAG ATTGAACTAG TCTTAAAGGA ACTGACAGAA CAAGTGGCTG CTCGTTTAAG 1140 

AAAAGCGCAT TGTCAAACGG AATGTATCAC AGTCTACGTT GGTTATTCCA AAGGTCAAAT 1200 

TGACCGTGAA GGACGAACTG GTTGGCGAAA ACAACAAACG ATTCCTGCAA CAAACAATAC 1260 

AAAAGTTTTG ATCACCTATG TATTAGCCCT TTTTAGAGAA CATTATCTAG CGGGAACCGA 1320 

TGTCCGGCAA TTAGGGCTTT CATATGGGAA ATTAGTTTGG AATGAATCTT TACAATTAGA 1380 

TTTATTTTCG GAACCAGAAG AGC/^TTAG TGAAATGGAG TTAAACTATC TGATTGATAA 1440 

AATCCGACAA AAATTTGGCT TTCAAGCTTT AATTCACACC TCTTCTTTGT TGGAAGGCGC 1500 

GACGGCAATT CATCGTTCAG GATtAGtCGG TGGACATGCT GGAGGAAATG TCGGTTTAGG 1560 

AGGATAACAT CATGAAACGA ACAAAAAAAG AmTTTACACC TTATAATGAG TACCATGaTC 1620 

GGCCCTTTgA ACTGAAATGG GCGACCGCTT TTGCATTGGG GGAGTTACAA GAAGGAATCG 1680 

AAGCTTCTGA CGAATATTCT AAGAGAGCGT TTGAACGCTT ACCGCAACAG ACAACAGAAG 1740 

AAATTGAAGA ATACTTAGAG CAGTCTTTAA AAAATAACCA AGTTTTAGAa ATCCAATTGA 1800 

ATACATTGGA TGATTTTGGC CGTGTTCAAC CGCCAATTGT TGGTGTTTTT GAAGGGATGA 1860 

GTTCAATGGA GTATGTAAAA GTTGGACATA CACATATTTT ATTTGAAGAG ATTCGGCATA 1920 
TTAAAAGTCA TGAGTTTACA AAGTGGTCGA GCCTTGATTA AAAAACTATT GTTATACGTG 1980 
CGTTTTGTGA AAACATATGA GATAATGTTC ATAGGTAATA GAAATAAAAA TTATGGAGGT 2040 
GGCTAGGATG GGACATCATT TAAAAGGACA CCATGGCAAA AAAACAAACG CAAATGGCGG 2100 
CTTTTTAACA AAGTTATTGG CAGGTGGCGC CTTAGTGGGC GTAGGAAAAA AGTTATACGA 2160 
CAATCGTGAA AAAATTAAAG AGCTGTTAAG TGACGACAAA GAAAAAGGCA ACGACACTGA 2220 
AAACAAATAA AGGACTGGAT TGACGGAATG GTAAAAGATG CAGCGTATAT TATTGAAGAG 2280 
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ATTACCACGA TTTTAGAAGG AACCTATTTT GTGATAACCA AGMAGCGAA TAAGCGCCGC 2340 

ATGTGTTTAC ATGTTTCAGA TACCGCGAAA GAATTTTTAC GTTGTCGACT GGAGTATGAA 2400 

aCATTCTTAG AGCAACAGTT TGATTATGCC aCAATGTTTT TAGAAACATG TACGGAAGAT 2460 

GTTTCTGTTG AAATTCAAAC GGGACTACTT GAAGAAACAA ATTATCAAGC ATTAACCGTG 2520 

GAAAAATTAC AACAGATAGC CACTAAAATG TAACACAGCT GTTGTGTTAG TTTTGATTCT 2580 

TAATTTGGTT AAGGAGGCGG ATAAAATGTC AGTATATCAA CATATTTTGG TTGCTCTTGA 2640 

TGGCTCAGAT CAATCTGAAA AAGCTTTTCA TGAAGCTGTC AGGATTGCCA AAGAGGAACA 2700 

AGCCACTCTT TATTTAGCTA CAATTATTAA TGATGCTGAA TTTACGACGA GTCCATTTTC 2760 

TTTTGAAGAA TTGTATGATT TAGAAAAACA CAAGTCAGAA GAGATGCTCA CTGGAAAGGC 2820 

AAAACAAGCG AGTGAAATTG GTGTAAAAAC AGTCAAAAAA ATTGTTGAGC TTGGGAGCCC 2880 

AAAACGTTAT CTTGCCAACA CGATTTCTGA AAATTATGCC ATAGATTTAA TTGTGCTAGG 2940 

TGCCACGGGA AGAGGTGCTA TTCAACGAAC CTTGATTGGT TCTACTACAG ATTATGTGGT 3000 

TAACCATGCT TTGTGCAATG TCTTAGTCGT TCGCTAATTG TGAGTAAATA ATCAAGGAAA 3060 

CAATGTTTTA TCGGGACGTT TTTTTAAATT AAACGAGAGG TGAGTTTATG AAAACAAAAA 3120 

TCGGAAAAAC AGTTATCTTG TCAGCATTTT TATTCACAAG TTTCCTTTTA CTGAGTGGTT 3180 

GTACCTCGGC TGGCGAAGAG ATGGAAAAAA CAATTGATCG ACAGAAAGAA AAAGTCGATA 3240 

AAACGGTCGA TAAGCAGAAA CATAAAAATG AAAATTCCAT GGAAAGTTAC GACGAAAAAG 3300 

TTGACCGTTC TTTAGATAGT CAAGAAGACA AAATCGATAC TACTGAGTAA CGGGAAAGCT 3360 

TTCAACTATC GGCTAGTAAA TAAATAATAT ACAAAAGGGA GCATCCACTG CAATTATTTC 3420 

TCACAGCGGA TCCTCTCTTT TGTGGTTAAA AAGAAAAAGT TAACTGCTCC CCAACAGTTA 3480 

ACTTTTAATC ATGTTGCACA TAGTTAAAAC ACTCTATATA TCATTTATAT CATTTGAAAC 3540 

GGGTAGTGTA AAAAGAAATC TCTTGAAAAG AGTCTCCATT TATAGGGAAG ATTTCCCATA 3600 

AAAAAGAGAA ACGTATAATC ACAGAGAGTT AGAGAAAGCA AAGGGAAATA AGTATGTTAA 3660 

AATAAGAAGC AAGAAAGAAT ACAAGTATGC AGAAAATAAC AACTAATTAG GTGAAAACAA 3720 

TGGAAATACA TTTTGAAAAA GTAACAAGCG ATAATCGTAA AGCAGTGGAA AACCTACAAG 3780 

TTTTCGCTGA ACAACAAGCC TTCATTGAAT CAATGGCAGA AAATCTAAAA GAATCAGACC 3840 

AATTTCCAGA ATGGGAATCG GCAGGTATTT ATGACGGCAA TCAATTAATC GGCTATGCCA 3900 

TGTATGGACG TTGGCAAGAT GGGCGCGTTT GGCTAGACCG CTTTTTAATT GATCAACGGT 3960 

TCCAAGGACA AGGTTATGGG AAAGCAGCCT GTCGACTTTT AATGTTAAAG CTAATAGAAA 4020 

AATACCAAAC AAACAAGCTA TACTTGAGTG TTTATGATAC GAACAGCTCa GCAATTCGGT 4080 

TGTATCAGCA ACTCGGTTTT GTATTTAATG GTGAATTAGA TACGAATGGT GAGCGGGTAA 4140 

TGGAATGGAC ACATCAGAAT AAGTAGATAG AACAGGTGAG ACAAAAGGGA TAGCGCATTT 4200 

TGTTTCATCT TTTTCTTTTA ATTGGCTTCA TTTGTTTACT TTTAGTAAGT AAAATGGTAA 4260 
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ATTTAGTTCA 
TTCCATGATG 
CAAGAATTTC 
GATTTTGAAA 
GTGCATGCCA 
CGATTTGATT 
GGCAACCAAG 
AGTTTAAATA 
TTTGCTTTAT 
TTTGCGCTAG 
TATCTAATTA 
GCAGAAGAAA 
TCGTGTGAGT 
CTTTGACAAA 
ATATAGAGAA 
TGGTGGCCAC 
TACAGATATA 
ACCATTATAT 
AGAAGATTGC 
ATCGGTACTT 
CCAGTGAATC 
AAAATAGGAC 
AAAATATCTT 
TTTGGTAGAT 
TTGTTAGAAA 
GGTTGATTGG 
GTAATTGCTA 
CCAATATATT 
AAACTCTGGT 
TTTAATAGAT 
TATAA6TGAA 
TCGGAGGTGT 
GCGAAGTCGA 
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ATCTGTTTTG AAAAAAGAGT 
TTATTTGTAG TTTTTGTTTT 
CTGAAATTGA ATTGATTCAT 
CAATGTTTGG CTCACGAGCA 
ATCAAAACGA TTCAAAGCAT 
TTCCaACATC AAAGAATGTA 
ACACCTATTG GTTACTGTTT 
TTGAAGAGCC AGAAGTAATT 
GGAAAGAAGC TGTTGCTTCC 
CTTCTGCTTA TGGTTTACAA 
ATGGTGCAGT ACCGAAACAG 
AACAACAGCA ACCGTTAGTT 
TTAAGGAAGG CACATGGACG 
ACGAAAGACT TCTGTAATAG 
TACTGGAATA ATATTACAAC 
AATTAATTAA TTATTTGGTG 
TTGAAGTAAC ACCCAATGAT 
TTTTCTTGCT GACTGCTTTA 
ATGATCGCGA CCACAGTAAC 
TAGTGATCTT CATTACATTA 
AATCAGAAAT TTAATTTGCT 
ATAAGTCATT TCGACTTATG 
CTGTATGATG GCACCTCACA 
AGGGAAAACC . ATTAAGACGT 
ATAAGATAGC ATCATTAATT 
TAATAATCAA GTCAAATTCT 
ACGTATTTGA ACGAATCCGA 
GACTATGATT CAGGCCCATG 
TAATTTCTGG AAAAGTATTA 
TTGTTTCTAA AAGAGCGGAA 
GCTGAGTAAA ATGAATTTGT 
CAAAAGGCAC TAATTGAGAA 
CGACTTCTTC TGTTAATTGT 



422 
TACTGGAGGG 
CCAATGTCTG 
CGGTCGTTTG 
GCAGTAAAAG 
CGTTTTAATA 
TTATTAGCAG 
GATAAGTTAC 
GAGAAGCTAG 
GAGGCTGTTT 
GGAGTTCCCG 
CAAATCAGCC 
TCATTAACTC 
TGTCATTAAG 
TTAATGCGGG 
AATGAATGCG 
TTAGGGATTT 
GGAACAATCA 
ATTTGGTTGG 
TGGTGGATTT 
ATTTTGCCAA 
GGTTTCgTTT 
TCCTATTTTC 
TTACGCGGTT 
TAAAAGTTTA 
AAAATACTTG 
TCGAGTCGAT 
TGCAAAGCTA 
TCGCTAATAA 
AGCAACCAAA 
AAGAT^ATGCA 
CGAGTGAAAA 
AACAAATAAA 
TCAATCTGTG 



TAAGGATGAA 
ACCGCATGCA 
CGTTAGGATG 
AAGAAGTTTT 
TTGAAGGGAT 
CAAAAGCAGC 
AAGAAGGTTT 
TAAAAGAGAC 
GGACGGCGGT 
CACTAATCAT 
AAACTATTCA 
CTTCTTCTGA 
CAGCTTAAAA 
GGTGATAATA 
TCACAACaCG 
ATTCAGCAAT 
TTAAAGAATG 
CAAACTTTAC 
TATTCTATTT 
GTAAAAGTCA 
AGCTGATTAT 
TTTTTTATTT 
CTTGATAAAG 
ATAAATCGAG 
GAACACTGGT 
GGTCACCTAA 
AAAtATCGTG 
TCAAAATCTT 
AAATATAAGA 
ATTCTGCTTT 
AGGTGGCCCT 
GACAGGTTAA 
AAAAAGCATT 



AATAGAATTT 
TGAGATTCAA 
GTCAGCAAAC 
GACGCATTGG 
GAGGAAGCAA 
TGGATACATT 
ATTTGTGAGA 
AACGATTGAT 
TCAAGAAGAT 
TAATCAAAAA 
AAAAATTCTC 
AGAGTCAGCT 
TAACGTTATT 
TGTTTAACGC 
AGCACAATAT 
TACGGGCGTT 
GAATACTGTG 
TGTTCGAGCA 
ACTTCCTTTT 
CACAAGATGG 
CTTAAGAGTG 
CAGAAAAATA 
GGGCTATCTT 
CAATTCTCTT 
TTCAATAATT 
TTGTTCTTCC 
GaCATAGTTT 
TTTTGAGTCG 
AGATGAATAG 
TAATCGTGCA 
TTGTGTCAAT 
CAAGTG7ACA 
AAATTTTAAA 



4320 

4380 

4440 

4500 

4560 

4 620 

4680 

4740 

4800 

4860 

4 920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 
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TTTTCACGTA 


TTCTTTGACT 


GATGACTTCT 


TGGTCTTCGG 


AG.TAGTATTG CTGAAAAGCG 


6300 


CCTCTTTGAA AATATCGGAG 


AACTTCCCGA 


TAGGTTTCAC 


GGCTGAGTCC 


AGTGAAATGA 


6360 


CGACTGAGTA AAAGATAATC 


TTCCATTTTG 


ATTGGACTCT 


TTTTATCTGT 


TTTTTTCTGG 


6420 


AAACGTTGTT 


GAAATCCTTG 


ATGCTCTCTT 


AATAAATCAA 


CAATAAAATA 


CATAATAAAG 


6480 


AAATCGTGTT 


CATAACTGTT 


ATCCATaCTA 


ATAATATTTA 


ATTGGCATTG ATTACGACGC 


6540 


ATTAATTGTT 


TGATTTCCTG 


CTTATCTAAT 


TGATAAGGCA 


ATTCATAATG 


ACCATAATAC 


6600 


ATTAAAAAGA AAAAGGTTGT 


AAAAATACGT 


AAAGGCAATT 


CTTCTGTTTG 


GCTAGTTAAT 


6660 


TGATAGCCTT 


GATCAACGAT 


AACGCG7VAGA 


GAGTMGTTT 


CCAAGACCTG 


GTTACACTGC 


6720 


TTCAATTTTC 


GAGCAAAAGA 


AGAGCGACTA 


ATATTGATGG 


TATCGCAATA 


ATATTCGATG 


6780 


GTTTTGCCAG 


GATTTTGGAT 


AAACTCATGG 


AGCAATTGGG 


TGGTCGTTGA 


CGTTTTTGCC 


6840 


ATATCCGTCA 


GCACGTTTAA 


ATATAAGCAC 


APATTAAAGT 


TTTCTAAACG 


ATAGCCTAAT 


6900 


TTATGCGATG 


TTTTAATGGT 


TAAACTTTPG 


rCCCAACGCT 


GTTTAATCTC 


CTCAATATAA 


6960 


CGAATAAAAG 


TAGACTCAGA 


APTArPAATA 


TATTTTGPAA 
XJnX X X X o^rin 


GTTCTTTTTT 


AGTCATCCAT 


7020 


TTATCAGAAA 


TGGATAAGAC 


ACGAATGATT 


TCT TPCATT T 


TCCGATAATT 


TTTTTCTAAA 


7080 


AACCTAATCA 


TATTGTTTCC 


TCCCAAGTCC 


TTTACTTTAA 

XXX XXX 


TGAAACAATC 


ATCGAAACAA 


7140 


TAACGCAATT 


TCAGAAATGA 


AACCGCTTGG 


TATGAAAGCT 


GGTGTGATTT 


AATGAGTTCC 


7200 


TTCTCTGTAT 


AATAATTATC 


AGATTAGAAT 


AATGAAATTT 

^V^XwrW^X X X 


TACGTGCTAG 


GCAATTCtaA 


7260 


TTTAACTGkT AAwTCmTaAA 


AGaAAAGGAG 


AGAACAGACA 


TTAAATTACT 


AAAGAmATTT 


7320 


GGGGTTTTCG 


GCGTAGTCTT 


TTTACTTGTC 

XXX mm\tm X X \7 X V# 


TCATCCTATT 

X X X •* X X 


TCATACCGCT CATTGGCTAy 


7380 


GCAGAAACAG 


CCAAAGAAGT 


TGAAATTACA 


TCCGCTCAAA 


TGATAACAGA 


TGAGAATGAT 


7440 


AAAACGAATA 


TTAATATCGA 


GTTAAATCTT 

\J X X X \^ X X 


CTCAACCAAA 


CAGAGCAGCC 


ATTACAACGA 


7500 


GAAATTCAAT 


TGAAAAATGC 


ACAGTTCATG 


GATACTGCTG 


TAATTGAAAA AGACGGATAT 


7560 


TCTTACCAAG 


TGACTAATGG 


TACGCTTTAT 


CTGACTTTGG 


ACGCACAAGT 


AAAAAAGCCG 


7620 


GTACAGCTTT 


CGTTAGCTGT 


TGAGCAAAGT 


TCGCTTCAAA 


CAGCTCAGCC 


ACCTAAGTTA 


7680 


TTGTATGAAA ACAACGAATA 


TGATGTTTCA 


GTTACTTCTG 


AAAAAATAAC 


AGTAGAGGAT 


7740 


TCTGCTAAAG 


AATCAACTGA 


ACCAGAAAAA 


ATAACTGTAC 


CAGAAAATAC 


GAAAGAAACT 


7800 


AACAAAAATG 


ATTCGGCTCC 


AGAAAAAACA 


GAACAGCCGA 


CCGCAACAGA 


AGAGGTAACC 


7860 


AATCCATTTG 


CAGAAGCAAG 


AATGGCGCCA 


gCTACTTTGA 


GAGCGAATCT 


GGCACTGCCT 


7920 


TTAATTGCAC 


CACAATACAC 


GACGGATAAT 


TCTGGGACTT 


ATCCGACAGC 


TAATTGGCAG 


7980 


CCCACAGGCA ATCAAAATGT 


GTTAAACCAT 


CAAGGGAATA 


AAGACGGTAG 


TGCACAATGG 


6040 


GACGGCCAAA 


CGAGTTGGAA 


TGGGGACCCT 


ACTAATCGCA 


CAAATTCTTA 


TATTGAGTAT 


8100 


GGCGGTACAG 


GAGACCAAGC 


CGATTATGCC 


ATCCGAAAAT 


ATGCTAGAGA AACAACAACA 


8160 


CCAGGGCTTT 


TTGATGTATA 


TCTTAATGTG 


CGTGGGAATG 


TTCAGAAAGA 


AATCACGCCA 


8220 
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TTGGATTTGG TCTTAGTCGT TGACTGGTCC GGTAGTATGA AT^SAAAACAA TCGGATTGGT 


8280 


GAAGTTCAAA AAGGAGTGAA CCGTTTT6T.T (SATACATTGG 


CAGATAGCGG 


TATTACCAAT 


8340 


AACATCAACA TGGGCTATGT 


TGGCTACTCA AGTGACGGTT 


ATAATAACAA 


CGCCATTCAA 


8400 


ATGGGGCCGT 


TTGATACAGT 


CAAAAATCCA ATTAAAAATA 


TTACGCCAAG 


TAGCACTAGA 


8460 


GGAGGAACTT 


TCACTCAAAA 


AGCATTAAGA GATGCTGGTG 


ATATGTTAGC 


AACGCCAAAT 


8520 


GGACATAAGA AAGTCATTGT ACTTTTAACG GATGGCGTCC CAACCTTCTC 


TTATAAAGTG 


8580 


AGTCGAGTTC AAACAGAGGC GGATGGTCGC TTTTACGGGA CACAATTTAC GAATCGACAA 


8640 


GATCAACCAG 


GTAGCACTTC 


TTATATCTCT GGTAGCTATA 


ATGCGCCAGA 


TCAAAACAAT 


8700 


ATCAATAAAC 


GGATTAACAG 


TACGTTTATC GCCACGATAG 


GTGAGGCAAT 


GGTCTTAAAA 


8760 


CAACGTGGGa 


TTGAAATACA 


TGGATTGGGC ATTCT^TTGC 


AAAGCGATCC 


ACGAGCTAAT 


8820 


TTATCTAAAC 


AACAAGTTGA 


AGATAAAATG CGTGAGATGG 


TGTCAGCCGA 


TGAAAATGGA 


8880 


GACCTTTATT 


ATGAATCCGC 


GGATTATGCA CCAGACATTT 


CTGATTATTT 


AGCGAAAAAA 


8940 


GCCGTTCAGA 


TTTCAGGAAC 


GGTTGTAAAC GGAAAAgTAG 


TTGATCCAAT 


TGCTGAACCT 


9000 


TTTAAATACG 


AGCCAAATAC 


ATTATCAATG AAAAGTGTGG 


GTCCTGTTCA 


GGTTCAAACA 


9060 


TTACCAGAAG 


TGTCGCT7WVC 


AGGCGCTACA ATTAATAGTA ATGAGATTTA 


TTTGGGTAAA 


9120 


GGGCAAGAAA 


TTCAAATTCA 


TTAtCAAGTA CGTATTCAAA 


CAGAGTCAGA 


AAACTTCAAA 


9180 


CCTGATTTTT 


GGTATCAAAT 


GAATGGTCGG ACAACGTTTC 


AGCCATTAGC 


CACGGCCCCT 


9240 


GAAAAAGTTG 


ATTTTGGGGT 


TCCTTCGGGA AAAGCACCTG 


GCGTGAAGTT 


AAACGTGAAA 


9300 


AAAATCTGGG 


AAGAGTATGA 


TCAAGACCCG ACAAGTCGGC 


CAGATAATGT 


GATTTATGAA 


9360 


ATTAGTAGAA AGCAAGT^C 


TGACACAGCC AACTGGCAAA 


CTGGGTATAT 


TAAATTATCA 


9420 


AAACCAGAAA ATGATACCAG 


CAATAGTTGG GAGCGCAAAA 


ATGTAACCCA 


ACTTTCCAAA 


9480 


ACCGCGGATG 


AAAGCTATCA AGAAGTTCTT GGGCTTCCCC 


AATACAACAA 


TCAAGGACAA 


9540 


GCTTTCAATT 


ATCAAACAAC 


CCGTGAATTA GCAGTTCCTG 


GTTACAGTCA AGAAAAAATC 


9600 


GACGATACTA 


CTTGGAAAAA 


CACGAAGCAG TTCAAGCCAT 


TAGATTTAAA 


AGTAATCAAA 


9660 


AATTCTTCCT 


CAGGTGAGAA 


AAACTTAGTG GGAGCCGTCT 


TTGAATTGAG 


TGGTAAAAAT 


9720 


GTTCAAACAA 


CATTAGTGGA 


CAATAAAGAT GGTAGCTATT 


CCTTGCCAAA 


AGATGTGCGC 


9780 


CTACAAAAAG 


GGGAACGCTA 


TACATTAACT GAAGTAAAAG 


CACCTGCAGG 


ACATGAGTTA 


9840 


GGCAAGAAAA 


CGACTTGGCA 


AATTGAGGTG AGTGAGCAAG 


GCAAAGTAAG 


CATCGATGGA 


9900 


CAAGAAGTGA CCACCACAAA 


TCAAGTTATT CCATTGGAAA 


TTGAAAATAA 


ATTTTCTTCT 


9960 


TTGCCAATCA GAATTAGAAA ATACACCATG CAAAATGGCA AACAAGTGAA 


CTTAGCAGAG 


10020 


GCGACTTTTG CGTTGCAAAG AAAAAATGCT gCAAGGAAGT TACCAAACTG TGGCAACTCA 


10080 


AAAAACAGAT 


ACTACAGGAT 


TGAGCTATTT TAAAATTAGT 


GAACCTGGTG 


AGTATCGAAT 


10140 


GGTGGAACAA TCAGGACCAT 


TAGGCTACGA CACTCTTGCT 


GGAAATTATG 


AATTTACTGT 


10200 
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TGATAAATAT 


GGGAAAATTC 


ACTATGCAGG CAAAAATATT 


GAAGAAAATG 


CGCCAGAATG 


10260 


GACACTGACA CATCAAAATA ATTTGAAACC TTTTGACTTA 


ACAGTTAATA 


AAAAAGCCGA 


10320 


TAATCAGACG 


CCACTTAAAG 


GAGCGAAATT CCGTTTAACA 


GGACCAGATA 


CGGATATTGA 


10380 


ATTACCAAAA 


GATGGCAAAG 


AAACGGATAC TTTTGTTTTT 


GAAAACTTAA AACCAGGGAA 


10440 


ATATGTTCTA ACAGAAACCT 


TTACGCCAGA AGGATATCAG 


GGGTTAAAAG 


AACCAATCGA 


10500 


ATTAATAATT 


CGTGAAGATG 


GTTCAGTCAC GATAGATGGG 


GAAAAAGTAG 


CAGATGTTTT 


10560 


AATTTCTGGA GAGAAGAATA ATCAAATTAC TTTAGACGTT ACGAACCAAG CAAAGGTTCC 


10620 


TTTACCTGAA ACTGGTGGCA TAGGACGCTT GTGGTTTTAC 


TTGATAGCGA 


TTAGTACATT 


10680 


CGTGATAGCG 


GGTGTTTATC 


TCTTTATTAG ACGACCAGAA 


GGGAGTGTGT 


AATCAATGAA 


10740 


AAACGCACGT 


TGGTTAAGTA 


TTTGCGTCAT GCTACTCGCT 


CTTTTCGGGT 


TTTCACAGCA 


10800 


AGCATTAGCA 


GAGGCATCGC 


AAGCAAGCGT TCAAGTTACG 


TTGCACAAAT 


TATTGTTCCC 


10860 


TGATGGTCAA 


TTACCAGAAC 


AGCAGCAAAA CACAGGGGAA 


GAGGGAACGC 


TGCTTCAAAA 


10920 


TTATCGGGGC 


TTAAATGACG 


TCACTTATCA AGTCTATGAT 


GTGACGGATC 


CGTTTTATCA 


10980 


GCTTCGTTCT 


GAAGGAAAAA 


CGGTCCAAGA GGCACAGCGT 


CAATTAGCAG 


AAACCGGTGC 


11040 


AACAAATAGA AAACCGATCG 


CAGAAGATAA AACACAGACA 


ATAAATGGAG 


AAGATGGAGT 


11100 


GGTTTCTTTT 


TCATTAGCTA 


GCAAAGATTC GCAGCAACGA 


GATAAAGCCT 


ATTTATTTGT 


11160 


TGAAGCGGAA 


GCACCA6AAG 


TGGTAAAGGA AAAAGCTAGC 


AACCTAGTAG 


TGATTTTGCC 


11220 


TGTTCAAGAT 


CCACAAGGGC 


AATCGTTAAC GCATATTCAT 


TTATATCCAA AAAATGAAGA 


11280 


AAATGCCTAT 


GACTTACCAC 


CACTTGAAAA AACGGTACTC 


GATAAGCAAC AAGGCTTTAA 


11340 


TCAAGGAGAG 


CACATTAACT 


ATCAGTTAAC GACTCAGATT 


CCAGCGAATA 


TTTTAGGATA 


11400 


TCAGGAATTC 


CGTTTGTCAG 


ATAAGGCGGA TACAACGTTG 


ACACTTTTAC 


CAGAATCAAT 


11460 


TGAGGTAA/U^ 


GTGGCTGGAA 


AAACAGTTAC TACAGGTTAC 


ACACTGACGA 


CGCAAAAGCA 


11520 


TGGATTTACG 


CTTGATTTTT 


CAATTAAAGA CTTACAAAAC 


TTTiSCAAATC AAACAATGAC 


11580 


TGTGTCGTAT 


CAAATGCGTT 


TAGAAAAGAC CGCTGAACCT 


GACACTGCGA TTAACAACGA 


11640 


AGGACAATTA 


GTCACGGACA 


AACATACCTT GACTAAAAGA 


GCCACAGTTC 


GTACAGGCGG 


11700 


CAAGTCTTTT 


GTCAAAGTTG 


ATAGTGAAAA TGCGAAAATC 


ACCTTGCCAG 


AGGCTGTTTT 


11760 


TATCGTCAAA 


AATCAAGCGG 


GGGAATACCT CAATGAAACA 


GCAAACGGGT 


ATCGTTGGCA 


11820 


AAAAGAAAAA 


GCATTAGCTA 


AAAAATTCAC GTCTAATCAA 


GCCGGTGAAT 


TTTCAGTTAA 


11880 


AGGrmTTAAA AGATGGCCAG 


TACTTCTTGG AAGAAATCTC 


TGCACCAAAA 


GGTTATCTTC 




TGAATCAAAC AGAAATTCCT 


TTTACGGTGG GAAAAAATTC 


TTATGCAACG 


AACGGACAAC 


12000 


GAACAGCACC GTTACATGTA ATCAATAAAA AAGTAAAAGA GTCAGGCTTC 


TTACCAAAAA 


12060 


CAAATGAAGA ACGTTCTATT 


TGGTTGACGA TTGCAGGCCT 


GCTAATCATT 


GGGATGGTAG 


12120 


TCATTTGGCT ATTTTATCAA AAACAA7U\AA GAGGAGAGAG AAAATGAAGC AATTAAAAAA 


12180 
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AGTTTGGTAC 


ACCGTTAGTA 


CCTTGTTACT 


AATTTTGCCA 


CTTTTCACAA 


GTGTATTAGG 


12240 


GACAACAACT 


GCATTTGCAG 


AAGAAAATGG 


GGAGAGCGCA 


CAGCTCGTGA 


TTCACAAAAA 


12300 


GAAAATGACG 


gatttacx:ag 


ATCCGCTTAT 


TCAAAATAGC 


GGGAAAGAAA 


TGAGCGAGTT 


12360 


TGATAAATAT 


CAAGGACTGG 


CAGATGTGAC 


GTTTAGTATT 


TATAACGTGA 


CGAACGAATT 


12420 


TTACGAGCAA CGAGCGGCAG GCGCAAGCGT 


TGATGCAGCT 


AAACAAGCTG 


TCCAAAGTTT 


1 2480 


AACTCCTGGG 


AAACCTGTTG 


CTCAAGGAAC 


CACCGATGCA AATGGGAATG 


TCACTGTTCA 


12540 


GTTACCTAAA AAACAAAATG GTAAAGATGC AGTGTATACC ATTAAAGAAG AACCAAAAGA 


12600 


GGGTGTAGTT 


GCTGCTACGA ATATGGTGGT 


GGCGTTCCCA 


GTTTACGAAA 


TGATCAAGCA 


12660 


AACAGATGGT 


TCCTATAAAT 


ATGGAACAGA 


AGAATTAGCG 


GTTGTTCATA 


TTTATCCTAA 


12720 


AAATGTGGTA 


GCCAATGATG 


GTAGTTTACA 


TGTGT^AAAAA 


GTAGGAACTG 


CTGAAAATGA 


12780 


AGGATTAAAT 


GGCGCAGAAT 


TTGTTATTTC 


TAAAAGCGAA 


GGCTCACCAG 


GCACAGTAAA 




ATATATCCAA 


GGAGTCAAAG 


ATGGATTATA 


TACATGGACA 


ACGGATAAAG 


AACAAGCAAA 


1 9Q00 


ACGCTTTATT 


ACTGGGAAAA 


GTTATGAAAT 


TGGCGAAAAT 


GATTTCACAG 


AAGCAGAGAA 


1 2Q60 


TGGAACGGGA 


GAATTAACAG 


TTAAAAATCT 


TGAGGTTGGT 


TCGTATATTT 


TAGAAGAAGT 


1 "^020 

X ^ V/ 


AAAAGCTCCA 


AATAATGCAG 


AATTAATTGA 


AAATCAAACA 


AAAACACCAT 


TTACAATTGA 


13080 


AGCAAACAAT 


CAAACACCTG 


TTGAAAAAAC 


AGTCAAAAAT 


GATACCTCTA 


AAGTTGATAA 


X X W 


AACAACACCA AGCTTAGATG 


GTAAAGATGT 


GGCAATTGGC 


GAAAAAATTA 


AATATCAAAT 


1 ^^200 


TTCTGTAAAT 


ATTCCATTGG 


GGATTGCAGA 


CAAAGAAGGC 


GACGCTAATA 


AATACGTCAA 


13260 


ATTCAATTTA 


GTTGATAAAC 


ATGATGCAGC 


CTTAACTTTT 


GATAACGTGA 


CTTCTGGAGA 


13320 


GTATGCTTAT 


GCGTTATATG 


ATGGGGATAC 


AGTGATTGCT 


CCTGAAAATT 


ATCAAGTGAC 


13380 


TGAACAAGCA 


AATGGCTTCA 


CTGTCGCCGT 


TAATCCAGCG 


TATATTCCTA 


CGCTAACACC 


13440 


AGGCGGCACA CTAAAATTCG 


TTTACTTTAT 


A T 1* 7171 1\ 

ItCATT f/VAAl 


GA/U^GCAG 


ATCCTACGAA 


13500 


AGGCTTTAAA AATGAGGCGA ATGTTGATAA 


CGGTCATACC 


GACGACCAAA CACCACCAAC 


13560 


TGTTGAAGTT 


GTGACAGGT6 


GGAAACGTTT 


CATTAAAGTC 


GATGGCGATG 


TGACAGCGAC 


13620 


ACAAGCCTTG 


GCGGGAGCTT 


CCTTTGTCGT 


CCGTGATCAA 


AACAGCGACA 


CAGCAAATTA 


13680 


TTTGAAAATC 


GATGAAACAA 


CGAAAGCAGC 


AACTTGGGTG 


AAAACAAAAG 


CTGAAGCAAC 


13740 


TACTTTTACA 


ACAACGGCTG 


ATGGATTAGT 


TGATATCACA 


GGGCTTAAAT 


ACGGTACCTA 


13800 


TTATTTAGAA 


GAAACTGTAG 


CTCCTGATGA 


TTATGTCTTG 


TTAACAAATC 


GGATTGAATT 


13860 


TGTGGTCAAT 


GAACAATCAT ATGGCACAAC 


AGAAAACCTA 


GTTtCACCAG 


AAAAAGTACC 


13920 


AAACAAACAC 


AAAGGTACCT 


TACCTTCAAC 


AGGTGGCAAA 


GGAATCTACG 


TTTACTTAGG 


13980 


AAGTGGCGCA 


- GTCTTGCTAC 


TTATTGCAGG 


AGTCTACTTT 


GCTAGACGTA GAAAAGAAAA 


14040 


TGCTTAATTT 


CTAGCATCAC 


CGAAGAAATT 


TTTAGAAAAA 


CAAAGAGCCT 


GGGCCAATCA 


14100 


CTGTCCCAGG 


CTCTCATGCT 


TTATTTTTAA 


GGAGGAAGCA 


ATGAAGTCAA AAAAGAAACG 


14160 
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TCGTATCATT GATGGTTTTA 


TGATTCTTTT ACTGATTATT 


GGAATAGGTG 


CATTTGCGTA 


14220 


TCCTTTTGTT AGCGATGCAT TAAATAACTA TCTGGATCAA CAAATTATCG CTCaTTATCA 


14280 


AGCAAAAGCA AGCCAAGAAA ACACCAAAGA AATGGCTGAA CTTCAAGAAA AAATGGAAAA 


14340 


GAAAAACCAA GAATTAGCGA AAAAAGGCAG CAATCCTGGA 


TTAGATCCTT 


TTTCTGAAAC 


14400 


GCAAAAAACA ACGAAAAAAC CAGACAAATC CTATTTTGAA AGTCATACGA TTGGTGTTTT 


14460 


aaPPATTPCA AAAATAAATG 


TCCGTTTACC AATTTTTGAT 


AAAACGAATG 


CATTGCTATT 


14520 


GGAAAAAGGA AGCTCCTTGT 


TAGAAGGAAC CTCCTATCCT 


ACAGGTGGTA 


CGAATACACA 


14580 


TGCGGTCATT TCAGGCCATC 


GTGGTCTCCC TCAAGCCAAA 


TTATTTACAG 


ATTTGCCAGA 


14640 


ATTAAAAAAA GGCGATGAAT 


TTTATATCGA AGTCAATGGG 


AAGACGCTTG 


CTTATCAAGT 


14700 


AGATCAAATA AAAACCGTTG 


AACCAACTGA TACAAAAGAT 


TTACACATTG 


AGTCTGGCCA 


14760 


AGATCTCGTC ACTTTATTAA 


CTTGCACACC GTATATGATA 


AACAGTCATC 


GGTTATTAGT 


14820 


TCGAGGACAT CGTATCCCAT 


ATCAACCAGA AAAAGCAGCA 


GCGGGGATGA 


AAAAAGTGGC 


14880 


ACAACAACAA AATTTACTAT 


TATGGACATT ACTTTTAATT 


GCCTGTGCGT 


TAATTATTAG 


X •» J •* w 


CGGCTTCATT ATCTGGTACA 


AGCGACGGAA AAAGACGACC 


AGAAAACCAA 


AGTAGTATGA 




CGAAAAGGCT AAACATACTA 


AAAAAAAGAG TAAAAAAATA 


GCTTTTCAAT 


TTTTAATCCT 


15060 


CCTTATCGTG CATAATTGAA CCAGAGAAAC AGAAGTATTA ACGAAATAAC TAAAAGAGCA 




AGCCCTGAAT AAAAAGCGAC 


AAAGGGCCAA TCAATCGACT 


GTTTAAATTC 


CTGCCAAGTT 


15180 


TGGATTTTTC TGTTTTTTTT 


CGCGCTATCC TCAAGCGTGA 


GTAAATAATT 


CAATAGTAAG 


15240 


AGGAGTAGCA ACACCGTGAA 


ATCATTTGTG GTAAAAAGCA 


CATGTAAAAA 


TAGAATGACA 


15300 


AAGACAACAC GGGATAACAC 


TCGATTCCGC AAAATTAAAA 


ATAACTTAGC 


ACGCATAATA 


15360 


AACCACCATT TCTTaTCAGA 


GATAATGAAT CTGTTTTTGT 


CTACTCTTTA 


GTTATATCAT 


15420 


AAAATTCTTA ATAATGAAAA AATGACTCGA GAAAATAATT 


GAAAAAAGTT 


TTTTTTCCTG 


15480 


AATCATTATT TTCGTAAATA 


AAGAATAAAC GTGTTACTCT 


TGGCTTATCA AATTTGGAAG 


15540 


GAGTGTTAAA AATGAAATAT 


CTGGATATTA TTGCTTTAAT 


TTTATTGATT 


GTCGGAGGTT 


15600 


TAAACTGGTT ATTA 








15614 


(2) INFORMATION FOR SEQ ID NO: 46: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3169 base pairs 

(B) TYPE: nucleic acid 

(C) STRT^DEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

TTCACAAAAA AATGAAAACA CAAATTTAAC TTGGAGGATG TTGAGTAGTG AGCAACGGGA 60 

AAAAGATTTT nATTAGTCAT AGTTCAAAGG ATcAAGAATA TGTGGATGCG TTTATCCAAT 120 

TATTGAAAAA GTTTGGGTTT CGAACACAGG ATATTTTTTA TAGTTCCACT ATTGAAACAG 180 
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GTGTACAGCC 


AGGAGAATTA ATTTTTGACA CGATTAAAAG 


GGAGCTAACC 


AATCAACCTG 


240 


TCATGCTGTA TTTTTTATCT 


GATCATTATT 


ATCAGAGTAT 


TCCATGTTTA AACGAAATGG 


300 


GTGCTTCTTG 


GATGCTATCA 


GATAAACACT 


ATCCAATTGC 


GTTAAATAAT 


TTTTCTATGA 


360 


AAGATATGAA 


AGGTGTTATT 


AGTAGTGAGC 


GTTTGGCGAT 


TGCGTTTAAT 


GATAAGACAA 


420 


GTACTAATGA 


AATAAACTGc 


TTATTAAAAA 


AGTTATCTCA 


TGACACAGAT 


GTACAAGCTG 


480 


AGCCAGATTT 


TGAACTAAAC 


GTTGAGAAAA 


ATATTCAGCC 


ATTCCAAAAT 


AAACTGACAC 


540 


AACTGATTCG 


GCAAGCAAGT 


TATTTAAAAC 


CTGATGAAAA 


GGGATATTTT 


GAAACGrTTC 


600 


tTAGCACTCA 


TCGTCCTGTG 


TATGGCACAG 


CAAAAGGGGT 


CTATGACTGT 


TTTAAATTAC 


660 


CTAGCTTAAT 


TGAGCCTAAA 


AGTTTAGGTT 


TAGACACGCT 


ATCAGAGGAT 


GAAAGCCATT 


720 


GGCTATTCTT 


TTTTCTTACG 


TGGGGAACTT 


TTCAAGAAGG 


CGAAAAAGTG 


CGGTTTAAAt 


780 


TGAAAAAGGa 


TAAGGcGTAT 


AACAATCGGG 


AATTCAGTGA 


TATTGGTAAA 


TGTAAAAATA 


840 


TTTACGTCTC 


TTATTTAGAG 


AAGGTAGAGT 


AATTGGAGGA 


GGTAATAAAT 


GAAAATGAAT 


900 


TGGcGTCAGT 


TGGTTGGTAC 


ATATCGTATT 


TCTGATAATG 


GTAAACAATA 


TCAAAATGAC 


960 


TGGTTAAATA 


AGCAAGTTGA 


TTTAGCCACA 


GAACTTCGTA 


AATCTTTTGA 


TAGTGACTAT 


1020 


CGTAGAGTAA 


TTAAAAGTGA 


TGCATTTaGA 


CGCTTwnCAa 


GAkAwAACAC cAAGTTTTyC 


1080 


CTTTwGrAnC GTAATGaTTT TgGTAAGAAC aCGTTTAACA CATAGCTTAG AAGTGGCAAT 


1140 


GCATAGCAGG 


GATATACTTA 


GATTAGTTAT 


TTCACAGTTG 


AACGAACGAA ACATTGAGGT 


1200 


TATGGAATTA 


TCAGAATGCT 


ATCGGTTGTT 


AGAAACAGCT 


GCATTAATCC 


ATGATATCGG 


1260 


CAATCCCCCT 


TTTGGACATT 


TTGGGGAAGA 


AGCCATTCGT 


ATTTGGTTTG 


AAAAAAATGG 


1320 


TCCTTTGTAT 


GCTTGTTGGT 


CTGATTTTTC 


GGAGCAACAA 


CAAAATGATT 


TTCTTAACTT 


1380 


TGAAGG7\AAC 


GCTCAAACCA 


TCCGACTTTT 


AACAAAACTG 


CATCATGATA 


ATGGGACTTC 


1440 


TAAA6CAGGA ATGAAACTAA 


CAGCCACGAC 


ATTAGATACA 


GTTATCAAAT 


ATACGGCATC 


1500 


TTCAGATCAA 


TTAAATAAGC 


AACAGCTTTT 


GTCTAAAAAA 


GTTGGCTATT 


TTTACTCGGA 


1560 


AAAACAAGTG 


TTTAACGCCA 


TTAAATTTGC 


GACAGGAACA 


CmAAATAAGC 


GACACCCATT 


1620 


AGTCtTTTTG 


TTAGAAGCTT 


CAGATGATAT 


CGCCTATACT 


TTTTCAGATA 


TCGAAGATGC 


1680 


CTAT7\ATTAT 


GGTTTATACA 


GCTATCaAGA 


TTTAAAGCaG 


TTTGTTGATG 


CGCAAACAGG 


1740 


AACAAAAAAA TTTTTAATGT 


TAGAAAAAGA TAAAACAGAA 


GAAGCTACCT 


TACAAGAATT 


1800 


TTTGAGAAAG ACACAAAGAG 


CTGTTTGTCA AAGCGTAGCG 


AAAAATTTTG 


CTAATCATTA 


1860 


TGACTTAATT ATGAGAGGTA 


TCTATCATAA TGAAATTATC ATTGAAGATT 


GTGATGAAGT 


1920 


GAAATGCTTT 


CAGGCATTAA AAAACTTTTC 


TAGAGTCTAT 


GTTTTTCAAA 


CAAAAACAAT 


1980 


TTTGGATCAA GAGGTATTGG 


GCTTCAATAT 


CATCAATCGT 


TTACTAGATG 


AATTTGTTCC 


2040 


AGTCGTACTG 


AAATATGAAA 


AAGTTTCCAT 


GAATAAGTAT 


GAGGAGCGAA 


TCTTTAATAA 


2100 


TATTTCGGAA AGCGCGAAAG 


CGTTGTATAG AAGAGAAGCA AAGAACGCTA 


CAGAAGCAGA 


2160 
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AAAAa^VCTAT 


TATCGTTTGA 


AAATGGCGGT 


GGACTTTGTA 


TGTAATATGA 


CGGATGGATA 


2220 


TGCCAAAAAA 


GTTTATGATA 


CATTATTCAC 


TTAAAATTTG 


AGAAAAAGCT 


TCTCAGCGGT 


2280 


TGCCGCCGCT 


GAAGAAAACT 


AGTTCATTGC 


ACAGGCTGAT 


CGTTTCAGTA 


AAAATTTTAA 


2340 


AGATGCTAGT 


AATTTCACTG 


AAAATCGGGT 


ACACTAAAAT 


TAGCTAGTGT 


AACTAGCAAA 


2400 


AAAGGCAACT 


GGAATAG?iTA 


GGTTGTGCTG 


TCTATGAAGG 


GCTTTGTTTG 


TCAGGTGGTG 


2460 


CTGACATGCA 


AAGTGCCTTT 


TTTCTTTTTC 


TCAACATTTT 


AAGGACAAAA 


AACGATTGCA 


2520 


ACATGTCACA 


ATCGCTAACC 


TCGGAGAAAA 


AGGAATTTTT 


CTCCAATGGC 


CATTTCGATA 


2580 


ACCTGTTGTG 


TGTTTCGAAA 


CATTCAGAAC 


GGTGGTGCGT 


TCTTTCAGTG 


CCTAATTATC 


2640 


CAATTCGTTT 


CTTGTTCCTC 


TATTTACAGT 


ATACTGAATG 


AAAGCCTATT 


CTGCAATGAT 


2700 


AGAATCAGGT 


TCCTGTGATT 


AAATAACATA 


AAAAATGGGC 


TCCTTATATA 


AAGAAAAATA 


2760 


ACAAAAGAAA 


TTTTGAATAT 


TGACGTTTTT 


TTTAGGTACG 


TTATGATAAG 


CGAGAGGTGA 


2820 


TTTTATGTTA 


AAAACGCAAA 


TTGTTGATTA 


TTTACGCAAT 


CAAACAGCTT 


TTTTTGAACC 


2880 


ATCTTTAGTC 


AGTGAAATTT 


TCACAGCAAG 


CAATATTGCA 


ACGACATTTT 


CTATCAAACG 


2940 


AAATACAGCT 


AGTCACTATT 


TAAATCAGTT 


AAACGAAGAA 


GGAATTCTAG 


TTAAGATCAA 


3000 


TACACGACCG 


GTTTACTTTT 


TTCATAAAgA 


AGCGTTTCAG 


CAACAAAACT 


ATCTTTTAAA 


3060 


AAGGAACTGT 


TTATCCAAGT 


TTCCAAGAAA 


TGATCGATGA 


ACAGCCTGTT 


TTTGGATCGC 


3120 


CAAAGnGGAn 


TTTTnTTCAG 


TCGGGTCATn 


GGGTATCGCG 


GGTAGTTTA 




3169 



(2) INFORMATION FOP SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17087 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GTCTATACAA GTTCACAAGG AGGCGCCAAA CATGACCATT TTAGAATTAA CATTCGCAAC 60 

GAACCAACTA ACGACGACTT TTTTTAACCA TAAATTGGAA CCGCTTCACC AAAAAACATT 120 

CCCTTTTGAA GCCTTGACTA TCGAAGAAAC GGTTCCTGAA CTGTGCCAAT TGATTTTGGC 180 

AGAGGCGGCA CCGATTCTGG GGGGGATTGT GGCTGATTTC CCAGCGGACT TCTTTTCGCC 240 

AGAGGAAACG TTGGCGCTGG TTCCGCAAGC GGTGATTCAA GCATTCGCGG AACAGTTGAA 300 

CACGCCACTT TTAACGGCAG CCGAACAAGC GGCGGCACCG AATCTGCTAG AAGCATTTGA 360 

AGAAAAACGG cAGTATCTGG CTTGGCATTT GGACTATTAT GGCTACGTGC CTGGGAAAAA 420 

TGAATATGCG GTGGAGTCCC TGTTAACTGT CGGCAATGGC TTTTTAGGCT TGCGCGGCAC 480 

GACACCTGAG ATGACGATTT CTGATGATCA TTATCCAGCG ACCTACATTG CGGGGCTTTA 540 

TAATACAGCG GCTTCGGAAG TTGCGGGTCA AGTAGTGGAA AATGAAGATT TCGTCAATGC 600 
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TCCTGACAAC 


CAACACATCG 


CTTTAAAAAT 


TGGCGATGCC 


ACTGACTGGT 


TAACGATTTC 


660 


TCCAGACACG 


CTACAACAAT 


TACATCGTCA 


ATTGAACTTG 


AAAACGGGGT 


TGTTTGTCGC 


720 


AGAAATGATT 


TTGAAAGATG 


CGGACAATCA 


GCAAATTAAA 


TTAACGACGA 


AAAAAATCGC 


780 


CAATATGGCA 


CAGCCGAATG 


ACTATCATCT 


GCAATATACG 


TTTGAGCCGT 


TAAACTTTTC 


840 


AGCCCCGATT 


ACTTTGAAAA CCGTCACGGA TGGCAGTGTC 


TACAACTATA ACGTGGCGCG 


900 


GTATCGCAAC 


TTAACAGCGA 


AACATTTTCA 


GGTGACAGCA TT7UVGCGCGC 


AGGAAAACAA 


960 


AACCGTGATT 


GAGGTTTGTA 


CCAATCAATC 


CAACTTAAGC 


GTCCGAGAAA CGTCTTTGAT 


1020 


AACCGGTGAT 


TTTTTTGAAA 


AAGAAGCCAT 


TACGATTCAA GAAGAAGCGG 


AGAAAATCGC 


1080 


CCAAGTGGTG 


ACCGTGATGG 


CGCACCAAGG 


AACGCGCTAT 


ACGCTAGAAA AACAAGTCTT 


1140 


TGTACAAGCC 


AGCCACGCCG 


AACAAAGTTG 


GCAGGTACCG 


TTTACGCCGA AGGACTCTTT 


1200 


TGCGGCCGCC 


GCTCAGGAAA 


GCGCCCGCGC 


TTGGCAAACT 


CTTTGGCAAC 


AAGCCAACAT 


1260 


TACCGTGACA 


GGCGATTTGA 


TGTCTCAAAA 


ACTTTTACGG 


ATTCACAGTT 


ATCACTTGCT 


1320 


GGCTTCGGCT 


TCGCCTTTTA 


GTAACCAAGC 


GCAAGCATTG 


GACGTCTCGA 


TTACCGCCCG 


1380 


AGGCTTGCAT 


GGTGAAGCGT 


ATCGTGGACA 


TATTTTCTGG 


GATGAAATTT 


TTATTTTACC 


1440 


TTTTTATATT 


CAACATTATC 


CTGACACAGC 


GAAACAATTG 


TTGCTGTACC 


GCTACCACCG 


1500 


ATTAGAGAAG 


GCGAAGGAAA ATGCGGCGGC 


CAGCCAGTAT 


CGTGGCGCAA 


TGTATCCGTG 


1560 


GC AATCAGGG 


C6TGAT6GGC 


GTGAAACCAC 


GCAAAAACTT 


CACTTAAATC 


CGTTGAATGG 


1620 


ACATTGGGGC 


GAAGATCATA 


GTATTTTGCA ACGGCATGTT 


TCCTTGGCGA 


TTGCTTACAA 


1680 


TGCTTGGT TG 


TATTGGCATA 


GTACACAGGA 


TCATGAATTT 


ATGAAACAAT 


ATGGCGGCGA 


1740 


AATGTTGTTA 


GAAATCGCTC 


AATTTTGGAA 


CAGCGCTGCG 


ACGTTAGATG 


ACGCAACAGG 


1800 


ACGGTTCTTT 


ATTGATAAAG 


TGATGGGACC 


TGATGAGTTT 


CACGAGGGCT 


ATCCAGACCA 


1860 


AGCGGAAAGC 


GGCTTGAAAA 


ACAACGCCTA 


TACGAATTTG 


ATGGTGGTTT 


GGCTATTCGA 


1920 


AGAATTAACC 


AACATTTTGG 


CACTTTTCTC 


TGAGGAAGAA 


CAGGCACAAC 


TTTTTGCCAA 


1980 


AACGCAGACC 


ACTTCGGCTG 


ATTTAGCCCG 


CATGCAGCAA ATTCAAAATT 


CACTGGAAAT 


2040 


TGAAGTTAAT 


TCTGACGGTA 


TTATTGCGCA ATATGAAGGG 


TATTTCGGCT 


TAAAAGAGAT 


2100 


TGACTGGACA 


GCGATGAAAG 


AGAAATACGG 


CAATATTTAC 


CGCATGGATC 


GAATTTTAAA 


2160 


AGCCGAAGGA 


GAATCCCCCG 


ATGATTACAA 


AGTGGCGAAA 


CAAGCCGACA 


CGTTAATGTT 


2220 


GTTCTACAAC 


TTAGACAAAA 


CTCGGGTGGA 


TCAAATTTTA 


GAAGATTTAG 


GGTATCAATT 


2280 


GCCTGCGGAT 


TATTTGGAAA 


AAAATCTTTT 


GTATTATCTA 


AAACGGACGT 


CTCATGGTTC 


2340 


GACTTTGTCA 


CGAATTGTTC 


ATGCCCAATT 


GGCAGAAATG 


GCGCAATTCC 


ATGAGCTATC 


2400 


TTGGCAACTT 


TATCAAGAAG 


CGTTGTATTC 


CGATTATCGT 


GACATTCAAG 


GTGGCACGAC 


2460 


CGCTGAAGGG 


ATTCATACAG 


GGGTCATGGC 


TGCGACGATC 


CACGTTACTT 


TAGCGACGTA 


2520 


TGCAGGCGTT GACACGCGCC AGAACGAGTT ATCAATTTGT CCAAACTTAC CTGAACATTG 


2580 
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GCAAGCGCTG GCCTTCCAGT 


X XAX X WilW\ 


Alao^vi X urALrVS 


XAll^AAx X X A 




^ OH V 


AACCAGTGTC 


ACGATTACAG 


r*ar'aT arcana 
ViAiiA X AvsAU A 


r'ar^AfaaTTa 
WilJAUAAi XA 


X X uu X X ^«ALi\j 


/^Pf^PP TTP a T 
\3V>u^\3 X X uA X 


* vU 


TCCGTTAACC 


GCCGAACGAC 


r*zv ]\& iir*iv ivr»*p 
uAAAAvyAAux 


AlJAi lAIUAA 


Ta a a*^r* ar'^^r* 

X AAAvnaAvjLtVa 


a appapa a*pp 

AAwC AC AA X u 




ACGATTGGAT 


TTATTTTTGA 


m m m/* r» TV /* 'P 




ar*a r'p/^r^r' a a 
ALfAU^vjUCAA 


Al 1 XCAX lAi 




CAAGCTTGGA AAGCGTTGGC 


CGAT TCGCT A 


^•/■m jk fti^^^r^JV 

GGTATCCOCA 


■pvp^ IV TV TV n 

T xG ACGAAAC 


C I 1 CAACGAA 




A ^ a T T H IV H 21 


GPATTAGTPG 


GATGGATTCC 


TTG G ACCGT A 


TTTTAG X CCA 


rGGCCACCGC 






1 1 A^ X wwAva^ 


AGAAAAAGAA 


GCCTTAGCTC 


1V<*^1V TV IV IV IV IV IV 

AGCAAAAAAA 


TGAnCaCTAC 


3UUU 


GTTCAAl lAI 


xAoAA\^/l/\l X 


AACAACAGAA 


GATGTTTTAC 


CTGGTGTCGT 


GCCATTGCTA 


o A c rv 




AAuk^A^u X V^A 


CATTCCCTGC 


GCCGTGGCTT 


CGGCTTCAAA 


AAACG CGCCC 






AAAAAX XAuL) 


CGTGCGCGCG 


TAcTTCGCCA 


CAATTGTCGA 


TCCCGACTCG 


T 1 o rv 


•P'T'ATV/^'PATVA^ 


VjO AAAO^ X oA 


TCCTGAAATC 


TTTTTAGCCG 


CCGCTGACAG 


TATTGGCGTG 






a cncf* a TT^^r* 


CTTTGAAGAT 


GCACAATCAG 


GCATTGACGG 


CTTG7\AAGCC 


3300 




Ai uL-Uls 1 


CTTGTCGGCC 


AGCCAACCCT 


TAATCGGAGC 


CGACATGCAA 


3360 


V9 1 1 1 U 1 UAAA 


X la AV« X iaAA^ I 


AAGCGTTGAT 


GCACTGCTCA 


ATCGCTAAAG 


AATGACAAGA 


3420 


1 1 1 AvtUA 


/i/^ a ^2/^ r" a rT"P 
vyWivavjLfA^C i 


CTTTTCATCT 


ACATTGTATC 


TATGCTAGAA 


TGTAGATGAA 


3480 




TGPf2T fAAPA 


TACCTCTCTA 


GTGTAGAGCC 


GTTTGAGACG 


GTGACCAATT 


3540 






ACCGGTCAAA 


GTAGACGGTT 


ATTTTTTATT 


^m^^ mmik ^iv 

GTCATTACAC 


•JfiA A 

Jouu 




GATTAAAAAG 


GATTGAGTTG 


AACGTGGTTC 


TGTGGTAGAC 


TTTTATTAGA 






TGTTTf*TTGA 


TGATCAGGCA TCATAATAAC 


TCAATACTGA 


AATCGGAAGT 




7VATAGCCGAT 


AGTTGGCGAG 


CAACAGAAAA ACTCGATATA 


GAATTGGACG 


TAGAGCCAAT 




AGTTGGTGAG 


CAACAGAAAA 


ACCTGAATTA 


TATTTGAAGT 


TTAAAGAAGG 


CAAGCTAAGC 




TTGGCTTCTT 


TATTTATAGG 


AGAAACTAAT 


GATTGAGAAA 


AAAGATTTTG 


ACACAATTAG 




AAAGGCGTTC 


TTAATTTATG 


ACACCAACTT 


CAATAATCAT 


ATGTTCACTT ATACATATCA 




ACATCGGATUV 


ACAAAGAAAT 


TGTTAGAAAT 


GAACGTTCAA 


TTTGAAGCTG 


GGAATTTTAT 




GCATTTATGC 


GGAGTTGACT 


ATTATCAAAT 


AGATAAACAA 


gGGAGCTGTA AATTTCGTTT 


4 I/O u 


TAAGGCTACT 


CAATTTTACA 


GAGCTTTAAA 


AAATAATAAA 


GTTTCACTGC 


GCGGGATTAA 




ACCTAAAGAT 


GATGGAACCA 


CAGGCCAAAA ATTACAAGTA 


ATTCCTCTAT 


TAGAAATGCT 




TATTTCTCCA 


GGTGTACGAA 


TATGCGATGG AGGAAAATTT 


TATAATTTAC 


AGTATGAAAA 




AGCAATTCGT 


TCAGGCAAAA 


TGATTGTGGC 


GTTAACATGT 


AAAGAGAATA ACAAAAAGTA 


4320 


CGTTCCy^CAA 


TCTTTATTAT 


CGTTAATAAA 


TCAGCCACGA 


AAGAGCCAAT 


CGAAATCTTT 


4380 


AACTGAATCC 


CATGAAGTAA 


TTAAGATAAG 


TAAGTCTGAG 


CTGAATAGCA 


CCTCAGTAAT 


4440 


TGAAGTTTAC 


GATAAATTTT 


AACAAAAAAG CGGGCTAAAC 


AGACTATAGG 


CAAAGAGAAA 


4500 


AAGCCCAGAG 


GTTTTTCTCC 


TCTTTAACAG 


CTTTTAGGAt 


TTCACGGATT 


AACGCTAAAT 


4560 
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TTATTCGTAC TTTTCGGGAA AATGCGCTAT AATGAAAATG AAAAGAGAGA 
CATACCTCTC TGATGTAGAG CCGTTTAAGA CGGTGACTTT TTGAATTATT 
CGTACTCGGT CAAAGTTGAC GGTTATTTTT TATTGTCATT TTTAACAGCT 
CAAGGACCAA TACTTTACTA TTAGCAACGA AGTTCACGGT CCTTTGTGCT 
TTCATTTCTT TTGCTTATTG TTTAAATCCT ACTAACGGAA AAATGAAATC 
ATCCCCGCAT AAAAACATAG GGCAAACATC AAAATCTTAA TTAAGATTTT 
CGTTCAGTTA TCTTTT'iCAT TTGTATACTC CTTTCCaATA TTTTTAAAGC 
TTAATCTTTT TGACATCTAT ATTTTATCTA ATTATAATAA AACTCACAAT 
TCATTTTTTC ATTTCCTTTT TATTCCTTGA TTAGTAGTTC AATTTGTAAA 
TGACAAAATA GAAAAGGAGG TTGTTATAGT ATCGAAACAA AATTATCTAC 
ACACTTCTTT ATCTCCTGTT CACTTTTTTG ACAGCCCTAC CCAGCATAGA 
GGAATGAGCT TAATAGCTAT AAAAAATAAC GAAAAAGCCT TGATTATCTT 
CTATCTTTAA TTATTATTAT CGTGCTGTCA GTTTTCTATA AAAAAATAGT 
AAGCAAGCTA TCTTTAAAAG TCTATTACTA GCTATTGGCA TGTTCTTATT 
CTAAATTTAC TTTTCTCACG CATTCCTACT CCAGAAAATC AACATTATAT 
CGAAATGTAG CACCTGTTTT AATGGCCCTT CATACTAGAC TCTTTTCACC 
GAACTCCTGA CTCGTGGCAT TTTTATGAAC TTATTTTTTA TAAAAAACaC 
GTCTTATGTA AAATATTTTT TTCTGGTCTT TTTTTTTGGT CTTAGCCATG 
ATCAATTTCC ATGCTTTTTT ATTGTGGAAT GGGCTGGGTT TTAGCCTCTA 
CTGTAATAAC AATCTCACGT ATCCTCTTGC TATTCATTTG CTATTAAATA 
CACAAAAAaC CCGCCCACAT CAGAAAAATG ATGTGGGCGG GTTTTATAAT 
TkGTTTCAAA TAACGGACTA ACTGGTTTGT TTTCGTGGAT ACGTTTAATC 
TTAATTCACC AACGCTGACT TCGTCAATTT TaACAATTTT ACGATCATCA 
TAGAGTCTGT TACAACTAAG CGTTCAATTG CTGAATCTTC AATACGTTGT 
CTGATAAAAC GGGATGTGTA CAAGAAGCGT AAACGTCTTT TGCTCCGGCT 
CGTTTGCAGC AAGAGAAATT GTCCCCGCTG TGTCAATCAT GTCATCAATT 
TTTTGCCTTC CACATGACCA ATGATATTCA TTACTTCTGC TACGTTCGCT 
GTTTGTCAAT GATCGCAATT GGTGCTTTTA AGAATTCTGC CAATTTACGC 
CACCACCATG GTCAGGGGAA ACAACAACGA CGTCATCGCC TTGAATGCCG 
AGTAATCGGC AATTAATGGT GCACCCATCA AATGATCCAC TGGAATATCA 
GAATTTGGAC CGCATGTAAG TCCAATGTCA ACATTCTTGT TGCACCAGCT 
TGTTTGCAAC AAGTTTAGCA GTGATTGGTT CCCGGGCACG TGCTTTACGG 
CATACCCATA ATAAGGCATA ACCACGTTAA TTGTTTTAGC ACTCGCACGT 



TATGCTTCAA 
TAAAAATAAC 
TTTTAGTTTC 
TGTCTCCACG 
AACGAATTTT 
GACAATTGGT 
TCCAACCTTT 
AGAGGAAGCA 
CATATCATTC 
AT^AGAAAAAA 
AGCATTAACC 
TGATAGTGTA 
AAAGAAAAAA 
TTCTATTATT 
TAATGAAGTC 
TATAATAGAA 
CTCTCGAAGT 
GACTATTACC 
CCTATTATTT 
ATTTATAATA 
TATTTATTTT 
GCATCTCCGA 
GATAGGTAAA 
AAGGCTGGTC 
TCTTTCAATG 
AAGACACATG 
TTTGGACGAC 
GCACGCGTTA 
TGTTCGATGA 
AAGAATCCTT 
TTTTCAATCA 
TCTTGACGTG 
TTTAACGCAT 



4 620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 
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CAATCATGAT 


TAATAGTTCC 


ATTAAGTTAT 


CGTTAACAGG 


GCTACTTGTT 


GATTGAATGA 


6600 


CGTAAACATG. .TGCACCACGA 


ATACTTTCTT 


CAATATTCAC 


TTGAATTTCG 


CCATCACTAA 


6660 


ATTGTGTTAC 


AGAGCACTTT 


CCTAGCTCCA 


CACCCACAGC 


ATCCGCAATT 


TTTTCTGCTA 


6720 


AGGGACGATT 


AGAGTTCAAA 


GAAAAGATTT 


TCAATCTTGG 


ATCAAAATAA 


TGTTTCGACA 


6780 


TTAGAACCTC 


CACTTTCTAT 


TTCAGCCAAT 


TCGTTAAAAC 


TGaACAATAC 


TTnCCTTCTA 


6840 


AAATAATAGT 


AGTCCTAGCC 


niAATAAATCa 


aGAGaTTTTC 


tGtATTTCAT 


ACmAaGATAT 


6900 


TCmTcAAAAG 


tCGcTTTCTT 


CTAACTATTT 


TATCGTATTT 


GTCTAAAAAA 


TGAGGATTAT 


6960 


TTGAATAAAG 


AAAGGTACTC 


GCCATAGCCT 


TCTTTTTCTA 


AATCTGCCAC 


TGGCACAAAT 


7020 


CTCAAAGCTG 


CCGCATTAAT 


GCAATAACGT 


AAGCCACCTT 


CTTGGAGTGG 


GCCATCAGTG 


7080 


AAAACATGCC 


CTAAATGTGA 


ATCCGCTTCT 


TGACTGCGAA 


CTTCTACTCG 


GTGCATGCCG 


7140 


TGACTAAAAT 


CAGCTTTTTC 


TTTGACGCCA 


CGTT TTTCAA 


TTGGT T TGGT 

X X A X A, X^ A 


AAAGGATGGC 


7200 


CAGCCGCAAC 


CAGCATCGTA 


TTTGTCCAGG 


GAGCTAAACA 


ACGGTTCGCC 


ACTAACAATG 


7260 


TCTACATAGA 


TTCCGTCTTG 


GTAAAAGTCA 


TCATATTCTC 


CTGAAAAAGG 


GCGTTCTGTT 


7320 


GCGTTTTCTT 


GTGTGACGGC 


ATATTGAAGA 


TCCGTCAAGG 


TTTGTTTTAA 


TTCTTCTTCT 


7380 


GTTGGTTTTG 


TCATTCTGTC 


CACTCCTTTT 


CTTCTGTTCA 


ATTTATTGGG 


TTCAATGATA 


7440 


CCACGTTTGC 


CAAACAGGCA 


CCTaATAGTT 


TGCATAGTAA 

X X sX\^ X X^aX 


AAGACAGTCC 


TTTCTCTTTT 


7500 


TGACGAATGA 


AAAGGACTGT 


fPTTT ATT 
w X JL X X r& X X n w 


TTGAATTCTT 

X X X X %^ X X 


TTAACAGTCG 


AGCTACTTTG 


7560 


GCAATTGGTA 


AGCCCATAAT 


GGCGTT^TAA 


TCGCCAGCGA 


TTGCTTCAAT 


TAGTAATGCT 


7620 


CCTTGTCCTT 


GGATACCATA 


AGCGCCTGCT 


TTGTCGGCAT 


ATTCTGCGGT 


GTCTAAGTAA 


7680 


GCATGGATTT 


CCGTGTCTGT 


CAGTGGATAA 


AACGTCACGG 


TTGAATGAAC TGTCGCGGAg 


7740 


cGCTcTTTTT 


CCCCTTGTTT 


CAACGTCACA 


CTTGTATACA 


CGTCATGTGT 


TTTGCCACTT 


7800 


AGGAGGCGCA 


ACATGCGATA 


ACCGTCTTCC 


CGAGAAGTGG 


GTTTGCCGAG 


GATCTCTCCC 


7860 


GCCAATGCCA 


CAATTGTATC 


GCAGCCAATG 


ACTAGTGCTT 


CGGGGGATTG 


CTCTGCTATA 


7920 


GCCGCTGCTT 


TTTGGGCAGC 


CATTTGCGCC 


ACATATTCTG 


CTGGTAAgCC GTCTTTGCCA 


7980 


ACAGTTTCAT 


CGATGTCGGC 


TGGTGCAATC 


GTA/VAAGTTG 


GAACCACACG 


CTTTAATAAC 


8040 


TCCTGACGGC 


GAGGAGATTG 


GGATGCTAAA 


ATAATTTGCA 


TAAATTGATG 


ACGCCTCGTT 


8100 


TCTTTTTAAT 


GGGGGTCTTG 


AATGCGTTTA 


AACATTCGTT 


CCATATCTGA 


ATTCGTAAAA 


8160 


TGAATCAGCA 


CCGGACGTCC 


ATGCGGACAA 


TTAAACGGAT 


TTTCGCACAA 


GGCTAAATCT 


8220 


TTTAATAGCA 


CGCGTGCTTG 


TTGCTCATTG 


AGGTAATGAT 


TGGCTTTGAT 


GGATCGCTTA 




CAGCTCATCA 


TAATGGCAGT 


TGCTTCACGG 


AATTTTTTCA 


CGCTGACCGA GCCAGTTGTT 


8340 


AGCAACATGT 


CAATCATCTC 


GCGAACAATG 


CTTTCCTCTT 


CGCCTGCAGG 


ATACCAAGTT 


8400 


GGGTGCGCCC GCACAATGAA 


ACTATTTTGT 


CCAAAGTCTT 


CCAAATGGAT 


GCCAACTTCC 


8460 


GCAAGGGTTT 


CTTTCTGCTC 


TTTAATTTTC 


AAGGCATCGC 


TATTTGGATA ATCAATAACG 


8520 



WO98/S0SS5 PCT/US98/08985 

434 



ATTGGTACCA 


ATAGTTCTTG 


TAAATCATCA 


CTGACTTCGC 


CAATTTTTTC 


ACGGAAGTAC 


8580 


TCGTATTTAA 


TTCGTTCTTG 


GGCAGCGTGT 


TGGTCGATAA. 

XAJ^J X WW«X A^aCM. 


TAAACAGGCC 


ATCTTTGCTT 

aa A AAA A A 


8640 


TGTGCGAATA 


AATAAGTTCC 


GTGCATTTGA 


CCGAAGTATT 


CTAGCTCAGG 


AAAACGCTCT 


8700 


GTCGGTCGTT 


CTTCTTCCAA 


CTTGTTCAGG 

W X X W X * %««^ww 


GCTTGTTTCA 

X X W XXX 


GTTCTCTTTG 


ACTCGCAGCG 


8760 


PTATTGGCAT 


CGAATTCTGG 


GTGAAGTTTC 

\j X XXX v« 


ATTTCTTCTT 

<^X X X W X X X« X A 


GGAAAACCTC 

^7^Ja«aaaaCa%^%^ A 


GTCAGCACPP 


o w 


TCTGATGAAC 


GCAGAGAAGG 


TTCTGAATCA 


GGCACTTCAG 


TTATTTTCGT 


TTCTTCGGCT 


8B80 
woo V/ 


ATTArCTGTT 


GCTCATTACT 


AGCTGTTTCC 


CAATGATTTG 

Voaark X XXX W 


CGGAATCAGC 


AGCTGCCGAA 


8<)40 

O 7^ W 


GAGGTTTPGT 


AAGTTGArGT 


TTr'TTPf Gf*T 

X X V« X X X 


Ar!ArTrGTTG 


ATTCTGGCAC 


TGGTGGrCPA 


?WM w 


AATGGATPGT 

nn X U\3n X X 


PfATTGGGTP 


X X ^ X X 


TPTGTPTGGT 

X V« X U X ^ X \3U X 


TGAPAGPTTP 


AGGGGPGPTT 




\31/aV70V# X \3\3 X \3 


X X X \A3 X ^ X 1 


CfZT' TXTf^TTCr' 
v^oV^Al^x Xv«l^ 


r*r*a T a a a nr* 


X V3 X X 


rz^^caTTaTap 


w 


r'TT a a firTr: r* 

v^i lr\noV^xV3^ 


\^\^\a\3X\,\3X XKJ 


TTTTarT'cr' a 

X 1 i 1 AL>v^oV>/\ 


X X V3 X 1 w^o 


TPr^aTaaann 


a aTTTPpaTT* 

i\i\X X XV^^iiX X 


Q1 An 


T 1* T fTZ/Z/^T T 
1 o X X L^^tV:!^ i X 


TTxezfZTTr^Tnn 

Xrx\j\jX JLk^xou 




TTTTTaaaar* 

X X X X XruxnriU 


/tpaaaTTrsTP 


TPPPPPaTTP 


n 




O 1 X k« X X 1 


paar'ar'TTr'T 


X oixH X ^VsV^V^ X 


/^TTPaaTTaa 


TfZPPaTTaaa 




TCXTTTTCTT 


T a HTT a a a r*<^ 


X 1 X 1 V7V^ 


X X r\V3 X X 


i^i^apaT'P<^ap 

X X o 


aTPPaPTaap 


^ J DU 




X X X\.rf«rinX XXV* 


TAar*Ar*acpA 

X /irl\.«nV.*rVl9Vi*rl 


aTTnnaAAar* 


\sy^\^\*\^n\f\^ft\x 


Tar:TTTPf:aa 

1 AVj XXX k^VSAA 






PTAfAATTGr* 


X X X AX X Xnti\^ 


GPfiAAATTTT 

\9wl3fV\nX XXX 


TAATATAGPG 

Xnt\Xt\Xt\\3\^\3 


APPATTGATA 
A^V^AX XOAXA 


74 O W 


ATPGTTGATA 


AATAGTTGPG 


X uw«v*^o X 




GPAATf^aGAP 


ATAPPPTGTP 


7«I4 w 


AAAGTAAAAT 


\«nn\9%3xv^x X X 


TPPTTCAATT 


TTPAAPATTT 
X X \^t\n\^t\ X X X 


XV^X X XUWrOOX 


GPTAATGPPA 

VJ\Mr X AAX l7^VrA 


7 Www 


TAAATCCCCG 


CAATCGTTTG 
^■nn X ^ w X X X u 


TTTTAAATPA 
XXX xnnnx^n 


PPATTGPPTG 


TTGTAPTCAT 

X X O X Aw X WA X 


P ATT TTGTGG 

\^M\X X X XOXUw 


9660 


CCATCATGAA 


CTAAGnGAAA 


A Gr* A AOTTT A 


GGGTCGPTTA 


APGPPAGAPG 


ATTGACAATG 

t\X X wAW/M^X w 


9720 


TCACCGATAT 


X W^^/W%X X v« 


v«ox X xwnxVaf 


\3 X X X X unv^u X 


ATTTPAAAPG 

AX X XVrAr^%V«V3 


ACPGGGPGTA 


^ ' w w 


TTATAAAATA 


AATTAGAAAC 


GGTPATTTTT 

wUX VriXlX X X X X 


GTGPPTTTAP 


GGAGCGPTGP 


TGGTPGGTTT 

X X V« wVI XXX 


9840 


TPTTf*GAPr*T 

X V« X X X 


X w\«\<rX«W^ XXX 


TAGAATCATA 


TAGPTGPPTT 
X X OwV« X X 


PTTPTTPPTP 

Vrf X X ^ X X X 


AGPTGTCGPA 


9900 


GTTTCGACAA 


TCATTTCAGA 


AACTGAGGCA 


ATACTTGGtA 

t^X t^K^ X X 


AnGcTTCGCC 


TCGGAACCCC 


9960 


AAGCTGCGAA 


TTCGGAACAA 


ATCATCTCTT 


GTATGAATTT 

%7 A aa A ^JU^aA A A A 


TGCTTGTGGC 


ATGGCGCTTG 


10020 


AAGGCATTCA 


GTAC!ATC!TTC 


TTTGGCAATT 


CCTTCACCAT 


TATCAATAAT 


TTGAATGGTT 

X X \^nT\ X X X 


10080 


TTTAAGCCAG 


CTTCTTCAAT 


AAAAATATCA 


ATTTGAGTGC 


TTCCTGCATC 


CAGTGCATTT 


10140 


TCCACCAATT 


CTTTGACGAC 


AGAAGCGGGA 


CGTTCAACAA 


CTTCACCAGC 


AGCAATCTGG 


10200 


TTGGCTAGTT 


GTTCGGATAG 


TTCTTGAATT 


TTCCCCATTT 


CACGACCTCC 


TAACCGGTTC 


10260 


GGTTCCTAAT 


TAGATACGTT 


TTTGCAATTG 


ATGAAGCATG 


TTTAACGCAT 


CTAGTGGTGT 


10320 


CATTTCTAAT • AGATTCATTT 


TCTTCAATGT 


GTCAATGACA 


CTTAATTCTT 


CTGTTGAAAC 


10380 


TTCTTTAAAT 


AATGAAAGTT 


GCTCGGTTTC 


TTCATGTACT 


TCTGAAACTT 


CTTCATGATG 


10440 


AACTGAAATT 


GGAATCGTTG 


TTTCTTCCGC 


TTCTAAAGCG 


GATAAAATGG 


TGGCTGCTCG 


10500 
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TTCTAAAAGC 


GGGCTTGGTA 


AACCAGCGAT 


TTTAGPAAPA 


TKTATPPPAT 

X Ax ^V«^#AX 


AfiPTPTTATP 

AO^ X \^ X X AX ^ 


X.U30U 


AGCAGGTCCT 


TCCATCATTT 


TATGAAGAAA 


GACCACTTPC 


PPaTPTTTTT 

\»\^t\ X V« X X X X X 


PfiAPAHPRPP 


.1 tJD^'*' 


CACGTGAATA 


TTCTTCAAGC 


CTTTTAAGGT 

w X X Jk X X 


TTCATPTAGA 
X xwnxv^xnun 


APAf3TTACTT 


PATnATAATP 
W^XVsAXAAXo 




T AH AC AAT 


ACCGTTTTGG 


PTTGTAPTTP 

\^ X X U X tl\» X x\^ 


APr^&Tf^l^aTa 

AVitVJA X \30n X A 


TATTPaaT/ia 

XAX XV«AAXl3A 


T/IPPT TOOO 


1 AT Jl rt 

lU /4U 


TUCPfSPP ATT 


rPATCATAGG 


X X \9V(\.» w X 


RprsTPPTaaT 

OV^la X X AA X 


TPaTPaaaTa 

X V#A X \.nAAA X A 


a a a T I* a o/T"p 
AAAX X AGGLx 


lOoUO 






flTf^PTTfZ/ZTT 


uu\«X X\«v,*AXl^ 


aTTTPTappa 

AX X X C X Al.^^ A 


T a a a oon» a on* 
X AAAGG r A(^T 


10860 




aTTa/iaTpaT 

Al XA\3A1^«AX 


pa na /^rspppp 


a a TPPPTPT a 
AAX V^CGiGI A 


aaaaTTTPaT 
AAAAX X XGAX 


f*f* a TV a TV fi>n*oo 
GGAAAATTGG 


10920 


V^X X XV^AVtWI 


^X X Xv«V^u«. Lo 


fZTapaaaapa 

l3 X A\m> AAAAI.«A 


rr^r* a a*PT»nr*r" 
uUV«AA 1X1 iaK, 


GL>uA xGACuA 


CGGTwiAAGC 


10980 




aTPTapr'Tar* 

AX o X Also X AU 


1 X i iul^L. IbA 


a ^•p 'nn* ^ 'p 
UAibrTTGbi 


a oi* a a n»o TV 

UtAGTAATCA 


T\ /"•7k O TV TV 1*TV n*/^ 

AGAGAATATC 


11040 


X V9 X X X L> 1 


X XUAXVai^lsVjA 


IAUXV9X XlUl? 


AATG X ATTCT 


rT>/^ TV tfr^r^r> fornix 

1 GAxGCCL xA 


Tk TV Ik /^mmmfnfny** 

AAACTTTTTC 


11100 


U/\^oA^ X Qs^TVi 


loXCVjiUd 1 


CuAUAATCGC 


CAAATTTTTC 


GTGTTACTTC 


TTAACGTTGG 


11160 


1 LCxAACU 1 A 1 


1 (aAl ACCGTT 


CACTAATTGT 


TGCAAAaGAT 


TGTAAAACAT 


CAACCGCACT 


11220 


An. 1 X 1 i X A 


oC^AolVjX X I 


b L.AAC,CCjATC 


AATGTTCGTT 


TTCACTTGTT 


CTCTCACGGC 


11280 




ioAlAx id A 


Li rXLAACTGA 


TTTTTCTTCG 


GCTTCTAAGA 


TTA7UVCGTTC 


11340 


X AAl 1 i 1 


A(jC X L X (tItUO 


TAaTAAAGCG 


CtCCGCATTA 


oofnvk Ik fi*/ »iiiinm 

GCTAATGTTT 


GctTACGCTC 


11400 




ru 1 1 L> i AAAT 


TCGCTAAATT 


TGaCTTaGTG 


ATTTCAATGA 


IV Tk m Ik Ik Ik Ik Ik 

AATAACCAAA 


11460 




1 AviCuAATUi 


TCAAGTTTTT 


AATCCCTGTT 


TCTTGGCGTT 


CTTTCGCTTC 


11520 


m a a r*Tpr*/ir*'p 


a a f ^ a TTrj^i* 
AAUUAi L\j\/k 


TA^CaTTGCG 


CaTGGCaTCA 


f^f^r^rn 7k 7k rn 

CGGTATTCaT 


CTAATTGGTC 


11580 


fiT T a T a fippp 


iUX 1 lAAlAA 


CCTTTCCTTC 


AGTAATTTGT 


7\ 7\ 7v r^r^f^r^tnrn 

AAAGGGGCTT 


CTTCATTAAT 


11640 


X V^ULpA 


a Tr*a a TfiT'/^a 
A X UAA 1 vtUViZA 


CI AAA! CTTC 


AACAGGGTTT 


AAGTCGACTA 


7k ^ Tk TV TV TV 

ACAAATCATC 


11700 


L#L#AX Xl,»V^V^X 


TP aTTP aTPo 


r*r*a/^a a»i»»pa a 
v^lsAUAATTAA 


Ul GTC^GAATC 


GTTGGTAUTT 


#^mm/^mTk TV Tk OTV 

GTTCTAAAGA 


11760 


V>l9 X V,*L»V7 X AAX 


TPaaTTaaaT 
il?AAX lAAAi 


WiCoCCCAx x 




AATGCGAGGG 


GTCCTGCTAA 


11820 


rrcrTPTAAA 


TPATaaapcT 

X v«nX /innlrfU X 


TPf^TPa aT^2p 

X X ^^AA X uL* 


T/ir^'pr'pa a a 
XuOX lisV-AAA 


X U WAAijd? X X 


o a a a a n* a /v^o 
UAAAAxAGGC 


XlooU 


ATTTAATAAG 


CATTCPAPPA 


TTTPTT/I aP^ 
X X X\.«X X\3A^V9 


pfiPPTTaaTT 

^O^^X XAAX X 


T^iaPflTTPPT 
X oACo X X Wv. X 


pa aTPaf'P^io 

GAA X L^GCGG 


1 1 A 


ACGATCCAAC 


CATTGTTTTA 

X«/IX XUX X X x^\ 


ATAAAPf^nPP 


flPPPATA^^PT 


fITTTTPfiTt- T 
V3 X X X X X L. X 


paTPTa aTap 

v«A X L. X AA X AU 




CCACAATAGA 


GTPPPTTTTT 

V3 X X X X X X 


TPmrppTnT 

X \^ X \3 X X 0 1 


TPfZAATPfJAP 


Tf^APTPAATT 

XoA^X^.^AAl X 


PTaaaTTPaa 

V^l AAAX XGAA 




TTTAGAATAA 


TGATCCATTT 


TP AAfi AAATf^ 


rix X kjvjv^ X or\ 


TATTPPAPAH 

XAX X^\^A^AW 


PTTTPT^aaT 
K« X X Xt^XuAAX 


i.^ xZU 


ATGGGACAAG 


CTCPCPTTTT 

V> X v«V«V3V^ X X X X 


fii^TAAPr^nA 
kji\3\j x/vu^u\9!r\ 


TAAATAAfiAT 


AATAATTTHP 

AAXAAX X XO\^ 


PTnT a aTTTr* 

V^XvjXAAX X Xv^ 


XZ lOU 


AATTTCTAAT 


GGATTAATCA 


GTTCATTGGT 


TAAAAAACTA 


AACTCGGCAT 


TTTCTTCCAC 


12240 


CGTTTCTTGT 


GTCGAAAAGA 


TGATATTTAA 


ACGTAAACTT 


AAGTTCTCTT 


TTAACGATTC 


12300 


AGGGATCCCT 


GAACCTAGGA 


CCATTTCTTT 


TGTTTGCAAC 


GCAGAAGCTT 


CGTTCAAAAC 


12360 


GCCCTCTTCA 


TCAGCCAAGA 


CTGCTGTTTT 


TAATTCGCCT 


GTACTCAAAT 


CTACATAGGC 


12420 


AAAACCGAAC 


TGATTACCAT 


CTGTTAGCAC 


AGCGGTTAAG 


TAATTATTGT 


CTTTGGCTTC 


12480 
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CAAGCCTTTA 


CTATTCATCA 


CCGTTCCTGG 


CGTTACCAAC 


TGGACCACTT 


CTCGTTTGAC 


12540 


CATGCCCTTG 


GTAGTCTTTG 


GATCTTCCAC 


TTGCTCACAA 


ATCGCTACTT 


TATAGCCTTG 


12600 


CTCAATCAAC 


GTATCAATAT 


AGCCTTGCGC 


GGCATGATAA 


GGAACGCCAC 


ACATAGGAAT 


12660 


CGGATCATCC 


GCATTGCGGT 


TACGACTGGT 


TAACGTCAAT 


TCTAAGATTT 


GCGCTGCATT 


12720 


GATAGCATCT 


TCATAAAATA 


ATTCATAAAA 


ATCACCTAAG 


CGATAAAATA 


AAAACGCATC 


12780 


CTTGTATTGG 


GCCTTAATGG 


ATAAATATTG 


TTCCATCATG 


GGTGTATTTT 


TTGTCTTTTG 


12840 


TGGCATTTAA 


TTCCCTTCCT 


TCTCTAGCTC 


TTCGTTCACG 


CGGTATTGAA 


TGAGCGCCGT 


12900 


GACATGTTGG 


ACTAAGTCAT 


TGGCTTCGAT 


CAGTTGTTCG 


CGGTAGGCTA 


CAACGAGTGG 


12960 


GTGCTCATCA 


AATTCTTTTG 


TTTTCGCATC 


GGCTCGTTGA 


ATGGCTTCTT 


TTTCAGCAGT 


13020 


TGGTTTGCCA 


TAGTGAGCAA 


ATTGAACCGC 


ATCTTTTTGC 


GCAGCTTTAA 


TTTCTTCCAC 


13080 


TAATTCTGTC 


AGCTTTTTGT 


TTTTCTTTAC 


CTTTTCTTCA 


ATCGCTTGGT 


AGCGCTGAAT 


13140 


GACTTCATCT 


TGTCCAATTA 


AATGTAACAA 


TTTTTCTAAT 


TCTTTATTAC 


TTTGTGCATC 


13200 


TAAATTCAAC 


GGCTCCATTT 


TTTTCACCTT 


CATCTTTCAC 


CGGCACTTTT 


TCTATAAGAG 


13260 


CAGAAAAAGT 


TAACCTCTTA 


ATTATCCGAG 


ATTAACTTTT 


CTTCTATTAT 


TCTTGAAACG 


13320 


GGCGATCTTC 


GCTAATCTGA 


ATCGGCTCAA 


TCATTTTCGC 


TTGGCCCGTT 


TGGTCATCAA 


13380 


TCTCTAAAAT 


ACAGGCAGAT 


AAAATTGTCC 


GCCCTGTTTC 


GACTACCTCA 


AAGCGCTTCG 


13440 


GCAATGCTGT 


CATAAACTTC 


TCAATAATCG 


GTTCACGCTT 


CATTCCTAAA 


ATCCCATCAT 


13500 


AAGGACCGGT 


CATACCGACA 


TCACTTAAGA 


AGGCTGTCCC 


TTTCGGTAAA 


ATCCGAGCGT 


13560 


CATTTGTTTG 


AACATGCGTA 


TGTGTTCCAA 


CAACGGCTGA 


AACTTGGCCA 


TCTAAAAACC 


13620 


AGCTCATGGC 


TTGTTTTTCA 


CTCGTCGTTT 


CGCCATGAAA 


ATCCACAAAA 


ATGAGCGGTG 


13680 


TCCGTTTGCG 


AGCGATAGcT 


ACTAACTCGG 


CGGCTTTCTT 


GAAAGGATCT 


TCCAATGGCA 


13740 


CCATAAATGT 


GCGGGCCTGT 


AAATTAATCA 


CCGCTAGTTC 


CAATTGATTG 


ACTTTCACAA 


13800 


AAACCATTCC 


TTGACCAGGC 


ACATCTTCTG 


GAAAGTTCGC 


TGGACGAACC 


ATTTTCTTCG 


13860 


CATCATCAAT 


GAATTCAAAG 


ATATCCTTGT 


TGTCCCACGT 


ATGGTTACCC 


ATCGTAACAA 


13920 


CATCGACACC 


ATCTTGCAAA 


AATTTCTTAT 


AAATCTTGCC 


AGTAATTCCC 


CGACCAGATG 


13980 


CCGCATTTTC 


ACCATTGGCA 


ATGGTCACTT 


GTGGACGATA 


TTTTTTCTTT 


AATTTAGGGA 


14040 


CATACGTTGC 


TAAGGCTTCT 


CTTCCTAAAG 


AACCGACAAC 


ATCTCCTATA 


AATAATATAC 


14100 


GCAAAAAGGT 


TCCTCTTCTC 


TTTTTAACAT 


CTCTTGTTTA 


TTATAGTTGT 


TTTCGCTTTG 


14160 


AAAAGAAAGT 


TTTGGTTTTA 


AAAGATTAAA 


AAAAGtGTCT 


CAACGAACAA 


TCCTTGCAAT 


14220 


TGTTCGTTGA 


GACaCTACAC 


GCGATACTTA 


TTCGCTTTTT 


ATTTTATTTC 


GCATAGTCTA 


14280 


CGGCACGTGT 


TTCACGGATA 


ACCGTTACTT 


TGATATGTCC 


TGGATAATCC 


AAGTCATCTT 


14340 


CAATTTTCTT 


cx:gaatgtct 


CGAACTAAAC 


GAACAGATTC 


TAAATCIAGAA 


ATTTCTTCTG 


14400 


GTTTCACCAT 


GACACGAACT 


TCACGTCCTG 


CTTGAACAGC 


AAAACTTGAA 


TCAACGCCTT 


14460 
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CAAAACTATT 


AGAAATATTT 


TCTAAGTTTT 


CTAAGCGGCG 


AATGTAGTTT 


TCTAACGATT 


14520 


CcTACGTGCA 


CCTGGACGGG 


CTGCAGATAA 


CGCATCTGCG 


GCTGCCACTA 


AAACAGAAAT 


14580 


AACTGAGGTT 


GCTTCCACAT 


CACCATGATG 


CGAGGCAATG 


GCATTAATAA 


CTACTGAATT 


14640 


TT CTT TATAT 


TTCGCAGCTA 


ATTCTGCGcg 


AATCtCAACG 


TGAGAACCTT 


CAATTTCATG 


14700 


ATPCAATGCT 


TTCCCAATAT 


CATGTAGTAA 


ACCTGCACGT 


TTGGCTAATT 


GAACATCTTC 


14760 




GCGGCTAAAA 


CGCCAGCTAG 


TTTTGCTACT 


TCGATTGAAT 


GGTTCAAGAC 


14820 


ATTTTGACCA 


TAACTTGTTC 


TGAAACGTAA 


ACGGCCTAAA 


ATTTTAATTA 


AATCAGGATG 


14880 


TAAf^STATGA 


GCACCTACTT 


CAAAGGCTGC 


TTGTTCTCCG 

A A \3 A A A w W 


TATTCACGAA 


TCCGTTCATC 


14940 




PGTGATTTTT 
X X X X X 


CTACCATTTC 
oxn\«%«/^x A x^ 


TTCGATACGA 


GCGGGATGAA 


TCCGACCATC 


15000 


TTf^ATTAAT 
X XV9V9iriX Xnnx 


TTTTCCAAGG 


TCATTCGAGC 


GATTTCTCGT 

UVl X X X O X OO A 


CTGATTGGAT 

^ A W** A A >JW«* A 


CAAATCCGGA 


15060 


TAAPAPAAPf 


GT'TTPTGGfG 


TATC!ATCAAT 
X n X x<iX& X x 


GATTAAGTCA 


ATCCCTGTTA 

r» A A W A A «1 


ATGTTTCCAA 


15120 


TGTAPGAATA 


TTACGTCCTT 


CAPGACCAAT 


AATGCGACCT 


TTCATCTCAT 


CATTTGGTAA 


15180 




GAAAPAGTTG 


TTTPAGAAAP 


AGAATPGGPG 


GPGPAACGTT 


GAATTGCTAG 


15240 


r^r: A A A A Ar: 


X X O X X ^^V9l^ X X 


TGPGATCAGA 


TTPTTPTTTG 

X XOX XV^XAAx3 


aPACGTTGTT 

^\_#Xlw\3A AwA A 


CTGACTCCTT 

X XJaIV^ X V^X^ X X 


15300 




U X X r\\3 X X 1 


nATTPAAr*TP 


TTPTTPTnTT 

X XV^X AV^XOX X 


GAPTTPATAA 


TTUVTPGATTT 

A A V* VJX& A A X 


15360 


X \9W<» 1 1 0 1 X V> X 


TTTnAAAfZf^ 


AAKPAATTPG 


TTPTAATTPT 
X Lm\x xv«x 


TGATGTTGPT 

X \3n X VJ X X W^xt X 


GATTAATTAA 

ws* A A A A 


15420 


u X X X X V.^ X r%k» X 


1*/ ••|H|M|M|H|l/ Mil/* 

XVivX X X X Xn,*LO 


CPTf!ATPAAT 


TAATTfiPTfST 

XrVXX XOWXwX 


CTAGCACCAA 


GTT TCTCTTC 

\S AAA X« A \^ A A Vi« 


15480 


XXXXXOXX \^v« 


AGTGAGCCTT 


PAPrtTTTTTP 

V^AVi^Vi X X X X X 


mAGyGAGTPA 


TPTTTAPGAT 
X V* X X X nx^vtn, x 


CCAACGTTTG 


15540 


X X v«v«v«0 X X \3t\, 


ATTAAACGAT 


TTTPTTGAGA 

AAA \^ A A V3/^\3n 


TTTAACTTPT 

AAA XV^U A A \^ A 


AATCTGCTTT 


CTTTCAACTC 


15600 


ACTTTcAATT 


TCTGAACGAT 


ACTTCTGGTT 


CTCTTCCTTA 

X X X X^V^ X X 


GCTTCTAGCA 


ATGCTTCTTT 


15660 


TTTCAGTGTT 

AAA VoJaKJ A ^7 A A 


TCTGATTCTT 

A S« A ^9r* A A \^ A A 


TcCGAGCGTT 


CTCGAGAATA 


cTCGACGCTG 


AAATTTTTGC 


15720 


ACC AT TT AT T 

rlV^WJ^A A A XI A A 


GCTTTATCGT 

AAA A WW A 


GACGTGACTT 


TGCAAT7\ACA 


AATCCTAAAC 


CAAGACCGAC 


15780 


AATCAAACCG 


ACGATAGCGA 


GGAGAATATT 


TAAAACCATA 


TTTCCACCTC 


CATACTATCT 


15840 


TTTTGTTAAT 


TCATATCTTG 

ANm^A«*A%^A a V7 


TAGTTTTTTG 

X X X X X X X 


ATAAACTAAA 


ATCAATTATC 


AATATGCCTA 


15900 


CTTGCATATT 


TGTGCATACA 


CACAATTTTA 


TTCTAAtGTT 


TGTCGTGAAA 


GGTGTCAACT 


15960 


TAAAAAAGCC 


CTTTCTTTTG 


GAAAGGGCrG 


CAAATATTAG 


ACAACGTCAC 


ATTTTTAGGT 


16020 


TTCCGCCGTC 


ACGAATTGTT 


TAAAT TTT AG 


AACAAACGCC 


GCATAGTCTG 


CTGTTTGACT 


16080 


AAGCTGCTTG 


ATGTGGATGG 


GTTCGGACAT 


GGACAACAGT 


CTATTTAACA 


ATTTAGTTCT 


16140 


AAACAAAAAA 


AGACAACACC 


TGTCCTGTTC 


AGTAAGTGTT 


GTCTTTCTTC 


TTAACGAGTC 


16200 


GTATTCACTC 


GTTATCTATT 


CATCTAAGGG 


TAATTCTTCT 


TGACCTTCTG 


CTTCTTCTGT 


16260 


AATGGTTGAA 


CCGTCACCGA 


TGCCGTATGC 


ATCACGGACC 


AATTTAGACA 


CTTCTGCCAT 


16320 


CATTTCAGGA 


TGATCCGCCA 


TGTATTGTTT 


GGCATTTTCA 


CGGCCTTGGC 


CAATTCGTTC 


16380 


TTCTTTGTAG 


CCATACCAAG 


CACCACTTTT 


ACTGATAAGG 


TCTTTTTCCA 


CAGCCATATC 


16440 
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TAGTAATTCG 


CCTTCTTGGG 


AAATACCT lis 




IV *P IV T*/^ IV & r^TT 

A TAi UAAL. J, 1 


UKaCUAUri r 


IbDUU 


AAAT.GGTGG.C iJCCACTJTGt 


TTTTAACGAC 


TTTaATTTTT 


GTGCGfe Tl Av* 


CGACGATGtC 


1d5d0 


TGTTCCTTGT 


TTTAACTGTT 


CT6CACGACG 


GACTTCTAGA 


CGAACCGTTG 


/^^i* IV IV IV iwiniii 

CGTAGAATTT 


16620 


CAATGCACGT 


CCACCAGGAG 


TTGTTTCAGG 


ATTTCCGAAC 


ATCACGCCAA 


CTTTTTCACG 


16680 


AATTTGGTTA ATGAAAATAG 


CAATTGTTTT 


TGTCTTATTA 


ATTGAGCCTG 


ATAATTTACG 


16740 


TAGTGCTTGA 


GACATTAGTC 


GAGCTTGTAA 


GCCGACATGG 


CTCGCTCCCA 


TCTCACCATC 


16800 


AATCTCTGCA 


CGAGGAACTA 


ACGCAGCAAC 


CGAGTCGATG 


Ik ^ Ik Ik w 

ACAACGATGT 


CAATCGCACC 


16860 


ACTTGAAACT 


AAGGCATCGG 


CAATCTCTAA 


GCCTTGCTCG 


CCCGTATCTG 


GTTGAGATAA 


16920 


AAGTAATTCA 


TCGATGTTAA 


CGCCTAGTTT 


CTCCGCATAT 


TGAGGATCCA 


ATGCATGCTC 


16980 


AGCATCGATA 


AAAGCGGCTG 


TTCCACCATT 


TCGTTGCACT 


TCTGCAATTG 


CGTGTAAAGA 


17040 


AACAGTTGTT 


TTACCTGAAC 


TCTCAGGACC 


ATATACTTCA 


ATAATTC 




17087 



(2) INFOEIMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



AAATGAATAT 


TTATTCTGCT 


TTATTnTCAA 


TACTTTGTTT 


TCATAGTTGC 


TTTAGCTGTT 


60 


CCTTTTTTGA 


ATAAGATTAA 


AATGAATAAA 


TAACAAGCAA 


AATATAAGAT 


TTTTTTGTTT 


120 


AACTCTTTTT 


GTTTCAATAG 


GAAGTGCTTA 


GTGTTTTTTC 


AGAATTGAAA 


AATGT^TACG 


180 


ACAAAAAATT 


TATCTATATA 


ATAGAAGGAA 


AAATAAAAAT 


AGTGAATGAT 


AAAAAAAATT 


240 


CAAAGAAAAC 


ACCCGTTCTT 


TTTAGGCCTG 


AGTTTTTGAA 


AAAAACATTC 


TCATGTCTAG 


300 


TATCTTTAAC 


ATTTTGTTAC 


TTTTTGACTG 


AAACGAATGT 


AAAAAAACCG 


ACATTTAAGG 


360 


GCTAATCAAG 


GGTTAGAAAA 


CGGTTTCTCG 


AAAATAAAAT 


CCTGTTTTTT 


TAACTTTTTT 


420 


TCGCAAAATA 


TAAACAGGAT 


AATATCAAGC 


TTCCAGCAGC 


TTTTTATTAC 


AAAATGTAAC 


480 


AAAACCGACA 


TTCTAGTAAT 


AAATTGTCAC 


GA6ATATTCA 


CtACGGCCGG 


CCATTCGATG 


540 


ATAAAGTATG 


TGAAGTCGTT 


TCGATAAAGC 


GAAGCAGACG 


TGGTTGTTGC 


CGAGTGCTAA 


600 


GAGCAACAAT 


CAAATATTTT 


TAATTACAAA 


GGAGAAACAT 


ACATGAAATC 


AATCAAAACG 


660 


ATTTTACTTG 


GAACAACTTT 


GGCCGCAGtT 


TAGGATTATT 


CTTAGGAACA 


GACGCAAACG 


720 


CAGAAAGCTT 


ATATACAGTT 


AAAGCAGGAG 


ATACTTTATC 


AACAATTTCT 


CATCAATTTG 


780 


CAGGAGACAA 


TAGCTTAATT 


CAAAAAATTG 


CTTCTGATAA 


CAAATTGCCA AACCTTGATT 


840 


TAATTTTTGA 


AGGAGAACAA 


TTAGTTATTC 


GTTCAGAAAA 


AGAAgTTGCT AATACTCCAG 


900 


CACCAGCTGT 


AGAAGTTGCA 


CCAGTTCAAC 


AAGTAGTTGA 


ACAACCTGTT 


GCACAACCAG 


960 


TACAACAAGA 


AGTACAACAA 


CCTGTTGCTC 


AAGAAGTAGC 


GCAACCAGCA 


GCACCTGCTG 


1020 
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CAAGCAGTGA 


TGCAAAAGAG 


TGGATCGCAC 


AACGTGAATC 


TAGTGGTTCT 


TACGATGCAA 


1080 


CAAACGGTCA 


GTATATCGGT 


CGTTACCAAT 


TATCTGCCfC 


TTATTTAAAT 


GGTGACTATT 


1140 


CACCTGCCAA 


CCAAGT^ACGC 


GTGGCAGATC 


AGTACGTAGC 


AGGTCGCTAT 


GGTTCATGGG 


1200 


ATGCAGCTAA 


ATCATTCTGG 


TTAGCAAATG 


GTTGGTACTA 


AAATTAAATA 


AAGGArTAAT 


1260 


CTAATTACTT 


ATTAAAAACC 


ATAAACAAAA 


AAACAGTAAC 


CTGTAAAGAT 


TACTGTTTTT 


1320 


TTTGTCGATT 


TGTTAAGGTA 


CAACGCCCAA 


CGCTCGTCAA 


TTGGTGGTCT 


GCGTTATAAA 


1380 


TTTTTGCTTC 


CCAAACTTGA 


AGGGTCTTTC 


CCGAGTGATC 


GGGCGTCGCA 


ATCACGGTAA 


1440 


GGCTACCATC 


ATGGACACTA 


CGAAGATGAT 


TTACTTGTAA 


GTCTATGCCG 


ACAGCAAAAG 


1500 


TGTTTTCAGG 


AACGTTTTCG 


TTCGCACCTA 


AACTACAGGC 


AGTTTCAATT 


AAGACGCCGT 


1560 


TTAAGCCCCC 


GTGGACAAGG 


CCATAGGGTT 


GTTTATGCAC 


GTCTTTTACC 


GTTAAAGTAA 


1620 


GCTGACATTT 


CTCAGCAGAA 


ACCTGTTGAA 


TCGTGATGCC 


TAAATACTCT 


AAAAGATGAA 


1680 


TAGTAATCaG 


TCCTTTCTAA 


TAAAAGAGAG 


AATAAAAGGT 


TTTTaTTaTT 


ATACAGGAAT 


1740 


GAAGAAAATT 


kGAAATAGCC 


AkGTTTTTtC 


TTAAAATtCa 


AGCGAmTTTT 


TCyCtAAAGg 


1800 


CCGATTCTgk 


GkATAwTrGA 


AAaGATGrGA 


TtAATGAAGG 


AGGTTTCCTA 


ATGAGAAACT 


i860 


GGACAACAAT 


TAAAGAATAT 


GATGAAATTT 


TATTTGAGCA 


AGCAGGCAAA 


GTAGCAAAAA 


1920 


TTACTATCAA 


CCGCCCGCAT 


GTTCACAACG 


CATTCACACC 


AAAAACAGTC 


ATGGAAATGA 


1980 


TTGATGCATT 


CAACATTAGT 


CGTGATAAAG 


AAGATGTCGG 


GGTTATTATT 


TTAACAGGCG 


2040 


CAGGTGACCA 


AGCATTCTGT 


TCTGGTGGCG 


ATCAAAAAGT 


TCGAGGTAAT 


GGTGGTTACG 


2100 


TTGGCGAAGA 


CAACATTCCA 


CGTTTAAATG 


TGTTAGATTT 


ACAACGGTTA 


ATCCGTGTCA 


2160 


TTCCCAAACC 


AGTCATCGCA 


ATGGTGAAAG 


GCTGGTCCAT 


TGGTGGCGGC 


AATGTTTTAC 


2220 


AATTAGTTTG 


TGATTTAACA 


ATTGCGGCGG 


AAAATGCTAA 


GTTTGGGCAA 


ACAGGACCAA 


2280 


ATGTCGGTAG 


CTTTGATGGT 


GGATACGGCT 


CAGGTTACTT 


AGCGCGTGTT 


ATCGGCCATA 


2340 


AAAAAGCAAA 


AGAAGTTTGG 


TTTATGTGTA 


AACAGTATTC 


TGCTCAAGAA 


GCTTTAGATA 


2400 


TGGGCTGGAT 


TAATACAGTT 


GTTCCATTGG 


AACAAGTAGA 


AGATGTGACA 


ATGGAGTGGG 


2460 


CAGAAGAAAT 


GTTAACCAAA 


AGCCCGATTG 


CTTTACGGAT 


GATTAAAGCC 


TCTTTAAATG 


2520 


CCGATACAGA 


TGGCTTGGCT 


G6TGTGCAAC 


AATTAGCTGG 


TGATGCGACA 


CTTCTTTACT 


2580 


ATACAATGGC 


TGAAGCACAA 


GAAGGCCGAG 


ATGCATTTAA 


AGAAAAACGG 


ACACCAGATT 


2640 


TCGATCAATT 


CCCTAAATTC 


CCATAAACCA 


ATGACTTGGT 


TAAATAAACA 


AGTACAAAAA 


2700 


CGTCCCGATC 


ATCCTGCTTT 


TTATTTTCAG 


GATGAATCTT 


GGACGTTTTT 


AGAAGTGcAA 


2760 


CAAGAAGTCA 


GCCATTGGGT 


AGCCACATAT 


CAACAGGTGC 


TTGCACCAGA 


AGAAAAACGT 


2820 


GTGGCTTTAT 


TCAGTAAGAA 


CAGCAAAGAG 


TTGTATTTTT 


CAATTCTAGC 


CTTATGGGAA 


2880 


TTGGGGAAAG 


AGTTATTATT 


TTTAAATACG 


CATTTGACAC 


TTGCTGAACT 


GACTTTTCAG 


2940 


TTAAAAGATG 


CGCAAGTTAA 


AACGATTATT 


GGTGCGCCTG 


AAACGCAGGC 


TTTGTTAGAA 


3000 
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GAAATTTCTT 


TTGTCGACGT 


TCAGCCAATG ATTAAAAAAC AGCATAGTCT TTCACATCAG 


3060 


GAGTTTCAAC AGCCAAGTGA 


CTTAGAGTCG 


GTGGCTTCTA 


TTATGTATAC 


ATCTGGAACT 


3120 


ACAGGACAGC 


CTAAAGGGGT 


CTTGCAACGG 


TTTAAAAATC 


ATTTGGCCAG 


TGCTAGAGGC 


3180 


ACTCAAGAAA ATATGGGGAT 


TACTGCTGAA 


GATTGTTGGT 


TGTGTGCAGT 


CCCATTATTT 


3240 


CACATTAGCG 


GATTGTCGAT 


TGTTGTTCGA 


CAACTCGTCT 


TAGGTTGTAG 


TATTCGCCTT 


3300 


TACGATAAAT 


TTGATGAACA 


GCAAGTGACA 


CAAGATTTAC 


AAGAAGGCCG 


TGGAACCGTC 


3360 


ATTTCAGTGG 


TTGCGACAAT 


GTTACAACAA 


TTATTGTCTG 


TCTATCCTGA 


AGCTGGCTAC 


3420 


AGCGCCAgCT 


TTaAAGGAAT 


GCTATTAGGC 


GGCGGACCGA 


TTGCTCCGGA 


TAAATTAGCG 


3480 


CAGTGTGAGG AAAAAGGGAT 


TCCAGTCATT 


CAATCTTACG 


GGATGACGGA AACGTGTTCG 


3540 


- CAAGTTGTTG 


CCTTGAAATT 


TGAAGATGCG 


GCACTGAAAA 


TCGGCTCTGC 


TGGACAACCG 


3600 


TTAAAAGATA 


TGCAAATCAA 


AATTGTTGAT 


GAACTAGGAC 


AAGAACAGCC AGAAAAGCAA 


3660 


GTCGGGGAAA 


TTTTACTTAA 


GGGACCAAAC 


GTTGTTTCAG 


GTTACCTTAA 


CCAGCGTCAA 


3720 


CCTGAAAAAT 


GGACAGCAGA 


TGGTTGGTTT 


AAAACAGGCG 


ACATGGGCTA 


TTTAGATGCA 


3780 


CAAAGTTACC 


TGTATTTGGT 


TAGTCGTTTA 


AGTGAACTCA 


TTATCTCTGG 


TGGTGAAAAT 


3840 


ATTTACCCCA 


CTGAGGTTGA 


ACAGGTGTTA 


CAGGCGATAA 


CAGGAATTAA 


AGCAGCGGCA 


3900 


GTTGTAGGAG 


AACCAGATGC 


GCAATGGGGa 


GCCGTCCCAG 


TTGCATATGT 


GATTAGTGAC 


3960 


CAAGAAATCA 


CCTTAGCGCA 


AATTCAAGAC 


CAGTGTTCAC 


GAAAACTGGC 


AAAATATAAA 


4020 


CGACCGAAAC 


GCATTTATTT 


TTGTCATTCT 


TTCCCGCAAA 


CAGCAAGTGG 


AAAGATTGCC 


4080 


AAACATCGTT 


TCATGACAGA 


AGAAAGAGAG 


GCGTTTTTAA 


TCAGATGAAA 


TTGCCTGATG 


4140 


AATTACGACA 


AGCCTATCAG 


AAGAATGCCC 


GTCAGTTCAG 


TTGGGTTTTT 


CCATTAGAAA 


4200 


"CAGAACAAAC 


AGCCTTGGCA 


ATTTTTGCCG 


CTGGTGAAAA 


GGCATACCAA 


GGAGAACGTT 


4260 


TTTTTTGGCA AACGCCGAAA 


AAAGATTTTG 


CTTTGGTTGG 


CTTCGGATAT 


GAAACTATCT 


4320 


TAAAAGGCGC 


AGAAACAGAA 


ACGCATCAAT 


TGAACAACTT 


TTTAAAAAAA 


GAGGCGGCAA 


4380 


CTCGTTTTCA 


AAATATGTCT 


ATTTCTGGTA 


CGGGGGCCCT 


TCTGTTTGGC 


GGTTTGCCTT 


4440 


TTGATACAGA 


ACAAACGCCT 


ACTGAGGCCT 


GGGGAGAGTT 


AGGGGAAGGT 


TGGTTTTATC 


4500 


TACCAAGTAT 


TCTATTTACT 


TTTTCAGGTT 


CACGTATTTA 


TGGAACGCTG AACTTTTCAG 


4560 


GTTCTAGTGA 


ACAAGAAGTC 


GAGGAACGTT 


GGTCGACTTT 


GGTGGCTCAG 


TTTGATCGCT 


4620 


TGTTAGCTAC 


CTGTGAGGAG 


TTGCCAGCTG 


TAGCGGAAAC 


AAAGATTGAC 


AAGGAAGAAG 


4680 


TTGCCGTGAC 


TGAATGGCTA 


CAAGCTGTCA ACGAAACAGT 


AGCAGTCTTA AGAGAAGATG 


4740 


GCCCTTTAAA AAAAGTGGTT 


TTGGCTCGAC AACTTGCTGT 


GAGCAGTCCT 


GAAAAAATTC 


4800 


GAAGTAACCA AGTGCTCGTT 


AATTTAATGG 


CACAACAACA 


AAATACGTAT 


TTGTTTGCTT 


4860 


TAGAAGCCAA 


AGACACAAGT 


TTCATTGGTG 


CAACACCAGA 


ACGACTTCTG 


TTGGGTACTA 


4920 


AAGAAACGTT 


TGCGACCGCT 


TGTATTGCAG 


GAACCATTAA 


AACGGGTCAA 


ACACCAGAAG 


4980 
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AAACGAAAAC 


TTTAGGAGCA 


CAACTTTTAC AAGACCGCAA AAATACCGGA GAACACCAAA 


5040 


TTGTGGTGGA 


ACGTTTAGCG 


AAAGAGTTGG 


CGAAGATGAC 


GACTTCTGAA AACAGTATTC 


5100 


AAGCACCCAT 


TATTTTGGAA 


AATCGAGATG 


TACAACATCT 


TTATGTGCCA 


ATCTCGGGTC 


5160 


AACGTAAACC 


GGGAATTTCA 


TTTTTGGAAA 


GTGTGATGCA 


GCTACACCCA 


ACGCCAGCTT 


5220 


TAGGTGGTGA 


GCCCAAAGAG 


TTGGCAGTCG 


AATGGATTCG 


ACAATATGAG 


CCAGGGAGTC 


5280 


GTGGGTTATA 


TGGTGCGCCA 


ATCGGTTGGA 


TTTCAGGGAA 


TGATGATAGT 


GGCGAGTTCG 


5340 


CTGTGGCCTT 


GCGTTCAGGT 


GTCTTTGCTG 


GTCAGCAAGG 


CGTTCTTTAT 


GCAGGTTGTG 


5400 


GTATTGTTGC 


GGATTCCCAA 


GCAGAGCTAG 


AAAGAGAAGA 


AACGAAAATA 


AAATTTCAAC 


5460 


CGATGTTACG 


AGGAATTGGA 


GGTCAGGTTT 


AATGAATCAT 


CAAGAAACAA 


TGACCGATTA 


5520 


TTTAATGGCC 


TTTATTGAAG 


GCTTAAAAAA 


TAGTGGGGTT 


GAACAAGCAG 


TCATTAGTCC 


5580 


AGGGTCCCGG 


TCCACACCAC 


TGGCTTTATT 


ACTTCATCGG 


GAAACCGCAA 


TCCAAACTTT 


5640 


TGTCGATGTC 


GATGAACGTT 


CCGCTGCTTT 


TTTTGCTTTA 


GGATTAAGTA 


AAGCGAGTCA 


5700 


AAAACCAGTG 


GTCTTACTTT 


GTACCTCTGG 


AACAGCAGCT 


GCAAACTATT 


ATCCAGCAAT 


5760 


TTGTGAAGCC 


AATATTAGCC 


ATGTGCCTTT AGTCGTTTTA ACAACAGATC 


GGCCGCATGA 


5820 


GTTACGTCAA 


GTTGGTGCTC 


CTCAAGCCAT 


GGACCAATTA 


CAAATGTATC 


AAAATCACGT 


5880 


GAAATTATTT 


GTTGAAATGG 


CTTTGCCAGA AGCAACAGAA GAGATGTTGA ATTACGCCTA 


5940 


TTGGCAAGGC 


GCTAAAGGAG 


CAGCATTCGC 


GCAACAAACG 


CCTGCAGCAC 


CTGTTCATTT 


6000 


GAATTTTCCT 


TTGCGGGAAC 


CATTATTACC 


TGACTTAGAG 


AGAAAAACAA 


AGAGTTCTCA 


6060 


ACAAACGGCT 


TTATTCGCTG 


GTCAATCAAT 


CCTTTCTACA 


GAACAAGTGC 


AACAGTTAGC 


6120 


GGATCAATGG 


TACCAGAAAA 


ACGGCGTCTT 


GGTTGTCGGT 


GGCAGTCATA 


CAGAGGAAGA 


6180 


AGCGACGTTA 


TTTATTCAAT 


TAGCCGAAGC 


TTTGCAATGG 


CCGTTACTAG 


CGGATCCTTT 


6240 


AGCAAACATT 


GTGACACATG 


GGCAAAATAG 


TGAAGTTGTT 


ATTGCACACA 


GTGATCTGTT 


6300 


TCTAAATGTG 


GCGACCTTGC 


CACAAGAACC AGAAGTGGTA 


GTGCGTTTTG 


GTTCGCTGCC 


6360 


AATTTCTAAA 


AACATCATGT 


TATGGTTGAA ACGCTTAGCA ACGACAGAGA 


CTGCTTTTTA 


6420 


TTTTGTGGAC 


GAAAATGGGC 


AATGGCAAGA GCAATTGAAA AAATCACAAA 


CAGTTATTCA 


6480 


GGCAAAGGAA 


ACCACGTTTG 


TAGAACAACT ACTAACGGTT 


GTCAAACCAA CAGAAGCGAC 


6540 


TTGGTTAGCA 


CAATGGCTCC 


TGCTGGAAAA AACGGTCTCT 


GAAGTTCTTC 


TAGAAACGCT 


6600 


GAACGCGACA 


GAATTAAATG 


AAACAACAGC CAGCTTAGCT 


GTTCACCAAA CAATGAAAGA 


6660 


AAATGGTCAG 


TTATTTGTAT 


CTAATAGTAT 


GGCTATCCGC 


TACTTAGATC 


GATTTATGGA 


6720 


TAGTCGCCCT 


TATCGGATGT 


TTGGAAATC6 


CGGCATCAAC 


GGGATTGATG 


GGATTGTTTC 


6780 


AACTGCTTTA 


GGAATGAGTG 


CCATAGCACC 


GACACAGCAA 


AACGTTCTAC 


TGATTGGTGA 


6840 


TTTAGCGCTA 


TATCATGATA 


TGAATGGTTT 


ATTTTTGGCT 


AAACGTTATC 


AGTTGCCGTT 


6900 


AACAATTGTT 


TTATTAAATA 


ACAATGGTGG CGGGATTTTC 


TCTTTTCTAT 


CTCAACGAAC 


6960 
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TTTACGTGAG GACGATTTCG AGCCTTTATT TGGAACACCG CTTGATTTAG ATTTTTCGTT 7020 

AGTTGCTGAA CTTTACGGTG CTTCGTACCA AGAAGTCAAA ACAATCGCGG AgCTCAAGCA 7080 

AATTTTACAA GCAGCG6CTG AAGAACCACA ATTTCAAGTG ATAGAAGTCA AAGGAAATCG 7140 

CCAAGAGAAT GTTCATTTGT ACGAATCAAT TTTAGCTGAA ATTGGTAGAC GAGTAGAAAG 7200 

ACAGGGGATT TCATGGAACG GTTAATTCGG GGGATGCAGT ACCATTATCA ATGGTTAACA 7260 

CCTTTTGATG CAAAGCGAAC GACAGTAGTC TGTTTACACG GTTTCACAGG AACGTTAGCC 7320 

ACGTTTGCAG CCGTTTTTCC CAGTCAAACA CCTTATAACG TTTTAGGAAT TGACTTACCT 7380 

GGGCATGGCG CGACTGCTAG TTTGGTAGCG CCAGAGCGTT ATACCATGAA ACAGGTTTGC 74 40 

CATGATATTG CTGAACTAAC CGAATCGCTA AATTTGCCTT GCTTTTGTTT ATTAGGCTAT 7500 

TCAATGGGCG CTCGAACAGC TTTAGGTTTT GCATTACATT ATCCACAAAA GGTACAGCAT 7560 

CTTTTATTGG AAAGTGGATC ACCAGGTTTG GCCACTGCCG CAGAACGTCA GGCTCGTATC 7620 

TGTCAAGATC ATCGTTTGGC GGAACGTCTT TTAGAGGAAC CGCTGGTGGA TTTTATTGAT 7680 

TTTTGGCAAG AATTACCGTT ATTTCAAACG CAAAAAGCCT TGTCTGTGGC ACAGCAAATG 7740 

GCCATTCGTC AGGAACGTTT GAGCCAATCG GCCTTTGGAT TGGTTAGTAG TTTGTGGTAT 7800 

ATGGGCACAG GAGCGCAAGA GAGTTACTGG GAACGTCTAG CAGAATTGCA GCCGATTCCT 7860 

ACCGATTTAT TGGTAGGCGG CGAAGATCAA AAATTTATTG GAATTGCGAA AAAAATGCAG 7920 

GCACGTCAAC CGTTGTTGAG GCTGACAATT TTTCCTGAAG CAGGACATTG TATCCATTTA 7980 

GAACAACCAA CGATTTTCTA TGAAAAGGTG ACAGCGTTAT TGGAAGGAGC CATCTGAATG 8040 

AACATTCAAT CAATTGAAAC CTATCAGGTT CGTTTGCCTT TAAAAACGCC GTTTGTCACG 8100 

AGCTATGGTC GCCTGGAAGA AAAAGCGTTT GATCTATTTG TAATAACGGA TGAGCAAGGA 8160 

AATCAAGGGT TTGGCGAATT AGTGGCtTTT GAGCAACCTG ATTATGTGCA AGAAACGTTG 8220 

GTAACCGAAC GCTTCATCAT CCAACAGCAT TTAATTCCGT TACTTTTAAC AGAAGCGATT 8280 

GAACAGCCGC AAGAGGTTTC AACAATTTTT GAAGAAGTTA AAGGGCATTG GATGGGAAAA 8340 

GCTGCGTTAG AAACTGCCAT CTGGGATTTA TATGCCAAAC GGCAACAGAA GAGTTTAACT 8400 

GAATTTTTTG GACCAACGCG GAGGAAAATT CCTGTAGGGA TTAGTTTAGG GATTCAAGAA 84 60 

GACTTACCTC AATTACTTAA ACAAGTTCAG TTAGCTGTAG AAAAAGGGTA TCAACGAGTG 8520 

AAGCTGAAAA TTCGTCCCGG TTATGATGTG GAACCTGTTG CTTTGATTCG TCAGCATTTT 8580 

CCTAACTTGC CTTTAATGGT CGATGCAAAT TCAGCGTATA CATTGGCAGA TTTGCCTCAA 8640 

TTGCAACGTT TAGATCACTA TCAATTAGCG ATGATTGAAC AGCCTTTTGC TGCAGATGAT 8700 

TTTTTAGACC ACGCACAATT ACAAAGAGAA CTGAAGACAA GAATTTGTTT GGATGAAAAT 8760 

ATTCGTAGTC TCAAAGACTG CCAAGTGGCG TTAgcATTGG GCAGTTGCCG TAGTATCAAT 8820 

TTGAAGATTC CTCGTGTAGG TGGGATTCAC GAGGCGTTGA AGATTGCCGC ATTTTGCCAA 8880 

GAAAATGATT TATTGGTTTG GTTAGGTGGT ATGTTTGAGT CTGGTGTTGG TCGCGCATTG 8940 
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AATTTGCAAT TTGCGTCACA GCCTACCTTT TCCTTTCCAG 


GTGATATTTC AGCAACGGAA 


9000 


CGCTATT TTT ATGAAGATAT 


CATCACGGAA CCTTTTATTT 


TAGAGCAAGG 


GACAATGACC 


9060 


GTGCCTCAGG GACTTGGTAT 


TGGTGTTACA CTTTCCCAGA CGAATTTGCT AAAATACAGT 


9120 


CAGTATCAAA AAATCATGTA ATGATAACAA AATTGCGCCT 


ATGACTATGC 


AGAGAAGCAT 


9180 


AATCTAGGCG CAATTTTTTA 


TTTTTCTAAT TGACTCGCTG 


CGGCTTCATC 


CAAGAGAACC 


9240 


GTTAAATTTG GATGGAGTTG 


AAGAATAGAG GCAGGATACG 


TTTCTGTGAC 


AGGTCCTTGG 


9300 


AGCACTTGTT TGACCATCTG 


AGCTTTTTGT TCACCATTTA 


CAATTAGGAC 


AAGATGCTTA 


9360 


ACTTTCATGA TGCTGGCTGG 


CCCCATCGTA ACAAAGGTCA TGCCTGGCTG 


CATCATTTCA 


9420 


GGAACAAACC AAGGCTCTGA ACCAGTGACG GTCACTTGGT 


AGGTTTCATT 


TTTAAAATGA 


9480 


GTGGTAGTGG GCATGTTCCC 


GCAAAAATGA CCATCTGCTC 


CTAAGCCAAT 


TAACATCAAA 


9540 


TCTAAGCCGC CTGCGTCAGC 


AATCCGTTGT TCTTGTTGTT 


GAAAGTTTTC AACAGTTAAT 


9600 


GGATGGATGT TTTGCTCGGA 


TATTTCCGCA GGTGTCAAAT 


ACAGCGTCCG 


CAAATCTGTC 


966.0 


AGAGTAATGC CTTCCTTTTG 


ATGAGGAACG GGGATTTCAT 


CAAAGTTATA ATAATGAACA 


9720 


TTATCAAAGT CAGATGAATT 


TTTCACAATC GAGGCTAATT 


TTTGGTAAAC AAGTGCTGGT 


9780 


GTTTTGCCAG CGGTAATTGA 


TAAATTCACT CGTTTATCTT 


GGCTCATCGT 


GCTTAGAAGT 


9840 


AACATTTTTG CCCATTCGCT 


CATCGTGTCG AAATCTTGTT 


TAATTAAAAT 


TTTCATACTT 


9900 


GCTCACTCCA TTCTTTGAAT 


AAGCTAAGTA TAATTTGTGG 


GAACCATCCC 


ATGTCAAGCT 


9960 


CTTCACAAAA AGGATTTAAT 


TAACGGTAGG CAAAAGAAAG 


AGGAGCAGCC 


GTCAAAACGG 


10020 


CGACTGCCCC TCTTTTTAAA ACTTAAGATA ACCTACAACG AGTTAACTTT 


TTAGCTTTCT 


10080 


GTTCCACTTG CTCCAACGCT 


GGTCTAAACG TTCTTTTGAC 


CAACACTGTT 


TAACAAAGAA 


10140 


TTTTATTTTA TTGTTCTTTC 


AACCATTTTT CTAGTTGTAA ACGTTGCACT 


TTCATAGTAG 


10200 


CTGTGTGAGG GATTTCATCA 


AAAGCCATAA TGATAGGTTC 


GTTTAAATGT 


GGTAAATCGG 


10260 


CTACTTGTGT CCACCAAGCG 


TCCCAATCCA TTTCTTCCCC 


AGGGACAACC 


GCTAAAATTG 


10320 


GCTGAGGTTC ATTTTCTTTG 


CCACGGACAA TAATCACTTC 


TTCTAAGAAA 


TCTAATGCAT 


10380 


CCAATAAATG ATCTTCAATA 


GCTAAATTAC TGTCGATGTT 


TTCAATTAGA 


TCTACTTGGC 


10440 


GATCTTTCAA GAATAAGTGG 


CCGCGTTCAT CTAACATTCC 


GTAATCGCCG 


CTGTCCCACC 


10500 


ATTCGCCATA GACATTTTCT 


GCAAAGCGGG CATCTTCTTT 


GTAATATGTT 


AAGGCCCGTC 


10560 


CTTTTGATAA GAATTGGATA 


TGGCCGTCCG TCATCGCAGG 


TAAAACTTGT 


CCTTTTTCAT 


10620 


CGGTAATTCG AGCTTGTGTT 


AAATCCTCTA AACCAACGCC 


CATGTCACGA GCATCTGAAT 


10680 


CTTTTAGGGA TTCTAAAGTA TGCGCTCGTA AGATCATTGG CCCGCATTCA CTTTGTCCAT 


10740 


AAACTTGTAG GAAAATTGGT 


TCATTGGCAG CAGATGCTTT 


TAAGAAAGCC 


ACCATTGTAG 


10800 


CGTTGTTAAT TGCATCAAAA 


GTAGAATGAT AATAATGGAC 


ACTAGCGAAA 


GCTTCAGGTT 


10860 


TTTCTTTGGC CAAACGAACC 


CATTGAACGA AGTTATTGGG 


GTGTGTTTCT 


ACAGCGATTG 


10920 
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GTTGATAGGT GCTGAACATT TCAGCGACTT TTTCGCTCGA AGCATTTGCT AAAGGCATCA 
TAGGGAAGCC CATGGCCATT AAAGAAGACA CGCCAATGTT AAAACGAGAA TGAACTGGCG 
AAATATGAAA GCCAACTAAT TTCTTTTCAG CAATTTTTGT AAAAATGGTT TTTTGCCATT 
TTGTACGCCA ACCCATAGAG TGATTGGAAT GACAGATAAG TTTAGGAATA CCTGTTGTTC 
CAGAGGTGTG GGTCATGTAG GCAATTTCGT CTTTAGTCAG TTCGTCTTGA GGAACTTCCG 
TGGCATCAGC TTCTAAAAGA GAAGCAACCG AAAGTTTTTT ATTGGCAGAG CTATTACGAA 
CAGCTTGAAC TTTTTCAGCA GTCACATCAT CAAATAAAAT ATAAGGATCT TCTAAACGAT 
CAACGAATAC TTCAATTGTT TCAAAAGGTA AATGGTATGA AACCATTACA GGAACGGCTG 
CTAGATATGA GGCTGCAACA GCTAGTAAAT ACGTGTCGAA TTTAGGACTT TTATAAATAA 
TAATTTTGTC GCCTTTTTTT ACACCTAAAG TGGCTAATTG ATATGCCCGT TTCAAAATTT 
CTTGATGGCT TTCACCATAA GTTGTTTCTA AGCCAAGTGC AGGAAAAGCA GGCAAAGATT 
CATCAAAAAT AATTGGAACA GTTGGTGTTT TTTCTGCGGC TTCTTGATAG TTTGTATAGA 
GGTTTAATGG TTGATAATCA AGTAAGTGTG TCAAGGGAAA GTCTCCTTTG TAAnATAGTT 
TGTGTACGAA TGTACAAGTT CACTTTATCC TATCATAGAA TGGGTCTTTT TCTACGTCAn 
AAAT 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11307 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
TCCAATATAA AGACACAACA GACAAACGaT GAGGAGTGTG AAwTAAtATG 
TTATTGGGTT TATCTTTTGG ACAATGACAC TAATGTTAGG TTATCTAATT 
AAGGCCGTAA ACATGAGTAA TTTAACAAAA CGTAAAAAAG ATTTATTTGA 
GTTGTATTTA AAGATATTTC AAAGCAACAA AGCGAAAAAG CACAAAAAAG 
TTACAACTAA TGAATCAATA TCCCGATTGG GCAAGTCAAA AAAATAAACT 
GAAATTCAAG AATTAGGACA AGCAATCGGT AATTGGTCGA TGGATCAATC 
CAATCCATCA AGGCCGCATC GTTTACAAAA AGCGAGTATC TCTATTTAAT 
TATTCAGATG AAGCGATTCG TCACGGCTTA GACATGTCGA AAGAGTGTTA 
CGATTAACAC TTTTAAATGA ATAAAAGTAA AGGAGATTAA CCAATGCGTA 
TAATTATATC AAAGATATTT TAGCAGACTT TTATAAAACA GAAGAGTATA 
GGAAGAAGAA TTACGGCACC CTTATCAAGA AGCAGATTTA AATGCTGGTA 
AGGACTTCAC TCTGTAGTGA CCGAACGAAT GGCGATTACG ATAGCTATGG 



TTaACAATTA 
GGTGAAAGAG 
AATGAAAAGC 
AAAACGACTC 
TATTATGCAG 
AAGACCCATC 
TTGGCTCGGT 
TTTTATTTAT 
CGTCAACATT 
TCCGTCAACG 
TTAGAGGACA 
aTCGTCGTCT 



10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11764 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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GTGGAACTTa 


GAGAGAAATC 


GAGACATTAT CAAAAATTGT 


TTAGCCGAAG CGGaTGAACA 


780 


AACGCGCGTG ATTATTGAAG AACTATATAT GAAAAAACGG CCCTCTTTAA CATTAATTGG 


840 


ACTTGCCCAG 


CAATTATTTA 


TTAGTAAAAG CCAAGCCTAT 


AAATTAAGAA ATCATTTCTT 


900 


TGAAGCGGTG 


GCGGATGAAC 


TAGGGATGTA AACATGGAAA AAGCGTGGAA TTTTTTCAGG 


960 


TGTCAACATG GTAAATTAAT AGTGTCGAAA GAGATAGATA AACGTGAGGC AACCAAAAAA 


1020 


ATG/^GACAC 


GGAATTCTAT 


GATTTTGACT GCTTTCTTGT 


GTCAGCTATG AAGGAGCAGA 


1080 


AAATGCCGGC 


TACTTTCAAG 


ATCCTTCATT TTGACTAGAA 


GAGAGCCAAT TTGTTAACCA 


1140 


ATCCTGAATT 


TTTTGAATGG 


AAAGGTGGCG CTAAAAATGA ATGAAGCGGA ACAAGAGTTA 


1200 


TATGAAGCCC 


TTGTTGCAAT 


CTGCCAGACG TCaGGATTTT 


TGTTGCTAGA GGAACTGCCG 


1260 


ACAGATTTAC 


CAGATCAGCC 


ATTTGTTTAC TTAGGTGATA 


GTAAAGAATT ACCTAAGCCA 


1320 


ACTAAATCAG 


CTATTTTGGG 


AGAGATTGAA TTAATAATGC 


ATGTTTATGG TGCGTTATCT 


1380 


GAACGACAAC 


AAATTTCTAC 


AATTAAAGGA ACGATTTTAC 


GGCAGGCAAC CAGTAACTTA 


1440 


AAACGAACGG 


CTCATTTTAA 


TTGGGGTATC AAACATCAAG 


AAGTCAAAGC ACAAATGGTA 


1500 


AAAGATACCA 


AACAAATGAA 


AAAAACAATT TGGCATGCTG 


TACTACCATT ACACATGCAA 


1560 


TTTTACTAGG 


AGGAATTATC 


AATGGGAGAA GTTATGCAAG 


GAAAAGACCG TATTTTATTA 


1620 


GTTCGTCGCT 


TGGATGAAGC 


AGCGACAAAG AAAGCAATGA 


AACCCTTATT TCAAATTGAA 


1680 


CATGAATGGG AATTCTCACG 


TGAATCGAGC GGTACGCAAA 


CAAAAGATGG CGTCGCGAAT 


1740 


GCTGTTTCTG 


GTTTAGAAGT 


TACGTTATCG TTAAGCGGTT 


TAGCCTCTCG AGATGATGAA 


1800 


AATTTATACA TGaAAGACGC AgcGAaGATG GCATCTTAAT 


GGAATTTTGG GATGTTGATT 


1860 


TAAAAGGTGA 


AAAAAATGCG 


GAAGGTAAAT ATCCAGCAAT 


TTATGCCCAA GGTTATGTAA 


1920 


ATTCATGGAG 


TTTACCAGCC 


AATGTAGAAG AATTAGTAGA 


AATCGAAACA GAAGCCTCTA 


1980 


TTAATGGCAA GCCACAAGAT 


GGCTTTGCAA CAGTAGAAGC 


AGATATTATT GCAGAAGCAC 


2040 


AATATGCGTT 


CCAAGATACC 


GTTCCAGATA AAGCACCACA ACCTGGCGAA TAATCAAAAA 


2100 


GTGTTGAATT 


TTAGGAGGAT 


AAAAAATGAA TTTAGAGATT 


AACGGAAAAA CAATTGAAGT 


2160 


GAAATTTACG ATTGGCGCGA 


TTCGCGAATT AGATAAACGT 


TACCAAATTG AAAATGGCGC 


2220 


TGCCAAATTC 


GGCATGGGCA 


TCAGTTCAGC AATGATTTAT 


TTACGCCAAT ACAATCCAGT 


2280 


AATCTTAGTT 


GACATCATGG 


AAGCTTTACA AAGTGGGCAA 


TTAAAAATAG GTAAGTCGGA 


2340 


AATTGAAGCA 


TGGTTAATGA 


CCCAAGATGT CAAAAAACTT 


TCAGATGATT TGCTTAAAGA 


2400 


AATGGGAAAG 


CAACCTCTTA 


CAAAACCAAT GATCGATCAG 


TTCAGCAAAG AAGCGAAGAA 


o A cn 
Z4 bU 


AGCAGAAGCG CAAssgACCA ACTAATTAAA ACGAGCGATG ACGTGTATCA CGACATCGCT 


2520 


CTTTCTGCTT- TTCGCTACTT 


AGGCTGTCGT TCATTTGAAG 


AAGTGGATCA GATGACCATG 


2580 


TCTGAATTTG AATTACGAAT GATTGCTTTT AATTTAGCAG AAGTAGATGA AGAGCGGAAA 


2640 


AGGCACGAGC 


TTGCCTACTT 


AAATGTTAAA GCGCaAGCGA CAAACAAAAA AGGAAAACCC 


2700 
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GTTTTTGAAA 


GCTTTAAAAG 


TTTTTATGAT 


TATGAAAAAC 


GAGTTGCTGA 


AGTTCTGGCA 


2760 


GCTAACCAGC 


CACAACGAAC 


AAAATTAAAT 


GAGCGGAAAA 


AAACGCAACT 


TGCCACTGTG 


2820 


GCAGAgCGTC TACGCCGCTA 


TCGAGAAGGG 


AGGAGAGTAG 


ATGGAGAATG 


ACAAAGAAAA 


2880 


AACGCCGTTA 


TCGGAGGCAA 


AGAAAAGCcT 


TGCAGGCGTC 


CAACAAGCAT 


TAAAAAGTAT 


2940 


GAGCGGTGAG 


TATGCCTTAT 


TAAGTGGATA 


TTTAGGGAAA 


ATTAGTGCGG 


GTGTCAATCA 


3000 


GTCAGCCACG 


GTCATGAACA 


CATTTAAAAC 


CGTCATGCAA 


CAATCTGGAG 


AAACAGTG/^ 


3060 


AAAAACAGGA 


GACGAAACAG 


CAAAGGCAGC 


AGATCAAATG 


AACACAGCGT 


TAACAGATTC 


3120 


TGCTGAACAA 


GCCGGTGaAG 


CAGCTAAAAA 


AGCGGGGAAA 


GAAACCTCTG 


ATGGCTTTAC 


3180 


TAATGCACAA AATAATATGC 


TGAGCTTTGG 


GACGGCCATG 


ACTAGTGCCG 


TTTCCTTACC 


3240 


TATGCTGAAC 


GTTTTAAAAA 


CAGCTATGGG 


CGTCGGTGCT 


GGGGTCAGTG 


GCGAATTTCA 


3300 


AGGAATGCAA 


GGACTGATTA 




AGGAGGGATT 


TCTGATTCAT 


TGCAAGGCGA 


3360 


GTTGCAAGGG 


GCATTGACTC 


AGATGAATCA 


ATCATTTGAA 


GCGGCGGCAC 


AAGTGATTCA 


3420 


AAGCGTGATG 


GCTCCAGGAA 


TGG AAAT T T T 

X wOJTiiVrV X X X X 


GGTTCAAGTG 


GTTATTACAG 


TCGTCAAAGG 


3480 


CATTACAGCT 


TTGGTTAATT 


TATTTATCAA 

X *y X X X AX X v^^xxxA 


ATTACCAAAA 


CCCGTCCAAG 


TTTTTATTGT 


3540 


TGCCATTATG 


GGCATTTTAG 


CCGCCATTGG 


GCCCATGTTG 


ATTATGGTAA 


CGATGGCTCA 


3600 


GCTAAAATtT 


CAACAGTTTA 


V7 X wV« x \i>\3 X X X 


GGCTCTTGTA 

w X X X \7 X rk 


CAAGGAAACA 


TTGGGAAGTT 


3660 


AGGTGGTGGC 


TTATCAAAAC 


TAAGTGCTAG 


TTTTAGTGCC 

X X X X X ^^K'V^ 


TTAGGTGGAG 


GACCATTAAT 


3720 


TTTAATTGTA 


GCAGCCGTTT 


TAGCAGCGGT 


AGCAGCGTTT 


ATTTATTTCT 


ATAAAACCAA 


3780 


TGaAACaTTT 


AGAAATAGTA 


TCAATAGCTT 

X X^XVIX «V9V> X X 


AGCTAGTGCC 


ATTCmAGGaG 


CTGTTTCAGC 


3840 


GGCGTTTGGC 


AAATTGGTAG 


GATTGPTACA 

UtrX X X \3 V« X £^\*Fx 


ACAGATCCAG 


CCGGCCTTTC 


AGCAAGTAAT 


3900 


GGCAGTTTTT 


AAACAATTTT 


TTGCAGTAGG 


CTTAGAGAAA 


ATGGCGACTA 


TTTTTTCAAC 


3960 


AATTGGTCGT 


GTGCTAGCAG 


GCGTTTTTGC 


CAGCGGTTTG 


CAATTAGGTA 


GTAACTTATT 


4020 


AGGGCAATTT 


GGTGGCACCT 


TTGACAAAGC 


TGGTTTAGCG 


GTTGGTCTTT 


TGGTAAAAGT 


4080 


TCTGACAAAG 


GTTGCACTGG 


CTGCATTAGG 


AATTCTGGGC 


CGTTTGGTCT 


AATTATTTCC 


4140 


TTGATTGTTT 


CATTCGTGAC 


GGCCTGGATG 


AAAACCGGTG 


ATTTGAGTGC 


GGGTGGTATT 


4200 


ACCCAAGTCT 


TTGATAATTT 


AGGTAACACG 


ATTACATCGG 


TTACAACAAT 


GCTGGCAACT 


4260 


AATCTACCGA 


AAGTTATACA 


ACTTTTTACA 


ACAGTCTTAA 


CCAGTATTCT 


CGGGAAT^TA 


4320 


ACAGAAGCTA 


TTCCAAGCAT 


CGTAACCGCG 


TTATCTAGTT 


TAATTACGTT 


AATTGTTGGT 


4380 


GCGATCGTTG 


CCAATTTGCC 


AGTCTTAATT 


GAAGCGGCAA 


CACAAATTAT 


TACTACGTTG 


4440 


ATTCAGGGGA 


TTACAACAGT 


CTTACCAATG 


TTGATAGAAG 


TTGGTTTGAG 


CTTATTAATG 


4500 


ACTTTAGTTA ATGCGATTGT 


CACCGCCTTG 


CCAACAATTA 


CAACTGCAGC 


GATTAATATC 


4560 


ATCACTACAT 


TAGTGACAGC 


TTTTGTCACA 


GCGTTACCAA 


TGCTAGTTAC 


AGCAGGTGTT 


4 620 


TCAATTATCA 


CGGCCTTAGT 


CAATGCATTT 


GTTACTATGT 


TACCGTTGAT 


TTTGACTGCT 


4 680 
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GGTTTACAAA TTTTGATGGC ATTAATCACT GGGATTATGA CGATTTTACC TCAGTTAATT 4740 

CAATCAGCGC TGACGATTAT TCTAGCGTTA GTGACAGCGT TGATAGGTGC CTTACCACAG 4 800 

ATTATCAGCG CAGGTGTCAA ATTGTTAATG GCGTTAATTC AAGGAATTAT TTCGATTTTA 4860 

CCAACCTTAG TTGCGGCAgC TATTACCTTA ATTTTGACAT TGGTAAATGC CTTAATTGGT 4920 

GCCTTGCCAC AAaTCATCAG CGCaGGCGTc AAATTGCTAA TGGCTTTGAT CCAAGGGaTT 4980 

ATTTCAATTT TACCGCAACT GGTTACTGCA GCAATTACGC TAATTACCGC TTTAATGGGT 5040 

GCGTTAATCA ATGCGTTGCC ACAGTTGTTA AGTGCTGGGA TTCAACTGAT TCAAGCCTTA 5100 

ATTAATGGTG TACTCAGTCT ATTGGGTGCC TTGCTGTCCG CAGCAGGAAC ATTAATCTCA 5160 

CAAATGATCA CGAAGATTGG TTCTTATTTT GGTCAACTGT TAGCTTCGGG CGGACAGTTA 5220 

GTTGAAAATA TCAAAAATGG GGTTACCAAT GCAGCCAATC AGGTAAAAAA TGCCATTGGT 5280 

TCTGTAATTG AAGGTGCTTG GCAAGCAATC CAAGGTTGGT TTTCAAAATT CACCGATGCC 5340 

GGTGCGAATA TTGTCGGCAT GATTGCTGAT GG7VATTACAG GCGCTVATTGG aAAAGCCAAA 5400 

GAAGCAATCG ATGGAGTTGT CAGTAAAATT CGTAACTTTT TACCATTTTC ACCAGCAAAA 54 60 

GAAGGTCCCT TATCTGATTT GCATAAATTG AATTTCGGCG GCACGATTGC CACGGGGATT 5520 

TATGCAGGCG AAACAGCCGT TAGTAGAGCA ATGGCTTCTA TTTTAGATTT ACCGCTGTTA 5580 

AATGATTTTG CCTTGGACTT AGCTGGTCGA GGAAACTTCA CGGCAACGAT TGACCATCGT 5640 

TTAGAAAATG ATGCATACAA TCGACCATTA TTTGTGACAG TAGAGTCAAC GTTAGATGGA 5700 

AAAGTTGTCG CAGCAACTAC GGCGCCTTAT TTAGCAACAG AGTTACAACG ACAACAAGTG 5760 

AAACAAAATA ACCGCTTAGG AAGGAGAGGA TAACATGTAT AAATTTGTTG ATACCAATCA 5820 

AGCAACTCAT TCAACGCCTC TTCCTTCAGA AGCGTTGAAT TTTAACGGCC AATTTTTAGA 5880 

AAAAGTCATC CCTGGCTATC AAACATTATC AGTTTCAGGA CGAGAATTAG TTCCAAGCGA 5940 

AATTGAAAGC TATCAATTAG GGATTCGTGA TGGTAAACGT CACGTTTATG CGCGAATTCC 6000 

AGAACGAGAA TTAACAGTCA AATATCGCCT TTCAGCTGTG AATAATGAAG CATTTCGAGA 6060 

TGCATTTAAT CATTTAAACG TTGCTTTGTT TACGGAAAAA GACGTTtCTA TTTGGTTTAA 6120 

- CGATGAACCG GAAATGCTGT GGTTTGGCAG TAAGTCTTCA GTGAGTGATG TACCCGAAGt 6180 

GTTAACCAAG TAACAGGCAC CTTTACTTTA TTGCTTTCTG ATCCGTATAA ATACACACGG 6240 

AGkATgGCGA CTAGTGTGAT GTGGGGTTCG CCAACCATTA CATTTCAAGC GAATTACTTA 6300 

ATGGGGAATA CAGGCTCAGG TGCATTTGAT TTTCCAATTT TAATTGAAGG CGGGGCTTAT 6360 
TGGGGATCAA CCATGATTAC CTTTCAAAAT CGGGCTTACA CGATGGGGGA TTTAGGCAAA 6420 
GAAGTTCGGC CAATTGAAAT TTATCCTACG GTTGAAGGAT TAAAAGTCAA ACCGACCATT 6480 
ATTTTAACAG GAACCGGACG TGGTGTTTGG ATTAAAACAC GGAACGATAC AATTAACTTA 6540 
GGAGACTTTG ATCGTTCGGA AATTATTATC GATACTGAAA ATTTTTATCT GACaAAAAAT 6600 
GGTGCACCGA TGATTC6ACC AATGAACGAT TTTTATCTAT ATCCCAATGA ACCGCTGTAT 6660 
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ATTCAAGCCA AAGATAGCGA CTTCCGCTTG ACGATTCGCT ATCCTAACCG ATTTGTGTAG 6720 

GAGGGTGATT AAATGTTAAT GGCGCTGGAT TTGAAAAGAA CATATACGGC AATCTTGGAT 6780 

AATGCCTATC AAGTCAGTTA TGAAAAAATA GAGAACAAAA TTGGCAGTTT AGATTTTACC 6840 

ATGCCACTAG ATGATCCTAA AAATGAATTT ATTGCAGAAA TGCAATGGGT GGAACTGACC 6900 

GACAATGAGA ATGAATAVAT TGGTTTATAT CGCGTGATGC CAACCACAAT TAAGAAAGAT 6960 

GCGAACAATA ATCAAATTCA TTACTCTGCC ACAGAAGCAT TATGTACCTT AGGTGATACT 7020 

GTCCTTTTTG GTTGTCACGA AATTAAAAAC AAAACAACGA AAGAGGCCAT TCAATTTTTA 7080 

TTGAATAAAC AAAAAACAAA GCATTGGGTC CTAAAAAAAT GTGATTTTTC AAGGAAATTA 7140 

ACCTATAAAT GGGAAAATGA AAATGGGTTA GTTGAGCCTT TATTTAGCAT CCCAGCCGAT 7200 

TTCGAAGaGG AATATCTTTG GCAATGGAAT ACAGAGGTCT ATCCTTTTGA ACTTTCATTA 7260 

GTCAAACCGC CAACAGAACC AGTTGCGCGA ATTCAAGAAG GTTACAACAT GCAAGGATTT 7320 

GAAATAGAAC GTAATCCCAA GATGCTAATC AATCGGATTT ATCCATTAGG TTCAGGCGAA 7380 

GGTGTTAACA AAGTCAATAT TCGCTCGGTC AATCAAGGGG TTCCGTATTT AGAGAACAAG 7440 

GCCGCAATTG ACCGCTATGG TTTATTGGAG TCTUVTTTGGG TGGAACAGCG TTTTTCTGAT 7500 

CCCAAGGCAT TAAAGGAAAA TGCTTTGCGA ATGTTAGAAG AATGGACCAA GCCACAAGTT 7560 

TCTTGGGTAG TGACTGCAGC TGATTTAATT AAATTAACAG ATCAACCTTT GGCAATCGAT 7620 

CGTTTGCGGT TGGGCACGGT TATCATGATT AATACGAATG AATTTGGGAG TGTCAACCTT 7680 

CGTATTAAAA AAGAAAl=J>A AAAAGATGTC TTTGGTGCCC CCCAAGACAT TCAGCTAGAG 7740 

TTGGGAAACC TGCAAGAAAC AATTCATAGT ACCATGACAG CTTTCAGTCG GAAACAAGAG 7800 

ATTAACGAAA CTTACGCACA AGGGGCGACG ACACTTTTM ATCGTTCAAT ACAAGGAGAA 7860 

CTTAGCAAGA CACAGCCAGT GGAGCTGAAT TTATACTTTG ACGAGGACAT TCTTTATATA 7920 

AACACCGCAG AATTAACGTT CAAGGCAACT GCTAAAGGAC CTTCGCATTC TGTAACGAAC 7980 

ATTGATTTGG TAGTGGATGG CAAAAAATTA CCCCAACTAT CATTGCAACA ACAACGGCTA 8040 

AACATTTTGA GTTATTTACG AAAAACAACA GATGAAAAAA TCGAACGCGG CAATCACACG 8100 

CTTCAATTTT TCTCTCATCA GCCACTATGG TTGGATGCTT CGGTCATCTG TCGTGTGTAT 8160 

ATTCAATCCC AATTGGGTGG CCAGTTTTAA TAAAATAATG AAAACTAGAG GAGTGTGACG 8220 

AAATGTCAGT AGAACATATT GAAGAATTAG ATACCCTGAA TCAAGGTCGC CTTAAAATCA 8280 

ATGCAATCTT GGATCAGTCG AATGCATCAG CTGAGAAAGT AGATGCTTAC CAAGTCCAGT 8340 

TAACGAATGG AATTTCTGAA GCGAAAAACA TAGCAGATGA AGCTGGCAAA GAAGCCGTAC 8400 

AAATTGCCAC CGATGCAGGC AATCAAGCraA aTGAAACAGC CAACCAAGCG aTGaACaATG 8460 

CCAAAACAGC-AaTCATGATT GCAGGaAATG CAGTTTCAAC GGCAAATAAT AATAAACAAG 8520 

AATTTGATAC TTTACGAAAT GATTTCGATC AATTAGTAGC AGAAGCGGGT GATAGTAATC 8580 

CAGAAATTGT CCAAGCAC6T ACAGATACAC AAGGCATCAA ACAAGCTACC TTAGCGAATC 8640 
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GTCTTCAAAT TGATTTGAAT GACCGTATGA CAAAAGCAGA CGGTATTTCT TTATTGGCTA 8700 

AGCCAACTAC TGTCAAAATG AAGTTAGACT TTAACGGTAA AACGGCCGGC AATACAGCCA 8760 

CCAATGCAAA CAGTTATTCC ACTGATTTTA CGGCAAAAAT TCTTAAGAAG CCAACAGAAG 8820 

TTTGGGAGGA AGTTTCCCAA GCGGACTACA ATAAAATGGC CAGCCGTGAT GATGAGGGCG 8880 

TGAAAACAGG TTCCACCCAA AGCGGTGTGA TTCCGCAACA ATTAGCGGCC TTCAATCTCG 8940 

TTGAA6CCGC TAAAAAATTA ATTCCACAAA TGTTTGAAAC AGTCACAACT GACGAGGCGG 9000 

TGGCATTTAT TCGCCAGAAC GTrCAATTTT TTAC6ATTAA TCAACGT6TG AAAGCCGCTG 9060 

CGCCCAATAA TCAAACG.nTT AAAATCGCTA CGTATTTACC AACTACGGAT AATTGGGTAA 9120 

CTCAAATCCA AGAATCAGCA AAA6AGTTTG GCGATTTTTC AATTCAAATC AATGATCAGA 9180 

ATTTTATCAC AGATGAAGGT TTCATTTATT TAATGAGCTA TACAGATTCA TCGAATGGGG 9240 

TAACGCCAGC TAGCTTAGAA GTTGATTACG TGGGGCTTCA TATTG6TCTG TCTGTTGATG 9300 

CCCAAGCGGT TTTAGCGAAG AGTGGTTTTG TTCAAGCA6A GCAACTCAAT ACCCATATGG 9360 

AAAATCAAGA TAACCCGCAC CAAGTAACCG CTGAACAAGT GGGGCTAGGC AATGTAGAAA 9420 

ATTATGGnTT CGCATCAGAC AGCGAAGCAG TCGCGGGAAC TTTAACGAGT AAATATATGC 9480 

ACCCGAAAAA CGTTGCGGAA GCGATTAAAG GTCAAGCTGT GACACAAACA GGTGATCAAG 9540 

AGATTGCTGG GGTGAAGAAT TTTGTAACTA TGCCAACCGT CAATGGTGTG CCTTtGGAAT 9600 

CCTCTAGAAT GGCCATTTAT GAAGCTAnTG GAGTTGGTGA AGyCGAGGCA AAATATCAGg 9660 

CGGCCTTTAA TAAGGAtAAT ATGAAATTTG TATTAATTAG GGTAGGAAAT CGTGTCGATG 9720 

CATTTGTAAG ATGTAATTTG AGTGATCCaA CGAAATTGAA TAATAATTTG GTTAAAGTGT 9780 

TTACTGTTCC AACAGGATAT ACATTaTCGA CGAAGATTAC AAAGGGAATA TGGAATyTGG 9840 

CGTTAACTGC TATGCAATAT ACATTCCCTC AACCGAATTG TGCTGGTTTA TATGAGATGG 9900 

GAAATCAAGG AATTCTTTTT GGTGCTAACC GTGCTGGAAA TATTTACCTA CAAGGAAGTT 9960 

6GTACACGGA CGATCCGTTT CCAACAAAAT AACAAGCTAG TTTAGGAGTC TTTTTATG6A 10020 

ACGTTATCTC AACACAATAA CAATGCTTTT AAGCATTTTC GGTGGGATTG TCGTACGTTT 10080 

ATTA6GCG6A TTAGATCAAT TGTTGGATGT CTTCCTCTTT TTAATTATTG TCGATTTCAT 10140 

CACAGGTTGG ATTAAGGCAA TCGCCACAAA AGAATTGTCC AGTCGGATTG GTATGCTCGG 10200 

AATTGCGAAA AAAGTGACGA TGTTATTTGT GGTTGCCGTA GCGGTTCGTG TT6AAAAAGT 10260 

TGTGGGGAAC AATTTGCCAA TTCGGGAAAT GGTTCTGATT TTTTACATTG CGAACGAAGG 10320 

ACTTTCTTTT TTTGAAAACA TTGCGACCTT TATTCCTATG CC6AAAAAGT TAAAAGAGTT 10380 

ATTTATTCAG TTAAAAAATA AAGATGATTA AGTA6AAGTG GTCGGGACAA ACGTAGAACT 10440 

TTCGcTGATT.GCCGAAGAAA TTACTTCTGT CCCGCCATTT ATCTGCAGGT TTAAGCCGTG 10500 

GAAGGGAAGT TATTTTGACT TTCCTTTCAT GGCTTTTTTA AGAAAGGAGC ATGCTATGTT 10560 

TAAAAAATTA ATGATTCAAC TTGCTTTAGT GATTGGTTTA AGTTTAACGA TTCCGATGAC 10620 
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ACJTATTTTC 

CGGAACCTAG 

AATGCTTATG 

GGCCAAATTC 

TTGGCTCGAA 

GCCCGTGATT 

GGCaTAGTCa 

GGtTATTTAG 

TCyGrAACAg 

AGTAGGACAA 

ACATCTATnA 



GGctTrCGCT TACACCATCG AAGCGGATCC AATCAACTTT 
AAGCAATGAA TTAATtGTTX - tACATGAATC TGGAAACGAG 
TTTAGACAAT GAAGTGGCCT ATATGAAACG AAATTGGTCA 
TGTCGGATCT GGTGGACGAG TGAAaCAATT AGCTCCTGCT 
AGgTTCTTTA GCTAATCAAA AAGCCTATGC GCAAATCGAA 
GGCGACATTT AAAAAAGATT ATGCTGCCTA TGTTAATTTG 
CATTGGTGCT GaTTTTTCTC TGGaCGATGG AACAGGTTAT 
GaTTACAAAA AATTGGTGGG GAGATCATAC AGATCCTTAT 
ATtAGTAAAG CGCAtTGGcA CmAGATTTaC aAcGGGCGTT 

TCATTATTCA GCCAGGTAAA CCTAATGCGC CAAAATATCA 

tCACTTCAAT CTATCCAACA CCAGATGCTT TAATCAATGA 

TTTGGACACA AGTAGGAACA ATTACAG 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
AAAGCACAAA TGCAACAAGT TGTGATAGCT ATTCAAAATA CGGAAGCCTT 
AATGTAATGA AGGTTATAGG AGATGAGTAA ACATGTATGA AGGTTTGTTA 
AAGCGTATTT GCCCGTAACA GAAAAAACAC CAATGATTTC TTTAGCCGAA 
CACTGATTCC TTTACCTAAT TTATCCAAAG AATTAGGTAT TCAATTATAT 
AAGGCTTAAA TCCCACTGGC TCATTCAAAG ATCGCGGGAT GGTGATGGCT 
CGGTAGAAGA AGGTGCCAAA GCGATTGTTT GTGCTTCTAC AGGAAATACT 
CTGCTGCGTA TGCCACAAGA GCAGGCATTA AAGCGTATGT AGTGATTCCA 
TCGCACTGGG AAAATTGGCG CAAgcTATTA TGTATGGGGC GGACATTATT 
GTAACTTTGA TGAAGCATTA AAAGCGGTGC GAGAGATTGC GAAAACTGAG 
TAGTTAATTC AGTGAATCCT TATCGCTTAG AAGGGCAAAA AACAGCAGCT 
GTGAACAATT AGGTCAAGCG CCCGATGTCT TAGCGATTCC AGTTGGAAAT 
TTTCCGCTTA CTGGAAAGGC TTTAAAGAAT GGCATGAAAA ACAAGGAACA 
GAATGCATGG TTTTGAAGCA GAAGGT6CCG CAGCGATCGT GAAAGGCCAA 
AACCTGAAAC CGTGGCAACA GCGATCCGGA TTGGGAATCC TGCTAGTTGG 
AACAGGCACG GGATGAATCC GGTGGTTTCA TTGATGCCGT GACAGATCAA 
CGGCTTACCG AAAAATTGCT GCTCAAGATG GGGTTTTCAT TGAACCTGGT 



CCGGCTCTGC 
GACCACACAG 
TCTCATATTT 
AATATGGCGC 
CGAATAATGC 
TGGCTCAGAA 
CTCATGATTG 
CGCGTgGGGG 
GTGaGACTGG 
GCAATTCGTT 
GCAGAGGCAC 



TCAGTTATTG 
AAACAGTATC 
GGAAATACGC 
GGAAAATACG 
GTAGCAAAAG 
AGTGCCGCCG 
GAAGGGAAAA 
TCGATTCCAG 
GCAGTTGCGC 
TTTGAAGTTT 
GCCGGCAATA 
ACTTTACCAA 
GTGATTGAGC 
CAATTAGCAG 
GAAATTTTAA 
TCGGCTGCTT 



10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11307 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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CATTAGCTGG AGTTATTCAA CATGTTAAAA GCGGTAAAAT CaXaGCAGGT GAAACCGTCG 
TGGCAGTCTT TACTGGTAAC GGACTGAAAG ATCCTGATAC "AGCAATGGAG"ACAGAAGTAG 
CTATTTCAAA AATGAGTGAC GTGGAAGaAA TGCGTCTGcA CTTACgAAAA GGAGTCGCAA 
CACTATGaAA ATAAGAGTAC CTGCCaCtAG TGCCaACTTA GGACCTGGCT TTGATTCATG 
TGGCATTGCA TTATCCGCTT ATTTAACAAT CAACGTTTTA GGTGAAAGTG AATTTTGGGA 
AATTCAACAT ACTCTGGGAG AAGAGATTTC TACTAATGAA GAGAATTTAT TAATTCAAAC 
GGCATTGAAA ATTGCACCAG AATTAACACC AAAAGTTATT CGGATGGTTT CTGACATTCC 
CCTTGCAAGA GGACTAGGCA GTAGCTCAAG TGTGATTGTA GCAGGCATTG AATTAGCTAA 
CCGACTAGCA CATTTAAATT TGTCCCCTAA AGAAAAAGTC CGTTTGGCCA CAGAGATGGA 
AGGACATCCG GATAATGTGG CCCCAGCTAT TTTAGGAGAT TTCGTGGTGG CTAGCCATGT 
AGAAAACCAA GTCTACCATG TCAAACATCA . CTTTcCAATG tGTGATGtCA TTGcATTTAT 
TCCAGAGGAA CCATTATTTA CCGAAAAAAG TCGCGCTGTT TTGCCCGAAA AATTGGCTTA 
TAAAGAAGCG GTCGCGGCCA GCTCAATTGC CAATGTTATG ATTGCCGCAA TTTTAAACGG 
GGATTTACCT TTAGCAGGCA AAATGATGGA ACAAGATAAA TGGCATGAAA CGTATCGTCG 
TTCATTGGTT CCTCACTTAA AAGAAATTCG TCGTTTAACT CAGCAAAAAG GTGCTTATGG 
CTCTTTCTTA AGCGGTGCTG GTCCAACTGT GCTAATCTTG TCGCCAGAAG AACGAACCAA 
CGAAATTGTT CAAAGCTrGG AGAAATTATC TACCAAAGCG AGCATCCAAA TTTTCAATAT 
TGATCAAGAA GGTGTTCAAG TCTTTTAAAA AAAACGAGAA AAAATCACTA GTGATTTTTT 
CTCGTTTTTT TTAATCCTCG GATGTTTCCG TGGAATAATA ACTATGATCT TCTAAGTTCA 
AAGCATTAAG AATCATTGAG GCAGTTTCTT CAATTGAAAG AGAAGCGACA TCAATAACGA 
TAC/yiCCTAA TTTTTGATAA AGATCATTAG CAAAAGCCAG TTCTCGCCGA ATTTTTTCAA 
TATCTGAATA AGAAGTATTT TCAGGCAATC CATATGCGCG CATTCGTTCT TTACGAATCC 
CGTTTAAAAC ATTGGGATCA TTGGTTAAGC CAACAATTTT TTTAGGGTCC ATTTCGAATA 
ATTGTTTCGG TAAATGTGCT TCAGGAATTA GTGGCAAATT AGCAACTTTT AAGTTCTTGT 
TGGCTAAAAA GAGACTTAGC GGTGTTTTAG AAGTTCGGGA AACGCCAAGA AGTAACACAT 
CCGCTTCTAA AAAGCCTCGT GGGTCCTTGC CATCGTCATA TTTGACAGCA AATTCCATTG 
CTTCAATTCG TTTAAAATAA TTTTCATTCA GATGATGTTU^ GGCACCTGGT TCCCGCATGG 
GCGCGACACC TGTTAATCGT TCGATTTCCG CAACTGGTGG TGTCAAAATG TCAAAATGGA 
ACAATTGATG CTCATTGAAA AATTGATTAG CAATCGCCAC GAGATGATCA TTAATGATms 
tATGCAGTAC CATGGCCTCT GCTTTTTTGG CATCTTCTAA CGCTTGCACC AGTTTATCTT 
CGTCTTTAAC GAATGTTCGT CTAATCAAAG TAAAATCAAC AGTGGGGTAC TGGGCCATTG 
AAGCAGCCGC CAATTTTGAA GCGGTCTCGC CAGCAGAATC TGAGAT 
(2) INFORMATION FOR SEQ ID NO: 51: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ^ ID NO: 51: 



GATTTnAAAn nAATTAATTG AAAATACCCG CTGsCAtGaC ACGGGAAGGG 


aTTACGAAAG 


60 


AAATTCCAAT GGACGAAATT GTTCCAGGAG ACATTGTAAC TTTAGCTACA 


GGAGaCATGA 


120 


TTCCTGCAGA TGCCGTGTTG ATTTGGACCa AAGATTTATT CGTTAATCAA TCgTCGCTAA 


180 


CAGGCGAATC CATGcCyswC GAAAAATTTG TGGATGCAGG TGTGGATCGT 


CAACAAACAG 


240 


AGGTTTCTGC ATTAGATATG CAAGACCTAG TGTTTATGGG CACAGATGTC 


TTGAGTGGAC 


300 


AAGGAAAAGC GATTATTTTA AAAACTGGGC AACATACATT TTTTGGTGAT 


ATTGCTAAAA 


360 


ATGCTACAAC ACAACGAGGA AAAACTAGTT TTGACTTAGG CTTAGCAAAA 


GTCAGTAAGT 


420 


TTTTATTACG AATGGTAATG ATTTTGTTTC CAATCGTTTT CTTGATTAAT 


GGTTTAACTA 


480 


AAGGTGCTTG GGGCGAAGCG TTTTTCTTCG CGATTGCCGT TGCAGTTGGT 


TTGACACCAG 


540 


AAATGTTGCC AATGATTGTT ACGAGTAACT TAGCCAAGGG TGCATTATCC 


TTATCGAAAC 


600 


ATAAAGTCAT TGTTAAAGAA TTAGCGGCTA TCCAAAATTT AGGTGGTATG 


GATGTGCTCT 


660 


GTACAGATAA GACTGGAACG ATTACCGAAG ATCGTGTAGT TTTGGTTCAA 


CATTTGAATC 


720 


CATTAGGTGA TTTGAGTGAC GAGGTTCTAA ATTTAGCCTA CTTAAATTCT 


TCTTATCAAA 


780 


CAGGTTGGAA AAATTTAATG GATATTGCCG TTATTAATTT TtATGAAGAG 


CATCAATGGA 


840 


AAACGCCGTT TAAAAATGTC ACTAAAATTG ATGAAATTCC TTTTGATTTT 


TCTAGACGCC 


900 


GCTTAACGGT TGTCGTGAAT GCAGATGATC ATCAATTGAT GATTACGAAA 


GGCGCCGTGG 


960 


AAGAAATGGA AGAAGTGTGC ACGCATGCGC AAATTAATGG CGAAATTGTA 


CCATTAAGCA 


1020 


CTGCTGTACG AGAAGAACTA CGCCGAGTGA ATGTTCAAAT GAACAAACAA GGGATGCGTG 


1080 


TGTTAGCGGT AGCTGTAAAA AAAGATGTCC ATAAAGAAGC AGTTTATTCA 


GTTGAAGATG 


1140 


AAAAAGAGAT GACATTGATT GGGTTTATGG GTTTTCTTGA TCCAGCAAAA 


GAGTCAGCCG 


1200 


TCAGCGCTAT CCGCTCCTTA CATGAACATG GTGTCAACGT GAAAGTGCTA 


ACGGGCGATA 


1260 


ATGACATTGT CGCCAAAAAA GTCTGTAAAG ATGTAGGCAT TGAAGTTTCC 


CATGTGTTAT 


1320 


TAGGTTCTCA AATTGAAGCA ATGACAsnTG AAGAGTTGCG GGCGCAAGTA GAAGAAACTA 


1380 


ATTTATTTGC CAAATTAAAT CCGATGCAAA ATCTAAAATT ATTGAATTGC 


TACAAGCAAA 


1440 


AGGTCATACG GTTGGTTTTA TGGGGGATGG CATTAATGAT GCACCGGCAT 


TACGAAAAGC 


1500 


TGATGTGGGC 'ATTTCAGTAG ATACTGCAGC TGATATTACT AAAGATGCTA 


GCTCGATTAT 


1560 


TTTACTGGAA AAAAGTTTAA ATGTACTGGA ATCTGGTGTG ATTGAAGGAC 


GAAAAGTATT 


1620 


TAGCAATATG ATGAAATATA TTAAAATTAC CATCAGTTCT AAT 




1663 
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(2) INFORMATION FOR SEQ ID NO: 52:- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
CTTAATGGnG ATGGnAAGGG GATATTATTG GTTATTTTTT GAGnATATAT GATATCCTTT 60 

ATTATGTAGG AATCnTATTA TATGGsAGGm ACGAGCGATG AAAGTTATTG TTTTAGGATC 120 

ATCACATGGA GGTTATGAAG CGGTAGAGGA ATTACTAAAT CTTCATCCTG ATGCAGAAAT 180 

TCAATGGTAT GAGAAAGGTG ATTTTATCTC ATTCTTGTCT TGTGGCATGC AGTTGTACCT 240 

AGAAGGAAAA GT6AAAGATG TTAATTCTGT TCGCTATATG ACTGGCGAAA AAATGGAGAG 300 

CCGTGGTGTA AATGTCTTTT CTAATACTGA AATTACAGCG ATTCAACCAA AA6AACATCA 360 

AGTGACAGTG AAAGATTTA6 T6TCAG6TGA AGAACGTGTT GAAAATTATG ATAAATTAAT 420 

CATCAGTCCC GGAGCTGTCC CATTTGAATT AGATATTCCA GGTAAAGATT TGGATAATAT 480 

TTACTTGATG CGTGGTCGTC AATGGGCCAT TAAATTAAAA CAAAAAACAG TAGATCCAGA 540 

AGTCAATAAT GTGGTTGTGA TTGGTAGTGG TTATATTGGG ATTGAAGCTG CCGAAGCATT 600 

TGCAAAAGCC GGCAAAAAGG TTACTGTTAT TGACATTTTA GATCGTCCAT TAGGGGTATA 660 

TCTAGATAAA GAATTTACAG ATGTTTTAAC AGAAGAGATG GAAGCTAATA ATATTACCAT 720 

TGCAACTGGT GAAACAGTTG AACGTTACGA AGGCGACGGT CGTGTGCAAA AAATCGTTAC 780 
AGATAAAAAT GCGTACGATG CTGATTTGGT CGTTGTAGCG GTTGGTGTCC GTCCAAACAC 840 
TGCTTGGTTA AAAGGTACCT TGGAATTACA TCCGAATGGC CTAATCAAGA CGGATGAATA 900 
CATGCGGACA AGTGAGCCGG ATGTATTTGC AGTAGGGGAT GCTACGTTAA TTAAATACAA 960 

TCCTGCAGAC ACAGAAGTAA ATATTGCCTT AGCAACGAAT GCTCGTAAAC AAGGTCGCTT 1020 

TGCTGTGAAA AACCTAGAGG AACCAGTTAA ACCTTTCCCT GGTGTTCAAG GATCTTCTGG 1080 

CTTGGCC6TC TTTGATTATA AATTTGCTTC AACAGGGATT AACGAAGTCA TGGCTCAAAA 1140 

ATTAGGAAAA GAAACAAAAG CGGTGACAGT AGTAGAAGAC TACTTGATGG ACTTTAATCC 1200 

AGACAAACAA AAAGCTTGGT TTAAATTAGT GTATGATCCT GAAACAACAC AAATTTTAGG 1260 

CGCTCAATTA ATGTCGAAAG CAGATTTAAC TGCAAACATT AATGCTATTT CATTAGCGAT 1320 

TCAAGCCAAA ATGACGATTG AAGACTTAGC CTATGCGGAC TTCTTCTTCC AACCAGCGTT 1380 

TGACAAACCT TGGAATATTA TTAATACAGC GGCTTTAGAA GCGGT6AAAC aAGAACGTTA 1440 

ATAGAAGAGA mAATAAAACC CATGAGTATT TATAGTACTC GTGGGTTTtA TTTTTATTTT 1500 

GTATTCAATT GATTTTTCAA CTCTTGTTGG ACACGAGTTA ATTCTTGTTC ATCCACACGT 1560 

TGATAGGAAA CACCATCTTG CATAAAACCA GTACCTTGAA GTTGGTCTTG TTTCACTTTA 1620 

CCAAAGGCAC TGCGATAATC TAAGGCAATT TTTTTCATGT CATCAAAACT TAAATCTGTC 1680 
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TTCATATTAT CAGAAACAGC TGTTAAAATT TCTTGaTAGT TGCTTACGCT GTTAAGACTT 
AAGACTTTTT GGACGATGCC TTCAATCACT TTTCTTTGAC GTTCTTGGCG GCCGTAGTCA 
CCATTAGGGT CTTCGTAACG CATTCTTGAA TAGGAGAGTG CTTGTTCACC ATCCAATGAA 
ATTTTACCAA TCGTAAAATC ATATCCGTCT TGAGAAAAAG TCAGATTATT GTTCACTTCG 
ATTCCGCCAA CCGCGTTGAC TAATTCTTTT AAACCAGCCA TATTAATTGA AACATAATGA 
TTAATAGGTA TGTTTAAATA GTTTTCAACT GTGTCCATTG CTAAAGATGC GCCACCAAZA 
GCATAGGCGT GATTCAATTT ATCTTGTTTT CCTTGACCTG GAATATCAAC ATAGGTATCG 
CGAGCAAGAC TGACTAAGGT TGTTTGCTTG TCACGAGGAT TAACTGTTGC AACAATTGTT 
GTATCCGAAC GACCTTGCTC GACACGCCCA TCATCGCCTG TATCAATCCC TAATAATAAA 
ACAGAARAAG GCTCCTTATT GTTTAAATTG ACCTGACTTT TTTTAGATCG TTCTACTGTT 
TCATAGGTTT TATCCATTGA TTTAGAAACA TCCCAATAAA GTTTTGCTCC CATtCCAaCA 
ACTGCAaGAA aXAGAmCCAa GATT^TTCCA ArGATAATGG CAAAAATTTT CTTCCCTTTT 
GACATATTTT yCCTCCAATA ATTTATCACA GGTTGTGATT TTTTTCTTAT AGGTTAATCT 
TACTAATTTT TAGTGAAGAA GTATATAATA ATTGTTGAAA GACTATAAAA CTTTAAGGTA 
TTTTGCATTA TTTTTGCTTA ATCTTTTGAA ATATCTCAAT TCTATTAAAA TATGCGAATT 
AGTATATTAC ACGTTTCCAA TGTGTTTCAT TGTTGATTAA TTAGAATTTT AGTGGTATTT 
TTGTATTTGT AATTAGTACA GATTTTAATT GGAGGTATAA CAGTGAGTAA AAAAGGATCA 
GATATCATAG TAGAAACTTT AATCAATCAC GAAGTGCCCT ATATTTTTGG GATTCCCGGT 
GCAAAAATTG ATGGGGtTTT TGATGCATTA GTtGACAAGG TCCAGAATTA ATTTTAGCTC 
GACATGAroCA AAaCGCTGCC TtTATGGCAC nAGGGaTTGG TCGTTTGACG GGAGAGCCTG 
GCTTAGTTAT TGCAACTAGT GGACCTGGTG CCAGCAATTT AGCAACTGGG TTAGTAACTG 
CGACTGCAGA AGGGGACCCT GTTTTAGCAT TGGCTGGACA GGTGAAACGG TCCGATTTGT 
CAAAATTAAC TCACCAAAGC ATGAATAATG CTGCATTATT TAGTCCAATA ACAAAATTCA 
GTACAGAAAT TCAAGATCCA GAAACATTAT CTGAAAATAT TGCCAATGCT TACCGTATCG 
CCAAAACAGC GAAAAAAGGC GCAACATTCT TATCGATCCC ACAAGATGTC ACAGATAGTC 
CGGTGACAGG CGAAgcAATT AAGCCTTTGT CGGCTCCGAA ACTGGGCCAT GCATCAGCAG 
GAACAATCGC AGCTTTAGCT GAACGAATCA AGCAAGCGAA ATTACCTGTT TTACTTTTAG 
GAATGCGGGC CTCTGCACGA AAAGTTACCG CAGCCATTCG TGAATTGGTT GCTATAACTG 
GTTTGCCAGT CGTGGAAACG TTCCAAGGAG CAGGCATTAT TTCAAAAGAA TTAGAAGAAC 
ATTTCTTTGG GCGTGTTGGT TTATTTCGAA ACCAACCTGG GGACCGTTTG TTAAAAAGAA 
GTGATTTAGT GATTGCGATT GGTTATGACC CAATTGAATA TGAGGCACGT AATTGGAACG 
CAGAAAAAGA TGCGCGCGTT ATTGTTATTG ATGAAGTTCA AATGGAAATT GATCAATATA 
TGCAACCGGA AGAAGAGCTA ATTGGTGATA TGTCTAAAAA TATTTTGAAA TTATCAGAAG 
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CATTTAGTGA 
AATTAACGAT 
TTGTTCAAAC 
GTCACTACAT 
GTAACGGCAT 
GTCCAAATAC 
AATTAGAAAC 
GTTATGATAT 
TTGGTCCAGT 
ATTCACAAGC 
TTATTGATGT 
ATCAACTTTA 
AGGCGGTTTA 
AGGTGATTTT 
TACATTGTAT 
GCCATATGCC 
CGATGAAAAT 
TAAAATTAGC 
GTATCCACCA 
GGGAACAGTT 
TCATTTGCAC 
TAAACAAGGG 
TGATGCTGAT 
AGCGGAATAA 
TCAACTTGAA 
AGTATCCTTT 
ATAGAAGTTG 
TTATTTTTAT 
GGAGACTGGT 
GTTTAATCTT 
ATTCTGAAAA 
CGAATGGCAT 
GGGCCGAACG 
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ACCAATATTA 
TAAAGAAGTA 
GCTTCAAGAA 
TTGGATGGCT 
GCAAACATTA 
ACAAATTATC 
AGCGGTTCGT 
GGTTAAATTC 
TGATTTTGTG 
AGAATTTGCC 
TCCTGTGGAT 
TTAGGAGATG 
ATGGAAAGTT 
GGAATTGGGA 
CATGCAAATC 
GCAGTTACTC 
ATTAAAGCAC 
GGTCTTTTTG 
TTTGTAGAAG 
GTAGGGTTCT 
TTTATCAGTG 
ACTGTTTCGT 
TATCGGAATA 
CTGAATAACA 
TGGGCTTATT 
TTATAATAAG 
GCAATTGTCT 
AGTAATTAGT 
TCCCTATGCG 
TTTCGGAATG 
AACAAAAGGC 
TAGCACCAGT 
AATGTGGnAG 



ACAGAAGATG 
AAAACAAGTA 
AAAACCACGA 
CGCCACTTTA 
GGTGTCGCTT 
TCTGTTTCTG 
TTAAAGCAAA 
CAAGAAGAAA 
AAGTATGCTG 
GCAGCTTTAG 
TATTCAGATA 
GAAAAATGAG 
TGATGGCAGG 
CATTAGAAGG 
AAACTGGCGA 
GTTTTCAAGA 
AAATTTTAGA 
CGAAGATGCA 
CAGCACGCAA 
TTACACCTAA 
AGGATCACCA 
GGATGGAAAC 
AAGAAATTGA 
AGGCACATTT 
GCTGTTCGTC 
CAAATAACAA 
GGTTGGCTCT 
GGTATTTTGG 
TTAATTTTAT 
TTTCGTTCAG 
TTAACTTCAA 
AACTCATTGT 
TAAACGTTTA 



CGCAAGATTA 
CAATTGAAAA 
ATGAAATGAC 
GAAGTTATGA 
TGCCTTGGGC 
GGGACGGCGG 
ATATCGTGCA 
TGAAATATAA 
AAGCTTTTGG 
AAGAAGGAAT 
ATCmAGAATT 
CGAACAATAT 
AACGGCAGAA 
CTCAAATGGC 
AATTACTATT 
AGATGGCGCA 
AAAAATTAGT 
TGTCCGTGTG 
TCAACCTGAA 
ATTATTTCAT 
ATTTGGTGGG 
AGCAGAATTG 
TATTGCAAAA 
CTCCACGscC 
GCTTTTTGTT 
TTGGAAAGGA 
ATCATGATAT 
TTATTTCCTT 
TGGTTTATCT 
TAACGATTGA 
TCGAGCGAAT 
TCCCAAGTGG 
AGTTGGCCGT 



TTTAGAAACG 
TCGATTACAT 
AGTTACTGTT 
ACCGCGTCAT 
AATCTCAGCT 
TTTCCTATTT 
TATTATTTGG 
TCGATCCTCT 
AGCCAAAGGG 
GCAAACAGAA 
AGGTAAAaCA 
GTTTATCAAC 
ATTGGTACGT 
GAAATTATTT 
CTAGAAGGCG 
TTCCCTGTAT 
CCTAATTTTT 
GCACCTAAAC 
TTTACAGCGG 
GGTGCCTCTG 
CACATTCTTG 
CGACAGCATT 
GCTTTGTCCG 
ATTCAAGTTG 
GTTATTTTTC 
TGGCGTAATG 
TTTTGTTATC 
AATAAGAAGG 
TGCAGTGGTA 
AGAGAAAAGT 
GATTATTCCA 
ATTTCnGTTA 
GATTAATTTA 



CTTCAAGAAA . 
CCATTAGAAA 
GACGTAGGGA 
TTATTATTTA 
GCACTAGTTC 
TCGGCGCAAG 
AATGATGGTA 
GGCGTAGATT 
GTTCGAGTTC 
GGCCCAGTGA 
TTGTTACCAG 
ATGGTACGCT 
TACTCACGCA 
TATTGGACGG 
AAGAATTAAC 
CAACGGAAAC 
TTGCGGCAAT 
AAGAAAAACC 
AGAATATTCA 
CCGCAGGgTT 
ATTTTGGCAT 
TTCCAGTTCA 
CAATTGAAGA 
ACTAACTAGG 
AAAAAAAAAT 
6ACACACTTT 
ATTGTAGCAA 
CGGAGTACAA 
CATTTTGCTG 
GCGACGTTTT 
AATGGACGAA 
ATAGTCAGAC 
ATGGnCCAAC 
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AGATCCATAG GGGGTTTTGA nAA 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
AAACAGCATA GACAGCAATT GAGTTATGGC CmCgGTTtGA AGcwTwAAGG rAACgGCCaT 
CAGArGAAaT ACGrATCGsA GcmCCmCCmT TAAACsCATC ATAATCAGCA GGTAAAGTAG 
AAACTTTTTG TAAtTCaGTG AAAGaACCAT CTGTTyCATC ATATGAAAGA ACTGAAACAG 
TGCTATCTAA TTCACCAAAT A/^TAAGCCA ATGATTTTGT TGGGTGGAAC ACTAAATGAC 
GTGGACCAGT CCCTGGCTCA GTCACGAATT GAGCGATTTC AGAAAGCTTT CCATCTAAGC 
TAACATTATA CGTATACACA CGATCGGTTC CTAGATCACA GACGACTAAG CGTTGATCAG 
GCGTCAAATC TGTGTAATGA ACATGGGCAT TGTTTTGGTT TTCATGAGGA CCAGTTGGCT 
CTGTATGAAT CACTTTGTCA GCATCTTCCA AAGAACCATC TTCTAAGATA CGATACACGT 
GAACGATACC TTGGTGATAA TTTGCTGCaT AGACAAGTTG TCgGTtyTCG TCaACAGCTA 
cATAAcATGG CGGCGCACCT TCCtCaGTTA CTTTAtTCaA TAAGTTAAAA tGGTCCATTT 
CTGGTTGATA AGCGGCAGTG CCACCTTCAC CGTCAACACT AGTGACGCTA AAAAGTTCGC 
CCTTTTGACT CAGAGCAAGA TAGGTAGGAC TCGTTTCTTC TAACAAAAGT GTCGCTTCTG 
TTAAACGACC TTGTGTCGTA TCTAATGCAA TTTTATAAAT GCCTTGGCTG GCTCTTCGAG 
TATAGGTACC TAGTAAAATT TGTTCTAACA TCACAATTCC TCCAATTTGT TTCAACACAA 
TCACTTTTTA TGAAAGTGCT TTTATATAAT GTAGTATACC aTAAGAAGTC TAAGAACAAT 
AAAACTTTCT ATCTGAGATG TTCgCaTTTT AAAgtGGTTT GtGCtATTAT GaTGaGaGAC 
tGTGaTGACa GTCaTTTTTT gaTTTTTAGA GGGGTGACGA ACATGTACGA CaATACgGAT 
AATAACGGgA CCGTTCGTTG TTCATTTTGT GGGAAAACaC AAGAAGAAGT AAAAAfiAATT 
GTTGCTGGTC CGGGTGTCTA TATTTGTAAT GAATGTATTG ATCTATGTAA AGAGATTATT 
GATGAAGAAT TTTATGATGA AGCAGTCCGA GAATTAACGG ATGTGCCAAA ACCCCAAGAA 
ATTTTAAATG TTTTAAATGA ATATGTTATT GGACAAGAAC GTGCAAAACG CACACTTTCT 
GTTGCTGTTT ACAATCACTA TAAACGTGTG AATCAATCAG AAACAGCTGC AACTCAAGAT 
GACGTTGAAT TGCAAAAAAG TAACATTTGT TTAATTGGAC CAACAGGTTC AGGTAAAACT 
TTCTTGGCTC AAACGTTAGC AAAAACATTA AATGTACCTT TTGCAATCGC TGATGCGACC 
AGTTTAACAG AAGCAGGTTA TGTAGGGGAA GATGTTGAAA aCATCCTTTT AAAATTACTA 
CAATCTGCTG ACTATAATGT TGAACGTGCG GAAAAGGGCA TTATCTATAT TGATGAGATT 
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GATAAAATTG CCCGTAAGAG TGAAAATGTT TCGATTACTC GA€ATGTTTC aGGTGAGGGT 
GTTCAACAAG CCTTACTAAA AATTCTTGAA GGAACAGTTG cAAGTGTACC ACCACAAGGG 
GGCCGCAAAC ATCCACATCA AGAATTTATC CAAATTGACA CAACGAATGT TCtCTTCCAT 
CGTTGGTGGG AGCATTTTGA TGGGAATTGG AAACCATGGT TTAAAAACCC GCCTTAGGC 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11266 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1620 
1680 
1740 
1799 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CGCCAAG¥TC GTCAATAATC GGTCAGATTG TTTTCAATCA GGAAAtcAcG 
TTCTTTCAGg TgAAAATTTT CGCCCAGCGT TCATAGCCTG CGCCnTTTtC 
TTTCGCCTGA ATGTCAATCA GGAGATTGAC CTTTCTGCCG GTTCGCTTCG 
TCCTGACGTC AAACGTCTGC GCTGGCTGGT AATTCTGGCG CGCAAGGCGG 
ATCGTCGCCC AGCCGGTTCA CCCTTGTAGG GCGATCCCAT CCGGGCAGAC 
ATTTTTGCCG TCCACAACCT CATATCCCTC AGCCCGCATC TTTGCCAGAA 
ATGATTGCAC TCTGGCAGCA GCCGGTCAAT GGCTTGCTGT AACTGCTTTT 
GCCGTACTGC GCCGCCGCCA TTTCCTTTTG GGGCCTGGCT TTTTTCTTCG 
CACCGAAAGG CCGTGGGCGC GGCACAGTTC ATCATTCATG CGTCGCAGGA 
GTTTTTCACG TTCTGGAhTT TTGCGTCACA GTTCAGGTTG GTGCTGTTAA 
GGTATGGATA TGCCGCTTAT CCGTGTGGGT GGCCACCACA AACTGATGCT 
AAATTTCATC GCCAATTCAC AGCCGATTTT GTGCGCCTCC TCCGGGCTGA 
TTTGAATGAC TGGATAATGC GGTACATGAT AATGTCATGC TTTTCTGAmC 
GGTAAGCTGG GCGTACAGTT TTTTGCTGAT TTCAAATTCT TCGGCGGCAG 
GCACATATAG GAGGAAACCA GTTCGCCGCC GTCTGTCTTT TCTTCTTTCT 
GTGGCGTTCT GCCATTGACT GCTGACGGGT GCGGTTTTTC ATAATCTCTC 
ATTAGAGATA GCCCAATGGT TCCTCCTTTC AGAAAAAAGG CTATCCATAG 
AGCATGGGTA GCCTACTTTT GCAAGCCATA TTTTGTATGC TTGATACnAA 
GATACAGCTA TGGATACGAT TTCAGCGATT AAGACCCGGA TACTCCGTTT 
CGCAACATTA CCATCAACAA ACTGGCTACG ATGTCGGCGC TGCCGCCATC 
AACATCTTGT ATGGAAAGAG TACCGATCCT AAAATTTCAA CCATCAAGAA 
GGTTTGGATA TGACGTTGAT AGATTTTTTC GATGCGCCCG AATTTGAAAA 
GAAATTAAAT AGCGTTACCG CGCCGCGTCC CGCTGCCGCT GTGCCGTCCG 
ACGTGGGTGC AGAGAAACTC GTTATAGTCC TTGCCCTGCT TCGGCGGGTT 



TTTTcGCCGC 
CGGCGGCAAG 
GCGCTGTTTT 
TCTTGGTATA 
GGAATCCGGG 
ATTCCTCATA 
TGTGgCTGAC 
GATTCTCAAT 
TAAAAGCGGA 
ATTCAATGTG 
GCCCGCCGGT 
CTTCATCGGG 
TGCTGCGCCC 
TTTCCGGGCT 
GGTCATAGTC 
GTGGTAAAAT 
CCTAAATAGT 
AGGGGGGCAG 
ATGTGCGGAG 
CTCTGTCAAA 
AATATGTGAC 
TTTAGArCCG 
TACCGCTTTA 
GGGGTAAACC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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AGCTTCACCC 
CCCTCCT^GC 
TCGTGAAGAA 
TCCCGTGGTT 
CTTTCCGCCA 
CGCTTGTCGC 
GCGCGGGCGT 
TCGTATAAAA 
TGCAAATAGC 
GGCTGTTTTT 
GTTTCCACCG 
CGCCCACCGA 
TGGGTTCTGG 
TGCAAATAAG 
GCCAAGTAAT 
GCCGCAGTTG 
TCAGAGTAGC 
CACsGGCGGG 
AGCACCGCGC 
GCCCGCAGgT 
GGGGAAACTT 
CCTTTCGCGG 
TTGCCTGACG 
CGGGTTTTCC 
TGTCCGGCCT 
CTTCTTCTTT 
GTGTGGCAAG 
TAATACGGGT 
GCTTCATGGT 
ACTCGCCCAA 
CTTCGCCCGC 
CGTCCTTGAC 
GCTTGGTTTC 
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GCCTGGAAAG 
CGGCCTTGTC 
ACTGTATCAG 
CGCCGCCGGA 
CCGCCAGCCG 
TTCCCTCCAC 
TCCCTGCTGT 
TCCCGGCTTC 
CCACAACAGC 
CTTTTATGGG 
CGCCTACAAA 
TGCCCCGGCT 
TGCAGTATTC 
AGAGCAGATC 
CACCTCCAAA 
TTTACAGTGc 
CCTGCCGCAA 
CGATCTGGTT 
CCATGTCAAG 
TGGAGGTTCC 
TGGTGAACAG 
GGTGCGCTCC 
GATAGAGGGT 
GGCGTCCTCC 
GGACAGTTCC 
TGGGAATGGC 
CCGTGCTTTT 
GACGTTGCCC 
GACGCGGAAT 
ATCCACGGTT 
CGCCTTGCGC 
CGGGTGGGCC 
CGCGATTTTC 



TTCCGCATCG 
ATTGTCGAGG 
GGCGCGGGGG 
CAGCTTTACT 
GGGGCAGTCT 
ATCCATGCGG 
GTCGCGCCCA 
CATGCACACG 
GGATGCACAT 
CGGCGCTTTT 
GTCCATGCCG 
GTTCCAGCAC 
GCGGGGGCCG 
CCATTCCTTT 
TATGAAAAAA 
TTCAAATTTC 
GTCCTCCACA 
GACGTTGCTG 
CTGCACGATG 
CAGCAATTCC 
CCCCACCGTG 
ATGCCGGAAT 
TTATAGCCAG 
AAATCAGGGC 
CGGTTAAGCT 
CGGTCGGCTT 
TGTGCGTCCA 
ACCGCATCGT 
TTGCTGCCGT 
TTTTCCGGGT 
TCCGTGTAGG 
TGCACGGTTG 
TGCGGGTAGT 



TTCTGGATTT 
CACAGACTGA 
CCGGTTCCGC 
AATGTGGCGA 
GCCTTGCCTG 
AAGCCGTCCC 
ACAAAGACGC 
CCAAGCAGTT 
CGGCTGGCCT 
TCGGGCGGTG 
CGCACCTTGA 
CATTTCCCGT 
CTTTTTTTCA 
GCTTTTGCAA 
GGGCGGCGGG 
TTCAGCAGGT 
TCCTGCTCAT 
GAAACGGAAC 
TAGCCGTCCA 
ATTTTGCGGT 
CGCCCCTCCT 
GTACCGGCGT 
CCGCCACAGC 
TTTCTTCAAC 
GGTTTAAGCG 
CTTCTTTGGC 
ACCGCTCCGT 
CAGACAGTTC 
CACACCTTAG 
CGTCCATCAG 
TCTGCCCCAT 
GCAAATCTGC 
ATTTCAGCGC 



CCTGGGTGAG 
CTTGGGTAAC 
CCAGGGATAA 
GGGAGAGCGC 
CCGGCAGACA 
GCGTCCCCCG 
AgTTCTGGTA 
CCGGGTCAAT 
GGGGCAGTCG 
GCGCACAGTA 
TCAGATAATC 
TGGAGATTTT 
GTTCCTGCGG 
TCTGCTCTTT 
AGCATCTTTC 
CGTTCACGCC 
AGAGGTTGTG 
GCAGCAATTT 
CCGCCATCTT 
CAATCGCCGC 
TTATCCTCAC 
GGGCGGTTCA 
GGGCATCGGC 
CTCCTGCTCA 
GGCGGATTTT 
GTTTTCCAGT 
CAGGCTTTCC 
GGCGGAATA6 
CTGCATGGGA 
CATACACGCT 
GACAGTCATG 
CGTCAGCGCT 
TTTGTCCTGC 



CCGTTCCATG 
GTGGGGGTGG 
ATAATGGCTG 
GTCAATAGGG 
GAAGCTGTAC 
CArGCTGGCG 
TTTGCAGCTC 
GCCGCGATCC 
GAAGGGCTTT 
GCCGCACAGG 
AAGGGCCGTG 
TAAGCTGTCA 
CTCATAGGCC 
GGTTACGCCT 
TTTTGCCCCC 
CTGCCAGATT 
GGTGCTGTTG 
TACCAGTTCC 
GCGTAGGTAA 
CTTTTCTTCC 
AGCACCGCCT 
TAGTTACGGA 
ACAACCGGCG 
TCCTCCTTTT 
TCCCGCAGTT 
TCGCTTTCCA 
AGCGCATTGT 
GTCAGGTTCT 
AAGCCGCGAA 
TTAATCACCG 
6AAAACGCAT 
TCGATTAACA 
ATTTCGTACT 



1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 
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GTTGGGAGGT GTGGTTGGCT TTGAGCATTT TCAGCTTGGA AACCTGCACA TCCAAATCCA 
^^•pT^ipTCGCG GATACGGTCG TCGCCGGTGG CAAGGGCCTT GACCTCAGCA TAGGAAAGGG 
CAGTGGCGTC CACATCCTCG GCAGAGCGGG CAGGGCTTTT AGAGGTCATG ATCTGACCGA 
TGAATTTCTG TTTGTTTTCC ACCAGCCCCC AGTTGTT^GC GTCGAATGTA CCCTTGGTGA 
CATACTTGAA CATTTTCACC GACGCATTCA TGTTGCCCTG GCGCAGAGAT CGGCCCGCCC 
GCTGTTCCAA ATCGGCAGGA CGCCAGGGGC AATCCAGGTC ATGGGACGCG ACAATGCGGT 
TCTGCACATT CGTGCCTGCG CCCATTTTTG CCGTGGAGCC GATTAAGACA CGCACCTGCC 
CCCGCCGCAC CTTGGCG7VAC AGTTCTGCCT TTTGGGTTTC CGTGTTGGCG TCGTGGATAA 
AGGCGATTTC CGCTTCGGGG ATGCCTTTGG CAATCAGCTT TTCCTTGATG TCATCATAGA 
CATTGAATGA GCCGTCCCCC TTGGGTGTGC TGAGGTCGCA AAACACTAAT TGTGTTCCTA 
AAATCTCCGT GCTGGCCTGC CATTCCTGCA CGATATTTTT CACGCAGGCG TTGACCTTGC 
TGTCCGGGTC GTCGGGCAGG AGCGGATTTT GCAGCCGCTG ATCCAGCGCC AgCTTGCGCC 
CGTCCGAAGT AATCCGCAGC ATATTGTCGA TATGGGGGTC AACCTGCCCG CCGCGCACAC 
TTTCCGCACG CTCGCCAAGT TCCTCCACCA TTTCTTTCTG GAATCCGCTG GGTTCCGTGG 
TGACGGTGAT ATATTCCGCC TTTGGCACGG GGAGATTCAG CATATCGGCG GTCTGGATGT 
CGGCGGCTTC TTTCCAAATA GAAATAAGTT CCGGCAGGTT ATAGAATTTC GCAAACCGGG 
TCTTGGAGCG GAAGCCGCTG CCCTCCGGCT TTAATTCCAT CGCCGTGACC TTTTCGCCAA 
AGTCTGCCGC CCATGCGTCG AAATGGCGGT GGCCGTTCTG CTCCAGGGTG TCAAACTGGA 
GGTAGCGCAT CATGGTATAC AATTCTACCA TTGAGTTACT CACCGGGGTT CCGGTAGCGA 
ACACGATGCC GCGCCCGCCG GTGATTTCGT CCATGTAGCG GCACTTGGCA AACATATCGC 
TGGATTTCTG CGCGTCCGTC TGGGCAATCC CCGCCACATT CCGCATTTTT GTGTGTAAAA 
ACAAGTTCTT AAAGCCGTGC GCCTCGTCCA CAAACAGACG GTCAACGCCC AATTCCTCAA 
AGGTAACAAC ACTGTCTTTT TTCTTTCCGT CCGCCAGCTT TTGCAGCTTG GCTTCCAGGT 
TTTTCTTTGT TTTCTCCATC TGCTTGATGG TGTAGCGCTC GCCGTCCTCC GCCTTTGCTT 
CGGCAATGGC GGTGACGATT TCATCAATCT GCCCCTTAAT GACGGCTTTC TGCCGTTCCG 
GGGAGAGGGG GATTTTTTCA AACTGTGAGT GTCCGATAAT TATCGCGTCA TAGTCGCCGG 
TGGCGATCCG GGCGCAGAAT TTCTTGCGGC GGGCAGGCTC AAAGTCCCGT TTCGTGGCAA 
CCAGCACCTT TGCGCCGGGG TACAGGCGCA AAAAATCGCC GCCCCATTGT TCCGTCAGGT 
GGTTGGGAAC GACGAATAGG GATTTCTGGT TCAGACCCAG CCGCTTGCCC TCCATTGCGG 
CGGCAATCAT TTCAAAGGTC TTGCCCGCGC CGACACAGTG GGCAAGCAGC GAGTTTTTCC 
CATAGAGCAT CCGCGCCACG GCGTTCTCTT GGTGGGTACG CAGGGAGATT TCCGGGTTCA 
TCCCAGCAAA ACGGATATGG GAACCGTCAT ATTCGCGTGG ACGGATGCAG TTGAATTTCT 
CGTTGTACCG CCTGCAAAGG GTTTCCCGAC GTTCCGGGTC TTTGAAAATC CAGTCCTTGA 
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ACGTTTCGCA CATGGCTTCC TGCTTCTGCT GGGCAATGGC GGTCTGCTTT TCGTTGAGGA 
CGCGCACCTC CTTTCCGTCC ACTTCTGTCT TATCAAATAT CCGTACATCC CGCTGGTTTA 
AGGTCTGCTC AAAGATTTCA TAAGCGTTGA TGCGCTTGGT GCCGTAAGTA GCATAGACGC 
GCACGTTATC CCGGTTGTCT GTGCTTTTTC CCTGCACGTT CCATTCGCCG GTGGTAGGGG 
AATATTGCAG CTTGATTTTT CTGTCCTGTA AATACACAGG TGTATGCAGC AGTTCAAACA 
TGAATTGCTG GTAATACTCC GGGTCAATCC AAGACGCGCC GATACGGACG CTGATTTCAG 
ATGCCGTCAG GTCTTTG3GC TGCACCTGTT CCAGCTTTTC GGCATTGATG GCAAACTGCG 
GGTTTTCCTC AGCGGCGGCG CGGGCAATCG CTAACTTGGC CCGGACGTTG CCGGAAAGGT 
ATTCGTCCGC CGTCTGCCAG CCCTGATGCC AGTGTGCGCC GCCGTCCTCC AGGTCAAAGG 
GGCCGGTGGC GGGGTTTTTG AAGATGATGC CTTTTAAGTC CTCCACAATC TGCGGGATTT 
TTTCAGGGCC GCCCATGAGG GATGCCATAT AGCCCAAATC CACGCAGGCT TTTTCACCGA 
TAGATACCGC CAGCGCCTCC ACAGCCGTGT CCACGCTGGT AACGGCGCGG TGGGTCTGGA 
TGGTGCGCCT GGTGAACATA TCCGCTTTAC GCTCCAGCTT CCCCTCCTCG TTGATGATTT 
CAAGAGAGCA GAGCAGCGGA TAGGAGGAAT CCTGCTCAAA GGCCAGCTTG TTGCCAGTGC 
TGTTAAGAAG CCCGTACTTT GCGCTGAACG CATCATAGAG CCGGTTTAAT TTTGCCTGCT 
CGGACTGGAT TTCTGTGTCG TCTGCGCCGG CAAGCTGTAA GTCGATCAGG CGGCGGGCGC 
TGTCCCGGAT GCCGATCATG CCCCTGACGC GCTCCGTGGG GGTTTTGCCA AGGACGGCCT 
GGGTCATGCG GGAGTTTTCC CGGAAGTACA GCTTATCATT GGATAGGGTG TAACTGAAAT 
TGCGTACAGT GGGGTCGGCG GGGATGCTTT CCAGTTCCTC GCCGTCTTGC CCGGAATCCA 
CTTCTAACAG TTCCCGGTCA GGCGGTGCGA TATGCCGGAT AGCTTCTGCA AGCTGCCCGG 
AAAGGTCTGC GCCGGGGATG GGCTGGCAGG CGGTTTCAGT TTCATTGCCG TACATACTGC 
GTCCCATTGT CATTGTCCCC AGCACCATTT CCGGGTGGTC TAAAAAGTAG CGGTTCAGGG 
GGATGCCGTC CTCTGTCTGC CCGACCTCCA CCCATTCCGG GAGTTTTTCC GGGATGCGGT 
CGCGCTTCTG GAAAAAGAGG ATGTCCGTTG TGACTTCGGT TCCGGCGTTG GCCTTGAACG 
CATTGCTGGG CAGGCGGACT GCACCCAATA AATCGGCTTT TTGTGCCAGC GCCTCCCGCG 
CCTTGCTGTT TTTCTTGTCC ATTGTGCCTT TGGTGGTGAT AAAGGCCACA ATGCCGCCGG 
GACGCACCTT GTCCAGCGTT TTGGTGATGA AATAGTCGTG AATCATCAGG TTTTGACGGT 
CGTACCGCTT GTCATGCACC TGATAGCTGC CAAAGGGGAC GTTGCCCACC GCAAGGTCAA 
AGAAATCGTC CGGGTGGTCG GTGCGCTCAA ATCCGTCTAC CGTTATATCC GCCTTTTGGT 
ATAGCTGTCT GGCGATGCGC CCGGTCAGGT CGTCCAGTTC CACACCATAT AACTTTGCTG 
CCGCCAGTTT TTCGGGCAGC ATCCCGAAAA AGTTCCCGAT GCCCATAGAC GGCTCCAAAA 
CCGTCCCCGG CGTAAAGTCC ATCTGTTCCA CCGCATTGTA 6ATTGCCTGG ATAACGGTGG 
GGCTGGTGTA GTGGGCGTTG AGAACCGTGC TTTGTGCAGA CTGGTACTCC CCAGGGGTGA 
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GCAGTTCTTT CAGTTCGGCG TATTCCTTTG CCCATTTTGG GTCGTTTGGC TCAAAGGCGT 
GGGAAATGCC GCCCCAGCCC ACATAACGGG ATAAAATGGC CTGTTCTTCT GGTGTGGCAA 
GGCGGTTTTC GGCTTCAATC TGTTTCAGGG TGCGGATGGC TTCCACGTTG TACTGGTACT 
TGGTCTTTTC GCCGCCTGCG CCTAAATTTT CATCGGTGAT GCGGAAGTTG TGGCGTTCCG 
GCTCCAGACG GAGCGGCTGT AAAATAATCT CGCTGGGGAA TACCCGGCTG TCCACTTCCT 
CCTGGGCGCG GCGGCGGGTC TGTCTTTGCG CCGGGGTTTC CGTAATCTGC CCGCCGTCAA 
TCTCCACCAG TTCCGGCGGT GCTTCCTGTT CTGGAGAAAC GTCCTTCTCT GGAAAAATTT 
CCGGCTCTGT TGGCTCCACG CCCTCTCTGG CACCTGCCAT TTCTGACCAC CTGATCCGCT 
GGTTCAGTTC GGCTTTTAAC CGTTCTTCAA TATCCGGTTC CACGGAAACC AGCATATCAT 
AGTCAAGGAA AGCAAGTTCC ATATCAATCA GTTCCCGCAG GCTGCCATAC TGCCGTGTTT 
CCACGCATTT CCCGTCTACG GACTGGCTGA TGGAGAAATC CAGCAGGTTT ACGTCCACCT 
GGACTTCTAA ACCGCCGTCC TCGGTCGTGC CATAGGCAAT CCCGATATGC TCCAAATCGG 
AAAAATCCAC ATCATCGCTG CCATATTCGG CGTAGGCAAA CTGTTCAATT AGCTGTTTTG 
CCGTATCTAA ATCGGAGGTT TCGGGGCGTT CCGGCTCCTG CATGGGTTCC GCTGTAAAAT 
GCTCCTGCTG GACTTCCTCT ACAAACTGCC GCACAAAGGG GATGCTTTCA CTGCGGAAAA 
TGGGATACCA GCCCCGCAGT TCCATATCTT GCAGGCTGAC GGTTCCGGCG TTATAGTCCA 
CGCTGTCAAT CTTCATCAGC CGCCCGTCAA TGGTCAGTTC CATGCCGACA GGGATATAAT 
CCGCCGCGCT GCGTTCCTTG ATGATTTCAT AGGGGGCATC GCCGCCGCGT TCCGTGTCAG 
GCCGCGCCAG CACCACGCTT TTTCCATGTT CCAGCAGCTT TTTTAAGACG GACTGCCAGC 
CATTGGCTCC GGTGATGGAC ACCATCCCCA GGTCGGGAAT ATCGCGGGTG ATGGTTTTCG 
TTCCCAGCGC AGGGGCGGCT TCCTCTGCGT CTGTGCCGTA AAACAGCATG GTTTTCCCTA 
CCCGGACGCC TACCAGTTTG TCAGGGTATT GGGCTTTCAG GTTCAGGTAT TCCTCCACGT 
TGGGCGTAAG GTTCAGACCG TCTGCGGCGG CTGTTTTTGG GGCTTCCTGG GTCTGCGGCG 
AACCTGTGCC GGTGGGTTCT GCGCCCGCCT TTTGCGCGGG GGATGGTACA ATATTGCTGT 
CCCCGGTTTC CGGCTCTGGA TTGACCGCAG CGGGCGCGTC CTGCACCTTT TCCGGCTCTG 
GCTCCGAGTT CCCaGGCTGC ACCCGTTCCG GCTCGTGGGG CGGGGTGGAA AATACTGGCG 
TGTCCTGGGT GGCAGAAATG GACATTTCAG TGGCAGCGGC AACCTTTTGA ATTTTTTCCC 
TGATTTCCTC CGGCATATCC GCGTCGTAAA AAGTCACGGT ACGGTCAGGG GCGATATGGG 
CAACTGTTTT GTAGTCGCCG TGTTCTTCCT CCAGACGGTT CCAGACGGTC AGGCCGTTCC 
CCATATGGCC ATAGCCTAAA TCATAGCGGT CAGCGGGGCT TTCGGCCATT ATGGGCGCAG 
CGCCTTCTGG TATGTTCGGC GTTTCCTGCG GTGGTTCCGG CTTTGATGCG ATAGGGACGG 
CCGTTTCAGA TGCGCCGGTT TCTGCCTGCG AAAGTTCCGG CTCCGGTTCA GGAGTTGCGG 
CTGGCTGTTC TGGTTCGGAC GAAAGCCCGA ACATATCTAA AGTAAGCTGC CCGTCTGCAT 
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CCTCCCGTGG GGCAGGCTGT 


TTCTTGCGGG 


AAAGGGCTTT 


TGCCGCAGCA GCAAGGCCGG 


9420 


TTGCAGGTTC AGCCGTTTGT 


CTGGCGGAAT 


CCTGCCTTTC 


CTTCTGTTTC TGTTTCTGCT 


9480 


TTTTCTGCCG TTCCTCCTGC 


AAGACGGCAT 


CAGGCAGAGG 


GGTGAATAAG GAGTATTCGC 


9540 


CCCGCAGgTA TGCGCCCATT 


TTTTGAAGCG 


TCGGCTCCAT 


CAGGGAATAG GCGCGCTCAT 


9600 


CCGGGGATAC CATTGCCAGG 


GCTTTCACCA 


TTTTTTCATA 


GTGCGCCGCG CATTGTTCCG 


9660 


GGTCATCTAA CAGCGGGCGG 


AATACCTTTT 


TTGCCGCGTC 


AACATCCCAG TCGTGGGGGA 


9720 


TTGTCTTGCT GGCATCGTTG 


GGGTCGTAGT 


ATTGGAAGGT 


ATATAGACTC CGAGCAAGCT 


9780 


GCATTTTTTC ATATTCCGGC 


AGATATGCCT 


TTTCCcTGCG 


GTTCAGATAC CGATCCGTTT 


9840 


TAATCAGTCC GGCAATCCGC 


TTTTGTACCT 


GCTTCCAGTT 


CAGCGTAACC GTATCATAAT 


9900 


TGCCACCGGA AAAATCGTCG 


GCGCGGGACA 


GCTTGATGCC 


TTTGCCGTCG TGCCATTCAT 


9960 


CATAGCCAAT GTAGCCATGC 


CCGCCAATGC 


CGTATTCGTT 


TTTCAGAAAT TCGATGCACT 


10020 


CTTTGGCATC ATGCCCCTGC 


ATAAAGTAGG 


CGTAAATACG 


GAATTTCCCC TCGGAAACGC 


10080 


CAGAGCCGCC AGTGAACATC 


CGGGTAATCT 


CGTCCTCGGT 


GACAAAAGCT CCACGGGCAG 


10140 


GCGCAAAGCC CTCCACCCCA 


TGAAATGCCG 


TCACCGGCTT 


ATCCATATCG GTCAGGCGGG 


10200 


TAAACAGTTC GGAGGGCGCG 


GGCCGCTGGC 


GGAAGCGCAG 


CAGATTCGGG TTCTCCTGAT 


10260 


GGGAGATGGC AAACTGCCGG ACTTCCTGTG CAATTTTGGC TCGTTCTGCC GGGTCTTTCA 


10320 


GGAGCGCGGC AATTTTAGCC 


GTGCTGTCCG 


GGAAACCACC 


GTACAGCGCG TCAATGGTGG 


10380 


GCAGCAAGCC CTTTTCCTTT 


GCTTCCTCGC 


TGAAATTCTG 


CCGCAGAAAC CACAGTTTTT 


10440 


CCGACAATTC CCGAAATTCA 


TTGTCACGGG 


CCGCGTCGAT 


TTTGTCCTGG CTGGCAAACT 


10500 


GGCCGCTGTT CAAAAGCTGC 


CGGATCTGTG 


CCGCCGCCTG 


CTTCCAGGTG ACAAGGGAGC 


10560 


CGCCCCCGGC -AGAGCGCCCC 


GGCGCGATGT 


GGATACCGCT 


ATGGCTGAAC CACAGCGAAT 


10620 


ACTCATGCCC GGCAATTTTA 


AGCCCCTTGC 


CGCCGCTGCC 


GTATTCCTTT TGCAGAAAAT 


10680 


CGGCGGCAGA CTGCTCCGTA 


GGGTCTTTCT 


GGAAAAAAGC 


GACGATACGC AGGATGCTGT 


10740 


CCGGCTCGTT GCCGCCGCTT 


GTGAGGGCGC 


GGCCAATGAC 


CGCATCCGTG AcCTTGCCGG 


10800 


AGGAAAcGGC GGrGCGCAGT 


TCCTGCTCGG 


TTTTCTGTGC 


CTGAGCAATA GTTTCAATCT 


10860 


GtGTTCCACA GTGGGGAATA AAGAAAACTG 


TGTAAATGCC 


GGCTTTTCAG AcGCAGAAAm 


10920 


GcGGACTGTT CGCCCGCCGC 


TTCCTGCTGG 


TTCTCCTGAT 


TTACTTGTAA ACGATCTCGT 


10980 


TCAGTACGGT TTCCTCCGCC 


TGGGCGAGAA ACAGGTTGAC 


TGCCTGCACC CATCCCATCT 


11040 


GGTCGCGGGC TTTCATCCCC TCGTmCACGC CGTTCTGTTT CTTTAATTCC GCCACGATCC 


11100 


GGTCTGCCTG TTCCTGCGCn TGCCGGTCGA TCTCCCGCAG GTGGGGTTCC AGCCGCCCGG 


11160 


TCAGCAGCAT CGAGGTGTAT 


GTCCCCTTGT 


GGTGTTCCTG 


AAGAAAGGTC TGCCGCAGCC 


11220 


TGCCGTACTT GCCCAGCGGC 


ATTtCCTCCA 


. TCTCGTCCTC 


CCCGTC 


11266 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



AATTTTAGTG TAGTAAAGAA GTACGTGAAT 


TTGATTTTGG 


ATATTGTATT 


AGGGATTTTA 


60 


GGAGTTTTCT TATTAGTATT TATGGCTAGT 


TATTTAGTTG 


ACCTCGCTAC 


A7ATCTCATG 


120 


AAGCCAATGA CGCCGGATAA TTTTTCAGTT ATCATGCAAG 


AAATTATTTC 


ATTCTTTATG 


180 


TTATTTGAGT TTATTATGAT GATTCTTCGC 


TATATTCAAG 


AAGGACACCA 


TATTCCGATc 


240 


GTTATTgATT TTAATTgTTa ACCGCAATTT 


TACGTCAATT 


AATGGkTGCG 


CATGGCGATG 


300 


GCyTGCAGAC TTwATkGTtA wCTTTAwCAA 


TTCTTTTACT 


AGTGGTTGTG 


CTATTTATTC 


360 


TCGGGTTAAA TGGCAGCAAA TTTTAtACGC TAGGAAAAGC GAAAGTAGaA gAAGaTCACG 


420 


AGCaAGATTT TCayCATTAA TCgAACaTTVA 


TCaAAGACCG 


TCCAACGTGA 


GAATTTTTCT 


480 


CTAACGTGGG GCGGTCTTTT TAATGCATAG 


TTAAATAGTT 


AACTCAATCA 


GTTCAATTGT 


540 


TGGTAAGACA CCAAAGCGTG CGCTGATTTT 


AGTGGTTCCG 


ATTCCTGGGT 


GAACAATTAG 


600 


CGGCGTTTCA TCTTCTAAAT GATAAAGCCC 


ATGTGTATAT 


TTTTTAGCAA 


GCTGATTTTT 


660 


GTAAGTGAAA AAAGGCAAGC GAATCTGTCC 


GCCGTGACTA 


TGTCCTGCTA AAATGAGGTC 


720 


AATGTTTTTG TTGACTGCTG AGTCTGCGAC 


ATCTGGCTCA 


TGGGTTAGTA 


GCAGAGTGGG 


780 


CGTTCCAGTA GGGCGGTTGA CTAAAGCTGC 


AGCTAAGTCG 


GTTTGTCCAT 


ATAACGAATC 


840 


TTCTAAACCA GCGACGTACA AGGAGATACG 


GTGTGAGACA 


GGCAGTAATT 


TGCCCGTGtT 


900 


TTTTaGTACG GTAAAGTCGG CAGCTTCTAA 


GATTGtTTCG 


TAAAAATTGG 


TATCTAAGCC 


960 


GCTATACTCG TGGTTGCCAT AAATAGCGAA 


TTTGCCATAA 


GGGGCGTTAA 


TTTTACGGAA . 


1020 


AAGTTGACTC ACTTCATCTT GAATTTCGAT 


ACCGAAACGG 


GCATAATCAT 


CAAATAAGTC 


1080 


TCCTGTAAAA ACGACTATGT CTGGTTGTAA 


TGTATTTGTT 


TTTTCAACAA 


TCTGTTTTAA 




GGCCTTCACA GAATAATTCT TTTTTAAATG 


AATATCAGAG 


AGTTGAACAA 


TCCGTAAGTT 


1200 


TTCCCTGATT CTGTTACTTt AATTTGATTT 


TTGTGtATGA CTAAGCATTT 


AGGTTCAATA 


1260 


AAAAAACTAT AGCTTAAAAT GCCGATAATA 


ATGAAAAATA GTGCCAGAAA 


TAAATAAACG 


1320 


TTCATAATTT CCTCCTTGAA GTCGCTAAAG 


TAGTTTACTA 


CAAAAGTGTT 


AAGCAAAAAA 


1380 


TAAGGAAAGG TTAAATTTTC AGTAAACTTT 


CTCGATGAAG 


TTTTTTGGTG 


ATTCATGCTT 


1440 


TTCTGAAGTA AATTTGCTAT AATGAAAAAT 


GAGAAA^U^G 


AAAGAAGGTC 


GAACCAATGA 


1500 


TGGCAAGAGT CGAAAAGCTA CGGGAGTTAA 


TGAAAGAAAA 


TAATTTACAA 


GGATTTTTAG 


1560 


TGACAAGTCC GTATAATTTA CGTTATTTAA CCAATTTTAC TGGGACmACA GGCTTGGCAA 


1620 


TGATTACATT AGATAAAGCT TTTTTTGTAA CGGATTTTCG TTATaCGGAA CAAGCTGCGG 


1680 
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AACAAGCAAa CTGGTTTTAC TATcGTTAAA AACACAGGAC ATaTTTTTGa CGAAGTAGCA 
GATTTAGCTG AACGTTTGCA ATTAGATAAT TTAGCTTTTG AAGAAACGCA AGTTAGTTTC 
GCTGATTATA GCTTATTGGA AGAAATTTTA CCATGTGAAT TAGTTCCTGT AATGGGTTTG 
ATTGAAGAAT TAAGAGAAGT GAAAGATGAA GAAGAAGTAG CGATTATTGA AAAAGCGTGT 
GCCATCGCGG ATCAAGGATT TGCCTTTGTC TTAGAGATGA TTAAACCTGG AATGACAGAG 
ATTGAAGTGG CTAATCAATT AGATTTCTTC ATGCGTTCAA AAGGAGCCAG TGGTGTTTCT 
TTTGAGACAA TTGTGGCCAG CGGTCTACGT TCAGCGATGC CACATGGCGT TGCTAGCCaT 
AAAGTCATCG AAAAAGGTGA GTTAATCACT TTAGATTTCG GTTGTTATTA TGAAGGCTAT 
GTTTCTGATA TGACACGGAC ATTCGCTATT GGTAGTATTC AGCCAAAACT AAAAGAAATT 
TATGACATTG TTTTAGAAGC ACAATTAAAA GTGTTAGCAG AAGCTAAACC TGGGTTGACG 
GGCATTCAAT TAGATGCGAT TGCACGGGAT CATATTGCTT CCTATGGCTA TGGCGATGCG 
TTTGGTCACA GCACTGGTCA TGGCATTGGG TTAGAAATTC ATGAAGGACC AAATGTTTCA 
TTCCGAGCAG ATAAACAGTT TGTACCAGGG AATGTGATTA CGGACGAACC TGGAATTTAT 
TTGCCAGGAA TTGGTGGCGT GCGTATTGAG GACGATTTAT TAATTACTGC TGAAGGTAAT 
CGTGTATTAA CACATGCGCC AAAAGAATTA ATCATTTTAT AAAGGTTGGG AGAACATGGA 
TATCCAAGAG TATCAGCGGT TTATTAGTGC GTTTTATAAA AAACGTAACT GGTATCAATA 
TGATCCGTTT ATTCGTTCCA ACTTTTTAAC AGAAGAAGTC GGCGAAGTTT GTCGGGCCAT 
TCGGACAATT GAAATTGGCC GTGATCGATC GGATGAAACA GTTTTGGCGG ATGAAGAGAA 
TCTGGAAAAT TTAACGGAAG AGTTAGGCGA TGTGTTGGAT AATTTATTTA TTTTAGCCGA 
TAAATATGAC ATCTCTTTAG AAATGATTAT GCAAAAGCAT CGGACGAAAC TAATGGAACG 
TTATCCTGAA GAATAAAGAT GTAAAAATAA GAAAGCCAAA ATTACTTGGT CTTTCTTATT 
TTTACGTCTA AAGGCTTGAA TTAGAAATTT ATGACAAGAA ATTATCTGTT CTTTTAGTCA 
GAACTTAGCT TTTCTGGTAG CGAAATAGGC TAATTAATGA TAAACTAAGA ACGAGTATTG 
ATAAGCAAAA TTTGGACGTT CTTTCACAAC TAACTGAAGA AACCAGTGtT GTTTTAATTT 
TtAAGAATCA TCCGTTGCTT AAAAGAGAAT CCTT7VAGAGA CTTAGTGATT CATCAAATAA 
ATCGTTTATT TAACGATTTT GTGCGTAATA GATAAGGAGG ACAACCAAAT GTCAGAAGAA 
AAAAATCTTG TAATCAATGC AAACGATTCT TTAGGAGAAA TCGTGATCGC TCCTGAAGTC 
ATTGAAGTAA TTATCGGAAT CGCTGCTTCA AAAGTAGAGG GCGTTTATGG GATGCGCGGC 
ACGTTTGCCA ATAATGTGAA TGAATTATTA GGCCGTGCAG CACACGGCAA AGGTGTATAT 
TTACGAACAG AAGAAGAAGG CTTAAAAGTC GATATCTATT GTTACTTGAA CTACGGTGTT 
TCTGTACCAA AAGTTGCGTT AGAAATGCAA GACCGTGTGA AACAACAAGT CTTATTTATG 
ACAGATATTG ATTTAGTTGA AGTGAATATT CATGTTGTAG CGGTAGTACC TGAAAAAATC 
CCTCAACCAG ACTTAAATGA ATTATTCCCA GAAGATGAAG ATGGAGAAGA GAATGAGTAA 
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AACAGAATTA ACCCGTCACG AGATTCGTGA AAAAGCGCTC CAAGCCCTTT TTCCTTTAGA 3720 

TTTTAATGCT GATTTAACCA -AAGAAGATGC CATTGATTAT GCATTGGCTT ATGATAATCG 3780 

GGAAATCGTC AGTGAAGATG GCG7\AGACCT AGTCCCTACG TATTTAGATT TACTTGTCGG 3840 

CGGTGTGTGT AGCCGCAAAG CAGAACTAGA CGAAGTGATT ACGAACCATT TAGGTAATAA 3900 

CTGGTCTATG CAACGCTTAG CAAAAATAGA TATTGTTATT TTACGTTTAG CTATTTTTGA 3960 

AATGCTATAC GTTTCAGATG TGCCAAACAT TGTGGCATTA AATGAAGCCG TTGAGTTAAG 4020 

TAAAAAATAC AGCGACGACC GTTCACGTAA ATTTGTTAAT GGTGTGTTAT CAAATGTAAT 4080 

GAAAGAAATT GATTCAGAGG CGTAAGCTCT GAATCTTTTT TTACTATCTA ACTGGTGTTA 4140 

AGATACTGGG TTATATGCTA AAATAAATAC TATTAAAGAA GTAAAGAAAC AATAGAGGGA 4200 

GCGAAGGAAT GAGTACAGTA ATTAACGGTC GTGAATTAGC AGATCAAATG CAAGCAGAGA 4260 

TTCAAAAAGA TGTAGAAAAG ATGACACAAC AAGGCATCCA ACCAGGATTA GTTGTTTTAT 4320 

TAGTTGGGGA AAATCCTGCC AGCCAAACGT ATGTGAGAAA TAAAGAACGT GCAGCAGCCA 4380 

AAATTGGCAT TCTGTCAAAG GTCGAAAAAC TGCCAGAAAC TATTTCAGAA GAAGAATTAT 4 440 

TGGCTGAGAT TGACAAATAT AATCAAGATT CACGCTTTCA TGGCATTCTT GTACAACTAC 4500 

CTTTGCCAAA ACATATTGAT GAAGTU^GA TTTTATTAGC GATTGATCCC AAAAAAGATG 4560 

TAGATGGGTT CCATCCAATG AATTTAGGCC GTTTGTTTGT AGGCAAACCT GAAATGATTC 4 620 

CTTGCACGCC GTAT6GAATT ATGAAGATGT TTGAAGCTTA TGATATTGAT TTAACAGGTA 4680 

AACGTGCGGT GGTTATTGGT CGAAGCAATA TTGTTGGCAA GCCAATGGCA CAATTATTGT 4740 

TAATGAAAAA TGCGACCGTG ACGATTGCCC ATTCTAAAAC TGAACATTTA GCAGAAGTTG 4800 

CAAAAGAAGC GGATATTTTA GTTGTAGCAA TTGGACGCGG GCATTTTGTC ACCAAAGAGT 4860 

TCGTGAAACC AGGTGCGGTA GTGATTGATG TGGGCATGAA TCGCAATCAA GAAGGAAAAC 4920 

TGATTGGTGA TGTAGCATTC GATGAAGTTT CTGAAATTGC AAGCTATATC ACGCCGGTTC 4980 

CCAAAGGTGT TGGCCCAATG ACGATTACCA TGTTAATGTA TCAAACAGTC GAGGCAGCAA 5040 

AAAAACAAAA GTAAGGTGAA AAAATGACGC AACAGTATTT AACAGTAACA GCCTTAACGA 5100 

aGTATTTAAA ACGTAAATTT GATGCAGACC CTTACTTAGG ACGTGTCTAT TTAACAGGTG 5160 

AAATTTCGAA CTTTCGCTTC CGAGCGAACG CGCATCAGTA CTTCAGTTTA AAAGATGATC 5220 

ATGCGAAGAT TTCTGCAATT ATGTTTAAAT CAGCCTTTCA AAAATTGAAG TTTCAACCAA 5280 

AAG/^GGCAT GAAAGTCATG GTGGTGGGGC GTATTTCGCT TTATGAAAAC AGCGGCTCCT 534 0 

ATCAAATTTA TATTGAACAT ATGGAACCAG ATGGTGTAGG TGCACTTTAT CAAGCATTGG 5400 

CAGAATTAAG AGAAAAACTA GGAAAAGAAG GACTTTTTGA AGGGCCTAAA CAACAGTTGC 54 60 

CTCGTTATCC CAAACGAATC GCTGTGGTTA CAAGTCCTAG TGGTGCTGTT ATTCGGGATA 5520 

TCATTACGAC AGTCAAAAGA CGGTATCC6A TTGCACAACT GGTCCTTTTT CCAACCtTGG 5580 

TCCAAGGCGA GCAGGCGGCG GATGACATTG TGCGGAACAT TCAGCGAGCC GATGCGCAAA 5640 
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GCGATTTTGA TACGATGATC ATCGGCCGTG GCGGTGGTTC AATCGASGAT TTATGGC&.T 
TTAATGAAGA AAAAGTGGCT CGAGCCATTC A7GCTGCCAC CACACC3ATT M7TCATCGG 
TTGGTCACGA AACAGATGTG ftCAATTGCCG A7ATGGTAGC AGATGT7CGT GCAGCAACGC 
CAACGGCAGC AGCGGAAC7A GCGGTACCAG T7TTAAATGA AGAACTATTA AGAATTAGTG 
AACGGCGTAG TCGTTTAGAA CAAAGCTTT? TATATCTCTT ACAACAACGA ACAGAACGTT 
TCCAACGACT ACAAAATTCT TATGTGTTTA AGCAACCTGA GCSACT7TAT GAAGGTCAAA 
CGATTAAATT AGATCGAATG ACGCAACGTC T7TTCCAAGC AATGACAACA ATTCATCACC 
AAAAACAACG TCAAGCCCAA GGAATTATCG CTCAATTGCA ACAACAGACA CCAAAGGGAC 
AATTAAGAGA AAGTCAGraA CAATTGGCAr T?TTACAACG AAACTTGCAA ACACAAATGA 
CACAATTGTT CCTAAACAAA CAAAAGCAAT TCACGTCTGC TGTTCAACAA CTAGACTTAC 
TCAGCCCACT CAAAATTATG GGTCGAGGAT ATAGTTATAC TACAAAAGAG GACCGTGTCG 
TTAAAACTGT TACCGAGTTA CAGCCAGCAG ATCAATTAAC GATTCATTAT GCTGACGGTA 
CAGTTCAAGC AAACGTCGAA ACAATAACAG CGAAAAAAGA GGAGTT7TAA CATGCCAGCA 
AAAGAAAAAA CGTTTGAAGA ATCATTAAAT GCCTTAGAAG AAATCGTTCA GCGCTTAGAA 
CGAGGAGATG TTCCTTTGGA AGAAGCCTTA GCTGCCTTTC AAGAGG^T GGCATTAAGT 
AAACAATGTC AAGATACATT GGAAAAAGCG GAAAAAACTT TAACGAi^Jy^T GATGACAGAA 
AACAATGAAG AAATTGTTTT TGAAGAGAGC GAGGAAGCGT AATGACGAAT TTTAGTCAAC 
AGCATTTACC GTTGGTTGAA AAAGTCATGG TCGATTTTAT CGCAGAA?AT AC7GAAAATG 
AGCGTTTGAA AGAAGCAA7G CTCTACTCAA TACACGCAGG TGGCAAACGA TTACGCCCGC 
TATTAGT6TT AACAACAG7G GCCGCTTTTC AAAATW^^SAT GGAAACACAA GACTATCaAG 
TGGCTGCCTC TTTAGAGATG ATTCATACGT ATTCATTAAT TCATGATGAT TTACCAGCAA 
TGGACGATGA TGATTTACGT CGTGGCAAAC CAACCAATCA TAAAGTGTTT GGTGAAGCGA 
CTGCCATTTT AGCAGGGGAC GGCTTATTAA CAGGTGCATT TCAGTTGCTT TC7TTGAGTC 
AATTAGGCTT AAGTGAAAAA GrTTTACTa^V TGCAACAACT GGCAAAGGCA GCGGGGAATC 
AAGGCATGGT TTCCGGCCAA ATGGGTGATA TTGAAGGAGA AAAAGTCMC TTGACTTTAG - 
AAGAGTTAGC GGCTGTTCAT GAAAAGAAAA CCGGAGCGCT AATTGRATTC GCCTTAATTG 
CTGGGGGCGT ATTAGCCAAT CAAACAOU^B AAGTCATTGG TTTACT6ACA CAGTTTGCGC 
ATCACTATGG CTTGGCTTTT CAAATTCGTG A7GACTTRTT AGATGCCACA AGTACAGAAG 
CAGATTTAGG TAAAAAAGTA GGCCGAGATG AAGCGTTGAA TAAAAG7ACG TATCCAGCTC 
TTCTAGGGAT TGCTGGCGCG AAAGATGCGC TAACACATCA ATTAGCAGAC AAGCTGTGAC 
CGTCTCCGGG AGCTGCATGT GTCAGAGGTT T7CACCG7CA TCACCGAAAC GCGCGAGACG 
AAAGGGCCTC GT6ATACGCC TATTTTTA 
(2) INFORMATION FOR SEQ ID NO: 56: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 6682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

^^YTTTTGCA ACGTTTCCAT GAAAGAATGC GGGAAGATAT CCCTAAAAAT TGATCTTCTT 60 

ATTTCTCTAT TCA7VAATGAT TCTAGCCTAA CCTTTTTTAC TATTTCATGA TATGATGAAG 120 

CAAAGAATAT TTTAGGAGGT TTTTACATGG CTGCAAAAGA AGCAAGTTTT GATGTAnTTT 180 

CAGAGGTCAA TATGGAGGAA GTC7\AA7^CG CAATTCAAAT TGCCTTGAAA GAATTGAAAA 240 

ATCGTTTTGA CTTTAAAGGG TCAATTGCTG ATATTAAATT GGAAAACGAT AAATTAGTAG 300 

TCGTAGCTGA AGATGATTAT AAAGTCGAGC AAGTGAAGGA TATTTTATTT GGaAAATTAG 360 

TCAAACGCAA TGTTCCGATT AAAAATATTC ATTTTTCAGA AAGTGAAAAA GCCTTAGGCG 420 

GTACTGCTCG CCAATATGGT GATTTAATCA GTGGTATTGA TAAAGAAAAT GCAAAAAAAA 4 80 

TCAACACAGC TATCAAAAAC TCTGGGATTA AAGTTAAGTC ACAAATCCAA GAAGACAAAA 540 

TTCGGGTAAC TGGAAAAAGT CGGGATGATT TACAAAAAGT CATGGCTCTT TTACGAGAAT 600 

TAGATTTACC AATGGcTCTT GAGTTTAATA ATTATCGCTA AAAAAATCCT TCAACTGTTA 660 

ATGAACAGTt GAAGGaTTTT TTTGTTTtGG TACCGGCCAT TCTTGATTAT TCAATAAAaC 720 

GATTGGTCGG CGTTGGATTT TTCCTGAAAA AGCGGCAATT GCTTTGAAAA CCTTTGgCAT 780 

AAAATAAAAC GCATTtAATT CTTCTAAATC CAGTTCTTCT TCTGCCGTAA ACGAGACACA 840 

AACCTCTTGT TCTGAAAGCA TAATTTCAAT TTTCTTTAAA AATTGTVAATT GCCGCATTCC 900 

TGGCACGAAA TAACCAGCCT TTGCAACACG AGAGGTTGTA ATATCTAATT TATCACTAAT 960 

GATTAGCGCT AGTGTCATTA ACTCTTCCGA ATCAAAGCCG TTACTATGAT TTTCTATAGA 1020 

AGATAAAATT TCTTCTTGAT AAGGCAAACA AATCTTTTGc TTACGAAAAt ATGCTTCTGC 1080 

AAATtGTTTC CCACGAAGTG CATGTCCTGC TTTTCCTTGT AGCGCACCCA CATCATGGAG 1140 

CAGTGCCGCA ATTTTTGCCG CCTCTAAATA CTCCTCTGAA ACAGCAAGTT GTTTCAAAAT 1200 

CATTTCCGTC ATGGCTACGA CATTGGTTAC ATGAAACCAA TTATGGTATG CCCACCCAGC 1260 

TTCTTCATCT TCCATTTGAC CAATTGCATC ATAAATTTGT CGAATTTCAG TATCTTGATG 1320 

AATGCGTTCA AACAGCTTCA TTTATTTTCT CCTTAAAATT AGTTACTGTT CCAAAAATTC 1380 

TTTTAACTCT GCAGGATTTT TTTCGCTGTT TAAACCAACA ATTAAACGAA TCCGCGCTTT 1440 

CGGTCCGTTT AAACCACGGG CAAAAACGAC CCCCATTTTT TTCAAGCCAA CGCCACCACC 1500 

AGCATAATCA TAGATATCTT CAGCAATGCC ATTGGAACAA CGAGAAACTA AAACGACTGG 1560 

AATCCCGTTG TCTAGCATTT TTTGTAATGC TGGGAGCGTT TCTGGCGGTA AATTACCCGC 1620 

GCCTAGCGCT TCAATTACTA ACCCATCCGT TTCTGGGGTG TTTAATAATT CAAACATCCG 1680 

TTCACCCATC CCCGCATAGG CCtTCcTACG TGTACATTTC CTTTCACAGA aGAAACATCG 1740 
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CAAACTTCTT 


GCGGTAAGAC 


CTCTTTCGCA AAAAAAGCAC GTTCTTTCGC AATGGTTCCT 


1 Qon 
xouu 


ATTGGTCCAA 


AAGTTGGCGT 


CCGAAACGTC GCTACGTTCG TTGTATGTGT TTTTGTCACA 


1 Q£A 


TAACGCGCCG TATGAATTTC ATCATTCATC ACGACTAAAA CACCTTTATC GTACGATTCG 




TCTGAACAAG CAGTCCAAAT AGCACTGATG AAATTATACA AGCCATCACT ACCGATTTCG 


1980 


TTACTTGACC 


GCATTGCTCC 


TGTTAAAACA ATGGVsVsATix XiiiUK^IAA Ulai lAlAlL^X 


2040 


AAAAAATAAG 


CAGTTTCTTC 


TAATGTATCT GTTCCATGCG TAATAACQ^AU vs^UAiUAAXi 


2100 


GGTTCAGAAT 


AAGCTTTTTG 


AATCCGTTCT TTTAATTGAA ACATCCGTTv^ lAAGGlUAlG 


2160 


TGTGGTGAGG 


GAATATTAAA 


AATATCTTCA ACAACTAAAT GAACCTTGCC aGAAAATAAT 


2220 


GCTTCTTGTT 


CCATCAACGG 


ATTTTTTTCA TTCGGGGCAA CACTTCCTCC TGCTACTTTC 


2280 


GACATGGAAA 


TGGTCCCACC 


TGTATGAAGG ACTAAAATGG TCTTCATTTG TTTCCTCCTT 


2340 


TGTTTTCTTT 


ATTTTTTAAC 


CAAAATTGTT TCACTTGTTG AATGAGTTGT TCATTGGTGT 


2400 


TTGCCTTTCG 


ATGCGCCAAC 


CAATGCGGCG GAAAGAAACG CTGGACACTT GGTAATTCAA 


2460 


ATCGTTCGTC 


TAATAACAAC 


ACAAAGCCTT TATCTTGCGC ATCTCGAATG ACACGTCCAG 


2520 


CCGCTTGCAA 


GACTTTATTC 


ATGCCaGGAA TTTGATACGC AAATTGAAAC CCTTGCCCTC 


2580 


GCTCTTCTTG 


ATAAAAAGTG 


CGAATCAGTT CTTGCTCTGG GTTCATTTGT GGTAGCCCGA 


2640 


CACTAACAAT 


TATCGTACCA ATCAATCGGG TACCTTTCAA GTCAATTCCT TCTGAAAATA 


2700 


TACCACCTAA AACACAAAAA 


CCGATGAAAG TTTCTTGAGG ATTCTCTTTA AATTCAGCTA 


2760 


AAAATGCTTC 


TCGCTCrrTT 


TCGTTTAAGT CCGTTCCCTG AATTTGAGTT TTTACTTGCG 


2820 


GGTAACGTTG 


GCTAAATACT 


TGATAGACAT CATCCATATA GGCGTAGGAA GGAAAGAAAA 


2880 


TTAAATAATT ACCTGTTTTT GTTTCACTTA ACTTCCCTAA AAGTTCACTA ATCTGTGGAT 


2940 


AACTTTTTTC ACGGTTTTTA 


TAGGTTGTTT GAAGATGACT-ACCAATCATC AACAATTGAT 


3000 


TTTCTTCCGG 


AAAAGGATAC 


GGAATCCGAT AACGTAGGCT TTCTTCACCT CCACCAAGCA 


3060 


CTTCTTGATA 


ATAATTCAGA 


GGTGTTAAAC TTGCCGAAAA TAAGATACTC GCTTTCCCTT 


3120 


TATCTAATCG 


TTGTGACAAT 


AAATAAGCCG GATCGACACA AAATTGTTTC ACAACGCAGT 


JloU 


CATAGTTTCG 


CAAAGAAATA 


GACGTATAAA AGCCGTTGTC GTAATACTCG CTGATTCGAA 




GATAACTTAA 


ACAATCAAAA 


TAAAAACTGA GAATCGACGC CAATTCCTCA ATGTGTTGAT 


J JUU 


TTTCTGGCAA CCATTCTGCA ATTACTTCTG TTAATCGATG TATCTGTTTA ACTAATGAAT 


3360 


CAATTGGCGC GCGTTGGGCA 


AAAAAATCCG TGTCTTTTTC TTCACAGATT GCTTCAATAT 




CATCAAACGT 


AGTTAAGACT 


TTTTGACAAG CTCTTTTCAA TTGAACAGAT TCTTTTTGCA 


3480 


ATCTTTTTTG AATTgCAAAA CCGCATTCCT AGAAATAGCG GCCGAATACA TCTCCCGTGA 


3540 


ACGATTAACC AAATTATGGG 


TTTCATCCAC TAAAAAAATA TTCTCTTCTG cACTTTGGGG 


3600 


TTCTTCAAAG AAGCGCCGTA AATAAACCGT XGGATCAAAT AGATAATTGT AATCGCCAAT 


3660 


AATTAAATCA 


CACCAAAGAC 


TTGCATCTAG CGACAACTCG AATGGACACA ATTGGTGCTT 


3720 
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TTGTGCATAC 


CGTTCAATAA CCTCTCTCGT TAAATGATTT TCATGTTGCA ATAAATCCCA 


3780 


CAGTCCCTCA 


TTTAAACGAT CATAATAGCC ATCAGCAAAC GGGCAATGTT CTGGTGTACA 


3840 


ATTTCGTTCA 


GTTAAAAAAC AAATTTTATC TTTTGCTGTT AACGTAATAC TCTTAGTTTG 


3900 


CGCTTGTTTT 


TCGGACAGCG CTTTGACGGC ATCTTCAGCC ACTTGCCGCG TGATGGTCTT 


3960 


AGCTGTTAAA 


TAAAAAATCC GTTCACCTTC TTCTTCACCA ATTGCTTTCA TTGCTGGAAA 


4020 


AAGAGTGGAC 


ATTGTCTTGC CTGTTCCAGT TGGTGCTTCC ACGAAGAGCC GCTGTTTACT 


4080 


AAGAATTGTT 


TTATAGACGG CTACCGCCAA TTCTCGTTGC CCTTTGCGAT ACTCCCCGTA 


4140 


AGGAAAACTA AGATTTTTTA ATGATTTATT TCTCAAAATA CGCCACTCTT CTTTAAAAAT 


4200 


TACCCACTGT 


TCATAGCGCT TCATCAAATC AGTAAAAAAC TCCGCTAATG CAGCTTCTGT 


4260 


AAAATGGCGT 


TCTTGTCGTG TGATTTGTTT TGTGGTGGTC TGATAATAAG TGAGTTGAAG 


4320 


CGTTATTTCA 


GAAAGTTGTT CTTGTTGGCA ATACATGTGA CCATAACACA TCGCTTGATA 


4380 


CCAAAACAAT 


TCGACCTTTT CTGGCGGCAA TTCTTCAAAC GCTGGTTCGG ATGTTTTAAT 


4440 


TTCATCAATG 


ACCACTTGCC CAGTTTCTTT CACAAAAATG CCGTCTGCAC GACCTTCTAC 


4500 


ACAATAATTA 


TGGCCATTTA AAGGCACCTC AATTTTTAAA GATACTTCCT TTTGATAAAG 


4560 


GCTCCCCGCT 


TCCTTTTGCA ACATTCGATG AAGGCGACTA CCTTCTTGTG GTGTATGTTC 


4620 


ACTCACATGG 


CGAGAATCGA TACTTCCCCG CCTTAAAATA AATTCGACCA CTTGACGCAC 


4680 


CGCAATTTTT 


CTCGTTGATT TCACGTAATC CCACCTATCT GCCTCCCTAT TATAGAACAT 


4740 


ACATTCGCTA 


AACAAAAGCA AAAAATCCAA AACGAACACA ATCATGCGTT CATTCTGGAT 


4800 


TTCTCTTTAG 


ATTATCAACA ATTAAAAACA ACTTGTTTCT TCTAACTGTT TGGTTTCTTT 


4860 


TCCATTTGCT 


TTTGCCACGA CCAATTGCGC CATTTGTTTA AACGAATTAT TTAAACCAAT 


4920 


GCCGCCCTTT 


TTGGGCATCT TCACAATCGG TACTTGCTGA TAATAGTGAC TAACTGCTGG 


4980 


ATTTTCTTCT 


GCCATCGCTG CAATTGCAGG AATGCCTGCA GCATTAAACT TACAAGCTAG 


5040 


ACGAGCAGCC ATTTCTCCAA AATTAGGATA ATGCATGGCT GGTCCGCAAA GAACCGCATC 


5100 


GGCATGAAAC TTTTTTGCAA AGCCAATAAA TTTCTTTGCT ACCTCTTCCT CGTTTTCCAA 


5160 


ATAATATTGG 


TCACCACAAT AAAGTGTCGC AACAATTTCC GCACCTTGCT TCTGTAAAAA 


5220 


TGGTAGCAAT 


GTTACCCCTG GACCTAAGGG AGCTTTTTTA CCACTTGGTT TCAGCTGAGC 


5280 


TTGTTCATCA CTGCCAAATC CTGCTAAACT GTGGTTTAAG ACTAAAAGGA TTTTTTTCAT 


5340 


GCCTATTTCC TCCTTATTTA TAAATCATCG AAGTCAAAGG AATCAAAATC AATTTCTTCA 


5400 


CTCTCTtCTT 


CTTGTGCTTT CCATTCTTCA CGTAAATATT GGTCATCCAT AACTCGAATA 


5460 


AAAGGATTUVT 


AAATGAACAT TCCTAGTACC AATAAGAGAG CTTGCAGAAT TGATCCTTGC 


5520 


CAACCGCTGA 


CTAAAAATCC TGAGATAATT GGCGGGGTCG TCCACGGCAA TTGAATACCA 


5580 


TTCGTCAATG 


GAACTAAGCC CCAATCCATA GCAAAATAGG CAATAATAAT ATTAATTGCG 


5640 


GGTACTAAGA 


CAAATGGAAT TAAAAGGATT GGATTTAAAA CAATTGGTAA CCCAAAAATG 


5700 



wo 98/50555 PCT/US98/08985 

470 



ATTGGTTCGT 


TAATCCCAAA 


AACACCAGGA 


ATTAGTGAAA 


GTTGCGAAAG 


CTTTTT7VATT 


5760 


CGTTGaCTTT 


TACAGAATAA 


GACCATCACA 


ATGATTAATG 


AGAGTGTGGA 


aCCGGCGCCA 


5820 


CCAAATGTTG 


cAAcATATCT 


TGAAATTGTC 


CAGTGATGAT 


GTTGGGAATT 


TCATGACCGT 


5880 


CTTTAAATGC 


TTGTAAATTT 


TCAACAGACA 


ATGCCCGCAA AACAGGATTA AAGACTGCGC 


5940 


CAACCACACT 


ACCTCCGTTA 


ATGCCAAaGA 


ACCAAAAGAA 


GTGTAAAAAT 


AAATAAGCAA 


6000 


TGCTCATTGC 


ACCTAAGGTA 


TTACCAAGAC 


TCAATAAGGG 


CGTTTGTAGA 


AATTCAAAAA 


6060 


TAACATCTTG 


TAAATTACCA 


TAAGGAGTTA 


AAGCAGCCAG 


CCAATTGACT 


AAAAACATGA 


6120 


GAGTAACAAC 


TAAAAAACTC 


GGAATCAGGG 


CCTCAAAAGA 


CTTAACCACT 


GTTGGTGGCA 


6180 


CTCCTTCTGG 


CATTTTTATT 


GTCCACCCTT 


TACGAACAAT 


GGCCACAAAT 


AAACGAGTCG 


6240 


CAATTAATGC 


GACAATCATT 


CCTAAAAACA 


AGCCACTGGA ACCCATCCAT 


TTTAAAGGTA 


6300 


AACTAGTAAC 


TTCATAAACC 


TTTGAACTAT 


TTTCCGGTGT 


AAATAATGTT 


TGATAAGGCG 


6360 


TTACAATAAA 


AAATGAAACC 


AATGCGATAA 


TTCCTGCTTG 


CATTGCATCC 


ACATTGAATT 


6420 


GTTTTCCTAA 


ACTATAGGCw 


ATACCGACAA 


CGGCCAGAAT 


AGTCATTACA 


TCAAaACTAG 


6480 


CACTAGCTGG 


AACAGACATC 


ATTGTCGCCC 


AGTCTTTACC 


AAGCAAATGA 


GACATCCAAG 


6540 


TATCCCATTG 


TGGCAAAGGA 


AAATTAGCAA 


TTAATAAGAA 


GACCGAACCA ACAATCAACA 


6600 


AGGGCGTTGA 


TAAAAGGAAG 


CCATCACGAA 


TCGCTACTAA ATACTTATTJc TCTCCTATCA 


6660 


CTTTTGCCAA 


AGGGGTCAAT 


AC 








6682 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

AACTCCTAGA ACCAAGTAGG TACTGTTTAA CATCTATTTG CTTTGCAAAT AGTGTTTCTC 60 

AGCAATAAGA AGGGATTTCT GAAAATTGTT CTATAATTTT TGGAGACCAA GTAGGTATTG 120 

TTCATCATCG AATCACTACG TTCTTCGTGA TTTTCAACAA TTTCGGCTTA TTTCCGAAGA 180 

CCTTCAACTC TTAGAGGTTT AGCTCTTAGA AGTTGTTGAG AACGAAGCAA CGCGTAGTGT 240 

TGCTTCTGTT CCCGCCGTTT ATCAGTTTTT GAGCGTGAAG TAAAAATCCA AAGTGATTTT 300 

TATCCCACGC CCATTTTCAT TTATGACGTT GGCTTCGTGT AAATTGAGAA GTCTCGTCCT 360 

CTCTAGTGGT TGATTGCTGT GTTCGTTGTA CGGCTTGCTT TTCTGGCTCG CTCTTTTTTG 420 

CAACATCAGT CACAACGTTT GGCTTCTGAG AAGGGGGAAC AGTTGTTTTT TTGTTTAAAG 480 

TTGGCGTAGT TCTTTTTGTT TCTTTCTTTT GTTTCTTTTT ATTATTTTTA TCATTTCCAG 540 

TGACTGTCAA ATAGATAATC AGAAAAATCA TTAGGATCGT TGCAACAATT AATAGAATAA 600 
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ACATGTTAAA TGAAATGAAC ATGCATGGCC CTCGTTTCTC GTTTTTTCTt AGTTTACCAA 660 

AATTtCAAAA TTTCGTCACT CTTTTCTtCA GAAAAAACGA ATTTGAAAGG GGTTAAAAGA 720 

AACTTTTATA AAAAAGCCTC CTTTTTTGTG TGAAAATGCA CAAAATTTCT GGaATCGTTT 780 
GCTGTTAAAA CCAGATACTG GTAAAATAAT TATTGTATAA GaATCaTTTT AAAAAACGAT 840 
TTTTGTTACA AATAACGTAT TTGTTGAAAA GGcACTGTGA AGTAGTTAGA GACGACAGCC 900 
aAGTGACTTT TAATGACAGC CGCCCAATTA ATTTCATGGA GCCTTTTCGT AGAAAACGAG 960 

GAGGTTATTT TGCGTGA.WG TGAGTATATT TTCGACATGC GTGGTGGATT TACTTTTTCC 1020 

7UUVTGTAGGG CAAGCAATGG TAGAGGTTTT GGAACGTTAT GGCTGTGAAA CGGTTTTACC 1080 

AACAAGCCAA ACGTGTTGTG GTCAGCCAAC GTATAATAGT GGCTATGTGA CAGAGAGCAC 1140 

GACAACGCTT AAGAATCAAA TTGATAGTTT TGACGGTGCC GATTATGTGG TGGGACCTGC 1200 

GGGGTCTTGT GTCGGCATGA TGAAAGAATA CCACAAATTT TTAGCAGATG ATCCTATTTA 1260 

TGGACCAAAA GCACAACGTT TAGCTGAGCG AACTTACGAG TTTAGTCAGT TTCTTTATCG 1320 

TGTCTTAGGG GTCAAAGATG TTGGCGCAAC GTTACATGGG AAAGCCACTT ATCATCGTTC 1380 

TTGTCATATG ACGCGAATTT TAAATGAACG AGAATCACCC TTTGTTTTGT TGGATCATGT 1440 

GAAAGGATTA GAAATGATTC CTTTAGGACA TCTTGAAAAT TGTTGCGGAT TTGGCGGAAC 1500 

GTTTTCAGTG AAAATGCCAG CTATTTCTGA ACAAATGGTG ACGGAAAAAA TGAACGATGT 1560 

GATTGATACA GGCGCAGAAA TTTTAATTAG TGCGGATATG GGTTGCCTGA TGAATATCGG 1620 

GGGTAAATTC AATCGTGATG GGAAAAAGAT TAAGATTATG CATATTGCAG AAGTCCTTAA 1680 

TCACGAAGTG GATGAAGCAC GCATGGACCA ACCACAAATC ATATCGGTAG GGTAAGGGGG 1740 

AAGAACTGTG GGATTA^CA CGAGCAATAA GCCGTTAAGT GAACGAATTG AAGAAAGTAA 1800 

AAAAGATGTG TTTATGCAAA AGGCTGTCGC TAAAGCACAA GATGCCCAAT GGGAGAAACG 1860 

AGAAGGTGCG CGGGAAGCTT TAGGCAATTG GCCACAATGG CGTGAACTGG GAGAACAAAT 1920 

TCGCCAACAT ACCATTCAGT ATCTGCCTGA CTATTTAGAA GAGTTCAGTG ATAACGTTGC 1980 

CAAACGAGGT GGCAAAGTCT TTTTTGCTCA AACCGCCGAA GAGGCCAATG AATATGTGAA 2040 
ACAAGTGGTT CTAGAAAAAA AGGCGAAAAA AATTGTGAAA TCTAAATCAA TGGTGACGAC " 2100 

CGAAGTCGAT ATTGATCC3\A TGTTATTGGG ATTAGACGAT GTCAGTGTGA TGGAAACAGA 2160 

CTTGGCGGAA TTCATTTTAC AAATGGATGA CTGGGATGAG CCGTCACATA TTGTTTTCCC 2220 

AAGTATTCAT AAAAATCGCG AACAGATTCG TCAGGTTTTT GCGAAAAAAT TAGGGTATCA 2280 

AGGAGATAAT GACCCGGTAA ATTTGGCACG CTGTGCACGA GAAGTGATGC GCAAATTCTT 2340 
TTTGGAAGCG GAGATTGGCA TTACAGGTTG TAATTTCGCT ATTGCGGATA GCGGCTTAAT 2400 
TAACTTAAAT ACAAATGAAG GAAATGCTGA TTTGACCATC AGTATTCCTA AGACACAAAT 24 60 

CGTGTTAATG GGGATGGAAA GAATCGTACC AACAATGCGT GAAGCAGAGG TTTTAGATAA 2520 
CCTTTTAGCC AGAAGTGCCG TTGGTCAAAA TTTAACAACG TATGTGACGT TTGCCGGGCA 2580 
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AAAAAATGCG 
ACGTTCAAAA 
TTGTCTGAAT 
TCCAGGACCA 
ATTGCCTTAT 
ATTGCACGAA 
TGGATTTGAA 
TAAAGCAGCC 
TACTGTTGAA 
AGGTTGGACC 
TTGGTTCAAA 
AACCTTTTTT 
ATCCATTTGA 
AGCTATTAAC 
CACAAGAAAA 
TTTTATTGCC 
TAGAGGCTGT 
AAACAGCTGC 
GAACGATTGT 
ATTACATTTC 
TTTATACAGA 
GGCCTTCTAA 
AAGTTTGTTA 
TTTTTAGAAT 
AAAGAAATAC 
GTGTTAAGCA 
GTTTTTGGAG 
TGTTTAGCTT 
GGTCATTCAA 
GAAATTAATC 
ATTAAACGTG 
ATCAATGGAA 
GGTCAACCTT 



GATGAATCCG 
GCGTTAGGAA 
GTTTGTCCGG 
ATTGGTGCTG 
GCATCTAGTC 
TTATTAATTG 
GATTTCCAAA 
ATGAAAGTCG 
AACATGTACA 
GATGTTCGTG 
AAACATCAAG 
ACAAAATCTA 
GCCAGTTAAT 
GATTGTGAAA 
TTTACTGACG 
AACAGATGCA 
TTCTGTTAAG 
TCaGGCaAAT 
GGTGGAGTCC 
AATTATACCG 
AAAAATAGAA 
TTCAGGAGAT 
CGTGGTTGTG 
TGATTATGCT 
GTTTTTTTTC 
CTATAGGATT 
GAATGAGTTT 
TGGGTCTGTT 
TGGATCCAAC 
GTTTCGATAT 
TGATTGGTTT 
AAGAAGTTGA 
TGACAGGCGA 



ATGGTCCAGA 
CAGCCTTTCA 
TCTATCGCCA 
TGCTTTCTCC 
TTTGTGGGGC 
AACATCGTAA 
TGCGTATGGT 
ATCATGCTGC 
ATCaTGGAGG 
ATTTGCCAAG 
AAGGTGAAAA 
CGAGAAAAAT 
CATTTACCAG 
GAACGTGTGG 
ACGATTCAGC 
CGTTTTGAAG 
CAATGGCAAC 
ATTGCGATTG 
aATGCGGGAC 
TTTAGCAAAC 
AAAGGCGAAA 
ATTGAAATGC 
ATGGATCGCT 
TTTTTTATGA 
TGACCATTTT 
GATTTACTTT 
GAAATCAAAA 
TTTACTGAGA 
GTTAGCAGAT 
TATTACTTTC 
ACCTGGAGAT 
CGAACCCTAT 
TTTTTCATTA 



472 
APAATTTCAC 
GCCAGTCTTA 
TATTGGAGGG 
AATTTTAGGT 
ATGTACGGAA 
AGTGATGACC 
AGGTAAAGCC 
AGCGGGCATT 
CTACCTTGAT 
ACCACCAAAA 
AAATGACTAA 
TAGACGTAGA 
AAGAACAGTT 
AAACGATTCA 
AAATTGTTGC 
CCTATGGATT 
CTGGAAGTGA 
CCTTTGCTGa 
aAGGCAGAGC 
TCGTTCCACG 
AAATCGGCTC 
AACTGGTTGT 
AAAACGCTAG 
GCAGGATCAA 
GTAAAAAAAG 
GTGGTAGAAT 
GAATTAATTA 
CAATTTGTAT 
G6TGAACGGG 
CCAGCGCCAG 
ACAATTGCGT 
TTAGATGAAT 
AAAGAAAAAG 



GTpGTCATTT 
CAGTGTATTC 
CACGGTTATG 
GGATATAAAC 
ACGTGTCCTG 
GATGATCTGA 
ACTGGTTCAC 
TTAAGTAAAC 
AAAGGACCAG 
TCTTCAGAAA 
CGAAGCGATT 
AAAACAGCCT 
GGCAGATAAA 
CACGAACTTA 
TGAGTTTGGT 
AGCTGATTTA 
GCAACGAGAA 
GTTTTTAcTA 
CtTACATTTT 
TTCGACTCAG 
AGCGATTCAC 
CGGGTTGCAT 
AAGCATAATC 
AAAAGTAAGA 
CGTACATAAA 
ATGAGACATA 
AAACAGTCGT 
TTACGCCTGT 
TAATTACGTT 
ATGAACCAGA 
ACAAGGATGA 
TTAAAAAAGC 
TACCAGCAGA 



TGGATAACGG 
GTTGTGGGTC 
GCTCTATTTA 
AGTTTGGGGA 
TGAAAATTCC 
AAATGAAACA 
CAGCAATGTT 
AAAAAGACAT 
GGTTAGTAAA 
ACTTCAGAAG 
CAAAATCGAG 
GTTTCGGCCC 
ACACCAGCAG 
GTGGAAACCA 
GGGGACAATC 
GCTAAGTCAT 
GCGAATATTC 
GCTGT^TCaG 
TTACCGAAGC 
CCAGCAGCTT 
TTTATTTCAG 
GGTCCTTTAG 
AATTCAGCCC 
ATCAAATTTG 
AATTAATTGG 
AGATAATAAG 
CTTTTTTGCC 
CGTAGTGAGA 
AAAAAACACA 
TAAAAATTAT 
TACGTTGTAC 
CTTAACAGAT 
TAGCTACTTT 



2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 
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GTTTTAGGTG 
AAAGATATTT 
CCAGAAGTGT 
AAACAAACAC 
GATGTGAAAC 
CACTTATACA 
TAGTCCCGAA 
TACAGACAAC 
GTTTGGCTTG 
CAGGTGCTGC 
TTAGAGGTGA 
TGCGCGAAGG 
ACCCTAAAGG 
TCCAATTGCA 
AGTATTTACA 
TTACTGCCAG 
TAGCCCTTTT 
ATGAAGCGGG 
TTCTTTCAGA 
TAAACGATTA 
GGGGCGATGG 
CAACCGAAAT 
TGACTCGTGA 
TtCCTGTTTA 
MTCTTTATT 
GAACGGAATT 
GGTTAAACCA 
CATACGGCTA 
ATTTTGAAGC 
GTCAATCGCT 
GTTTGGAAGC 
TAAAAATGTG 
AACAGACTTG 



ATAATCGACG 
TGGGTGAAGT 
CAAAACAATA 
ATGTTTCCGT 
CATGAGTGTG 
GGCGACAAAA 
CCATATTAAG 
TACGTCTGAT 
GTACTATTTA 
GATGATTTTT 
AATTAACAAG 
CAATATTGAG 
ACAAGATTTA 
AATGAGTCAG 
AACGGTTGAT 
AGAATTAAAG 
ATTGGACAAA 
ACAAACCTAT 
TTATGTGGAA 
TTGGATTCAA 
CTTATTTCTG 
TCGCCGTTTA 
TTTAGATTGT 
TGTCGACCGA 
TGCTATTCAT 
ATTTATGCCA 
ACGGAACGCT 
TGAAGGTTAT 
GGAAGAGCAA 
TCAACGGCTG 
AGCGACCGTC 
GCGCTTACAA 
GGATGCATTG 



GAATTCAAAA 
GAAATTTGTG 
AAAGAGACAA 
CCTTCTTTTG 
CAGTTTATTA 
CAATGGCTAG 
TTTGAGCAAG 
TCAATAACCA 
CAACATACAC 
CGCAAAATTT 
CCTGGCTTTA 
ATTGCGGAGC 
CAACTTAAGT 
TATGGCTATG 
TTGTCGAATG 
TTGATTGAAG 
CAATATCCGC 
CATCAGCTTT 
AAGAAAGAGG 
TCGCAAGAAC 
TGGCGTGCTG 
GTGGTGGAAG 
TATGAAAATT 
GATATGGCTA 
TCGTACAATT 
ATGAATCAAC 
TGGCGCAGAA 
TATTGGTCAC 
GAAGATGTCG 
TTACCTAGTT 
TTCTATCACT 
GCTATCGAAG 
ATGTCATTAC 



473 
GACGGTCGTG 
ATGTGGCCAT 
TGAGTAAAAG 
TGGTAAAATA 
GAGGAACCGC 
AGGAAGACGC 
AAATTCAAGT 
GTACCCGTCT 
CTTTTTACTC 
TAGTAGAAGC 
TTCaGCAACt 
TTTATCCATT 
TTCi\AGATTT 
AATCAGCGGA 
TTCAATTCGT 
TGCTAATGGC 
ATGACTTACC 
ATCAACTAGC 
TATTAATTAC 
ACCTGCCTCC 
AAAACGTTAA 
AAGGGTATCG 
TATTAGAACC 
TGGATCGGCA 
ATCGTTATCG 
TCGCTACTTC 
AAGTTGATAT 
AAGAAAAAGA 
CTACGATGGA 
ACTTTCAAGC 
TTTTACTGCA 
CAGGTCAACT 
TAGATGAATA 



TCATTGGTTT 
TCTCACGGTT 
AGAAGGACGG 
AAGGCAGTGC 
CGTGGCAGAC 
ACAGCATGAA 
CTTGCAAAAG 
GCAGGTTTTT 
GGCAGATGTT 
AGAAGAAGAG 
ATTTCAGCTT 
TTTAGAAAAA 
AACGTTAATT 
GATTATTCAA 
TATTTCAGGC 
GCAAgcAGGT 
TGATCCGCGG 
AAGACAGAAG 
AAATCCAGAT 
ATTAAGCACA 
AGAAGAATTA 
GTACAAAGAA 
GATTTTTGCG 
TCCTTTAGTC 
CGATGTGCTC 
TGAAGAAAGC 
TACCGAAAAT 
TTGGGAATTT 
AGAAGAGTCA 
AATGATAAGT 
AAGTGGTGTG 
AGAAACAGCT 
CGTCACTGTT 



TATTCATAAA 
TGGTCCAATA 
AAATGTGTAA 
AAAGGAAAAG 
TTAGAAGCGC 
GTATTCTATT 
TTACGTCAAT 
AGTTTTTATC 
TTATCTGATG 
TTACAAATTT 
TATCaAGAAA 
CAAACGGAGA 
TTTACGCGAT 
CATTTAAGCG 
TATCAGCAGT 
TCTGTGAAAG 
TCGTTATTTT 
CAAATCCCGA 
TTGCAAGGAT 
GCTGACTGGA 
ACACATGTTG 
ATTCAGGTGT 
GAACACGAGA 
GAATGGATTG 
CGTTTTTTGA 
TTGACTGATT 
GTAGTCCTGG 
ATTCGGTATG 
AATGCGATTC 
GCCAAAACTG 
GCAACACAAT 
CGTAACCATG 
TACGGAGAAA 



4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 
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GTTCGTTTGA TTTTACAACG TTTCAAGAAA TATTTGTTAG TGGTTTGGAA GGTTTGCACT 
ATAGTAAAGT GCCGACAGCG ATTGACCAAG TACAAGTTCG CGCAATGGAT TTAACTCGAC 
CTGGCGCTGC CAAGGTTACT TTTGCTATTG GGATGACGGA AGAAATTTTT CCACAAAAGA 
TTGAAAATAA AACACTATTG TCTGATGAAG AACGGCAGAC AATCAATGAT ACTTTAACGG 
AAAACCAATA TTTACGTGGG ACAACCGGAC GCAAAATTGC GCAAGAACCT TATGTTGCGT 
ATCTTGTCTT TTCTTCAGCG CGAGAACGGT TATATTTAAC CTATCCAAGT GTTAAAGATA 
CCGCTCAAGA AGTCAAACCG TCTCCTTATT TTAAAAATAT CCAAAAAGAT TTAAACTTGC 
CCGTTTTTGA AAAAAAT'JAA ACAACCATTT TCGATGATGA AACGACCAGT TTAGCCCATA 
TTAGTACGTA TCGGACGTTA ATCGGTGAAT TAACGCGTTT GAAAAGACAA CGGAAAGAAA 
CACAAGAAGG ACTGTTGCCT TTCTGGTTAA ATATGGAAAA AGCGCTAATG AACCAATCAA 
TTGCGCCGCT AGCCAAGCAA GTTTTCGAAA GTTTAACGCA TCAAAATATT CCTGAAAAGA 
TTGATGAGGT CTTAGCAGAA CCTTTATATG GTAAAGATAT TTATACGTCC GTTTCTCGGA 
TGGAGAGTTT TTATCGCTGT CAGTATCAAT ACTTTTCTCG CTATGGCTTA CGTTTGAAAG 
AAAGAGATGT CTTTGGGCTT TCGCCAGCAG CTACAGGGGA ATTTTTTCAT GAAGCATTGG 
ATCAGTTCTT TAAATTATTG ATTATGAATC AACGGAACTT ATCTGAATTA ACCGATCAGG 
AAGTCAATTT ATTGGCGGAG GAAGTACTAA ATAGTATTTT AGGGGATGCG CGTTTTTCTG 
TATTAACGAC CTCTAGTCGC ATGAACTATA TTCGCTATCA ATTGAGTCAA ACAATTAAAA 
AAGTCAGTTG GGCGCTGAAA CGTCAAAGCC AACGAAGTGs CATGACGACT GTTCAAACGG 
AAGTGTTATT TGGCCAAATT GCCGCTAAAA AAGGAATTTC TGGTTTGGaA TTACCATTAA 
AAAATCAAGG GAAGATTcAT GTTCGGGGAA AAATCGACCG GATTGATC7A TTGGTTACCC 
CCGAATCAAC CTATTTAGGT GTGATTGATT ATAAATCGAG TCACCGTAAA TTTAATATGA 
CAGAAGCGTA TTACGGCTTA GCCATGCAAA TGCTGACGTA CTTGGATGTG GCTTTAATGG 
ATGCTGTGCA ATTAGTTGGT CAAGAAGCTA AACCTGCTGG CTCACTTTAT CTGCATGTAC 
ACAATCCTAC ATTATCTTAT GAAGGCAAAG ACGACATTGA AC7VACAAATG CTTAAAAAAT 
ATCAGTTTlSA TGGTCTTTTG ATGAAAGACC CAGATTTATT GGATCATTTA GATACGAGTC 
TGCAAGCTAA ACAAAGTTCG TTGCTGTTTC CGATTGAAGA ATCCGCCAAA GAACAAATTA 
AACCTGGTCG TCGCCAAGAA GATAAATTTG TGACAGAACC AGAATTAGGC GCGTTGCTTT 
CCCATAACCG AAATAAATTT ATTGAAGCGG GGAATCAAAT TATTGGCGGC GAAGTTCAAC 
TTAATCCCGC TTATCAAGGC AAGGAACGGA TTGCTTGTCG CTATTGTCCG TTTCGAAGTG 
TCTGTGATTT TGATGTGATG CTAAAAGAAA ATAATTATCA TCGTATTGAG AATTTATCCA 
AAGAGGAAAT TATGGCGCGC TTGTTAAACA AAGATGAGGA AGGAGCAACC GAAGATGAGT 
AAAACAATTC CACTACGTCC AGCCAATGAG CAATTCACCG ACAGTCAATG GCAAGCGGTG 
TTTGATGGCG ATGAAAATAT CTTAGTTTCT GCTTCGGCTG GCTCAGGAAA AACCACCGTT 



6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 
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TTAGTCCGCC GTGTCATTGA AAAAGTCAAA AGCGGCGTCG 
GTGACGTACA CTGAAGCCGa CGCCC6TJSAG-ATGAAAGAAC 
AAGGCAATGA ATGAAGAACA AGATCCAGAA AGAAGACGGC 
CTTTTGCCAA CAGCCAATAT CAGCACGTTA CATGCCTTTT 
TTTTATTATT TAATTGACAT AGATCCAGTT TTTCGGATGT 
TTGTTATTAA AAGAAGATGT TTGGGATGCA TTACGTGTU^C 
GAAGCGTTTT ACCAATTAAC AGCGAATTTT TCCAATGACC 
AACTTAATTT TTTCTTTTTA TGAATTTGCC AAAGCCAATC 
AATGGCTTAA CACAAGCCTA CGAAGTAGGG GaTCAGCTAG 
ACGTATTTAA AACCGCTAGC CGTTGAGACG TTACAACGAA 
ATGGTGACCT TAACAGAGGG CGAAGAGAAA CTTCAAAAGA 
GAAAAGGAAC AAACAAAACA ATTTTTACAA TTTTTAGAGA 
TATAATTTGA CTGAATTGTT GAGCTTTGAC CGCTACCCAA 
AAACCAACCG CGGAACAAGC CAAACAGTTA CGAGAACAAA 
TTGAAAAAGC AACTGTTTAC ACTTTCCCCA GATGCCATGA 
ACGCCCATTG TTCAAGAAAT GGCTCACGTG GGGAAACAAT 
GA7KAAACGGC TTAAAAATTT AGTGGATTTT AACGATTTAG 
TTAGCGAAAA ATCAGGCAGA TGGCTGGCAT GCATCAGAAG 
AAATTTGATG AAGTGTTAGT CGATGAATAC CAGGATATCA 
TTGTATTGGT TGCGCCGTCC GCTTAGCACA GAAGGAAACC 
AAACAGTCTA TTTATTCTTT CCGCTTGGCA GATCCTACAT 
CAGTATGGGC AAGGCAAAGA GGGCAAACGA ATTATTCTAG 
AAAGACGTTT TAGACTTTAC GAATTTAGTA TTTAGTCAAT 
CAAATTGCTT ACGATGAGTC AGCTGCTTTG GTTCATGGTT 
GCTGATTATT CAACTGAATT GCTAATTTAT GAAAAAAAAG 
CCAGAATTAC AATCCCCTGA GTTGTTGATT GAAGATAAGA 
ACTGCTTTGA AAATTCGTGA ATTAATTGAC CAAAATTTCT 
AAAACGGATC GACCGATTAC TTATCAAGAT ATTGTCTTAT 
AATTTAACTA TTTTAGATGT TTTTAAATCA TTAGAAATTC 
CAAAATTATT TCCAAGCAAC GGAAATTCGT ACAATGATTG 
AATCCTTATCAAGACATTCC TTTGGCAGCT GTGTTACGTT 
GAAAACGAAT TAGTGCTTAT TCGTTTGGCT AATAAGGAAA 
TTAACGTTTA ATCAAAAGAT GGAACCAACC ATGGAGGAAG 
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ATATCGATCG 
GTATCCAAGT 
ATTTTTCTCG 
GTTTAACTGT 
TAACAGATGA 
AATTTTATGC 
GTAGTGATGA 
CTGATCCAGA 
GCGAATCAAA 
CTCTTCAGCG 
TTTGGTATTT 
GAAATGATTT 
CGGTTCGTGC 
ATAAAAAAGC 
AACAAGTCTT 
TTATGGAGGC 
AACATTATAC 
CCTCGGTTTA 
ATCAATTACA 
TTTTTATGGT 
TGTTTATTGA 
CTGAAAATTT 
TAATGGATGA 
TTGATCAATT 
CAACAGAATC 
CAGAAGGGGA 
TAATTTATGA 
TAACGCCTAC 
CAGTTCAAGT 
CCTTGCTTCA 
CACCAATTGT 
CCAGTTACTA 
CAGTTGTTCA 



TCTCTTGATT 
GGCTTTACAA 
TCAAATTGCT 
TATTCGTCGT 
AACAGAAACA 
GGAAAATCAA 
CGGACTCACT 
GGCGTGGATT 
GTTATTTCAG 
TTACGAAGAA 
AGCTCmAAAT 
AGAGA6CGCC 
CGAAGAGTTG 
GTTAAATGAT 
GAAAGAGGCC 
CTATGGCGCA 
GTTAGCTATT 
CTATCGCGAA 
AGAATCTATT 
AGGAGATGTG 
AAAATACAAT 
TCGTTCTCGC 
GCGAGTGGGC 
TTCGGAGGCG 
AGTTGAATTT 
ACTTTACGTT 
TAAAAAGCTG 
GAAGAAAAAT 
CAATGATGCT 
ATTGATTGAT 
TGGATTAAAA 
CGAAGCATTT 
GGAAAAAACT 



8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
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10020 
10080 
10140 
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ATACGTTTTG CTGAATCATT AGAAAAATGG CGGGAACAGG CTCGACGCAA CCAGATTAGC 10560 

AACTTGCTGT GGACGATTTA TCGGGAAACG GCCTATCTTG ATTAT6TTGG .CGGGTTGCCT 10620 

GTTGGCAAAC AACGCCAAGC CAATTTATAT GCGTTGGTCG ATCGGGCCGC GGCGTATGAA 10680 

AAAACAACCT TCCGCGGATT ATTTCAATTT GTTCGCTTCA TCGAAAAAAT GCAAGAAAAA 10740 

GACAAAGATT TAGCAGAACC AGTCGTTTTA AGTGAAGAGA ATGCCGTACG CGTCATGACG 10800 

ATTCACGCTA GTAAAGGGTT AGAATTTCCG GTGGTATTCG TTTTAGATAT GACCAAAGAA 10860 

TTTAATGTCA CTGATTTGAA TGAACGCTAT ATCTTTGAAG AAAATTTAGG GGTAGGCATT 10920 

CGTTATTTAC AACCTGAAGA ACGAGTTATG TACGATACTT TACCATTTTT GGCGATTAAA 10980 

CAAGTACGAT TAAGAAAATT ACTATCCGAA GAAATGCGGA AGTTATATGT GGCGTTGACA 11040 

CGTGCAGAAC AGAAACTTTT TCTGGTAGGC TCCTATAAAG ACCAAGCGGC GATGTGGAAA 11100 

6AGTGGCTGA AAGTTGGCGA TGTAGAAACA CTGGTGCTAC CAGCTGAAAA TCGCTTGCAA 11160 

AGTAAAAGTA GTTTAATGAA TTGGGTTGGG ATGACATTGG TACGTCATCA AAAAGCCGAT 11220 

GAATATCAGC AAGAAGTAGT AGTAAGTAAC GTTCCGCAAG TCAAAAAGCA CCCAGCTAAT 11280 

TTTCATATTC AGTGGTTCAA CGAAGAACAA CTACGAGCAG CCATTCAGCA ATTACAATTG 11340 

CCAGAAAGAC AAGCAGAGGA TTTAGCCGAA AAAGCGCAAT TATCTGCTGA CAAAATCAAT 11400 

CGTGGCCTTG CACGGTTAAG TTTCAATTAT CCGTTTGAAG TGGCTACTCG GACCACCAGT 114 60 

TATCAGTCTG TTTCTGAAAT CAAGCGAGTA TTTGATGACC CCGATAACAA AGAAATTGGT 11520 

AAAATTGAGG TTCGGGAaIsA CAATACTATT CAAGCGCAAC CGTTGATTGT CAATCGAATG 11580 

ATTGAAGGAG ATTTAAGTAA GCCTAAATTT TTAGATACGG TTCAAGCGCC CTCAGCAGCT 11640 

GAAATTGGGA CAGCAACCCA TTATTTATTA CAACTAATTG ATTTAAGTAA GCAACCTTCT 11700 

TACGAGGAAG TGCGAGCTGT TCAAGAACGC TTAGTGGAAA ATAAATTGAT TTTACCAGCA 11760 

ATTGCTGAAA AAATGAATCT GGAACAAATC GTGGCCTTCT TTGACACCGC CTTAGGAAAA 11820 

CAATTGATTC AACACCATCA AACCGTTAGA AGGGAACAAC CTTTTTCTAT GCTGATTGAA 11880 

GCAGAAGAGC TCATTCAAAA TTATCCAGAA ACGACACAAG ATGATTTATT AATTCATGGG 11940 

ATTATTGATG GCTATATCGA ATTAGACAAT CAATGCATTT TGTATGACTA TAAAACCGAT "* 12000 

CATGTAAAAG GCACCTCTCC CCAAGCAATT TCTGAAATTG TCGAACGGTA TCGCGGACAG 12060 

ATGAATCTTT ATCGTAGAGC GCTACAAGAG GCAACGCATA AAGAGGTGTC GCACGTCTAT 12120 

TTAATTTTAT TAAATGGAGG AGTAATTATT GACATGCAGA CmGGAAATGT TGTAGATTTT 12180 

ATAAAGTAGA AAAAATTAGA AGCAGAAACA CTTACATTTT GTAAGGAATG TGCTATACTC 12240 

ATTGAAGAGG TGAGGAAGAT GGAAAAAGTA AAAACAGAAT CGTTTGTACT TTGTsCCgaA 12300 

ATTTGAAAAA TCATTTTCAA TACTTGGTAA AAAATGGAAT GGTTTAATTA TTGATGTTTT 12360 

ATTAGAACGA GGTCCTCAAC GTTTTGGCGA ATTAAAAGAA rAAATTCCAA TGTTAAGTGA 12420 

TCGTGTCTTA GTAGAGCGAT TGAAGGAACT AGAAGCTGAA GGAATTATCA CCAAAGCTGT 12480 
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TCGTTGCGGT GAAGGAAATC GCTTTGGAnT ATTTCCTGAC CGAAAAAGGC GAAGACTTGC 12540 

AATTAGCAAT GGAACAAATT CAACATTGGG CAGAAAAATG GGTAAAAGAC GAAGAATGCA 12600 

GTTGACCGCA AGCAAAACTT CTTGTAAACT AAACGTAATT AAATAACTAA AAAAACAGTG 12660 

ATGGAAACAA GTAGTTTGAA TTGACGTGTT TTAGGGAGAG TCAGCCAGAG ACTGGAAGCT 12720 

GATTTACACA GTCACCAAGc GAAATTCACT TCCTGAGTTT GTTTGTTTTC AAGAAGAAAG 12780 

CAAACCGGTA ATTGCCGTTA AATGAAATGA GTGCATGACA ACAATTTGTT ATGAATTAGG 12840 

ATGGTAACAC GATAGCTCGT TCCTTTGAAA GGAATGGGTT ATCGTTTTTT TTTTAGTTTA 12900 

AATAAAAGAT AGAGGGGAAC GCATAATGAC ATTACAAGCG CAATTAGAAG CTCTTAGAGA 12960 

CAATACGCTC AAAGAAATCG CACAAGTTGC TACTTTAAAA GAATTAAACC AAATTCGCGT 13020 

AGAAACATTA GGGAAAAAAG GGCCAATCAC CGAAGTTT7UV GAGGCATGAA AAACCTTTCA 13080 

CCAGAAGAAC GACCGGTGGT GGGGGGCTTT GCAAATGAAA TTCGTGATTT ATTAACAGAA 13140 

GCAATTG7VAG CGCGCAAAGT TGTTTTAGAA AACGAAGCGT TAAATGCGGC ACTAAAAGAA 13200 

GA7UVGTTTAG ATGTAACTTT ACCTGGGAAA CAAATGCCTC AAGGCACACG TCATATTTTG 13260 

ACACAAGTTA TGGAAGAGAT TGAAGATATT TTCTTAGGCA TGGGTTATCA AGTAGTGGAA 13320 

GGATACGAAt AGAaTCCGAT CATTATAATT TCGAACGCAT GAATCTACCA AAAGATCATC 13380 

CGGCACGCGA CATGCAAGAT ACGTTTTATA TTTCAGATGA GATGTTAATT CGTACGCATA 13440 

CTTCACCAGT TCAAGCTCGA ACAaTGGaAA AACACGATTT TTCmAAAGGk GCATTACGAA 13500 

TGATTTCTCC TGGGAAAGTA TTCCGTCGTG ATACAGATGA TGCGACACAT AGTCACCAAT 13560 

TCCATCAGAT TGAAGGCCTT GTTGTCGATA AAAATGTCAC AATGGGTGAT TTAAAAGGGA 13620 

CGTTAGAAGT CATGATGaAA AAAATGTTTG GtGaAGATCG TAAAATTCGT TTACGTCCTA 13680 

GCTATTTCCC yTTTACAGAA CCTTCAGTCG AAGTAGATGT TAGCTGTTTT AAATGTGGCG 13740 

GCGCAGGTTG TAACGTCTGC AAACATACAG GTTGGATTGA AATTTTAGGT GCGGGGATGG 13800 

TTCATCCAGA TGTGTTACAA ATGTCAGGGA TTGATCCAAC AGAGTACTCA GGCTTTGCCT 13860 

TTGGCTTAGG ACCAGATCGC GTTGCTATGT TACGTTACGG TGTAAATGAT ATCCGTAATT 13920 

TTTATCAAAA TGATTTACGT TTCTTAAATC AATTCAAGGT AAAGGAGTAG TTGAAAAATG 13980 

TTAGTTTCTT ATAAATGGTT AAATGAATAT 6TTAATCTTT CAAACGTTAC TCCGCAAGAA 14040 

TTAGCGGACA AAATGTCTGT GACAGGTATC GAAGTTGAGG GCGTTGCTGT ACCTGAAGAA 14100 

GGATTAAAGA AAATTGTTGT CGGCGAAGTA AAAGAATGTG TTCCTCATCC AAACTCTGAC 14160 

CACTTATCAA TTTGCCAAGT AGACATCGGT GAAGAAGAAT TGTCACAAAT TGTTTGTGGA 14220 

GCACCAAATG TAAAAGCCGG AATTAAAGTC ATCGTAGCAT TACCTGGTTC AAGAATTGCA 14280 

GGAAATCAAA-A7VATTAAAAA AGGTAAAATG CGTGGCGAAG TCTCTAACGG AATGATTTGT 14340 

TCGTTAGAAG AGCTAGGATA TTCAGATAAT GTCGTACCAA AAGCCTATGC TGAAGGGATT 14 400 

TATTATTTAC CTCAAGAAGC AGTGAATGGG ACACCTGTTT TCCCTTATTT AGACATGGAT 14460 
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GATGCGATTA TTGAATTATC AATTACACCA AACCGTGCAG ATpCACTAAG TATGAGAGGG 14520 

GTAGCCTATG AAGTTGGCGC AATTTATCGT CAAACACCTC AGTTTAATGA TCCCGAACTC 14580 

AAAGAAGATG CTTCAGATAA CGTGGAAAAT TACGTAACAG TGACTGTTGA AGATTCACAA 14640 

GATGCGCCAg cGTATCAAAT TCGTGTTATT AAAGATGTGA CGATTGCAGA AAGTCCTCAG 14700 

TGGTTGCAAA ACCGATTGAT GAATGAAGGA ATCCGTCCGA TTAACAATGT GGTGGACGTG 14760 

ACAAATTATA TTTTATTATT ATTTGGTCAA CCATTGCATG CATTTGATTA CCAAAAATTA 14820 

GATAGCAAAG AAATCTTGGT TCGTCGAGCA ACAGCAACAG AAGAACTAAT TACATTAGAT 14880 

GGTGAAACAC GTCAATTAAC AGAAGAAAAT ATCGTCATTA CGAATGGAAA AACgCCTGTC 14940 

GGCTTAGCCG GAGTAATGGG TGGGGCTAAT TCTGAAATCA GTCAAGAAAC AACAACTGTG 15000 

GCTTTAGAAG CGGCATTGTT CAATCCATTG TCTATTCGCA AAACGTCTAA ACAATTCAAT 15060 

TTACGGAGTG AATCTTCAAG TAGATTTGAA AAAGGCATCA ACCAAGCGAC AGTTGGGCTA 15120 

GCTTGTGATG TGGCAGCCGC GATGATTGCA GAGTTAGCCA ACGGAACAGT GGTCTCAGGC 15180 

ACTGCAGTTG GTTCTGAAGT GGCCGTTAAA GAAGCGCAGG TAGCTGTGAC TTTAGAGCGA 15240 

ATCAACCAAT ATTTAGGCAC TGCATTAGAT GAAGCAACGG TGAATGAAAT TTTTGAAGCA 15300 

CTCGGTTTTG CTTATGAAGT AAATCAAGGC GCATATGAAA TTACGATTCC ACCAAGACGT 15360 

TGGGATATTG CGATTGAAGC AGATATTATT GAAGAAGTGG CGCGCATTTA TGGATATGAT 15420 

CATTTACCTT CAACATTGCC AAGTGGAGAA ACAGTTGCTG GAAGTCTAAC CAAAGCACAA 15480 

CATGTTACGC GCCAATTAAA GAGCTTACTG GAAGGTCATG GTACGAGTGA AGCCATCAGC 15540 

TATGCGTTGA CAACAGAAGA AAAATCCCGT CAATTTATGA TGAAAGAAAG TCAAACAACA 15600 

CGTTTGCAAT GGCCAATGAG TGAAGAGCGT TCTGTGTTAC GTATGAACTT AATTTCTGGT 15660 

TTATTAGATG ATGTCGCATA CAATGTGGCA CGTAAAAATA ATAATATCGC CTTCTACGAA 15720 

GTAGGACGCG TTTTCTACCA AACAGAAGAT CCAACAAAAA ATTTACCTAC AGAAGAAAAT 15780 

CACTTAGCAC TTGCTTTAAC TGGTAATACA ATGGTTAAAG ATTGGCAAAC AAAAGCAACA 15840 

GCCGTTGATT TTTATACAGT AAAAGGCTTA GTGGAAAGTA TTGTAGCTGT TTTAGGATTA 15900 

ACAGAAAAAA TCAGTTATCA AGCGACAACG GCGATTCCAG AAATGCATCC AGGCCGGACA 15960 

GCATGGATTT ATTTAGAAGG TGAAGTGGTT GGTTTTGTCG GACAAGTTCA CCCAACGACA 16020 

GCGAAAGCGT ACGATATTCC AGAAACATAT GTTGCTGAAT TAAACTTGCa ACAATTAGTA 16080 

GCCATAGAAG CAGGCGGGGT TACTTATGAA GCAGTTTCTA AATTCCCAGC AGTTTCTCGA 16140 

GACATTGCTT TATTAGTCGA TGAAACAGTT ACCAATCAAG AACTAGTTAA AACTATTTCA 16200 

GATAACGCAG GTAAATATTT GAAAGAGATT CATTTATTTG ATGTATATCA AGGTGAAAAA 16260 

CTAGGCGCTG GCAAAAAATC AATGGCGTAT AGTTTAACTT TCGTTAATGC AGAAGCGACA 16320 

TTAGTGGATG AAGAAATTAA CCGTTCAATG GAAAAAGTTG AAAAAGCGCT AATTGAAAAA 16380 

CATCAAGTAG AAGTAAGATA AAATAAAAAC AGGAAAGTGG GGCATAAGTC GAAATGACTT 16440 
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ATGCCCCACT TTCTAAATCT TAAGATAATC AGCTGAACAA AAPCAGCTCC AAGAAATAAG 
CCGAAATTCT CCAAAAATTA AAGAACAATT TTCGGAAATT CCTTCTTATT TCTTGGAGCT 
AAACACTTTT GTCTCAGCCT CCTATTTTTA ACTACATTAA TAACTTTTCG CCCAATTTCG 
TGAAAGCCAA CGTGAAAATT GTGAGATACT AAAGTTAATA ATGAAATAAA TTAGGGCGAC 
GATTCCGTAA ATAGTGAATA CTTGTCCTGT TTGTGCGTAg CGCCCCATTA AGATATAAGA 
TTTGCCAAAG AGTTCTT J./J\ AAGCAATCAC GGAATACAAG AAACTTGTAT CCTTGATGAC 
AGTAACAAAC TGAGAAACAA TTGCTGGTAA GACATTGCGA ATGGCTTGAG GCAAAACAAT 
ATGATAAAGA ATTTGCCAAT TTGAAAAACC TTGAGAACGT GCCGCTTCAA TTTGGCCTTT 
GTCGACCCCA TTTAAACCAC CACGAATAAT TTCCGCTAAA GCAGCGGTGG TAAACACAGT 
AAAACTCACA ATTCCAGCAG GTGTGGATTT GATTTTGAAA ATCAAAAAAA TAACATAAAT 
CCACAATAAG TTTGGAATAT TACGCACCAC TTCTACATAA ATACTGGCAA GTAACTTCAG 
TGGTCCATGG GTTTGATTTC GCAAAACGGC TAGAATGGTG CCAAAAATCG TACTCAAAAT 
AATTGAGACG AAGGAAATAT AGAGCGTCAA TTTTAAGCCG TTAAATAGAA ATTGAAAATT 
ATTTGCTGTC AGTAATTGAC GCATTTCTTC TGAAAAAGGC ATCTGTTTTC CTCCTATCTG 
CTATATGCTT TCTTATTTTT TTCTTCCATT TTTCTAGCCA AATTCGCTAA TGGGAAGCAA 
AGAATGAAAT ATAAGAAACC AGCAAGCGCA AAGGCGGGAA TATAGTTCAA GTTGACAGAC 
GACCAACTAT TGGCAGCAAA CATTACGTCA GCGCCAGAAA TAATGGCAAC TGTTGACGTA 
TTTTTAATTA GGTTCACTAC TTGATTGGTT AAAGGCGGTA ACATAATCCG CCATGCTTGT 
GGTAAAATCA CATACCGCAT CGTTTGACCG TACGTTAATC CTTGGGAGTA AGCCGCTTCA 
AATTGCCCTC TTGGGACAGC TCCGATTCCT GAGCGAACCA CTTCAGAAAT GTAGGCGCCA 
TGATAGAGGC CTACACACAG AATGGCAATC GTAGTAATAG AAAATGAAAG CAGCGGGCTT 
AGCAGAGGAA AGCCATAATA GACTACGATA AATTGAATCA ACAAAGGTGT GTTTTGAAAA 
AATTCTACAT AGACACGACT AATTGCGTGT AAAAGTTTAC TTTTGGTCGC TGATAAACTG 
CCAAAGAAAA TGCCTAAAGC AAAGGCAACT AAGATCGCTC CTAAAGAAAG CAGAATAGTA 
TAGAGAAATG CCTGACCAAA AATTCGCCAA TCTTGAAATA AAGCCTCCCA ACGATATAAG 
GCAAATGGCC CTTCTTCAAG CCAGGAAATA TTGGCTAACA AAAACATAAA AATCCCTCCT 
AAAATTTTTG ATTCTTCAAG TCTTAGGAAT GATT7WVTTTT TTAAGCCCCA GTCTTGATAA 
ATTTTTTCTA GTGTGCCGTC TTTTTCCCAT TTTTCAATTA AACCATTTAG ATAATCATTT 
AATTCAGTAT TTGCTTTCTT TGTGGTAATT CCATATTCTT GTGGCGAAAA ACCATCTTTT 
AAAATTTCTG TCTTGCCATC CACATAGCCA GATAGGATTG ATTTATCCAC TGAAAATGCA 
TCAATTCGTTTAGAAGTTAA CGCTGTTTTT AATTCTGGAT AGGAACCTAA TTCTTGATAA 
GTAAATTTGA TTCCTTCCTC TTGCGCTTTT TTCTCTAAGT TTTCTTTTGT CGTAGCAGAT 
TGTGCAACGC CGATTGTTTT TCCGTCTAAA TCTTTCATGC TGGAGAAGTT ATCCTCTTTG 
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16560 
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16680 

16740 

16800 

16860 

16920 

16980 
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17160 

17220 

17280 

17340 
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17760 
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CGCACAAGGA AACCAATCTC ATCTGTATAA TAAGGAGTTG TAA^TTATA AGTCTTTTTA 
CGTTCATCAG TAATTGTAAA . TG.TAQC^TT .ACCATATCTA GTTCACCATT GTCTAAAAGG 
GGACCGCGCG TTTTCGCTGT CACGCCCACA AATTCCACAT TCTCTTCGCT GCCGGTAATT 
TCTTTCGCAA TCAAGCGTGC AATATCTgGT TCCATTCCTT CAtTTTTGTn TGATCAGGAT 
TCATATAGCC AAAATTAGGA ATGCTTCTTT GACACCTACT TnAATnTnCC CTGCTTTTTT 
TAAT 

(2) INFORMATION FOR SEQ ID NO: 58: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



18480 
18540 
18600 
18660 
18720 
18724 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
AAAAATTACG TGATTCGCTT AGCCTGACCA GACTTTTGTC TAAGTCTTAA AACTTGATAA 
GTAATAAGAG GTCGGGACAG AAGTGCTTAA CTCCGAGAAC AAAGTAGGTA CTGTTCAACA 
TCTATTTGCT TTGCAAATAG TGTTTCTCAG CAATAAGAAG AAATTTCCGA AAATTGTCAT 
AAAACACAAC GAGGAACGAG TTGATGTTGC ATGATACACT ACACTCCGTT TCGTTCGCAT 
CAATTTCTAA GCATGGAACA TGCTAGAATT GAAGGACCCG GTGTTCTTTA ATTTTTGGAG 
ACCAAGTCCT TCAATTCTTA GTGATTTATC ACTTAGAGAT TGATGAGATC GGAGCTCTGC 
GAAGTAGGTA TTGTTCATCA TCGAATCACT ACGTTCTTCG TGATTCTCAA CAATTTCGGC 
TTATTTCCGA AGGACCTTCA CCAAGTAGGT ACGATGATTT CATCTGTTCA TGCTTCACAG 
TGAAACATCG CAACTCTTAG AGCTGTAGCT CTTTTGAAGG CTGTACTGTG ACGAGACGCA 
TGTCGAGAAA ACACCTTAGA AGTTGTTGAG ATCGGAGCAA CGCGTAGTGT TGCTTCTGTC 
CCCGCCGTTT ATCAATTTTT AAGCCTGGAA CAAAAATCCA AAGTGATTTT TGTTCCAGGC 
tCTTACTTTC ATACTAATCT TAGGTCAGGT TGTCATCAAG TTTGCCAATA ATAGACTTTT 
ACAAGCGCGC CTTTTTTGAG TAGTACACCA GCTGGTTGTT CTAGCAGGCA ATTTGTTTCT 
AAAGTATTTG ATAAATTGGA TGAAAGATGA CTTTTTGTCA CTAAATTTAC TGTGCCATCT 
ATTTCTTTCG CTCGGTAAAA TCGACGTAGT CTACTAGGGG TTAATTCATG CGCTAAAATT 
ACTGTTCGTT CCTTGAGATT TAACTCGACA CAATTAAAAA AAGTAGCTAA TAATGTCCAG 
TAAAAAAGAT GTAAATTAAC CATTGCCGCA AACGGATTGC CTGAAACGCT TAAAATGAGT 
GTTTCTTGAA ATTGACTTCC CATAACTGGG GTTCCMGTT TCATATTTAC AAAATGAAAC 
AACTCTGTTC CTCCGATACG TTTAATTGCT AAAGGTAAAT TGTCCTTTTC ACCTACCGAA 
ACACCACCAC AAGTAATTAA TACATCAAGT TTTGGTGAAA TCTCAATCAG CCTCTTCGCT 
AATTCTTCAG GGTCATCTGA ACAATGGTCT AGGTGGATTA CTTCATTACC ACTCGCTTTG 
ACAAAAGCTG CTAACATGTA GAGATTACTA TTATAAATTT TCCCAACTGA CAGTGGTTCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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TCCAAAGAAA CTAACTCACT ACCGGTCGCT AAAATCCCCA CCTTCATGCG CTTCAAAACA 
ACAACTTTAT CAATCCCTTG CATTGCCAAC AAACCAATCA TGCGACTATT AATCTGTTGA 
TGTTTTGCTA AAATTTTTTG CTGAAAAGCA ATATCTTCTC CAATTTCCGC ATAGTTTTGG 
CCCGCAGTGA CAGAGCGAAA AATTTTAACA TGGTCTTGCC CAAAATCCGT CCATTCTTGT 
GGCACTACAG CATCAAAATC GGTTGGAATA AAAGCACCTG TTGTTATTTT AATCGCACAG 
TTATTCGTCT GATTACAATA AAACATGGAA TCCCCTGCCA CTAAAGAACC GACGACTTTT 
AAACAACTTG GGCTTTCAGC AGTCGCACCT CGTGTTTCTA CTGCACGAAC GGCATACCCA 
TCCATCCCGG CACGAGGAAA ATGAGGAATC GCTGTCTCTG CAAAAATGGT TTCTCCGCTA 
ATCCGTCCCA AGGAATCAAG CAGCGCCACT TCTTCTGTTT CACATTGACT CTTCATATGT 
TGGGTCATTT TCTCTCGTGC TTCTTCCAAC TCGATCATGT TTTAACCATC CTTTTTTATT 
TATTCACTGG TTTAATCGCA ACGGCACAAA CTTTATATTC AGGAATTGTG GCGATATCAT 
CAAATACAGC ATTGGTGATT TCGTTAACAT TCCCATCTGG AAAGTGGAAT GTCATAAAGA 
CTTCTTGCGG AAATACTCGA TTTCCGACAG CCGCATAGGT TTCGATTTTG CCACGTCGAG 
AATGAACTTC AACTTTGTCA CCTTCTTGGA TTCCCAGAGC TTGTGCATCG ACTGCATTTA 
TCTCAATATA AGAGTGATTC GCTATTTGGT TAATCCCTTC CGTTCGTCCT GTCATTGCTC 
GGGTATTGTA ATGGTAAAGC ATTCGGCCTG TCGACATTAA ATACGGGTAC TCTTCATCTG 
GTAATTCTTG TGCTTGCTTA TACGGAATTG CTTTAAAT7VA ACCTTTCCCA CGAGCAAAAC 
TACCCACATG CATAATCX3GC GTACCTGGAT CTGTCAAGCT CCGACAAGGC CATTGTAAGC 
TCTCCTTCTC TAATCGTTCA TAATTAATTC CACCAAACGA AGGCGTCaCA GCTGAAATTT 
CTTCCATAAT TTCTTTTGCA 6ATTCATACG CACAAGGATA GCCCATTCGT GTCATGACTT 
CACAAAAAAT TTCATAGTCT TCTCGCGCTT GTCCTCTTGG TTCAACAGCT TTACGGACTC 
TTTGCACGCG CCGTTCTGTA TTTGTAAATG TCCCGTCTTT TTCTGCATAA CTGATGCCTG 
GTAAAACGAC ATCTGcATAA GCAGCCGTTT CGGTCATAAA TAATTCTTGA ACAACTAAAA 
AGTCnTAAAC TTTCTAAAGC TTGACGAACA TGTCCTGTGT CTGGATCCGT AACAATTGGA 
TCTTCTCCAA AAATATAAAG GCCTTTGACG CTTCCTGCCG TTGCTGCTGG TAAGACTTTA 
GTAGAAGTTA AACCTGTGTT TCGATTTAAC GGCACATGCC AAGCTTTTTC AAATTTGTCG 
ATCACTTCTG GATTATTGAC TTTTTGATAG CCTGGGAAAT CATAAGGCAT ACAGCCCATA 
TCGCAAGCAC CTTGAACATT ATTTTGACCT CGCAAAGGAT TGACACCACA GCCTGGTTTT 
CCAACTTTAC CAACCAACAT TGCCAAATTA GACATACTCA TAACACCCTC AGTTCCTGTC 
G7ATGTTCCG TTACACCTAA ACAATAAATG ATTGGTGCTT TCTCTGCTTT TGCATACATG 
CGTGCTGCTT GAATTAAATC TTCTGGATGA ATATGACAAA TTTCTGCGAC TTTTTCCGGC 
GTATAATCAG CCACCATTTT TTCTAAATCT AAAAATCCTT CTGTCCGTTC TTCAATAAAA 
TGGCGATCTG CTAAACCTTC TTTCAAAATC ACATGCATCA TGCCATTAGC AAATGCGACA 



1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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2520 
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3000 
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3180 
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TTCGTTCCCG CCTGAACTTG TAAATGCAGC GCACTATCCT TCACTAAGTT AATTTTCCGC 3360 

GGATCAACAA CAACAACTTG CGTTCCTCTT TGGATGGCTT GGCGAATTTG CGCACCAATA 3420 

ACTGGATGAG CTTCTTCTGG ATTGGAACCT ACTAATAAAA TCATGTCGAC ATCTTCAGTG 3480 

ATATCAGCAA TAGGATTAGT CATCGCACCA GAGCCTAAGG TTTGGGCTAA ACGATGGACC 3540 

GAAGCCGAAT GACAAACACG GGCACAGTTA TCAACATTAT TGGTACCAAA AGCAGCACGC 3600 

ACCATTTTTT GAAAGACaTA ATTATCTTCA TTTGTCGCTC TAGAGCACGA AAAACCAGCA 3660 

AGAGCATCCG GACCGT'^T^C TGCTTTAATT TCGTTAAATT TTGACGAAAC AAGCGTTAAG 3720 

GCTcTTCCCA AGAAGCAGGT TCAAAAATGC CATTCCGTTa ATTAATGGCT CCGTTAAGCG 3780 

ATCTCCTGAC CCAACAAATT TATATGAAGC AAATTTTCCT TTCACACAAA GTAAATTTTT 3840 

ATTAGCTGGT CCATCCAGGG GTTCAACACC CACCAAACGA TTATTTTTAA CGAGTAAATT 3900 

CATCTGACAG CCTGTCCCAC AATGCGGACA AGTTGTAGGA ATTTTCTGGG TTTCCCATTT 3960 

GCGGTACTCT kTCGTATCTT TTGCTGTTAA CGCACCTGTA GGACAGGTUVG AGACACAATT 4020 

GCCACATGAT TCACAGATAG ATTGATCAAA CGCCTGCCCA TAACTCGGCA TCATTTTTGT 4080 

TTCAAAGCCA CGATTAGCAA TACTAAGTAC ATCTCTTCCC TGCCGCAATT GACAGACACG 4140 

GGCACAACGA CGACACATGA TACACTTTTC TGGATCATAA GAGAAAAATG GATTACTGGT 4200 

ATCTTCTTGA TGGCAAGGCA TTCTTTTTCC CTCGGTAAAG CTAGTGACAT CAATCCCATA 4260 

ATCTAGCGCA TACTGTTGTA ATTGACAATC CCCGTTTTTT CCACAAGAAA AACATTCTAA 4320 

TTTATGATTA CTCAATAATA AATCAAGAAT AA/VACGTCTG GAATCAGCAA CTTTTTCAGA 4380 

ATGTGTTGCC ACAACCATGT CCTCTTGACA ATGTGCCGTA CAAGCAGTTG TTAAGCCACC 4440 

ACGTCGGCCA CCTTCCACCT CAACCACGCA CATCCGACAA GAACCATCAG GTGCTAATTC 4500 

CTTTAAATGA CAGAGCGTTG GGATTTCT^C ACCTAGCCCT TTCGCTGCTT CGAGAATGGT 4560 

AGTCCCTTTA GGCACAGTTA CTTCTTGGTT ATCAATAGAC AACGTAACTG TTTGCGTATG 4620 

CAACTTTGTT TTCATTTTTA CCAACCTCCT TTGCTGGTGG CATAGATTTC TGTTTCAGCA 4680 

GTCGTTTCAA ATTCTTCTGG AAACAAAGCC AATGCACTCT TCATTGGATT tGCGACGGAT 4740 

TGGCCCAAAC CACAAGCTGA TAATCGCGTG ACATGTgTGA GCATCTTTTc cAaTCGCGGC 4800 

AAATCACTAG CTGGTTGCCT CGTTCCGATk GAATTTACTC AATAATTCCA AAATCCTAGT 4860 

TGTTCCCAAT CGACAAGGGG TGCATTTACC ACAGGACTCA TGAGCAAAAA AGGCCGCCAC 4920 

GTGTACTAAA TAGTCCACAA CATTAACTGA ATCATCCATT ACTACAATTG CACCAGAGCC 4980 

GACAGATAAG TCATGGGCCC ATAAATCTTC GTAGGAATAC AAACAATCCC TTAACGTTGC 5040 

GACAGCACCA ATTGGACCGG ATTGTCCACC GAAGTGAATA AATTTCAATG CTCGACCAGT 5100 

TGCAGAGCCT CCGCCGTATT GCTCGCCATA AATTATTTCT TCTAATGGTG TTCCAAGGTT 5160 

GACCTCGTAT AAGCCCCGAT TTTTAATATG TCCAGACAAA CAAATTAGCT TTGTTCCGCC 5220 

ACCATCGGCA GTACCCATCT CTAAAAAGGC TTGTCCACCT TCTCTTAAAA TTACAGGAAC 5280 
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ACTTGCAAAT GATTCAACAT TATTTACTAA TGTTGGTTGC AAATATAAAC 
TAAGTGTGGT GGTTTGACAC GCGGCCGCCC CGTTTTTCCT TCAATTGAAT 
TGAATTTTCC CCACAAATAT AGGCGCCAGc ACCAGAAATA ATCGTAATAT 
TCCTTCAATT CCTAAAATGT TTTCCCCTAA AAAGCCAGCT TGTCGTGCAT 
TTCTTGAAAT GTTTTTTGGA TTCTGCGATA TTCTCCTCGC ATATAAATAT 
TGCTGAAAAT AAATAGCCAG C7VATAATCAT TCCTTCAATC ACACTCAATG 
TAAAAGTACT TTATCCTTAA AGGTCCCTGG TTCGCCTTCA TCAGCATTAC 
CTTTGTGGtT CCTTTTGcAT GATAAAGATG GcGCCATTTT TTTCCTAAAG 
TCCACCGsGa CCTCTTAAGT GcGcAATATC CAGTTCGTTT AAAATTTCTT 
TGAAATTGCT CTTTTTAAAC CATCAAATCC ATCGTATTTA CAATATTCAG 
CGCAGATTGC ATTTTCCCCA TTCTCTCTAA TAACATTGGT TGGTTTCTCT 
CTCACCTCyT ACAGTCCTTC ATAGCAATCA TTTTGTAAAT GTTGAAwTAA 
TTCTCTTCTG TTAGTTGACT AAAAACGGTG TCCTTAATTT TAATTACTGG 
CAAGCCCCGA TACAAGGAAT ACTATGATAC ATAAATAAGC CATCTGGTGT 
TCTGGCACTT CTAAAATGGT TTCTAAAACT TCGGCAACCA TCGCCCCACC 
CACGGTGTGC TGTTACAAAT CTTCAAGACA TACTTCGCTT GCGGTTCCGT 
GCATAAAAAC TGACGATTTC ATAGACTCGC GCTTCCGTTA AATGTAGATG 
AATTGAGCTG TCTCCTGATC GATATACCCT TCCTCTGAAG CAAATTGTAA 
ATATTTAAAA TTCGTTGTGG ATCAGCATCG TTTTCTAGAA TAATCGCTTC 
AAGCTTAATG CACACATTCC TTTTTATTCC CCCTTTTTAG TGAAACTTGA 
CTTGATAATA GATACGCAAT TTTTTTGGTA TTTCAGCAGA TAGTAAACAA 
TATCTATGAC TTTTTTTGCT GATTCTTTTT GGCGATTCTG GTTATTCATC 
GTCCTGGCTG CACAACATTA CGT/^TGAAT TTGATTCACA AATGATTAAT 
GCACAGCTTT CAGAAATTGC TGAATGGCTT CCGCCAAATT TTCTTGAAAA 
GAAAAACTTT TTGACAACCT GCAGCTAATA ATTTCATGGT ATCTTTATTG 
CTTGCTCTTC TACT7UVTTCA TAACCAGCTG TAATACTGGT ACAAATTCCG 
TGCCTCGTTG ACACACCCCT CGTTGACCAC TAATCGTAAT AATCTTCAAG 
GATAGACTTC TTGGTACTGT TTAATAAGGG CTTCGGCTAA TGTCGTTTTG 
GACCAGTAGA ACCAATTTGA ATCATGTTTG GTATCTCAAT TGCCATAGAG 
TGAATCGTTT TCTCAACTCT TGGTTTGACA GTTGCTGGTT TTGTCATAAA 
ACGGCCATAA AAAGACTGCC ACCTACTGCA TTGCCTAAAA ATACCCATAA 
AAATAATCCC ACCAACTGAT TGTGGTACTT TGTGAAAAAA TTGCCGCTGG 
GCATTAGCGA CCACGTGTTG AAACCCACTT GGGGATTACA ATTCAAAATG 



CAACATCTGC 
TTAAAAGTGC 
CATAATTGAA 
TATCGAGTGC 
AACCTGCTTT 
GATCTTCAGA 
AAACAATGTA 
GaTAGGcTGc 
CATCCGATAG 
CAACATCgGT 
TAAGCATTGA 
TTGATAAATT 
ACCTAAATCA 
CGGCTGATTC 
AGTATAATGA 
CTTTAA7UVTG 
CTCTGCAACT 
TTCAATTAAA 
TTTTTCAATT 
AGTTGTTCTC 

aaatcggctt 
ataagaaata 
gcttgggccg 
actttcaata 
ccaatcgtct 
catccttttg 
ccataaattg 
ccactattcc 
ctcctccgtc 
'gatcaatggt 
tgtattctgc 
aataataaat 

AGAT 



5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7254 
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(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GGGGCCAATT TaGaAAATJcG ACmATTGAAT AGAAAAGGaT GTGAACaAAT GGCaAAAGAa 60 

ACAAATGTTA AgTTAGTCaC GGTGAGTGTT TTTGTGGCAA CATTTATGAC AGCCATTGAA 120 

GGGACCATTG TGTCTACTGC GATGCCAACG ATTGTCGGCT CGTTACATGG CATGGAAATT 180 

ATGAACTGGG TATTTTCAAT TTATTTATTA ACGAATGCGA TGTTAACACC GATTTATGGG 240 

AAACTTGCGG ATAAAATTGG TCGTAAACCT GTCTTCATGA TTGGCATTAT CATTTTTATT 300 

TTGGGCTCCT CGTTGTGTGG CTTTGCTCAA GATATGTTGA CTTTAATTAT TGCCCGCGCA 360 

ATTCAAGGTG TGGGGGCAGG CGCAATTTTA CCAGTTGCGT TAACGATTAT TGCCGATATG 420 

TATACATTGG ACAAGCGAGC GAAAATTTTA GGTTtAAACA GTGCCGCCTG GGGTWVTTGCT 480 

AGTATTTTTG GTCCGTTAGC AGGTGGTTTT ATTGTAGATA CAGTCGGTTG GCATTGGATT 540 

TTCTTCATTA ATGTTCCTAT TGGACTTGTT TTATTGGGCT TGATTAGTAT TTTCTTAGTT 600 

GAACCAAAGC GGGAACGGAC CAAGATGCCA ATGGATATTT TGGGCAGCGT TACTTTGATG 660 

GTAGTGCTGC TAACGTTATt GCTAGGTTTT CAAATGATTA GCGATAATGG TTTTACATTA 720 

GTAACATTTG GTTGTTTAAG TTTGAGTGTG CTCTTTTTTG TAGCATTTGT GATGATAGAA 780 

AAACGCGCGC AAGACCCAGT GATTGATTTG CATTTATTTA ATCAACCAAC GTTTGTTTTA 840 

GTTW^TCTTA TTGCAGCGCT TATTAGCGGT TTCTTAATGG GGATTGATGT CTACATTCCG 900 

ATGTGGATGC AAGGTGTCTT AGGAAAAAGT GCAGGAATTG GTGGCTTAGT TTTAGCGCCC 960 

TATGTCGTTA CTTTGGATGG CTGGATCATT TATAGCAAGT AGTTTTATGG AAAAATATGC 1020 

TATGAAAAAA GTCTTAACGA TTGGGTTATC GATTCTATTA GTCGGGGCCA TCTTTTTAGT 1080 

GGTAATGCCA ATGGCCGTTC CGTTTTGGCT TTTCTTTGTA GTGTCTTCTG TCTTAGGAGT 1140 

TGGTTTTGGG ATTACTATCA CAACGACTAC GGTGACAGCA CAAAGTACAG TGGAGCCTGA 1200 

AAAAATGGGG GTTGCAACAT CGTTTAATAC GTTGGTGCGT ACAATTGGGC aGACTGTGAT 1260 

GGTGTCAATT TTTGGTGTGA TTTTAAATGc AGGAATGTTT GCGAAATTGG AAGCGAGCGC 1320 

GTTAAACGTC GATGCAGATG TCATGAATCA ATTAGTGAAT CCACATACTG CaAATTaATT 1380 

CCAGCTGCGT TGTTAAAACC ATTACGCGGT ATCCTCTATG CAGGTCTGCA TAATGTTTAC 1440 

TTAGTCGGTG ■ CGGGCTTAGT TGTTGTCGCT CyTTTGTTAA ATATTTTCGC AAAAGCGCAA 1500 

CGAGCGAAGG TTTAGTGGTA CACTAATAGA AAGAGTATTG GAGGGAAAAG TAGATGGCCT 1560 

TTTTACAAGC AAATATTTAT TCmAATGTAT TGGAAATGGA AGTGAATGTG AACGTAATTT 1620 
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TACCACAAGA 
CCGTGTTGTA 
CTATTGAGCG 
CTTGGTATAC 
CGAAAATTTG 
CTGGTTTGTC 
TTGCAGCCGT 
TATTAAAAGT 
AAGGTTCCAT 
TGCCCCAATT 
TGGCTCAAGC 
ATGACTGGGT 
AATAGAGAAA 
ATGACTGACG 
CATCTAAGTA 
ATTGAATGCT 
ACACTTGTTC 
CCGTTGCGTC 
TCATTTCTGG 
ATCGTTGGGG 
GATAGAGATC 
ATGCAGGTTC 
TATTTTTAAA 
GCaTAGAAAG 
TCGTGTCTTT 
AGTTTATCAA 
CGCTTGTATT 
ATTGGTGAAG 
ACTGAAGAAG 
ACGCCAAAAA 
TCTCAGCACT 
ATAGCGACAA 
TTGCAGGGGT 



AACGGTAAAA 
TTTATTACAT 
GTATGTCGCT 
GGACACACAA 
TCATCAACTC 
AATGGGTGGT 
TGCTTCTTTG 
CCGTAAGCGA 
TCACGATCCG 
TTACCTTTGT 
GTTGGAACAG 
GTTCTGGGAT 
GTAGTTTTCA 
GACCAAGGGc 
GGTTTTAACG 
TTCCTCCATG 
TTTTAAATAG 
CATAATCATC 
TGAAAAACCT 
ATAGGCAGGG 
ATAATAG7TT 
TTGAAAGTGC 
ATACCmATaA 
TTGtTCTAAA 
TGATAATTTC 
AAAGAAGGTA 
TTACCACTAA 
TCAATGAGCA 
CAGGGAACTA 
ATACAAAGGC 
TTGGCAACCA 
AATGAAAAAC 
TATTAATACA 



AAAGTGGGCA 
GGGATGGGAG- 
GATTTGGGGA 
TACGATATGA 
TTTCCACAGT 
TATGGCGCCT 
TCTGGGGCAG 
AgcTATTGGG 
CTGTATTTAT 
TGTGGGGAGC 
GCACAGGCAA 
GAATGGATTC 
ATATTTATTT 
TTTTTTTATT 
GCTTGAATAA 
TACGTTAACT 
TCATCTCCGT 
CCAAATAAAA 
GCTTCTCTTA 
ACGGTTTGGA 
TCCATGAAGG 
TCTTCAATCG 
ATTGcGGGCG 
GTGGGACTTT 
GGTTCCATGA 
CACTTTATGA 
AAATGGTAAA 
AATTGATTTT 
TTTTAAGAAA 
TCGGTAGTTA 
AAATTGTTAC 
TTAGGGATTC 
ATTTTTTCTG 



485 
CAAGTACCCA 
GGAATCACAG 
TTGCTGTCAT 
ACTATTGGAC 
TAAGTACAAA 
TGAAATTAGG 
TCAGTTTATC 
AAGGTATTTT 
TACAACAATT 
AAGATATGTT 
GTTACACTTT 
AAAAAGCCTT 
TAAAAGTTAA 
TAGATGAAGA 
AGGCGGATTT 
CATTGTCGGA 
TCAACACTTG 
GATGTTGTAA 
AAATGCCCAT 
TTTCGATAGC 
CTAGTAAGTC 
CTTCAGCCAT 
CCTGTACACC 
TTTCTAACAA 
TTTTAGCTCC 
TAGGAAAAAC 
TTATGAATCG 
TTTGTAATGG 
GTAGGGATAA 
TTGCCTTGTT 
AACAAGTGTT 
AATTAATTGC 
CCAAAGTTGC 



AGCAGCATTA 
TGTCTGGGCT 
TATGCCGTCC 
GTTTATCTCA 
ACGAGAAAAA 
ATTAGCGAAA 
ATCCACTAGT 
TGGCCCGTTA 
AGTAGAAAGC 
ACTTTCTGCT 
TGAAACTGGA 
GGCTTGGCTA 
ssCTTGGTCC 
TGTGTTGTCT 
TTGGTGGATG 
TACATATTGT 
CGAAACGAGC 
AGACGTTACT 
CATTTGATTC 
TACGGCACAG 
AGAATACCAT 
ACTTTGAAGT 
AAGCTGTTTG 
AGAAAATGCG 
TATTAAACGT 
TTGTGAAAAA 
GCCTTATTTT 
GGAAGAATTC 
GATGGACCTT 
GGCGGTGATC 
GGAAGCAATT 
CATTGCAGGA 
CCAAGGTGTC 



ACAGATATTC 
CGCAGAACCT 
ACGGATTTAG 
GAAGAATTAC 
ACCTTTGCCG 
CCAGAATCTT 
TTTGGGGAAT 
GATCAAATTG 
CAAACAGAAA 
AATCAACAGA 
CCAGGAGAAC 
CCAATACCAA 
GTCAGTCGAA 
GCTTGCAACC 
GAGTGTCGAA 
TTCATGTGCA 
TGTTTTTCCT 
GCTAAATGAG 
AAATGGCGTA 
GGAAATTGTT 
TCACCACATA 
AAGGCTTGCT 
GCGACTTTAC 
GCTGCAATAA 
TATCTTCGTA 
TTAACTTGTT 
ACAGTGTTAA 
GCAGACCAAC 
ATTATTCAAC 
GTAATGGTTG 
ATGAATGACG 
TTAGGACTCG 
AGTGCTGACA 



1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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TTCGTGAAGC AACTTTCCGC AAAATCCAAA CCTTTTCATT TGGCAATATC GAAAAATTTT 3660 

CAGCAGGTAA CTTAGTTGTT CGTTTAACAA ATGACGTTAC TCAAATTCAA AATGTCATTA 3720 

TGATTGCGTT GCAGACACTT TTTAGAATTC CTTTTTTATT TATCGGGAGT TTTATTTTAG 3780 

CGATGCTAAC CTTGCCACAA CTATGGTGGG TGATTGTTGC ACTTGTCATC GCAGTTATTT 3840 

TGATTTCTAT GCTGTCCTTT TCGCAAATGG GCAAGCATTt CATGATTATT CAAAACTTGA 3900 

TTGATAAGAT TAATGGTATT GCAAAAGAAA ATTTGTTAGG CATTCGTGTC GTCAAATCAT 3960 

TTGTTCAAGA AAAAAATCAA TTGTCACGTT TTACAAAAGT GaGTGAAGAA TTaAcCACCC 4020 

ATAACTTAAT CGTAGGTAGC TTATTTGCTG TGATGATTCC AGCGTTTATG TTGGTGGCGA 4080 

ACTTAGCTGT TGTAGGCTCA ATTTTCTTTG TGAGTAACTT AGTCAAGGAC GATCCAACTT 4140 

TAATCGGTGG TGTCGCTTCC TTTATGAACT ATTTAATGCA AATCATGATG GCGATTATTA 4200 

TCGGCGGAAT GATGATGATG ATGACTTCTC GTGCCGCTGT TTCCATTAAG CGGATTAAAG 4260 

AAGTGATGGA AACAGAACCA GATGTTACCT ACAAAAAAGT CCCAGAACAA GAGTTAATTG 4320 

GTAGTGTTGA GTTTGACCAC GTATCTTTCC GTTATCCTGG GGATGAAGAA GATACCTTGA 4380 

AAGACATCTC TTTCTCTATT CAGCCAGGTG AAATGATTGG GATTGTGGGC GCAACTGGCG 4440 

CTGGTAAATC TACATTGGCA CAATTAATTC CACGTTTGTT TGATCCGACA GAAGGAAAGA 4500 

TTGAAGTTGG TGGTGTCGAT TTACGAGAAG TGAACGAACA CAGTTTACGA AAAACAGTTT 4560 

CCTTTGTGTT GCAAAAAGCA ATCCTGTTCT CTGGAACGAT TGCGCAAAAT CTACGCCATG 4 620 
GTAAACGTGA CGCTTCTGAA GCGGACATGG AACGGGCTTC AGGCATCGCC CAAGCCAAGG 4 680 

AATTTATTGA AAAATTAGCA GAAGGTTATG ATGCACCCGT CGAAGAACGG AGTAATAACT 4740 
TTTCTGGTGG ACAAAAACAA CGGTTATCAA TTACAAGAGG TGTGATTGGT GAACCTAAAA 4800 
TTTTGATTTT AGATGATAGC ACCAGCGCGT TGGaTGCTCG TTCGGAGCGG TTAGTCCGAG 4860 
AAGCGTTGGA TAAAGAGTTA AAAGAGACTA CCACGATCGT TATTGCCCAA AAAATTTCTT 4920 
CTGTTGTCCA CGCAGATCGT ATTTTAGTAT TGGATAATGG ACGTTTAGTC GGGGAAGGCA 4980 
CGCATGAAGA ATTAGCAGCT ACGAACCCCG TGTATCAAGA AATTTACGAA ACTCAAAAAG 5040 
GAAAGGAGGA AGCTTAATGA CTGATTTAAT AAAAGCTAGT AAATTTTTCT ATCATTATTT 5100 
GAAACGTTAC AAAGTTTCGT TCCTCTTTAT TTTCTTAGCA ATTTTCGCAG CGACTTATTT 5160 
ACAAGTCAAA GCGCCGCAAT TCGTTGGGGA AGCTATTCAG GAATTAGCGA AATATGCGGT 5220 
TAATGTGATG CAAGGAAAAG ACGATAAAAG TGCGTTCGTT TCTGTCATTT GGAAACTACT 5280 
CATTTTTTAT GTCTTAACTA GTGCCGCTAG TTTCATTTAT AGTATTCTCT TTACACAAGT 5340 
CGTGGGGAAA TCGACGAACC GCATGCGGAT TGGTTTGTTT AACAAATTAG AAAAATTGAC 5400 
GATTCGTTTC TTTGATTCTC ATCAAGATGG TGAAATTTTA AGTCGTTTTA CTAGTGACTT 54 60 

AGACAACATC CAAAATAGCT TAAACCAAGC GTTGCTACAA GTATTAACCA ATATTGCCTT 5520 
ATTGGTTGGT GTCTTAATCA TGATGTTCCG TCAAAATGTG GAACTGGCAT GGGCCACAAT 5580 
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TGCTTCTACG CCGATTGCGA TTTTAATTGC GGTCTTTGTG ATTAGCAAGG 


CGCGCAAATA 


5640 


TGTCGATTTA CAGCAAGATG AAGTGGGTAA ATTAAATGGC TATATGGATG 


AAAAAATTAG 


5700 


TGGGCAACGT GTGATTATCA CTAATGGCTT ACAAGAAGAA ACCATTGACG 


GCTTTTTAGA 


5760 


f:rAAAATGAA AAAGTTCGTG CCGCTACGTA TAAAGGTCAA GTGTATTCAG 


GATTACTTTT 


5820 


rrr AATGATG CAAGGAT TGT CATTAGTCAA TACGGCGATT GTTATTTTCT 


TTGGTGGTTG 


5880 


PTi-AnrAATC AATGGCTCTG TTGATCGTGC CGCTGCGCTA GGTTTAGTCG 


TTATGTTTGT 


5940 


CCAATATTCT CAACAATATT ATCAACCATT AATGCAAATT TCTTCAGGTT 


ATAGCATGAT 


6000 


TCAATTAGCT GTTACAGGCG CTCGTCGTTT AAATGAGATG TTTGATGAAC 


CAGATGAAAT 


6060 


TCGTCCCGAA AATGGCGAAA AATTAGAAGa GATTAACAAA GCTGTGGCTT 


TAAATCAtGT 


6120 


TGTTTTTGGC TATAATCCAG AAACGCCTGT GTTGAAAGAT GTTTCTATTC 


ATGTGGATAA 


6180 


AGGTGAAATG GTCGCTTTAG TTGGACCGAC AGGCTCAGGA AAAACAACGA TTATGAATTT 


6240 


AATGAATCGC TTTTATGATG TGAATGAAGG CGCCGTGACG TTTGATGGTG 


TGGACATTCG 


6300 


TGAAATGGAT TTAGATAGCT TGCGTTCACA TGTAGGCATT GTTTTGCAAG 


AGTCCGTTCT 


6360 


ATTTTCAGGA AC7ATTCGTG AGAATATTGC TTTTGGTAAG CCAGAGGCAA 


CGGATGAAGA 


6420 


AATTGTTCAA GCAGCGAAAC AAGCGAATAT CCATGAATTT ATTGTAAATC 


TTGAGCAAGG 


6480 


CTACGATACT GAGATTACGG AAGAGAATAA CCTTTTCAGT ACGGGGCAAA 


AGCAATTAGT 


6540 


AAGTATTGCA CGAAC6ATTA TTACCAATCC AGAATTATTA ATTTTGGATG 


AAGCAACAAG 


6600 


TAATGTTGAT ACAGTAACAG AAGCTAAAAT CCAAAAAGCG ATGGACGAAG 


CAATTAAAGG 


6660 


GCGCACCAGT TTTGTGATTG CCCATCGTTT GAAAACGATT CTAAATGCGG 


ATCGGATTAT 


6720 


•TGTTTTACGA GACGGTGAAG TCATTGAAgA AGGCAACCAC CATGAACTAG 


TTGAACAAGA 


6780 


TGGATTCTAT GCGGAACTTT ATAAAAtCAA TTTGTTTTTG AATAAGTCGA 


AAAAGAAGCA 


6840 


AAATGAGTCA TTCTCATTTG CTTCTTTTTT TGCTATGCTA AATAGAAAAA 


GGAGAAATGT 


6900 


CAATGGAAAA AATGAGGAAT ATACGGCAAT TACGGCGCAC GTTTTTGTTG 


CT/^AATACTT 


6960 


TATTTGTACT GAGTGTATTC AGAGAGAATC GATGGACAAA ATCGGCACTG CTTTTTTTGG 


7020 


TTGTTGGCTA ATAGTTTTTT ATATGGACTG CAGTAGTTGA TTTATCTTGG GGGCGAAAAA 


/ UOv 


GGCTGGCTGA TACATTTGAA GCGGTGGTAT TATTGCGCGG TTCTTCTCCA AATAGCAGGC 


7140 


ACATTGATTG TATATTTTTC GTTTTTTGTA TAAAGGTCTC TCGTTTCTTG 


CCGGAGAATA 


7200 


TGCTAAAATA AAAAGCAGCA GGGGGCGAGA AAACCATGGA AAAGCCnAAT 


nAACGCCnAG 


7260 


CTGGATTTAG AACGTACACG GGCCAATTAA TTTTAA 




7296 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOUXSY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



GATTAAATGT 


CCTTGCCTCT GTTTTTGAAA ATCCTAAATC AAGCCTTTCA CACCTATATC 


60 


TTTTCGGTAG 


TACATTCCAG 


GAAATGAGTA ATTATCCAAT 


GTTTGATAAA 


CTTTTTTGCG 


120 


GGCTTCTTCA AGAGTCGAAG CAGTTGCTTC TGCTAAAAAG ATTCGTCCGC CAGCCGAAAC 


180 


CAGAGCCTGT 


TCTGAACGAG 


TTACCCCTGC 


ATAATAGATG 


GTTGCACCAG 


ACATTTCTGG 


240 


CAATGGTAGA 


TTTGTCTTAT 


ATTGATTAGG 


ATAGCCTTCT 


GCTGCTACAA 


CAACACCAAC 


300 


ACTCGCTAAC 


TCATTCCATG 


CAATCGTTGG 


TTGCTCTTTC 


ACCAGCAATG 


AAGAAATTAT 


360 


TTCTGAAAGG 


GGACTCTCTA 


AACGTGCCAA 


CACAACTTGT 


GTTTCTGGAT 


CCCCAAATCG 


420 


AGCATTAAAT 


TCAATA/\CCT 


TGGGGCCAGT 


TGCTGTTTGG 


aTTAAACCAG 


CATATAAAAT 


480 


CCCTTGAAAC 


GGACGTTTTT 


CAGCCACCAT 


TCCTTTAGCA 


GTCG6TTTCA 


GGACTTGGCT 


540 


AATGGCTTGC 


TGAACCGCTT 


CTTCTGGGAC 


CTGAGGCACT 


GGTGTATAAG 


CGCCCATCCC 


600 


CCCTGTATTC 


GGACCAGCAT 


CCCCATCGAA 


AATCCGCTTA 


TGATCTTGAG 


CAATCACCAT 


660 


CGGATATACC 


TTCTCATTAG 


AAACAAAAGC 


CATTAAGGAA AATTCTTCTC 


CTTGCAAAAA 


720 


TTCTTCGATT 


ACAACCGTTT 


GACCGCTTGC 


TCCAAATTGT 


TGGTCCTCCA 


TCATCGCAAC 


780 


TAGCGCCCGT 


TTTGCGGCTG 


CTTTTGTTTC 


GGCAACTACT 


ACTCCTTTTC 


CAGCTGCCAA 


840 


ACCATCTGCC 


TTAATAACAA 


TTGGTACTCC 


TTGCTGTTCA ATATATTCTA AGGCAGGTGC 


900 


CAAACGTCGA AAGACCCGAT 


ATTCCGCCGT 


TGGAATCTGG 


TATTTTTGCA 


TAAAATCTTT 


960 


CGCAAAACTT TTTGAACCTT CAATTATTGC AGCTGCTTTC GTCGGGCCAA AAATCAGTAA 


1020 


TCCCGCTGCT 


TGAAAGGCAT 


CTACAATTCC 


TTCAATTAAT 


GGCGTTTCCG 


GTCCGACGAA 


1080 


TGTCCAATCA ATCTGATTAT 


CCTTCGCAAA 


TTGAATTAGT 


TCCGTTTGCT 


GCTCTTCACC 


1140 


GATAGGTACA 


GTCCGAATAT 


TATCACCACT 


CATACCAGGA 


TTACCTGGTG 


CACAATAAAC 


1200 


AGTCTTGACT 


TTGTCTTCTT 


GCTGTAATTT 


TTTGGCGATT 


GCATGTTCAC 


GACCACCACG 


1260 


CCCAATCACT 


AATATATTCA 


TCGGTTTCTC 


CTTTAAAAAT 


TAATGTTTGA 


AATGACGAGT 


1320 


CCCAGTAAAA 


AtCATCGCAA 


TACCATAATG 


ATTCGCTAGA 


TCGATTGATG 


CTTGGTCTTT 


1380 


AATGCTCCCG 


CCTGGCTGAA 


TAATTGCTTG 


AATGCCaTGT 


TGTGCAGCAT 


ATTCCACGCT 


1440 


ATCTGCCATG 


GGGAAAAAGG 


CATCACTTGC 


TAAGACGGCC 


CCTTGTAATT 


TTCCAGCTGC 


1500 


TTGGGCTTGA 


TCAACTGCAA 


TTTTTACTGA 


ACCAATTCGA 


TTCATTTGAC 


CAGCGCCAAT 


1560 


CCCCACCGTT 


TG6CTTTCAT 


TGGCTAAAAC AATAGCGTTT 


GATTTTACAT 


GCTTGACTGC 


1620 


TTTCCAAGCA AAGTTCAATG 


CGTCTCTTTC 


GGCTGGTGTG 


GGTTGGCGCT 


CTGTAACAAC 


1680 


TTGCCATGCT 


TGGTCATTTT 


CAACAACTGT 


ATCCTGTTCT 


TGAATTAATA ACCCGCCCAT 


1740 


AACAGAAACT 


AATTCCACCG 


CTTTTTTTTC 


TTGAGCATGA AAATCTAACG 


TTAACAAACG 


1800 


AAGATTCTTT 


TTCTGAGATA 


ACAGTGCCAA 


AGCTTCTTCC 


TCAAAACTAG 


GCGCAATAAT 


1860 


TATCTCTAAA 


AAGATTTGAT 


GCATTTCAGC 


CGCAGTCGCT 


TGGTCCACTG 


GACGATTCAA 


1920 
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AACAACAATG 


CCACCAAAAA 


TTGAAACTGG ATCTGCTTCA AATGCCTGAC GATAGGCTGC 


1980 


CAAAATCGTT 


TTTCCTGTTC 


CAATTCCACA TGGGTTCATA TGTTTAACAG CGACGACCGT 


2040 


AGGTTCTGTA AACTCGCTAG CAATCCGGAG TGCTGCATCA GCATCTCGAA TATTGTTATA 


2100 


TGACAGTTCT 


TTGCCATGAA 


GTiGTCGAGC oGAACjCAAXI? UAlAAllaAlA UAlaolAwAAl..^ 


2160 


GGATTGATAA AAGGTGGC7T 


GTTGATGGCT ATTTTCGCCA TAAOUt-AAru 1 i HaUi lAV.,v» 


2220 


TTCATAAGTA 


AGCGTCTGTT 


TCTCTGGTTC AGTTTCACCC ACTTGCGTCt> riAAAlAGlL. 


2280 


AGCGATAAGC 


GCGTCATAAG 


CAGCGGTATG ACGAAAGACT TTTGCTGCTA ATCTTTTTCT 


2340 


AGTGGCTAAC 


TGGCTAGTTC 


CAGTGCTGCT AATTTCACTT AATACCTGTT CATAATCACT 


2400 


AGGATCAACC 


ACTGCTGTTA 


CGTCTTGATA ATTTTTTGCC GCTGAACGTA ACATGCTAGG 


24 60 


ACCGCCAATA 


TCAATCATTT 


CAATCGCTTC TGCTTCAGAG ATCGCTGGTT TTAAAATTGT 


2520 


TTCTTTAAAT 


GGATAAAGAT 


TCACTACTAC AAGGTCaATA AAGTGAATGC CTTGTTCAGT 


2580 


CAGTGCTTTT 


TGATGCGCTT 


GGTTTCCCCT TTTTGCCAAA AGTCCGCCAT GGATATTCGG 


2640 


ATGTAACGTT 


TTGACACGGC 


CATCTAACAT TTCTGGAAAG CCAGTGATGT CGTCAATAGC 


2700 


AAGTGTCGGT 


ACCCCCGC7T 


CTTCCAAGAC AGTCCTTGTT CCCCCAGTGG AGATAATTTC 


2760 


AAAGCCATTT 


GCAACTAAGC 


CTGCTGCAAA TGTTGTCACT CCTGTTTTGT CTGAAACGCT 


2820 


AATCAATGCT 


CTTTTGGTCA TTTCATTGTC TCCTTTACAA TTTGTGAAAT TATTTTTGGG 


2880 


TACCATTCGT 


GCTCAAGAGC ATGAATTTTC TCTGCTAATG TGTCCACTGT ATCTTCGGTA 


2940 


TCAATCTTGG 


TGGTT3TTTG 


GAAAATAATC GGTCCTGTAT CAACCCCAGA GTCAACaTAA 


3000 


TGGATAGTGA 


TTCCT3TTAT 


TTTCACTCCA TAATGAAATG CTTCTTCAAT TCCgTGAAGA 


3060 


CCTGGAAAAC 


TAGGTAACAG AGAAGGATGA ATATTGACGA TTCTCTTTGG ATAAGCCTCC 


3120 


AGTAAAGTCT 


TGCCaATAAT 


TCTCAAATAA CCAGCTAACA CGATTAAATC GATCTGATGC 


3180 


TCTTTTA/iAT 


GTTTCAATAC 


TTGTTCCTCA TATTGCCCAC GAGATGAAAA ATCAGATGGC 


3240 


GAAAAGCAAA 


CAACGGGAAT 


CTTGCGTTTT TGGGCACGGG TCAATACATA GGCTTCAGGC 


3300 


TGGTCACAAA 


AGACTAAGCT 


CAAATGCCCT GCGATTTTTT TTTGGGAAAA AGCTGCAGCA 


3360 


ATCGCCTCAA AATTGCTACC 


ATTGCCAGAT GCAAAAACAG CTATCTTCAT AAGGCTTCCT 


3420 


TAAAAATAAC 


GGCTTTCGTT 


TTCCTAGGTA CCAAACGACC AATTTCAAAA GCTTCTGGCA 


3480 


AAAGCTCCTG 


AACTTTTTCA ACATGCTCTG GACTCACCGC TAAAATCATC CCTAAGCCCA 


3540 


TATTGAAAAT 


TTCATACATT 


TCTTCAGGTG GAATTTCACC GTATTTTTGA ATCAATTTAA 


3600 


AAATTGGTAG 


CACTGGCCAA 


CTATTTAATT GAATCTCTGC AGCTAATGCG GAAGAAAACA 


3660 


TTCTTGGTAG 


ATTTTCTAAA 


AACCCGCCGC CAGTAATATG CGCAGCCCCA TGCACGAGTC 


3720 


CTGCTTTTAA 


CAATGGCAAT 


AATTCCTTCA CATAAATTTT TGTTGGTGTT AATAATTCTT 


3780 


CCCCTAACGT 


TTTGGGTGCC 


AATTCTGGAA GCTTGTCCGT TGTTTTAAAC GAATGTTTCT 


3840 


TAAAAAAAAT 




AJUIGAGTAAC CATTA6AATG AAGACCACTT GAAGGAAGAC 


3900 
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CAATCAAGAA 
TACCCACGGC 
CTGTTTCGCC 
CAATTGCTTC 
ACGGTTCGGC 
TTGTCTCGTG 
CTGATACCAA 
CACCAAGCAT 
CTACTACTTC 
CAATTCCTCC 
GGCGTTGGAT 
TTTAATTGAA 
ATCTGCTCTT 
ATATCAATGC 
TCTTGCGCTT 
ATCGAATCAT 
TTCATACGGA 
TATTGATTTT 
GCTGCAGACA 
ATTGCCAAGT 
GAATCTGGTC 
GTGAAAACCT 
TCTTGAATAA 
GCCCCATTGT 
ATCATCATCG 
TCTTTTAATT 
TCAGTGTCTG 
TTGGCATTTG 
AAAATGGCTG 
TAGCCGCCGT 
AACCTAAATC 
CTTGGCCTCG 
GACCCCAAAT 
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ATCGCCTTCT 
AAAACCAGCC 
CCCAATT/KAG 
AACTTTAGCT 
GCCTTGTGCG 
TTTCTGTTCT 
AACTGGTTCT 
GCCCAACGTT 
GTATCCCGCC 
TAGTTTCTTT 
AGTCTCCATT 
TCCCATCAAT 
TGATTTCTTC 
CATAAAAACA 
CTGCTTCTTT 
CTACTA7VAAT 
CACCTTGTTC 
TTACTAA'\CC 
ATGACGAGTT 
TCTTCCCCAT 
GTGCAAAATA 
CTATTTTCAA 
AGCGAGCCCC 
TCATTTGTCC 
TTTCTGTTAA 
TATCGAGAAA 
AATTGGAATG 
TCAAATTGCC 
AATATTATCC 
TCCTTCTAAC 
ACGGTGGCCA 
ATGCTGCAAG 
ACCGAAAATA 



TTTACGTTAC 
ACGTCATAAG 
GCTGCATTCG 
GGATTAACGG 
ACGACATCAT 
TCAATTGCCA 
TTAAGCTTGT 
CCCGTCCGTT 
TCAACATTTA 
TTTCAAGGAT 
AAAGTAAGCC 
CAGCCCTGCT 
AATTGAATAG 
CGGATATTTT 
TAATAAATGA 
GACTTTTTTT 
TCTTAATTCT 
TAGTTCGTAA 
CGGTACACCA 
TCTTTTACGA 
AATATATTCC 
GCCTTCATCA 
AACTGTTTCT 
GATGGAGAGA 
TAAAAGATAA 
CGAGGCAGCA 
AAAAATAGCG 
ATTGTGTGCT 
ACGCTACCCG 
GCATTCAGTT 
TCTAATTTCC 
CTGTGTAAGC 
CCACACTCTT 



TTGGAGTCAA 
CATCTGCTTC 
CCTCACAACA 
TCCCTAACGC 
TGACACACAT 
AGAGTAACTT 
AACTACTTAA 
GTGTTTTTTG 
CACCAGCTTT 
GCTTGATATT 
ATACACAGCC 
TCACTCAAAA 
TTTGCCGCAA 
AATGGTGGTG 
ATAATGCGCC 
CCTTTGACCA 
TGTGTCGGCT 
GGTATCTGTG 
ATTACCATAT 
GCTGTATGCA 
ATTGCACAAA 
GAAATGATGA 
AATGCACACG 
GGGCGAAAGC 
GCAAAGCCAC 
GTGCTTTTTC 
CCTTCTCTTT 
AaCCCAaTtG 
TACCAGCGGT 
GTTCTTGCTT 
CAGAATTGTT 
CAAAATAGGT 
CATTTAAGCT 



GAGTTGACTT 
ATACATATCA 
GCCAGCCGCA 
TAAATAATCT 
TGCTACACAA 
TGTGCCTACG 
ATCAAAACAA 
GCTATGCTTT 
TGAATAGGCA 
GTTCTTCATA 
CAGAATAAGG 
AAGCCAGCGA 
TTAGTTCGTC 
AAGCAATTCT 
GAATAGTGGT 
CGCCACGAAC 
GGATGAATGT 
CTTCTTCTGC 
CAGCTGCGAC 
CATTAACTCC 
TTGCCGGTTG 
CTACTTCACC 
TTTCGGATGC 
CGTTGGGATC 
CCTTAACCAA 
GAATCAAGTG 
CTAGTTGTTT 
ACTATCATAA 
TGCATAGCGT 
TTGGAATACT 
CGCCACAATT 
CACTTGTGCC 
TTTTGCTTCA 



TTCTCAGCAA 
GGCATTTCAG 
ACACCCGCAA 
AAAAAATATA 
TCAATTCCGA 
CCATCCGTTC 
CCACCAAAAC 
TGAATTCGTT 
TTTTCCATGT 
ATCATACAAA 
AGCATCATAA 
ATCTGCGCCA 
CCGCGTCTGT 
GACATGAACT 
CCCGCGAACA 
TGCCGATAGT 
TCGAGCGACA 
ATAGCCGCTA 
TGGACTTTCG 
GGCAATATTT 
GACTTCCGTA 
AGGCGCCACA 
CACAACGTAC 
CAGTGCAGCA 
ATTCAAGCTC 
CATTAAAATT 
TCGTAAACGT 
AATTTAAATA 
ACATGACCAA 
TCTGCTAATA 
CCCGCGCCTT 
GCATCAGGAT 
TAAGACATGG 



3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4 620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 
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AATCGCATCC 
CGCATGAATG 
AAGTGTTTCA 
CTCCGCAAAT 
AAAAGCCGAC 
AACTAGGCCT 
ATCAAATGAG 
GTTAAAATCT 
TTGTGTGGTA 
ATAAATGGCT 
TCCTAGCTGA 
TTCTGGTTTA 
ACTCGCTACA 
TAAATACAAA 
CACTCTTAAA 
ATCATAGGTT 
TAATAATTCC 
CATCCGCTCT 
TACTGGTACA 
TACGGAGCCA 
TTCTGCTCCT 
ATAGGGGCTC 
AcCTGCTTTT 
GATGCCGATT 
TTTTTCCATA 
AAATTCTTCT 
GTAAAGAATG 
CAATCCCACA 
TTCGCCGCCA 
TTCTTGAAAC 
TAAAGCAATC 
AGCTGCTCCT 
AACAGCTAAT 



TCCCATAAGG 
ACCAAACCAG 
AATGCTTCTT 
AGATATTCTT 
TCGGCT7\AAG 
ACTTGAATTG 
CGTAATTGCC 
GCGTGGGTTT 
ATTTGTGAAA 
TGACCATCTG 
CGGCAGGCGG 
TCTGGGGAAC 
ATATTACGGG 
TAACGGCTAT 
ACAGCTGCAT 
TCGTAAACTG 
GTACTATTCG 
GGTTCTTGAT 
TCTGCGACTA 
ATCGCTACCG 
TTTTCCACAC 
ATCTGCGTCT 
GATGCCATTT 
AAAATATCTT 
AACGGATCAC 
GAAGCAAAGG 
CTATTTCCGA 
CACATAGCAT 
ACGGTTGGAA 
AAATAACGTG 
GGCCGCGCCC 
TCGTAAGGTT 
CCGTCACCAA 



CTTTGGCTTC 
TTTCAGTCAC 
GATGCTGGGG 
TTTTTAATGG 
CAACGGCAAC 
CTTTTAAAAC 
CTTCAATTCG 
CACCGATGAG 
CATCTTCAAT 
TTTCATTGTA 
CGGCGATTCC 
CATAATTCAA 
CTGCTTCTGC 
TACAATCTGT 
CACTTCCTGG 
ATTTTTTCGA 
TAATTTCTGG 
AGGCTTTCTG 
TTTTGCCACC 
CTTCTAAGTC 
AGATTAACAT 
CTCTCTGCGG 
CAGCACTTGA 
GATGATTTTT 
CTACTTGTAC 
TCGCACCATG 
CGCCTTTTGC 
TAACAAGTGG 
TACCAATACA 
TTCGTGCATC 
CCATACTGAA 
CTACTGCAGA 
TATCGACAAT 



TTTTGTTGAG 
TTTCCCAATA 
ACTAACTGAA 
CAGTGTCACC 
GCCACCTTCA 
T7\ATTCTTGG 
ACCCAGTTGT 
ATAAATTAAG 
CACTCCCACC 
AAGAGAGACG 
ATCCGCCGAC 
GCAATCAGTG 
AACCGCAATT 
CGTCATCGCT 
TTGCACCACT 
TGCAATCGTT 
AATGAAGGGC 
GTAGACAGGG 
GTGAAAGACA 
ATACGTTTGA 
CCGTTCTTGG 
AACTTTATCT 
AGAAACCAAG 
GATTAAATCA 
CGCAGATCGT 
AATTCCATCT 
TTGGCCTTTT 
GTTTCCTTGA 
ATTTCCATAA 
C6TCAGTTCG 
AATATCGCGA 
AGGATGATTA 
CCCAGCTCCT 



CAATTGATCA 
TGTTGAGCTT 
AGAATAAAAC 
TGTAACCCCA 
GCGCAATCAT 
TTTGCTTTTT 
ATTTTTTGAA 
TCATCAACTT 
ATGCCAATCA 
TTCCCTGAAA 
GTCCATAATT 
ATAGCCAAAG 
TGCCCACCAA 
AAGGCTTTAT 
GTATTCGTTT 
GGCTGTTGCA 
GCTAATTTTT 
GCTTCTTCTG 
CGATACTGTC 
AACAATTCAC 
GACTCAGACA 
AGCTCTAGAA 
CCAGCCGCCC 
AGACAAGCTT 
TGCTGTTCTT 
CGTCCTGTTT 
TGAATCTCAT 
TAACATGGCT 
CCGCTAATCC 
CCAAATCTTA 
ATAATTCCAC 
TGGCTCTCT6 
TCACCAGGGC 



CATCATCTAA 
TTCTTCCGAT 
GGGATTGCGT 
GTTCGTTAGC 
GCGCGCTAGC 
CCTCTTTTAA 
TTTCTGAGCC 
GTTTAAACGC 
TTGGCGTTGG 
TCACGGGTGT 
CCCAAAAACT 
GTTGTGCTCC 
TTTCAGGATT 
TGGTGCCCCG 
GTACTTGACT 
ATAGGCGTTT 
TAAAAGCACG 
CCAAAGCATC 
CATCATCTGT 
AAACTTGCTG 
AGAGCATTTC 
TCAAGCCTGA 
CCATATCTTG 
CCAATAATAA 
CACCTTCACT 
TCGCACCTAC 
CATGACGAAT 
CAAAAGCAAT 
CCGCCACAAC 
GCGAATCCAG 
CAACGCCTGT 
CTTTAAAAAC 
CTTGAAGCAC 



5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 
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TTGTGGCCCT TCCGTCGGAA ACTTTTTTAA AACGGGCTTG GAATTTTTGT AAGAGCAATG 7920 

TTCACTCCAC ATCACGGAGA ATAAGCCAGT TTCTGTATAA TTAGGTtCTC TTtGCAAAAT 7980 

GGTCTCAATC TGATGATATT CTTCATCTGT TAGTCCCCAT TCTGCATACA ACCGTGTTTC 8040 

CTTGATTTCT TTTGGTGTTG GTTCATTCAT CATTGGCTTA ATTCCTTTCC AAAGTTTTTA 8100 

ATGATTGAAG CAAAAAAGCT GCGTCCATCT GTTGAACCTA ACAATTCTTC CATCGCTCGT 8160 

TCCGGATGAG GCATCATTCC TAGAACATTG CCCTGTTCAT TACAAATACC AGCAATATTC 8220 

GCAAGACTAC CGTTCACCTT TTCTTGATAC GTAAAAACAA TTTGGTTCTT TTCTTTCAGC 8280 

TCCTTCAACG TTTTTTCATC ACAATAGTAA TTCCCTTCGC CATGCGCTAC TGGTAAATGT 8340 

ATCGAAACAT TTTCAGGATA CTCGCTTGTA AACTTTGTGT GCGTATTCAC TTTCAATGGG 8400 

ACACTTTTAC AAATAAAACG AAGCGAATCA TTTCTTAACA ACGTGCCAGG AAGTAAGCCA 8460 

ATTTCTGTTA AAATTTGAAA GCCATTACAA GTACCAAAAA CCATTTTGCC TTCTGTAGCA 8520 

AAGCGAATCA CTTCATTGAT AATTGGCGAA AACCGAGCAA TTGCACCACA ACGTAAGTAA 8580 

TCGCCATAAG AAAAACCACC AGGCAACAAT ACGCCATCAA ATTTTGCTAA AGAAGTGGCA 8640 

TCGTGTCGGA CAAATTCTGC ATCGATGCCC ATCACyTCTT TGACAGCCCA CAATAAGTCT 8700 

GCATCACAAT TGGATCCTGG AAAAACAATC ACTGCCAATT TCATTTAAAT TTCCTCCAAA 8760 

GATAAAATTt CAAAACGATA GGATTCCATA TTTACATTGG CTAATAGTTG ATCACAAATT 8820 

TGTTCAATTT TCTCTTCAAT TGCTTCTGTC TGCTGGACGT GCATTTCAAA ATACTTACCG 8880 

ACACGGATTT CGGCTACTTC TGTAAAACCA AGCCGATGGA CTGCTTCCTT AACCGCTTCA 8940 

CCTTGGGGAT CCAAAATTGA TTCTTTATAC GTAACATAGA CTTTGACAAA ATACaTACGC 9000 

GCTTCCTCCT CAATTAGCTG TTAGTAAACG TTGTAAAACT TCTTCATATA CAGGAACTAT 9060 

TTCTCCTAAT TCACGGCGAT AAACATCTTT ATCCAGATGT TCGTTTGTGT TTAAATCCCA 9120 

TAGACGACAG GTGTCTGGAG AAATTTCATC TGCCAGTAAC AATTGATTGG CCTTCGTCCG 9180 

ACCAATCTCG ATTTTAAAAT CGATCAAACA AATATTGAGG CGTTGAAAGA GTTGACTAAG 9240 

CGCCTGATTA ATTTGTAAAG CCAATGCTTT AATCTCCACA ATTTCAGCAG GAGTGGCTAC 9300 

TTTTAGAAAT TGAATATGGT CTTCGTTAAT AAAAGGATCA TCCAGACGAT CTTCTTTGTA 9360 

ATAAAATTCA ATAATGGGCG TCACTAATTT AGTGCCCTCT TCAATCGCTA AACGCTTTGA 9420 

AAAACTTCCT GCCGCATAAT TACGTACGAC AACTTCCAAA GGAATCaTTT CTACGACCTG 9480 

AATTAATTGT TCGTGTTCAG ATACTTTTTT GATAAAATGA TTGGGAATCT TCTGTTGCTG 9540 

TAAATGTTCA AAAATTAGAC TAGTGATTTG ATTATTTAAA GCGCCTTTCC CAAGAACTTT 9600 
GTCTTTTTTT TGACCGTTTA AAGCTGTTGC TTGATCAAGA TATTCCACAA AAAGAACCGC 9660 
AGCATTTTCT GTTTGATACA ACTTCTTCGC TTTTCCTGTA TACACAAGTG CTTTCTTTTC 9720 
CATTCtTTCC CTCTTCTCGT TAGCTTCATG AACCCTATTC TTTCTAACTc AGCTGATTCA 9780 
CATCaGACaA AAACTTACAT TAGTTACAAA AAAACaATAA TCGTTCGTGT TTGTCGAATG 9840 
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TAAAAATAAA AAACTTGCTT GGAGCAAGCA TTTATTAAGT TACGTTCCAA ATACCAACAG 9900 
TCTCAATTTC CTTTAACGTT TCTGATA 9927 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7678 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPO'JOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



TGTCTGCmrT 


rCAGTAAGGG 


CGTTTTTGGC 


TGAAGCAAGA 


TATGAGGTGG 


ATGCCTGCAC 


60 


AGATGGAAAC 


GAAGCACACA 


CCAAGTTCTA 


TGAAAACACC 


TATCAACTGG 


TTATTCTTGA 


120 


TATTATGCTG 


CCCGGTATGA 


ATGGGCATGA 


ACTTCTACGT 


GAATTTCGGG 


CGCAAAATGA 


180 


TACCCCCATT 


CTGATGATGA 


CAGCCCTGTC 


GGATGACGAA 


AACCAAATCC 


GGGCGTTTGA 


240 


TGCAGAGGCA 


GACGACTATG 


TAACAAAGCC 


ATTCAAGATG 


CGGATTTTAC 


TAAAGCGGGT 


300 


GGAAGCCCTG 


TTACGGCGCA 


CGGTGCGCTG 


GCAAAGGAAT 


TTCGTGTGGG 


CAGGCTGACA 


360 


CTTCTGCCGG 


AGGATTTTAG 


GGTACTTTGT 


GACGGTACGG 


AGCTGCCCCT 


GACACGAAAA 


420 


GAATTTGAAA 


TCCTTTTGCT 


GCTGGTGCAG 


AACAAAgGCA 


GAACCTTAAC 


CCATGAAATC 


480 


ATTTTGTCCC 


GCATATGGGG 


ATATGACTTT 


GACGGTGATG 


GCAGCACAGT 


CCACACTCAT 


540 


ATCAAAAATC 


TGCGGGCGAA 


CTGCCGGAAA 


ATATCATCAA 


AACCATCCGC 


GGTGTAGGTT 


600 


ACCGATTGGA 


GGAATCATTA 


TAATGGAAAG 


AAAAGGGATT 


TTCATTAAGG 


TTTTTTCCTA 


660 


TACGATCATT 


GTCCTGTTAC 


TGCTTGTCGG 


TGTAACGGCA 


ACACTGTTTG 


CACAGCAATT 


720 


TGTGTCT TAT 


TTCAGAGCGA 




GCAAACAGTA 


AAATCCTATC 


AGCCATTGGT 


780 


GGAACTGATT 


CAGAATAGCG 


ATAGGCTTGA 


TATGCAAGAG 


GTGGCAGGGC 


TGTTTCACTA 


840 


CAATAACCAA 


TCCTTTGAGT 


TTTATATTGA 


AGATAAAGAG 


GGAAGCGTAC 


TCTATGCCAC 


900 


ACCGAATGCC 


GATACATCAA 


ATAGTGTTAG 


GCCCGACTTT 


CTTTATGTGG 


TACATAGAGA 


960 


TGATAATATT 


TCGATTGTTG 


CTCAAAGCAA 


GGCAGGTGTG 


GGATTGCTTT 


ATCAAGGGCT 


1020 


GACAATTCGG 


GGAATTGTTA 


TGATTGCGAT 


AATGGTTGTA 


TTCAGCCTTT 


TATGCGCGTA 


1080 


TATCTTTGCG 


CGGCAAATGA 


CAACGCCGAT 


CAAAGCCTTA 


GCGGACAGTG 


CGAATAAAAT 


1140 


GGCAAACCTG 


AAAGAAGTAC 


CGCCGCCGCT 


GGAGCGAAAG 


GATGAGCTTG 


GCGCACTGGC 


1200 


TCACGACATG 


CATTCCATGT 


ATATCAGGCT 


GAAAGAAACC 


ATCGCAAGGC 


TGGAGGATGA 


1260 


AATCGCAAGG 


GAACATGAGT 


TGGAGGAAAC 


ACAGCGATAT 


TTCTTTGCGG CAgCCTcTCA 


1320 


TGAGTTAAAA- 


ACGCCCATCG 


CGGCTGTAAg 


CGTTcTGTTG 


GAGGGAATGC 


TTGAAAATAT 


1380 


CGGTGACTAC 


AAAGACCATT 


CTAAGTATCT 


GCGCGAATGC 


ATcAAAATGA 


TGGACAGGCA 


1440 


GGGCAAAACC 


ATTTCCGAAA 


TACTGGAGCT 


TGTCAGCCTG 


AACGATGGGA 


GAATCGTACC 


1500 
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CATAGCCGAA CCGCTGGACA TAGGGCGCAC GGTTGCCGAG CCGCTACCCG ATTTTCAAAC 


1560 


CT-TGGCAGAG. GCAAACAACC AGCGGTTCGT CACAGATATT 


CCAGCCGGAC 


AAATTGTCCT 


1620 


GTCCGATCCG AAGCTGATCC AAAAGGCGCT ATCCAATGTC 


ATATTGAATG 


CGGTTCAGAA 


1680 


CACGCCCCAG GGAGGTGAGG TACGGATATG GAGTGAGCCT 


GGGGCTGAAA 


AATACCGTCT 


1740 


TTCCGTTTTG AACATGGGCG TTCACATTGA TGATACTGCA 


CTTTCAAAGC 


TGTTCATCCC 


1800 


ATTCTATCGC ATTGATCAGG CGCGAAgcAG caAAAAGTGG GCGAAgcGGT 


TTGGGGCTTG 


1860 


CCATCGTACA AAAAACGCTG GATGCCATGA GCCTCCAATA TGCGCTGGAA AACACCTCAG 


1920 


ATGGCGTTTT GTTCTGGCTG GATTTACCGC CCACATCAAC 


ACTATAAATA 


TTTAAAACTT 


1980 


AAATGATTTT GACCGACAGG TATAACCCTG CCGGTCTTTT 


TGTTTTTCGC 


CGCTACAGGA 


2040 


AAACTACAGA TTGACTACAG GGAAAGTACA GATACGCTTG 


CCATAAT/^C 


AATCGTACCA 


2100 


GCCACAAATC GTAGTTTTAT TGCAAAGGAG GCATTCAATC 


AAATGGAAAA 


AAGCAACTAT 


2160 


CATTCCAATG TGAATCATCA CAAACGGCAT ATGAAACAAT 


CTGGGGAAAA 


ACGGGCTTTT 


2220 


CTATGGGCGT TCATTATCTC GTTCACAGTC TGCACGCTGT 


TTTTGGGGTG 


GAGATTGGTT 


2280 


TCCGTATTGG AGGCAACACA GCTACCGCCC ATCCCTGCAA 


CTCATACAGG 


CAGCGGGACT 


2340 


GGTGTAGCGG AGAATCCAGA GGAAAACACT CTTGCCACCG 


CCAAAGAACA 


GGGAGATGAA 


2400 


CAGGAATGGA GCCTGATTTT AGTGAACAGG CAGAACCCCA 


TCCCCGCCCA 


GTACGATGTG 


2460 


GAACTTGAGC AGCTGTCAAA TGGTGAGCGG ATAGACATTC 


GGATTTCTCC 


CTACCTCCAG 


2520 


GATTTGTTTG ATGCCGCAAG AGCTGATGGA GTTTACCCGA 


TTGTCGCATC 


CGGATACCGG 


2580 


ACAACAGAAA AACAGCAAGA AATCATGGAT GAAAAAGTCG 


CCGAATACAA 


GGCGAAAGGC 


2640 


TACACCTCTG CACAGGCTAA AGCGGAAGCA GAAACTTGGG 


TGGCCGTGCC 


GGGAACAAGC 


2700 


GAGCATCAGC TTGGTCTTGC TGTGGATATC AATGCGGATG 


GAATTCATTC 


AACCGGCAAC 


2760 


GAGGTTTACA GATGGCTGGA TGAAAACAgC TATCGCTTTG GTTTTATTCG 


CCGCTACCCG 


2820 


CCAGACAAGA CAGAGATAAC CGGTGTGAGC AACGAGCCGT GGCATTACCG ATATGtCGgC 


2880 


ATCGAAGCTG CCACAAAGAT ATACCACCAA GGGCTTTGCC 


TTGAGGAATA 


TTTAAACACA 


2940 


GAAAAATGAG AAAAGGATAT AATGCTATGA ACAGAAAAAG 


ATTGACACAG 


CGCTTCCCGT 


3000 


TCCTGCTTCC AATGAGACAA GCGCAGAGAA AAATATGCTT 


TTATGCGGGA ATGAGATTTG 


3060 


ACGGCTGTTG CTATGCACAG ACGATAGGAG AAAAAACGCT 


TCCCTATTTG 


CTCTTTGAAA 


3120 


CGGATTGTGC GTTATACAAC CACAATACCG GATTTGACAT 


GATATACCAA 


GAAAACAAGG 


3180 


TGTTCAACTT AAAGCTGGCG GCAAAGACCT TAAACGGCCT 


ATTGATAAAA 


CCGGGGGAAA 


3240 


CCTTTTCTTT CTGGCGGCTG GTACGCCATG CGGACAAAGA 


TACCCCCTAT 


AAAGACGGCC 


3300 


TTACGGTGGC CAATGGTAAG CTCACCACCA TGTCGGGCGG 


CGGTATGTGC 


CAGATGAGCA 


3360 


ATTTACTATT TTGGGTGTTC CTGCATACGC CATTGACAAT TATCCAGCGC AgcGGTcACG 


3420 


TAGTAAAGGA GTTTCCAGAG CCAAACAGTG ACGAGATCAA AGGGGTGGAT GCAACCATCT 


3480 
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CAGAGGGCTG 


GATTGATTTA 


AAAGTGCGAA 


ACGATACCGA 


CTGCACCTAC 


CAAATATGGG 


3540 


TGACCCTAGA 


TGATGAGAAA 


ATCATCGGTC 


AGGTGTTCGC 


CGACAAACAG 


CCTCAAGCAT 


3600 


TATACAAAAT 


TGCAAACGGC 


AGTATTCAGT 


ATGTCCGTGA 


AAGTGGCGGG 


ATTTATGAAT 


3660 


ATGCCAAGGT 


TGAACGGATG 


CAAGTTGCCT 


TAGGTACCGG 


GGAAATAATA 


GATTGCAAGC 


3720 


TGCTTTATAC 


AAACAAATGC 


AAAATCTGCT 


ATCCCCTCCC 


GGAAAGTGTG 


GATATTCAGG 


3780 


AGGCGAACCA 


ATGAGAAAAA 


GTATGGGCAT 


TACTGTTTTT 


GGATGCGAGC 


AGGATGAGGC 


3840 


AAATGCTTTC 


CGCACCTTAT 


CACCAGATTT 


TCATATTATC 


CCTACGCTGA 


TCAGTGATGC 


3900 


GATATCGGCA 


GAC7VACGCAA 


AATTGGCCGC 


TGGCAATCAA 


TGCATTAGCG 


TAGGCCATAA 


3960 


GTCCGAGGTT 


TCCGAGGCGA 


CAATTCTTGC 


GCTGAG7VAAG 


GTCGGGGTAA 


AATACATTTC 


4020 


TACCCGCAGC 


ATCGGCTGCA 


ATCACATTGA 


TACGACTGCC 


GCCGAGAGAA 


TGGGGATCTC 


4080 


GGTTGGCACA 


GTTGCGTATT 


CGCCGGACAG 


CGTTGCGGAT 


TATGCTTTGA 


TGCTGATGCT 


4140 


GATGGCCATA 


CG6GGTGCAA 


AGTCCACCAT 


ACACGCCGTG 


GCGCAACAAA 


ATTTCAGACT 


4200 


GGATTGTGTC 


CGGGGGAAAG 


AGCTGCGGGA 


TATGACTGTG 


GGAGTTATTG 


GAACCGGCCA 


4260 


TATAGGGCAA 


GCGGTCGTCA 


AAAGGCTGCG 


GGGATTTGGA 


TGCCGTGTGC 


TAGCCTATGA 


4320 


TAACAGCCGA 


AAAATTGAGG 


CAGATTATGT 


CCAGCTTGAT 


GAGCTTCTAA 


AAAACAGCGA 


4380 


TATTGTTACG 


CTCCATGTGC 


CGCTTTGTGC 


GGATACCCGC 


CATCTGATCG 


GCCAGAGGCA 


4440 


AATCGGAGAG 


ATGAAGCAAG 


GCGCATTTTT 


AATCAACACT 


GG6CGCGGGG 


CGCTTGTCGA 


4500 


TACCGGGTCG 


CTGGTGGAGG 


CACTGGGAAG 


CGGAAAGCTG 


GGCGGTGCGG 


CACTGGATGT 


4560 


GTTGGAGGGC 


GAGGATCAGT 


TTGTTTATAC 


CGACTGCTCG 


CAGAAAGTGC 


TTGACCATCC 


4620 


CTTTTTGTCG 


CAGCTCCTAA 


GGATGCCAAA 


TGTGATCATC 


ACACCCCATA 


CGGCGTACTA 


4680 


CACCGAGCGT 


GTGCTGCGAG 


ATACCACAGA 


AAA7VACAATC 


AGGAATTGTC 


TTAACTTTGA 


4740 


AAGGAGTTTA 


CAGCATGAAT 


AAAATAAAAG 


TCGCAATTAT 


CTTCGGCGGT 


TGCTCGGAGG 


4800 


AACATGATGT 


GTCGGTAAAA 


TCCGCAATAG 


AAATTGCTGC 


GAACATTAAT 


ACTGAAAAAT 


4860 


TCGATCCGCA 


CTACATCGGA 


ATTACAAAAA 


ACGGCGTATG 


GAAGCTATGC 


AAGAAGCCAT 


4920 


GTACGGAATG 


GGAAGCCGAT 


AGTCTCCCCG 


CCATATTCTC 


CCCGGATAGG 


AAAACGCATG 


4980 


GTCTGCTTGT 


CATGAAAGAA 


AGAGAATACG 


7UWVCTCGGCG 


TATTGACGTG 


GCTTTCCCGG 


5040 


TTTTGCATGG 


CAAATGCGGG 


GAGGATGGTG 


CGATACAGGG 


TCTGTTTGAA 


TTGTCTGGTA 


5100 


TCCCCTATGT 


AGGCTGCGAT 


ATTCAAAGCT 


CCGCAGCTTG 


CATGGACAAA 


TCACTGGCCT 


5160 


ACATTCTTAC 


AAAAAATGCG 


GGCATCGCCG 


TCCCCGAATT 


TCAAATGATT 


GAAAAAGGTG 


5220 


ACAAACCGGA 


GGCGAGGACG 


CTTACCTACC 


CTGTCTTTGT 


GAAGCCGGCA 


CGgTCAGGTT 


5280 


CGTCCTTTGG- 


CGTAACCAAA 


GTAAACAGTA 


CGGAAGAACT 


AAAGGCTGCG 


ATAGAAGCAG 


5340 


CAGGACAATA 


TGATGGAAAA 


ATCTTAATTG 


AGCAAGCGAT 


TTCGGGCTGT 


GAGGTcGGCT 


5400 


GCGCGGTCAT 


GGGAAACGAG 


GATGATTTGA 


TTGTCGGCGA 


AGTGGATCAA 


ATCCGGTTGA 


5460 
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GCCACGGTAT CTTCCGCATC CATCAGGAAA ACGAGCCGGA AA|UVGGCTCA GAGAATGCGA 5520 

TGATTATCGT TCCAGCAGAC ATTCCGGTCG AGGAACGAAA TCGGGTGCAA GAAACGGCAA 5580 

AGAAAGTATA TCGGGTGCTT GGATGCAGAG GGCTTGCTCG TGTTGATCTT TTTTTGCAGG 5640 

AGGATGGCGG CATCGTTCTA AACGAGGTCA ATACCCTGCC CGGTTTTACA TCGTACAGCC 5700 

GCTATCCACG CATGGCGGCT GCCGCAGGAA TCACGCTTCC CGCACTAATT GACAGCCTGA 57 60 

TTACATTGGC GATA6AGAGG TGACCC6TAT GGAAAATGGT TTTTTGTTTT TAGAT6AAAT 5820 

GTTGCATGGT GTTCGTTGGG ATGCCAAGTA CGCTACATGG GATAACTTCA CGGGAAAACC 5880 

AGTGGATGGG TATGAGGTGA ATCGCATCAT CGGCACAAAG GCCGTGGCGC TTGCTCTGCG 5940 

CGAAgCACAA ATCCATGCGG CAGCCCTTGG CTACGGCTTG CTTTTATGGG ATGGATATCG 6000 

GCCAAAATCT GCGGTGGACT GTTTCCTGCG TTGGGCGGCG CAgcCGGAgG ACAACCTCAC 6060 

AAAAGAAAAA TATTACCCCA ATATTGAGC6 AGCCGAGTTG ATTACAAAGG GCTATGTGGC 6120 

CTCACAATCC AGCCATAGCC GTGGAAGCAC AATTGATCTT ACGCTCTACC ACTTGGATAC 6180 

AGGGGAACTT GTTTCAATGG GAAGCAACTT CGATTTTATG GACGAACGGT CGCACCATAC 6240 

AGCAAAAGGG ATAGGGAATG CAGAGGCACA AAATCGAAGA TGCTTGCGTA AAATCATGGA 6300 

AAGCAGCGGA TTTCAGTCCT ATCGCTTTGA ATGGTGGCAC TATAAGTTGA TTGATGAGCC 6360 

ATACCCCGAT ACCTATTTTA ATTTTGCTGT TTCATAATGA AAGTATTTGA TTTTCTAATT 6420 

ATGTATAAGT TGGCTACAAA TTACTTAGTA TTTCATCAGA CCAATTACTC TCTTGTTTAC 6480 

AGAAAAATTC TGCGCTGATG GAATCTGCTT TATTATGCGG GCGAAAAATG AAATTGACCA 6540 

TATTTTTTCa GAACTTTACT CTGTACCGaA TTGCCTGCAA AAGCCTTATT TTAAGCTGAA 6600 

AGTTCAGGAA TTGCTTTTGT TTTTGTGTAT GCCCCTCGTG ATTTGTACAC CTATCTTAAT 6660 

TGGCTTTGCA ATTCTCATTC CGTATCTCTG CTTTAAGAAT TTGGaAAAAc GAAGCATTGT 6720 

GAATCGGCTG CGGGCAGAGC AAAAAGAGAA CCAGCAGAAA CAAGTCGTTC TTGCTCTGCT 6780 

GATTCACTCG GAACTGTTTG ATTCGGGTTT TCGTT6AAGG TCAAGTAGCT GCTCTGTCAG 6840 

GAAGTCCAGT GTGTTCAGCA GAATCTGCTG ATTGTCACGG TTGCATGACT GAAATTTTCC 6900 

CATGAAACGC TGGAGTTCTT CATCCTCAAT AGAGTTTGAA GCTTGAAACA GGACATTCGG 6960 

AGAAATTCCC AGTCGTTCAA TTAACCGGGA AAGTGTTTCA TAAGACGGAT TTTTTCTTCC 7020 

CTTTTCGATG TCCTGAACC6 TTTTGACCGC CAAGCCAGAT TGATCCGcCA GTTCCTGTT6 7080 

GGTCAAGTTA CATCGAACCC TTTCGGCTCG GATGTGCCTA CCCAGAAATT TCAGGCTGTC 7140 

TTCCGGCATG ATtGTTCACC TCATATATAT TCTATCGTAC CAATTT6TAT TTTGGTACTC 7200 

CCTTAGAGTT GTGTTCCAAT CAGTATTATC ATGCCGGTTT TTGAGTTTAA TATGCAATAA 7260 

AGCCCTGTTGGAAGCAATAA AAAAAACAGC TTCTCACGGC aGGGCCTAAT TGCCaCGCCC 7320 

CCGTGGTTTG TCT6TAAAAT ■ CATGGGGGTA TTTTATGTTC AAAAGAATAA TGGGAGGTAA 7380 

TGCGTTAAAA AAAGTGTAGC TCCGCCGTGG GCATTATCCG CCCCGCTAAG CGCACTATCG 7440 
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TTCCTATACA ATACATATCA TCATTCCTAC TACCTTATTC GGTCGCATTG CGCCAGGAAG 
CCTAAGAAGC TTCCTGGCCy TTTTGTTTGT CGAATTTTGT TGAATGATAA CGAGAATTGT 
CGATGTCGCA GCCCGCCTTC ATCGGTTCGC AGCCCATGAA CTGATGAAAG GAGTAAACGG 
CTATGACCAA AGAATATTGG GTAGCGGTCT TAGAACAAGA AGATCGTnTG ATAAGTAA 
(2) INFORMATION EX)R SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
ATTTATTATA AGGGAGCGAn ATTnCATGGC ACGCGTAGAA AGTTtTGAAT TaGATCACAA 
CACAGTAAAA GCACCATATG TTCGCCTTGC TGGCACAGAA CAAAATGGTG ATGCGTTAGT 
CGAAAAATAT GACTTACGTT TCTTACAACC AAACAAAGAT GCCCTACCAA CAGGCGCATT 
ACACACGTTG GAACATTTAT TAGCAGTTAA CATGCGTGAT GAATTAAAAG GAATCATTGA 
CATTTCGCCA ATGGGTTGCC GCACTGGTTT TTATATGATT ATGTGGGATC AACATTCACC 
ACAAGAAATC CGTGATGCAT TAGTCAACGT TTTAAACAAA GTAATCAATA CAGAAGTTGT 
TCCAGCAGTC TCTGCAAAAG AGTGCGGAAA CTACAAAGAT CATTCTTTAT TTGCAGCGAA 
AGAATACGCA AAAATCGTCT TAGACCAAGG AATTAGTTTA GATCCATTTG AACGTATTCT 
GTAATCTTTA GCAATTAACG ACAGAGAAAC CCACTATTAA GCTGTGGGTT TTTCTGTTCT 
CTTTCTTAAT TTAAACGGAG GTTTTTTATG AATCAAACAA TCGAACAATT ACTAAGTCAC 
CGTTCCGTTC GTCATTTTAA AAAGCAAGCG TTAACGGACA AGCAGAAACA ACAGTTAATT 
ACTGCAGCTC AAGCAGGATC AAGCTCCAAT TTTTTACAAG CTTATACAAT TATAGAAATC 
AAAGATCCTG AGCTACGCAG AGAATTGGGA CGTTTAGCGA ATTGTGAAGA CTATGTCGTT 
AACACTGGTG TGTTCTATGT TTTTGTTGCT GATTTGTATC GTCATGCAAC GATTTTGTCA 
AAAGAGGGGC AATCATTGGA ACCCTTAAAG ACACCAGAAT CATTATTAGT AGCTGCCGTT 
GATACAACGA TTGCTGCGCA AAACATGGCC ATTGCGGCTG AATCTATGGA TTTAGGGATT 
TGTTATATTG GCGGCATTCG AAATGACTTA GACACAGTAG CCAAACGCTT ATCTTTGCCT 
GAATTAACTG TACCACTTTT CGGATTAACA ATCGGTGTAC CAGAAACGTT AAATGGTGTA 
AAACCACGGA TGCCTTTTGA AAATATTTTA AGTGAAAATC ACTACCAATC AGACAAATTG 
ACAGATATGC ACACATATGA TGAATTATTA AAAGATTATT ATGCCAGCCG CTCAAGTAAC 
GCCCAAACAG CGGATTGGTC ACAGAAATCT TTGTCCTATT TTTCTTATAA TCGTCGCCCA 
GAAGTAAAAA TTTTCTTGCA AAAACAAGGA TTtGATGTCT AAACAAAAGA GCGTGGGCTT 
ACGTTATTTC AACGTAAGCC CACGsTsTTT TGTTAATTTT TATTTATCTT TACTAAATTT 
GTCTTTTAAG TCTTCAAAGC CTTCTTTAAC TTTATCAGTT AATTTTTCAG CGCCTTCTTT 



7500 
7560 
7620 
7678 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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AAGGTCGTCG CCGGCATCAC GTGCTTTGTC TTTTACGTCA 
TCCTTCAAGT TCTTTGCCTT TATCATCCGT AACTTTGCCT 
TTCTACTTTA TCTTTTGCAT CGTCAAAACG TCCTTTTAAA 
TAAATAATTA ATTACATAAT AAGAGTATCA TTCTTAGCCA 
TGAACTACGA ATAAATGGTT ATTTTTGAGC CATTAAAGCT 
ACCGATTGTT CCTGGATTAT CTTCTAAAAG AGGCGTCACC 
TGTTTCCACA TATCCATTGA CTAATTCAGC AAATTTTTCA 
ACGTTGTTTC ATGACGCCGC CACCTAAAAT GATCACTTCT 
ATTGTACGnC GtTGCGCTAA ATAATAAGCT TCTAATTCCC 
AATAAATGTC CTTTTTTACC AGAACGACCT TCAACTGCTG 
AAACAATCTC CATGATAAGG ACAATTTCCT GCATACGTAT 
AACGCATGCC CCATTTCTGG GTGGCTAAAG CCTTCAATAA 
CCAGCGCCAA CACCAGTTCC AATTGTATAA TATACACAAC 
CCAGCAACAT ACTCACCATA TGCCGCAGCA TTCACATCCG 
TCA/U^ATGTT GTTTCATAGT TCCTAACAAG TCAAAGTTAC 
GAAGTGATAT AACCATACGT AGGAGAATCA ACGTGAATAT 
ATCCCAATCG CTTTTAAAGG ATATTGTTGG AAA7UVTTCTA 
TCTGGGGTTG TTGTGGGAAA ACTGACACGT TCTACGATGG 
CCACATACAA ATTTTGTGCC ACCGGCTTCG ATACTTCCTA 
ATATGCTCCT TCGTTTCCAT TTTACAAGTA AGCGGATACA 
AACATACACG TTTAGTTTAC CACTTTTTTG TCGTTTCGAC 
GTCAGGTTGC CCAATGAAAT CTGGCAAGTT GCTAAAGAGC 
CGAGCATTCA CGGTTATCTA ACAATGAATG CTCGTTTTTG 
AAATATAACA ATAGTCCGAT TGCAATCCAG CCACCACTAA 
AAAAGCGTTT CTTTTAGAAT ACTTCTGGGC TCTTTTGCAT 
GTCACCAAA6 AAAAGAGGAA AAACATAAAA CAAATGACCG 
AACCAACTAG CCATTATTCT CACTCTCCTT TTCGCTTGTT 
AATGACACTA TAAATTGCCC CAATTGTGAT TACTAACAAC 
CCATGTGGGT AGGACCCATT TGACCGTAGC TTCTTGATTC 
GTTAATCGTA AATTCTTGTT GAAAAGGCCA AGACTGTTGT 
ACGTCCTGTA AATATATAAG TCCCAGCATC TAGCGGTTGT 
ATTCATGGAA CTATTTGGGG CAAAGCGAAC ATTTTTTAGT 
AGAGGATCCT TTTTTTATCA CTTGACCTTC TAAACGAACT 



GCAAAGGTTG ATTGAGCTTT 1500 

TGCGCTTCTT TTGCAGTACC 1560 

TCTGCCATGC TAATCCCTCC 1620 

TTTCCTTCAA ATATAATGCA 1680 

TTTTTTGCCA AGGCAAAGCA 1740 

AAGTATTTTT CTAAAGGCGG 1800 

CGAACTTTCG GCATCAAATG 1860 

GGCGCTAATA ATAAAGTCGT 1920 

AAGTTTTATG ATCCTCTTCC 1980 

GTCCTGCTGC AATCCCTTCT 2040 

CTTCAGGATG ACGACGAACT 2100 

ACTCACCGTT TTGAATCGCT 2160 

TAGATGTATG TTGCCCATTT 2220 

TTGTCCAAGC CATTGGCACA 2280 

GCCAAGCTAA TTTTGGTGTA 2340 

CAATCGGACC AAATGAACCA 2400 

TTACTTTTTT CATTGTTTCT 2460 

TCAAATCATC TGTCCCAACG 2520 

AAAGTTTTTC TGTCATTGCA 2580 

TTACTTTTGA TTAAATAACT 2640 

AGAACAATTT AACAATTTCA 2700 

ATTTTCCTAA AACAAAAAAA 2760 

TTTATTGATT CTTTAGCGAA 2820 

GTAAAAGCAT TAACCAACTT 2880 

GCTGAATAAA ACTCATACTA 2940 

CTAAACCGCC AGACAACCAA 3000 

TTTTGATAAT TATTGCGCCA 3060 

GTGTAGAAAG TCGGTAACCA 3120 

AATTTTTTTG CTTCACGAGT 3180 

TCATCTTGTT GTAAAATAGC 3240 

TTACCCAAAT CAAGATGAAA 3300 

GTCCGTTGTG CAATTTTTTT 3360 

TCTGGAAAAA TAGCAGCATA 3420 
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TGGATTTTGA ATGACCGCCT TTAGCGATTT GGCTCCTAAA GCAATTGTAG GTGTTACTTT 3480 

CGCTiUVTTTCr 'aGTTGGTTCA CCTCTGTTAA AGGAACATTC GCAGCACTCC TCATGGCAAT 3540 

CGCAACATTA AACCCGTAGC GACTGGTTAA TCCCGTTGTT TGGTTCTCTT CCTCTGTTTT 3600 

TTCATCTTTG AGATGCGCAT 7VAACCCCGCC CAAAACAACT CCTTTGATAG GCTGATCAAG 3660 

TCCCTTCACC TTCATCTTTG CTGATACTGT TTGATGAGGC GGCACACTCA CTTCCTCATT 3720 

TTCTAAAGTA ACCAAGTCTG TCAGCGGAAC CTTTAACGAT GAATCAAGTG GTTTATCAAA 3780 

ATCATTGTAC ACAATCAAAC CATTATCATT TGTTGaTGCA TTTGCTAGAG TCAATACGAT 3840 

GGaCAATTTT TCGTCTGTCT TATTTTTTAT GTCTAAGACA ATAGTGGAAG ATTGCTTTGG 3900 

TGCCAGAATT AAATCATAAT AACTAGATGT TCCTGACACT TGATTTTCTG GTAAAACAGG 3960 

CGTTACTGTA AATTTAGTCT TCATTTCTTC TGTAGCTTCT ACTTTTGGAA CAATAATAAA 4020 

AAATGTGAAA AACAGAAATA GACAGAwAAC ACCGACTTTT TTTCTACTCA TTTCTTCAAG 4080 

ATCTCCTTTA TAAAGCTGAA TGTACCTACT AGTCTAACAA ATCAGCGAAA CAATGCTGAT 4140 

CAATCTTTGA CTTTTGGGAA GGATAATCTT GAAAAAAGTC TCCTTGAAAA ATACCAAGGA 4200 

GACTTTTTAT TaTTCACTAT AATATAATTC TTTTTTTATC TGTGTGGTTG ATACTTCTGG 4260 

CGTTCGTGGC AAATAAAGAA CCTTTGCGGT TGTTTCAACT TCTAGAAAAT CAAAAGCCCC 4320 

TTGCCAATCA TCGCCCATGA CTAAAGTATC AATTTTAAAT TCAGCAATAT CGGAAACTTT 4380 

TTGTTTCCAA CTAGTTTCAG GAATAACTAA ATCAACATAA CGAATTGCCT CTAATAATTG 4440 

TTTACGTTTT TCATATGAAA AATTUVCTTTG TTTCTTTTTT TCTAAATTAA AAGCATCTGT 4500 

TGAAAGTCCA ACAATCAGGT AGTCCCCTTG TTGTTTCGCT TTTTTCAACA AATTAATATG 4560 

ACCATAGTGT AGGAGGTCAA ATGTGCCGTA AGTAAGTATT TTTTTCATAA AAACCTCCAT 4620 

TTGTACTTTG ATATTTCTTC GTTAATTATc AATTTTTaTG AATAGAAAAA AGGCTACTCT 4680 

TGaTTGaCTT TGaTGGTTTT TCGATGCAAA TTTTTTATTT TnCGTAGCCG Tn 4732 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:" 7294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GGGaaAATwt TTTTAtgkGc tATGGcGTAt TACtAGTTAC gTTGTTGCCC TAGTGTCCcT 60 

TATTTAAAAt ATTTGGtGAA AAATCaGCGT TTATGGACcA TTGgATACGG CCcATTTCcA 120 

GTGGGGAATG TTTTkGcCTG TGCTTTTTGG CTTTTTCATG CTGATTGGCT TTGATCAATT 180 

TTTTATTGCG TTTCATGGAT TATTTTTTAA CAACGATGCT TGGTTATTTA ATCCGCTGAC 240 

TGATCCGATT ATCAATGCCT TACCTGAAGC CTACTTTATG CATTGTTTTA TCTTGTTTTT 300 
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TATTTTATTA 


GAAGGCCT'JT 


TTTTGATAGG 


TATTTTAATC 


GG^AAACGTT 


CTTTAAAAAA 


360 


ATAAAGAGAG 


AAAAGTGAAC 


AATTCCAAAA 


TTGCTCGCTT 


TyCTTTTTTA TTCAaTCtAG 


420 


TTTTAAAAAC 


TAGATTGAAT 


AATTGTCAAA AGTATGATAT 


TCTTTTTTAG 


AAATGGTTAG 


480 


GAGGGTTTTT 


ATGGACGAGA 


TAGAAAAATC 


ACTACAAAAT 


TGGGGCCAAC 


AATTGGAAAA 


540 


CGTTCATTTG 


CCAAGATGGC 


ATGAGTTACC 


AGATATTGAA 


TTATATATGG 


ATCAAGTGAT 


600 


TACTCTGATA 


GAAAAATACT 


TATCCCCATT 


GATTACACTG 


GAAAAACATA 


CTTTGTTAAC 


DOU 


GTCCTCAATG 


GT T AAT AATT 

XJ* X X mMtm X X X 


ATGTCAAACA CGGGTTAATC 


CCGGCCCCAG 


TGAAAAAGCG 


790 


CTACAATCAA 


AAGCATTTAG 


CGTTTTTAAT 


TGCGATTACG 


TTGTTGAAAC 


AAGTGTTAAC 


f O V 


ATTGCCAGAA 


ATTAAGCAAG 


GCATTTTATT 


CCAAGGAGCT 


ACGGTTGGTA 


TTCGTG7\AGC 




gTACAATTTA 


TTTTGTCAAG 

X X X X %7 X 


AACAGGAACG 


AGCGATTTAT 


GTGATTGCTG 


CTCAAGCGCA 


900 


AGAAAAA6AA 


GTTCAGGCCA 


AATCACAAGA ACCTATGGGG 


ATTGAATATT 


TGGCTGTAAA 




AGCAGCGACA 


ATGTCTTTTG 

«V X XJ X X X X X \3 


CTACTAAAAT 


GTTTAGTGAA 


AAAGTGATTG 


AATTGGAACA 




AGAATACTTA 


AAAGAAATGG 

****** VJ-*V*V#* X VJVJ 


ATGAGATGAC 


ACATGAATAA 


AGAAAAAATC 


GCACTACTAG 


x.\JO\I 


TTGATTCTGG 


GAC AGATf; T T 


CCAGAGGCAT 


TAGTCAAACA 


ATACGGCATG 


TATGTCTTGC 


114 0 


CCTTACAGAT 


TATTTATPPT 

xx^x X xr\XK^\^x 


GAAAAAACCT 


ATACAGATAA 


GGTAGATATT 


ACACCAGAAG 


X ^ VJ u 


AAGTGTATCA 


ACGTTTAGAG 


AAAGAAATGG 


CTAGTACTTC 


ATTACCTGaT 


GGTGCAACAA 


x.£Du 


TCCAAGCTAT 


X X X A umwo^v^ 


ATTAAAGAAG 


CGGGTTACGA 


AAAAGTCTTA GCGGTGACCA 




TT TCTAGTGG 

XXX N* X X WW 


TTTAAGTGGA 

X X X JV1«9 X 


ACGTATAATG 


TAGTTCGCTT 


GCTTGGAGAA 


CAAACCGAAG 




GACTAGATGT 


ATTTGTTTTA 

• X* X X X X X X Mm 


GATACAAAGA ACATTGGTAT 


CGGGGCTGGC 


ATTCAGGCGA 


1440 


TACGAGCAGC 


AGAATTAATT 


GAAACAGGAT 


TAGGATGGCA 


AGAATTACAA 


CAAAAATTAA 


1500 


CAGAAGAAGT 


AGCCAACGCA 


AAAGTATTTT 


TTAATGTAGC 


CACTTTAGAA 


TATCTACAAA 


1560 


AAGGTGGACG 


AATTGGTTTA 


GTCACTTCTA 


TTTTAGGGAA 


TGCGCTAAAA 


TTAAACCCCA 


1620 


TTATTTCTTG 


TAATGAAGAA 


GGCATTTACT 


ATACGGTGGC 


TAAGTCTCGC 


GGCCGTAAAA 


1680 


AAAGTCTTGA 


TAAAACGTTT 


GAGTTGGTGA 


CGAACTTTAT 


AGGTGAGGCA 


CCCCGTTTCC 


1740 


GTTTGGCAGT 


CGCTCACGGT 


GCGGCAGAGG AAGAAGCmAA 


AGCmATGATG 


GAACGCTTGA 


1800 


AAGCAGCTTT 


TCCACAAGCA 


GAAGAAATTT 


ACTTTGGAAC 


AATTAGTCCG 


GCATTAGTTG 


1860 


TTCATACAGG 


ACCAGGACTT 


TTAGGTGTTG 


GCATACAGTT 


ATTAAATGtT 


AtTAAATGAA 


1920 


AAATCCCCTC 


AAATTTTTTT 


GaGGGGrTTT 


TTCaTTTAAG 


ATTyCyCTTT 


TyCATAGrAA 


1980 


TATCaTTtAG 


CyGGaAAAAC 


aACGrAAATT 


wTwTTGaAAA 


TyTCACGAAT 


GTGTTACAAT 


2040 


CCATCTACAA 


AGGGGGAtTT 


TTTGTGaAAA AAGCAATAGC 


AGAATGTTTA 


GGAACATTTA 


2100 


TTCTTGTCTT 


TTTTGGGACA 


GGAACAGCCG 


TTCTAGGAAA 


TGGTATGGAA GGGATTGGAA 


2160 


CAACGGGGAT 


TGCCTTAGCA 


TTTGGGTTAA 


CAATCGTTGC 


TGCTGCCTAT 


AGTATTGGCA 


2220 


CAATTTCAGG 


TGCACATCTA 


AACCCAGCTG 


TTTCTATAGG 


AATGTGGCTT 


AATAAACGAA 


2280 
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nrJAAPTAATP 


1 1 M 1 o 1 Mo 


T/^nnaPAAAT 


1 VAX 1 V30 X V3V7V^ 


*PT/^ A TTPPr*T 

1 X \3t\. 1 X OV^V' 1 




l,^x\X X 1 V3V« 111 


Al IMnu^^Al 1 


\^ k AAAAuV? 1 \9 


PTPfZ T A A A A C 


TATTf^AnAAf 


TTAHf^TPAAA 
X i AooXl.^AAA 




Al VTvarOvy i n^aO 


rv^A ATTAACT 
OVaAAX XAAVai 




PATTCAPACT 
WlX XV9AV.rAl9X 


C^SAAATTATT 
^vxnAAX XAX X 


TTAAPATTTA 
X X AAUAX X X A 


OA CO 
^4 Oil 


TTTTTGTTTT 


A\a X X w X \*t\l V9 


af^a/^TTar'aa 
AUAo X 1 AViAA 


IST^^r"PAAfZAA 
u X ul.^ X AAoAA 


Aol9 v^AA X l9V« X 


TPATTAi^r'ar' 

X L.AX X AGGAG 




GAATCG xAAi 


T/VOTT A A^ A 


r"P A A/^ A A*P/^ A 

C i AALAAX liA 


i XGAGi lAGl 


k^GuOAX XGGX 


TT^aOArf^^ A A 

X XGAGAGGAA 




CTTCTGTCAA 




AbUAi Ububl^ 


UAGGGG 1 X 1 X 


X oColsG X GG X 


XGAGGGX XAX 




CTGAGTToTo 


uAX X X 1 XAXL» 


ox iliCuUUAl 


1 AAx XGGIGG 


ATTATTa^/^r* 
AX XAX XAGGG 


GG X G X X G X AG 


z /UU 


CAAAATx Ul> x 


C 1 1 AVaATAuVa 


r*A Ar'A ATAf A 

(9 AAiaAA X AGA 


1 iAGGGAAAG 


AAGAAIGX X 1 


AT A A^A Ar*/^ 

GAxAAGAAGG 




GTCAAxTbTT 


A^ A r* A *n*p 
lax uAViAlsATx 


CTTTTTTTGT 


T A T A /^T A A 

TA X AG 1 AGGA 


^A Ar»r"rA A Ar* 
GAAGG i AAAG 


GGGCATTTTT 


OQori 


AACCAAAuAT 


O TV TV TV TV /^TV rT»/^7V 

uAAAACATCA 


rn/^A TV A A T A A 

TCAAAATAAG 


GxATAATAGG 


A A AOnr AIV/^ A A 

AAAGTATCAA 


Tk m TV A/>AmfPA/^ 

ATAAGATTAG 


OOQ A 

ZooU 


AAGGAAAAAG 


AGGTGTCTCT 


1*fT»/^»I»/^TV IV ^ A TV 

T TGTGAAGAA 


TV Tk rT*f¥i mn* a /^rn tv 

AA T T TT AGT A 


GTTGATGACG 


TV TV Tk f^f^r^ Tk Tk m 

AGAAGCCAAT 


2940 


m rn tv tv iv f*/^ 

T T CACiAG ATC 




ATTTGGTTAA 


TV TV A /^/*» TV fT» TV T» 

AGAAGGATAT 


TV TV rn 7k rTtrrtrn Tk 

GAAGTATTTA 


CTGCTTATGA 


3000 


TG G AGAAGAA 


f^/^ T\ fry fry TV tv tv 

GCACTTGAAA 


TV Tk /*»ryt ^ r** fk jk /^Tv 

AAGTGGAAGA 


AG TGG AACCA 


7k ^^rnm 7k 7k Tifn 7k 

G ACTT AAT T A 


TTTTAGACTT 


3060 


AATGCTCCCT 


IV TV A TV n*/^^ A 

AAAATGGATG 


Mllfll Tk Tk TV /^rT» 

GCTTAGAAGT 


f f> ^ t\ TV Tk 

CGCGCGAGAA 


^^ff»^*^*/^^^ 7k 7k 7k 7k 

GTGCGCAAAA 


^^7k ^^7k 7k fYl 7k f¥l 

CACATGATAT 


3120 


GCCAATCATT 


ATGGTGaCTG 


CCAAAGATTC 


TGAAATTGAT 




GATTGGAATT 


3180 


AGGAGCCGAT 


GACTaTGTAA 


ih Ik Ik Ik fitm 

CGAAACCATT 


TTCAAATCGT 


^4k Tk mMI Tk 4 Hlklil^ 

GAATTAGTTG 


CTCGTGTAAA 


3240 


AGCCAATTTA 


CGGCGAGGTG 


^A A A A fWte*^ 

CAACCAATGC 


^Tk Tk A^lk A 

GAAAGAAGCC 


^A/^^m^A^A A 

GAGGTGACAA 


^Tk ^Wk Tk fn#^m^ vk 

CACAATCTGA 


3300 


ATTAACCaATT 


GGTGAT. 


CCATTCATCC 


m^A nt^^A nr A ^ 

TGATGCATAC 


ATGGTCTCAA 


AACGGGGTGA 


•a o A 
3360 


AAAAAX xVjAA 


TTAAuvJuACC 


r^nsr^ TV TV fnTft^ A 

GTGAATTTGA 


GTTACTTTAT 


fV*A^fn'PA/^/**A A 

TACTTAGCAA 


A A A rn A fT^r^f*^ 

AACATATCGG 




AW^Ava X 13aX o 


AU X x uAAL. 


ATTTATTACA 


AAGGGi 1 xGG 


GGXl,Ax.GAI 1 


AX X xxGGGGA 




X V X u^boAlJA 


l9 X (9VaAL>l3 X AA 


KMa i AL.GTL.G 


TT^n* A A A ^ A A 

X xTAAGAGAA 


A A A An*'Pr*A A^ 

AAAATTGAAG 


A T A r^HT*^ A A 

ATAGTCGAAG 




X v«A 1 ^LfAMv^la 


fp A qippq* A /-»f»m A 

lAX X lAui lA 


C 1 Uia X Ulj 1 GG 


GGx xGGJl XAI 


TA'P/^'PAA/^A A 

XAIGIAAGAA 


A Tr^^fp^A Ar« A 
AX GGTGAAGA 


'3£AA 


oiaAu X AA X L><A 


/^r^ Afi»r» A Ar* A 
lau 1 Ax (sAAtaA 


A A A A ar^TVp/^ A 

AAAAAGxxGA 


CTTTTTTCAA 


TCTGTTAATT 


■pip A A ^ A *P>PO/^ 

TTAAGATTGG 


JobU 


X X XAXUAX X X 


AX iUi\3CXA9X 


1 AL. xGAl X GG 


GAx X GAAAi G 


ATTGGTGGCT 


ATI TCATTGG 


no A 


XloAAl XuVsAA 


UUuAL. X ALAA 


TTTCCGATTT 


n>AAAAAAAAn> 

TAAAAAAAAT 


ATGGATTGGG 


AAGTTGTGCA 




AX XoXWAAAU 


A^laX XAAoXA 


L>wyw\oA 1 oAo 


^A A/^ A A Ar* A T 

GAAGAAAGAi 


r^rnr^f^ TV A/^^T A 

G i GGAAGG 1 A 


G X GAGGT 1 GA 


Q O >l A 

Jo4ll 


iov»/vv\l 1 l/\ 


A A A AA APrT'T 


T A Tr'Tr* A TT T 
X AXL^loAl 1 i 


1 1 GAAAl GGA 


A T A T 'P T 1^ 

GAiAl X XGiG 


A Af^^^A^A AT 

AAGGGAGAA 1 


"5 Q A A 


TGTCGATC5AT 


AAACT^nATTA 


TTrTy2/2pa a r* 

1 1 \^o\9V3V.«nn^ 


paaTr'aTTTA 


a ATr^aanaaa 

AA X L. AAV-*AAA 


AlAX XAI luo 




GAAAAAGAAT 


vsm i/^iv«oiV9 


ATTTAAATHA 

t\ 111 nr\t\ 1 \3t\ 


Vrf> 1 1 1 n^OMo i 


AA A A A ATATr* 
AAAAAAX Al ^ 


aapr*TTTafza 

AAol^ X X X AoA 


S LI 


TAATGATAAA 


CGCGTGTATG 


TGAATGTCCA 


GCCGATTCAA 


TCGCCTACTG 


GAGAAACAGT 


4080 


GATTGGCGTC - 


CTTTATGTGA 


AAAGTAATTT 


AGAAAATAAA 


TACCAAGAAA 


TTACCAACAC 


4140 


AGCAAGTATC 


TTTTTCACTG 


CTTCTATTAT 


TGCCGCAGCA 


ATCTCGATTA 


TTGTGACTTT 


4200 


ACTGATTGCA 


CGATCAATCA 


CGAAGCCGAT 


TGGTGAAATG 


CGCGAGCAAG 


CCATTCGAAT 


4260 
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CGCTCGTGGT 


GATTACGCTG 


GAAAAGTAGA 


AGTCCATGGA 


AAAGATGAAT 


TAGGCCAATT 


4320 




TTTAATCAAT 


TATCAGAACG 


GATTGAAGAA 


GCACAAGAAA 


CAATGGAAGC 


4380 


AGAAAGAAAT 


CGTTTAGATA 


GTGTCTTAAC 


GCATATGACA 


GATGGTGTCA 


TTGCGACGGA 


4440 


TPGrrGTGGA 


AAGGTGATTA 


CGATTAATGA 


AATGGCCCTT 


TCATTATTAA 

X W«*X A'aX AaS** 


ATGTAAAAAA 


4500 




ATTGGGACCT 


CGTTATTAGA 

Vi« w X X4^x x«M3n 


GTTGTTAGAT 

V3 X X \9 X X rnSfX X 


ATTGAAGAAG 


ATTACACATT 

«^ X X «*VM^»WMa X X 


4560 




TTAGAAGAGC 


CAGATGAACT 


GCTGATTGAT 


CGCTCAACGT 


CTGATCGTGA 


4620 




ATGATTATCC 


GGGTAGACTT 


TACAATGATT 


CGTCGGGAAT 


CAGGATTTAT 


4680 


T a PTCriPTT A 


GTTTCCGTAC 

U X X X w X 


TTCATGACGT 

X X X X 


CACAGAACAG 


GAAAAAAACG 


AACGGGAAAG 


4740 




(W TTPP A ATG 
UX X X^V^nnXO 


TTTPTPATHA 
X X X ^ X ^Ax vm 


GTTGPGAAPG 


PPTTTAAPAA 

XXX AAv> AA 


GTATGPGTAG 

O X A X X 


4800 




XXX O L o 


AArinACPTT*^ 


PPAAAAPPPT 


GAGATTGPGP 

V3AvlAX XV9\i«V3^^ 


PPAATTTPTT 

\i^\3AA X X X V« X X 


4860 


a n a a r"pr» a r^n 


TTanaanaaa 


p pr: a ppcn a T 


l^aTTPflTATn 


ATT A ATP ATT 
AX X AA1V3AX X 


TPTTaaaTTT 

X O 1 1 AAAX X X 


4920 


a Tr* T r'l^r* a Tr* 
AiUxL.lioAiU 


oAL. X U X oVsoo 


aTaf^ar'aTr^T 

A X AL-Aw* X X 


Tr*aaTTaf:ap 

Xi^AAX X/Wj/io 


TaTPTPAATT 
X A 1 u X o AA X X 


TTaappaATT 

X i AA^oAAX X 




ViAX XAAX XXI 


old XuWaAXk^ 


r^l^TTTP aTaT 
IjL..I X iwiAXAI 


paTPaTTfiaa 

uAXuAX X LxH/\ 


aaTPappaaa 

AA X oAu^ AAA 


aaaaTTapAA 

AAAAX XA^^AA 




AnX X IVtCv^oX 


naaTTTaf^Ta 

IjAA XXX A^ X A 


AAk«uL>oAX X X 


a Ti^ pr:T a A a 


TTAPaTAPAP 
X X A\:y A X AV,MrVV7 


APaaaPTAAT 

A\,« AAAV? X AA X 


SI on 

^ X\J\J 


lUiilTuX X X XA 


!2APAAPATTT 
l3A!L#AAV#AX X X 


TfTAAPaATPP 
X oAAViiAAX \a\* 


naTTAACTAT 
urVX X/inoXAX 


TPPPPAPATG 

X ^V9V^^«AO A X V3 


PPPPPPTPAT 

\3\.«OV3^0 X ^«AX 




■p a f^c^Tf^r^f^c A 


PTAPTTfZAAA 
X Al9 X X laAAA 


PAPATAATAA 


'P^'PP^ 'PPT T T* 
XoX\.i>\9X^X A X 


APTATPTPPP 

AU X AX Vnr X 


APPAAGPTTT 

A^WAA\9\3X X X 


5220 


/2/2/2r*ATr*r*r^ 

0\aOV«nX^^\«X 


AAAAAAf^ATP 


TPn/2f2 A A T 
X ^oouvvinu X 


PTTPnAHPPT 


TTTTATPPTP 

XXX XAXWV9X\3 


TGGATAAAGP 

X UwAX AAAU^ 


5280 




CPAPA Af2/^T<^ 
\9V«nv.*nnu\3 X V3 


laVanV.* X XXX 


aCf^TTAfSPA 


ATTTPTAAAP 

AX X X^XAAAV7 


AAGTAATTPG 

AAOXAAX XV«\3 


5340 


CGPPPATAAP 


UOWWJXnX X X 


nnf^Tf^AAAG 


TAPAGAAGGT 


GAAGGATPGA 

wAAwUAX ^UA 


PTTTPTATAT 

^X X X^Xf&X^&X 


5400 


TTPAPTAPPA 


TATGAAPPTT 
xnxi3nr\^V#X X 


ATGAAHAn^^A 


TTGGTGGGAA 

X X W X Uww/V% 


TGATGAAATT 

X\3AXwAAAX X 


ATPAGAATGG 

fix ^XlwXlAX \7w 


54 60 


ATTAf!AAGAA 


X X X X Un X 


TTTGATGGTP 

X X XUAX\9V3XV^ 


ATTTTAAGTA 

X X X X rirVS X X% 


TATATTTTTP 

x<^xnx XXX Xw 


AGTCAATATC 

X N^XV% X aX X w 


5520 


TGGCTGAATT 


PTGCCAAAAA 


AATAPPAGAA 


ATGAAGTCGG 


GAAGCCAAGT 


CACAACAGCT 

Ntft#^%MrA«XN«*X^J K# X 


5580 


GTCAATGAAA 


AAGCPATTGG 


CGATGTPTAT 

^U!n X O X W X A X 


TTACCTTTGC 

X X flN^Xtf XXX 


AATTGATTCG 

wvrx X X w«x X X 


AATAACTGAT 


5640 


GGAAAAGCGA 


TGCAAAGTAA 


TPGTGAAAPA 


TTAATTAGTA 

X X wmr\ X X «X V X 


ATGTTCAAAA 


TGATATTAAA 


5700 


ATGGCTACGT 


TTGGTAAATT 


GACACAAGTT 


GTGACAAAAA 


ATGCAGAGCA 


ACTTAAGCGC 


5760 


TACAACCAAA 


TGGAACAAGG 


P ATTG AACT T 

\,» A X X O/VW^ X X 


CTTTATCAAG 


GtCCCTTTTT 

^7 ^ ^^^^^^ X X X X X 


AATCtCaGAC 


5820 


TATGCTTCGA 


TTTATAATCT 


ATCCATTAAT 


TTTACTAACT 


TtAATGAGTT 


GACGGACCAG 


5880 


TATTTTACGA 


AAATTCAATT 


GGoAtTTTAA 


CGAAAATAAG 


ATACGTTTTT 


TAGATTATGA 


5940 


TCAAGCCAAC 


GTCTATGAAG 


CGCCCATGAC 


TGTCAATAAG 


GCGCGCTTAA 


TGGGAATTAT 


6000 


CAATAAAGAG 


GGATTACAAT 


ATCAAGAC6T 


TTCCGAAAAT 


ACGCTAACCA 


AACAAGGGCA 


6060 


ATGTTATTTA 


ACCAATGATA 


TGAAGTTGAA 


AAAGTACAGT 


TATATCTTAG 


CTTCGCAACC 


6120 


AGTTACTCGT 


TTTAGGAATG 


CTTTTTTCAA 


TGAAACGGAA 


GATATCCAAA 


CCAATGAAGA 


6180 


CAGTCAAGAC 


TTAACCTACA 


CGAGTAAAGA 


AGAACGATTG 


TTTGCAGAAG 


AAAAACTGGG 


6240 
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AAAAATTGAT 


TTTAAAGGGA 


CCT TGCC AGA 


AGAGAATAAA 


CGGGACTCAA 


TCTATAATCA 


6300 


AACCTTTTCT 


TATGTAAAAP 


GTTTACGAAC 
\3 XXX nwwm^ 


CAATATGGGG 


AATTTGCGTT 

XanX X X x^ . 


APTTTGATPG 


6360 




AGTCTHAATT 


n 1 XXX 


TGTGGAAGGT 

X \J X \«\9W^w\3 X 


TTCPCAGTGT 

X X V^w^X^w X w X 


TPAGTAATGA 
X X nn x uin 








TTP^^PATPAf 
X 1 Vi«OV«AX ^'A^i' 


GAAPAAPGAT 


GGTGPTGPAP 


PAAGPGTAAP 


04 DU 


v., AX i AAUAUA 


n/^TCTr* a a T A 


nci A Tfr* A A^T 


nPPG AT TfTT 


TPAGAAGAAG 

i. V>f/UjnnV9AA\7 


A A CTr* A nf*c"r 

AAu X wiCuLr X 


OD4U 




^AAAli&TTf^A 
uAnnAA x 1 o A 


X X Ano^u XXX 


AGAAAPf^nPT 
nOAAAV\9v9^>« X 


GHT^^PT A AAA 


APP A & 11 21 A &T 
AljVjAAnAnAX 


Ctif\f\ 

oouu 


TCAAl X 


vX XAXv^^oX X 


AXAdjX 


papaatt^aa 
uA^mAAX XuAA 


HAAf^TPAAAP 
\3AAl3 X OAAAVrf 


H H TPP TTP H 
AAVji X X X GA 


£itiCf\ 

ot>oU 


T TT AAG TCCi» 


laAA X X A 1 u 


XuUXTlAXAA 


p IV a r* ail TTP r" 


X AVJAU X AA 


UGGAV.X XGGX 


0 I^U 


AAAACAATTA 


CuAAbCTTv^Cv 


ACsGl GGGCTA 


AxitGAX J. X X.A 


AA^^CsAA ri. IsA 


GrGGATTx 


D /OU 


TTCTTAGCAT 


Tx. TTAGGATT 


AAATATCTTT 


T TATTTGGAA 


TTTATCGGGA 


AGGTGAAuAA 


bo4U 


G AGG AGAG CA 


ACGTCTCTTC 


TTCCAGCCAA 


ACAGAATCGA 


mm/^lk TV TV rn r^f^ 

T TGAAAG T CG 


/^m rn/~»/^ Ik Ik Ik Ik 7\ 

GTTGGAAAAA 


C a A A 


GAT AAC AT CT 


CGTATAAAGG 


GACACT TTCT 


m ^ TV TV iv t^r* TV m 

T C AG AACG AT 


TGGAAGG T T A 


timikmmmik tv m 

TTATT TAAGT 




GGC G AACAAA 


/^f^ TV TV mmmmm/^ 

CC AATTTT TC 


m/*>/^m/^/^mmm n 

TGCTGCTTTA 


AAAATCCAAC 


/^m/^ Ik Ik Ik Ik Tk Ik 

GTGAAAAGAA 


TAAAAATTTT 


T AO 

/ UzU 


TTGAGAAATG 


GGCTGCAAAT 


TGCGGATAAT 


ACTTTAACGA 


GTGTACCTAG 


TAAAAACTAT 


7080 


TTTATTGATC 


CTAAGAAAAT 


TGATAAAGAT 


TTAAGTACCT 


TTTTAAATGA 


AAAAAATGCT 


7140 


TTATTATTCG 


GAGACGAATA 


TCAATACTTA 


CCAGAATTTT 


CTCATTTAAA 


AGAGCCGACG 


7200 


GCAGAAATTG 


TGGCTGCACA 


ATCGTATAAA 


GGAATTCCTT 


TTAGAGACGA 


CACGGCAAAA 


7260 


TTAAGTATTT 


TAGCAGATAC 


GTCAGGTGAA 


TTAT 






7294 



(2) INEX)RMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

nATTTCATCT AACTTTTTGA CGCCnTAACA CCCTATTATC ACTTTTTACA TCTTATTTCG 60 

ATTTAAAAAT AACGATAAAA AGCCCGTTCC TAACAGAATT ACTGCTGAAA CATAATAAGG 120 

GAAAAAGTGA TTGATATCAA ATAAAGCTCC TGCTGCCATT GGTCCTAAAA TATTACCAAA 180 

ACTTGTAAAT GTCGAATTTA GTCCGTTGaT GGTTCCTTGT TGATCTCCAG CATGTTTGGA 240 

TAAATAAGTA GTTACTGCTG GTCTAAACAA GTCAAACGCT AAAAAGACAA TAAACGTTGa 300 

AAATACAACA ACTAAATTAT TTTTTGTAAA GGCAATCACG GCAATAAAAA TGGCACTTGC 360 

AAAAAAGGTT AATTGGATTA AACCCATTTC ACCTATTTTT TGGACGATTG CATCAAAGAA 420 

AAACAACTGA CAAATCAACG CTAAAATACC ACTAACCGTA ATCACGATTG CTATTTCACT 480 

TGTGGTAAAG CCAAAATTAA TAGTCGCCAT AATACTATAA ATAGATTCGA ACGCTTGCAG 540 

GCCAAAAGAG GAAATTAAGA TAATCACAAA TAATGAGGTA AACATTGGAT TTCTTAAAAT 600 
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ATCCATAAAT 


GAACCTTTTT 


TCGCTTCAAC 


AGCGGCTAAA ATTCGTTTCT 


CTGGCTCCTT 


660 


CAAAACAGTT 


AATGTCAAAA 


TAAAACCAAT 


AAACGCTAAA AATGCGGCTG 


CAAAAAACGG 


720 


GGCGCGAATA 


CCTAAATAAG 


CaATAAAACC 


ACCAACTCCT 


GGTCCGATAA 


TAAAACCACC 


780 


ACTAATTGCG 


GCTGACACAA 


GCCCCATCGC 


TTTCGGACGT 


TCAGAAATCG 


TGGTCATATC 


840 


TGCCACAAAG 


GCTGTCACTG 


ACGGCATTAA 


TAAAGCGGCG 


GCAATCCCGC 


CTAAACCACG 


900 


AGAAATATAA 


AAACCGCTTT 


TCGCTTGGGC 


TAAACCAAAT 


AATAACTCAG 


AAATTGAAAA 


960 


CACCAACATG 


CCCGTTGCAA 


TCATTTTTTT 


CCGACCAATT 


TTATCCGAAA 


GGACACCTGC 


1020 


GATTGGCGAT 


GTGATTAATT 


GGGCAATCGC 


AAAAATAGAA ATCATCATTC 


CCATTGTGGT 


1080 


ACCAGAAAAA 


TGCATTTCTT 


CTTTTAATTG 


GGGAATTACG 


GGAATGACTA 


GGCCTACGCC 


1140 


TAGAAAAACA 


A6AAATAAGT 


TAGAAATTGC 


TAAGTACATC 


ATACTATTTT 


TTTTCGTCAT 


1200 


AATTTGCCTC 


CTCCATAAAA 


AAACAGCCAT 


CAAAAATGGC 


TATTTCTGTT 


TCTTTGTATG 


1260 


TACTTTCTTT 


GACAGTGTTT 


CGGCTGAAAC 


ACTCGTTGAG 


TCATTGAGAT 


ACATTTTACA 


1320 


CTT^TTCACT 


GAAAATTTCA 


CTAGTTTTTA 


CTTATTTTTT 


TAAAGGAAAT 


TGTTATTTTT 


1380 


TAAAGACATA 


ACTTAAGTAT 


CCAAGTATTT 


AATGAATCGC 


TTTTGTTTTT 


TATCAGAAAC 


1440 


ATAGCCAATG 


TGTTCATAAA 


ATTGATGTGC 


TGCTGTACGC 


TGTTCCCCTG 


AATTT7\AGCG 


1500 


AATCGCAGAC 


AAGTTACACT 


TTTTAgCTcC 


GTTcTAGTTT 


CGTCATTAAt 


TGtTTACCAA 


1560 


TTCCACGGCC 


TTCTGCTTCT 


TTTGCCACTG 


CTAATCCCaG 


AACGTtCAAT 


AACGTTTGGG 


1620 


AATAAATTGT 


TTCATAC%CT 


TGCGCATGCA 


CATAGCCTAA 


AATCTTGCCt 


GTCTTCGTTT 


1680 


CTTCATAAAC 


ATATAAAAAC 


TCTCTGTCAC 


TGGCTAGTAG 


TAATTTCTCA 


ATCTGTTTAC 


1740 


AGGTCACGTC 


AACGGGAACT 


TCATAGCCCA 


ATTGTTCTTT 


ATTAATGATA 


GTTAACGCCG 


1800 


CTGCATCTGT 


TGGCTTCATT 


TTTCTAATCA 


AGGGTAACAC 


CCTCCTTTAC 


ATTCTTACTT 


1860 


AAATAATAAC 


GAAAGCATAA 


TATCCGCAAA 


ATCATTTAAT TCTGAATCAT 


TGTATTGCTC 


1920 


TAACAAAACT 


GTTTCGCCTC 


GATACTCTTT 


TAAAGCTAGT 


TGTATCAGCG 


AATGATGATT 


1980 


AGTAGGAAAA 


TGTACTAGGC 


CCCATTCCCC 


GCCAGCCCTT 


TTTGATAAAA 


TTTTATTCTC 


2040 


TTGCTTAAAT 


GCCATTACTC 


TACACATATT 


TAAAATAACG 


TACATAGGTT 


GTTGAACAAT 


2100 


TTCTTTTTTT 


GCCTCTTTCA 


CATCAGCTAC 


AATACTTTCA AAATATTTCT 


TTTGATCTAT 


2160 


CCAGCCAAAA 


ACGGAATCAA 


TCGGTTTTCC 


TACGAGTACT 


TTTCCATATA 


ATTTTGTAAT 


2220 


CATAAGATGT 


GCCACTAGAT 


CCTCATCGTA 


TCCCATCATA 


TCTTCAATAT 


ATTTATTTTT 


2280 


ATTAGTCAGA 


TATCTGGAAC 


GATGTATCGG 


AGAAAAATGT 


AAATCGAAGG 


GAATCGGATA 


2340 


ACTCAAATTT 


ATTGTATCTT 


tTAATTGtAG 


CACAkGGAAT 


TCTAtCCCTT 


TTTTTGGkGc 


2400 


CCACTTATCT 


AAAATATTAA 


TTGTATAATC 


TAATAATTGT 


TTCTTCTCAT 


CAATAGTAAG 


2460 


CTTTTCATAA 


ACCACAATCA 


GTACATCCAA 


ATCACTTACT 


GAATGACAAT 


AGGTTCCAAA 


2520 


AGCATAAGAA 


CCATGAAGAT 


ATATTCCAAC 


TAAATTTTCT 


TTTATTATTT 


TCATATATCC 


2580 
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ATTTTCCATT 


TGAGCTAATA 


ATTGATGAAC 


CACGATTATT 


CCTCCATTAT 


TGATTGACAA 


2640 


TTGCTATAGT 


TTTTTCGATA 


AATAAAGAAC 


TAACTCATCA 


TCATTAACAC 


ACCGTTCGTT 


2700 


TGGTTTCAGC 


TGCTTATTTT 


GAAACCAGAC 


TCCTGAACCA 


TCTGGCACAT 


ATCCTTGTTT 


2760 


GATATATAAT 


CTTTGCGCTG 


CACCATAACC 


AGGATGCAGG 


CCTACTCCCA AAGAAACAGT 


2820 


ATCTGCATAA 


CTTTTGGCAA 


CATTTTCTGC 


ACGATTTAAT 


AATAACCGCC 


CCACTCCTTG 


2880 


TTTTTGAAAT 


GGTAAGAATA 


CATTGAAATC 


AGCTATTTCA 


GGATATAAAT 


TTTTAAAAGG 


2940 


TCCTTCCTTA 


GCTAATGGCA 


GTAACGTCAC 


ATAACCTAAC 


AGCGTTGCCT 


TTTTCTCCGC 


3000 


TACAAAAACC 


GTTCGTTGCT 


TGACTAATTG 


TTCTTCTAAA 


TAGTTCATTA 


AAATCTCTTG 


3060 


GCGACTTGGC 


CAACCTTGTT 


GGrcAAATTG 


AGTATCCAGC 


GCTAAGATAT 


CTTTCTCTTG 


3120 


CATTTCTtTA 


ATAACCACTA 


GCTCAGACAT 


CAGCATTTCC 


TCCCGAAAAG 


CGTTTTCATT 


3180 


TTATAAAACG 


AAGCGAATAA 


TTCTTCGCTC 


CCTTATTCAT 


TTTCAA/^CC 


ATCGAGTTGA 


3240 


TAATATTTGG 


GCAAAACAGC 


AACACATGTT 


TCTTCTGAAT 


TAACAGCAAT 


AACGAACCCT 


3300 


TGCCATAAGG 


nCCACGTCAC 


CGCTTCACCT 


AATTTCTCTT 


TCACTCGTTG 


nATCATTTCA 


3360 


GCTTGCGTTA 


AGTCTCGTGC 


AGGrAAATGT 


TGAACTTTAT 


CATTACTGAT 


TAAAAAAATA 


3420 


AA 












3422 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



CCGGACCATC 


AAAATCACTT 


TTTTCCTGCT 


CTTTTnAAGA 


AACGAAnGGA 


CCCATGTCTT 


60 


TTCAGAAAAG 


TTTGTTACAA 


TnACAAGCGT 


AGAGAGGATG 


TTAGGGTATG 


GATAAAATGA 


120 


AAGCTATTAC 


TTTTTTTGAT 


TTnGATGGAA 


CTTTGTTAGA 


TGGAACATCG 


CAAATAACTC 


180 


CTGAGATTAC 


CGCAGCAGTG 


GCGGCCTTGA 


AAGACAATCA 


AATTTTACCT 


TTAATTGCGA 


240 


CAGGACGGAC 


CTTGTGTGAG 


ATTCAGCCAA 


TTATGAAAGC 


CAGTGGCATT 


GATTCTGCGA 


300 


TTGTAATGAA 


CGGGCAATTT 


ATTCATTATG 


AAGGAAAAAC 


GATTTATTCA 


GATGAATTTA 


360 


CAACAGAGGA 


ATGTGTTTCA 


CTGCATGAAC 


ATGTCAAACA 


GCGTGGTCAT 


GAATTGGCTT 


420 


TTTACAATGA 


GCGCCGTATT 


TTTTGTACAG 


GGCATACTGG 


CACAGTGAAA 


CAAGCCTATG 


480 


ATTATATTCA 


TTCAGCTGTT 


CCAGAAATT6 


ATCCCACAGG 


TTATGAGAAT 


GATGCCGTCA 


540 


ACATGATGTT' 


GGTCTTAAGT 


CAACATGGCG 


ATGATGATGA ATATTACTAT 


GAGCGGTTTC 


600 


CTGAATTGAC 


CTTTTATCGC 


AATGGTCCTT 


TTTCAATAGA 


TATTGTTCGT 


AAAGGGGTCT 


660 


CTAAAGGCTC 


TGGTGTCAAA 


AATCTTTTTA 


ACACCTTGGG 


ATTAAACGGC 


ATTCCGACTT 


720 
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ATGCCTTTGG 


TGATGGCATC 


AATGATTTGG 


CGCTGTTTGA 


GGtCTGTGAT TATGGAATTG 


780 


CCATCGGTAA TGCGCGAGAA 


GAACTAAAAG 


AAAAAGCCAC 


CTTTATTTCA ACTAAAAATA 


840 




CATTGTTAAT 


GGCTTGAAAA 


AATTTGATTT 


ACTATAAGAG AAAACTACCT 


900 




ATCTAPTTTA 


GAAGAAGGTA 


GTTCTTTTGT 


ATGT TCAT TT 


TTCTTTTTCT 


960 




VariVanu^ X X 


CCCACTACTT 


GAATCAGCAA 


AATCGCTCCC 


TV TV O TV Tt Tl H. 

AACAAACAAC 


1020 




\9l X XL^^XIjAI 


AGGATTGTtC 


TTCTGATTAA 


TV TV rpTV TV Vprn TV It m 

AATAAT TAAT 


ACAACACTGC 


1080 




i^Atj X A 1 


TGGTTGTCCA 


TTTTTATTGA 


CCCCTTTGTT 


TTTCTATGTT 


1140 


T TV cf^ TV T a t^r^ 2V 


i O 1 X Uk^Lr i A^ 


TAAAATGCAA 


GGnAAGATCC 


TAGATTCTTA 


CAACTAACCC 


1200 


ATAATTTTTA 


CTAATTCCTT 


ATTTGTTTTC 


ATTTATTCTT 


AAGATTCATT 


. TCATTTTTGg 


1260 


TAAAAAATCA 


CACTTTTTTT 


CTATCTCTGT 


TTTAAATTGC 


GTACAGTTCT 


TAAAAAAAAk 


1320 


GATACACTAA 


AaGTGaACTC 


GaAAkGAArG 


GrGCCATtCC 


TTTGGtTAAT 


AGCGAAAAAA 


1380 


GAAAACAATT 


AATTATTGAT 


ACATGCATTT 


TAGCTGGAAA 


AATTATGCTG 


GAAAGCGGAT 


1440 


CTGAAGTATA 


TAGAACCGAA 


GATACAATTA 


CTCGGATTGC 


AGCAAATGCT 


GGCGAGCCAG 


1500 


AGAGCGTTTG 


TTACACGACG 


GCTACAGGGA 


TTTTCGTGGG 


CTTTCGTTCC 


AGTAATTATA 


1560 


CGCAATTAGA AAACATCCCC 


CAGCGCTCGA 


TTAATTTGGA 


AAAAGTTAGC 


CTTGTCAATC 


1620 


AGTTATCAAG 


AGAATTTGCG 


CAAAAAGAGA 


TTACTTTACC 


AGAGCTTTAT 


CAGCGCTTAA 


1680 


CTCTATTAGA AACAGATACC 


CCCACTTTTT 


CAATTTCGTT 


ACGCTTGTTA 


GCAGCCGGGA 


1740 


TTGTTAGTTG 


TACGTTAATG 


TACATTTTTG 


GTGGCACTTG 


^ ^ Tt Ik ^ millf Hill 

GcAAGATTTT 


ATTGCAACTT 


1800 


TTTTTGTTGG 


CGTGATTGGG 


TATGCCAGTT 


ATCTTTTTAC 


f^f^l^ Tft It M /''II II II It 

CCAAAAGTTA 


TTCCAAGTTC 


1860 


CTTACCTCGA 


TTCATTCGCA 


GCCGCTTTTG 


TCATTGGTTT 


GTTGGCCTAT 


TTGGCTGTGC 


1920 


ATTTTCACTT 


AGCGGTCAAC 


ATCGATAATA 


TCATTATTGG 


TGCGGTAATG 


CCGTTAGTTC 


1980 


CAGGTGTGGC 


GATTACGAAC 


TCTTTTCGCG 


ACATTTTAGC 


GGGCCACCTC 


ATCAGCGGGA 


2040 


CGGCGCGAGG 


GACAGAAGCT 


ATTTTTATTG 


CTGGGTCTGT 


TGGCCTGGGA 


ATTGCCTTGA 


2100 


TTTTTAAATT 


ATTTATGTAA 


GGAGTGTTCA 


AATGGAACTA 


ATTTTTCATC 


TTCTCTTTAG 


2160 


TTTTTTAGCG 


ACTGTGACGT 


TTGGTATTAT 


TACGAACATT 


CCTCGTAAAG 


CACTGGTGGC 


2220 


TTGTGGCATT 


ACTGGCATGA 


CAGGCTGGAT 


GATTTATTAT 


GTCTTAACGC 


AAACTTTTGA 


2280 


TGCTAGTCAA ACCTTTGCAA 


ACTTTTTAGG 


AACGGTCGGG 


TTAGGAATCG 


CTAGTATCTT 


2340 


TTTTTCCCGT 


TACAAAAAAA 


TGCCGATGAT 


TATTTTTTAT 


ATTCCGAGTT 


TGGTTCCTTT 


2400 


AGTGCCAGGA GGCCCTGCCT 


ATCAAGCAGT 


TCGGAGCATT 


TTGTTAGGAA ATATTGATGA 


2460 


CGGTTTGCAG 


TTGATTTTAA 


AAGTTGTCTT 


TACTGCTGCA 


GCCATCGCAG 


CGGGCTTTAT 


2520 


GGTGACCTCG' 


CTACTGGAAC 


GAATTGTCAA 


ACGTTTTTTC 


CCCAAGCGAC 


TCTTCCGATT 


2580 


GGTCAAAAAA TAAAAAAACA 


AGCTCATTTT 


CTGAGCTTGT 


TTTTTTATTC 


TACTTCTTCA 


2640 


AAGAACAGCA ACAAATCTTG 


TAGCGTTAAT 


TGTGCTTTTT 


CAGCAGCGTT 


TAAATCTTGA 


2700 
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ACAATTTTAC 


CTTTTTGTAA 


AACAATTAAA 


CGATTGCCAT 


ATQGTAAAGC 


ATCTTCCATT 


2760 


CGATGGGTAA 


TCATTAAACA 


AGTTAAGTTC 


CCTTCTTCAA 


TCCGTTGTTC 


CGTTAAGTGC 


2820 


ATCAACTGCT 


TGGATGTTTT 


GGGATCTAAG 


GCAGCAGTAT 


GCTCATCTAA 


AAGTAATAAT 


2880 


TCTGGTTTTG 


TGATAGTGGC 


CATTAATAAA 


CTTAAGGCTT 


GGCGTTGACC 


GCCAGAAAGG 


2940 


TTACCTGTCG 


GTGTATCCAG 


ATGATTTTCC 


AAGCCATTGC 


CAACTTCTTG 


GCAGATTTTT 


3000 


GTAAACAAGG 


CGCGTTGCTC 


ATTTATCTTC 


CGTAATGTTA 


AGGGACGTTT 


TTGTCCGCGT 


3060 


TTCATGGCAA 


GCAATAAT^TT 


TTCTGCCACA 


GTCATTCTTG 


GTGCTGTCCC 


CATTTTAGGA 


3120 


TCTTGAAAAA 


CACGCGCTAG 


AGATTTGGCC 


CGTTTGACTT 


CAGAAAGATT 


GGTGCGATTG 


3180 


ACTCCTTCAA 


TTAAAAATTG 


TCCTTCTGTT 


AGTGTCAAAG 


TTCCTGCTAA 


ACTATTGAAT 


3240 


AGCGTACTCT 


TGCCTGCGCC 


ATTTCCACCT 


AAAACAGTAA 


TGAAGTCCCC 


AGGATAAATG 


3300 


GCTAAGTTCA 


CATGATCCAA 


AATCACTTTT 


TTTTCATTCA 


TCCCATTATC 


AATGATTTTT 


3360 


GTCGCATTTT 


TTAATTCAAC 


AACTGGTTTC 


ATTTCGTTTC 


CCCCTTATCA 


GAAAAGCCGT 


3420 


GTAGTCGTAA 


TGCTTTTTTC 


AATTGTGGGA 


TCATTAAACA 


GATTGCTAAA 


ATCACTGCTG 


3480 


AGAAGATTTT 


CAAATACGTC 


GTATCAAAGC 


CTAATTTAAT 


CACTAATAAA 


ATCAATAGTT 


3540 


GATAAATAAT 


ACTCCCTACA 


ACAATCGCCA 


TCAGCCGTTC 


ACCAAACGTT 


AATTCTCCAA 


3600 


ACAGTACTTC 


ACCAATAAtG 


AGTGAAGCCA 


AGCCAATCAC 


AATGACACCG 


GTTCCTTTAC 


3660 


TTACATCCGC 


ATATCCATCA 


tTTTGGGCAA 


TCAAAGCGCC 


aGATaAGCCA 


TCAAGCCATT 


3720 


CGaTaAAACC 


AaTCCTAAAA 


TTTTCATGCG 


ATCCGTTTGA 


ATGCCAATCG 


AGCGAGCCAT 


3780 


TTCTTCATTG 


TCACCTGTCG 


CAATATAGGC 


TTGTCCTAAA 


CTAGTGTTAA 


AGAAAAATAA 


3840 


CAGTAAAATC 


ATTACCACAG 


CCAAAACACT 


AATCCCTAAA 


AAAATAGTAT 


CATAATAATC 


3900 


GGGTAAGGCC 


ATTTTTTGAA 


AAACATCTTG 


AATCTTCGGT 


TGGTTTAATA 


AAGACTTATT 


3960 


TGGCGACTTC 


ATCACAAATA 


AAATCACGGA 


ATTTAAACCA 


GACATCACTA 


AAATTCCTGC 


4020 


TAAAATAATT 


GGAATTTTTC 


CTTTTGTAAA 


CAGTAAACcA 


GTCACTAAAC 


CGGCAAGCAT 


4080 


TCCTGCTCCG 


ACCCCTAAAA 


GAGTCGCTAT 


AAATGGTGAA 


ACCCCTGTTG 


TAATGGCAGT 


4140 


TACACAAACC 


GCCCCCCCTA 


ACGGAAATGA 


ACCTTCTGTG 


GTCATATCTG 


GAAAATTTAA 


4200 


AATACGATAA 


GTCATAAAAA 


TACCTAGTCC 


TAACAAGGCC 


CATAACATTC 


CTTGTCCAAT 


4260 


TGCTGAAACA 


ATCATCGCTT 


CTCCTACTTT 


CTTTGATAAT 


TAGTTAATCA 


ATTTCGATTT 


4320 


TTTTGCTAAC 


TCTGCGGGAA 


TCGTAATTCC 


TAACTTTTGA 


GCTTGTTTTT 


CATTGATAAT 


4380 


1 (3 1 U 1 UviCC 1 


L»T AT TAAAGC* 


TATAAATGGG 


TGTAGTTGCT 


GGTTGACl r 1 


1 rUCACsACAb 


A A A f\ 


AACCTCTGCT 


GCCATTTTGC 


CGGCTTGTAC 


CCCCAACGCA 


TGTTGATTAA 


TCCCTACAGT 


4500 


TGCTAAACCG 


CCTTGTTCTA 


CCATTGTATC 


CACCGAAGGA 


ATAATCGGCG 


TCTTGGTTCG 


4560 


ATTTGCTTCT 


CCGACTACCG 


TTTGCATGGC 


ATTAGCAATC 


GTATTATCAG 


TTGGTATATA 


4620 


AATCACATCT 


GTTTCCTTAG 


TCATCACTTG 


AACCGTTTGT 


GCAATTTCAT 


TACTTGATGG 


4680 
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pa f Af^r* AT AT 


fSTT TTP APTR 


XWlAXw^X 1 X 


GGCTTCAGCC 


GCTTTCTGCG 


C«T X UAGCTAC 


4740 


TTfZftTATPCA 


f^AATTTTPTT 
VmvIX X X Xlf X X 


\* X uAuuAnviV.* 


ATAGAGAATA CCTATTITTT 


TbGCAGCTGG 


il 0 A A 

4800 


TaTPAaTTPT 


TTTACTAAAT 
X X XM^XMnAX 


V« W\ i L> X laisuU 


TGACAAAGGA GATTGATCAC 


TAACACCCGT 


4860 


^ n 1* ji T T n ^ 


LrL> i \7V9l,« X 1 1 1 




CACCAATCCT 


GCACTAACAG 


GGTCCGTAAT 


4920 


GGCoCL'TAAA 


AxGAx IkTOiiA 


TTTCTTGGGT 


AGTGTTGGCT 


AAcGCTTGCG 


CAcTGGCGTG 


4980 


GCGATGCCAA 


rprn Ik 7k 7k Tk^^vpA* 

xTAAAACGTL 


TGCCTTTTTT 


TGTACTAATT 


GTTGACTCAT 


GATTGCTAAC 


5040 


TTaCTCTGGT 


CGGCTTGACC 


ATTTTGAAAA 


ATAATTTCTA 


AGTTTTTGCC * 


TTTGTCATAG 


5100 


CCATTTTCCT 


TTAATCCTTC 


TTCGACACCT 


TTATAAATTT 


CGTCCAATGC 


GGGGTGACTG 


5160 


ACAAaCTGCA 


7\AACCCCTAC 


AGTAACTGTT 


TCTTTCTTTG 


AATTCGTrAC 


TTTTTGATTC 


5220 


ACAAAAAAAG 


CACCGATTAA 


ATAAATAGCT 


AATAAAGACA 


CGACAAACGT 


TAAACGTTTA 


5280 


T T T TTC ATTT 


TGCATTCTCC 


TAACTGCCCT 


TTTGTTTGGC 


AACTTTTTCT 


TTTTAATAAT 


5340 


aaaaccatta 


TCTATAGGGC 


GTTTCTTCCG 


TTGAAAGTGA 


AACTGAGATT 


CGTAATCCTC 


5400 


TGCT tagtca 


AAATCCAAGG 


AACACTTTTT 


TACAAATGTC 


AGGATTTAGC 


TGCGACGAGG 


5460 


ATTGCAACTT 


TGGTGAGAAT 


CGCAGGAGTA 


CTTTTTCTAC 


GAATGGGACA 


ACGTTCTCTT 


5520 


CGTTCACCTT 


GTACTACTCA 


TCACTCTTGA 


AGGCCAAACT 


GCGGCGAGTA 


TACGCAGCCT 


5580 


GCATCTACTC 


TGTCnCTTCG 


TGCCAATCGT 


AGTGCTTGTC 


AGGATTTAAC 


TGCGACGAGG 


5640 


ATCGCAACTT 


TGGTGAGAAT 


CGCAGGAGCA 


CTTTTCTACG AATGGGACAA 


CGTTCTCTTC 


5700 


GCTCACCTTG 


TACTACTCAT 


CATCTACTCT 


GTCACTTCGT 


GCCAATCGTA 


GTGATTCGTA 


5760 


GCAAAAAAAC 


AACGACTTAT 


CCTAAGTGTT 


TAGGAGTGGT 


CGTTGTTTTT 


ATGCAAAACA 


5820 


7k 71 71 7k 7k 0 7k 

AAAACCACAT 


7k ^7k fik^^^m^^ 

AGATGGGTCC 


TCTATGTGGT 


CGTTTTTTCT 


TCTTGAATAT 


AACCAGCATA 


5880 


GATACACTTT 


TTAGAATAAG 


TCTTCTAAAG 


TTGTATCTAT 


GAAAGAATCA 


TACATACAAC 


5940 


TATCTACGCC 


AGTTTTGCCA 


TGATTGGTGC 


ATTTCTTTTA 


CGTTAGATAA 


TTGTGTCATG 


6000 


TATTTTCCCT 


CCTAGTTATG 


TTGTTAAAGG 


TAGTCTACAC 


TATATTGATT 


TTGTTCGTCA 


6060 


AGAAAAAGTT 


TTTATTTTCT 


TGCATTTCTT 


TTACTTCTAA AAAACGAGCC 


ACTCTTATGA 


6120 


GAGGCTCGTT 


TTTCTTTATT 


1 UAlaUA X ATT 


TACCACTTAA 


TTGTTTCGGT 


TTCTTTCAAT 


6180 


TTTCACGGGA ACTTTAACTG 


cx xlaX rx X xc? 


CGATTGAATC 


AGCCATAAAC 


TntT> 7k on 7k 7k Ik 7k 

TTACTAAAAT 


oZ40 


CGTCATTGAA ATTACTAATG 




TCCCCTCTTT 


CTAATTCCTT 


AAAACT 1 rCx 


conn 


ACAATCTAAG 


TTTAACATAG 


^^•f*7k^*^*W7ink*pik 

\«Xiv.90XAX XA 


TTCACCAGTT 


TCTGGTACTG 


TATuAbTud 


OJOU 


TTCTGTCGTC 


GTTTCAGTAG 


ATTGAGCAAC 


GCCAGTCGTG 


CTGGTCGTCG 


TCTGTGTTTC 


6420 


CGCGGTCTTT 


6CTGCTTCTT 


TTTCAGCTTC 


TTGCGCCGCT 


TTTTGCTGGT 


CTTTCAAAAC 


6480 


GCCAGCCATT 


GGTAATCGAT 


TTAATTCTGG 


TATTAATTGT GTTTTTAAAT 


CCATAGATTT 


6540 


TTCGACAACC 


GTCCGTGTCT 


TTTCTTTGCC 


GTTTTCAAAG AAAACAAGCG 


TAATCTCATC 


6600 


GCCGCTAACA 


TTGTATTGCT 


CTTGTGCTTT 


GCCAGCATTA TAAATTAACG 


GATAGATATA 


6660 
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TrSAAATTPAT 


•PP a •PI* T'P A TP 
1 UAX X X X AXV« 


TP A ATT A A AP 
XuAAX XAAALr 


A PPTTP A PP a 
X X UAUUA 


TTT^a fP^^fP9k 

X TTGATCGTA 


d720 




laAXoisX i XAu 


pa A AP ATP AP 


TP A P A T^^PT 


X l3 X X X U xT 


GGATTTTCTT 


6780 




1 iV9iAAii./XA 


AGAGACTGAC^ 


TTT^TPTPaT 


TT^ aT^^r^fP^ 

X XGATGGGTC 


CTGCGACTAA 


6840 


CGATTuAAC 1 


xbAbUAbCTC 


GAAT AT TTTT 


TTr*0 a TO/'TT 

i xbUAiCGx 1 


^f^lK li.T^'Tk M 9V Ik 9k 

GGAATAAAAA 


TGATTGAGCC 


6900 


AATTATTAAT 


Ik ^'M iiik Ik m^^m ik 

ACTAATGCTA 


^ik Ik mik ^^^v/'»tit 

CAATAGCGGT 


Ik 9k fP^Tk ^PVPli Ik O 

AATCATTAAC 


AGTTTTCTTT 


GTCTTAATTC 


6960 


ATAAAGTGCC 


TCTTTTAGTT 


CTTCCATAAA 


ATTATTCAAC 


TTCTAATCGC 


TCCTCGCTTT 


7020 


TAGTTCTTAT 


CTATTATACT 


AATCATTTTC 


TTTCTTGCCT 


ACCCTTGCGC 


TTTAATTACT 


7080 


AAAAAAACCA 


GCAAACTTCC 


CCAGCTTTCT 


AAGGAAAACC 


CAACTCCTTT 


TTAACAAGCT 


7140 


TCTTTCTTAA 


ATTCAAAAAC 


TCCGCTATAA 


TAAAGCTATT 


AGTCGGAAAG 


AAACGAGGGA 


7200 


TTTATTATGT 


CATTTGaTGG 


CGTATTTACT 


CATAGCATGG 


tTCATGAGTT 


GaCAGAAACA 


7260 


CTCGTAAGTG 


GCCGGATTTC 


AAAAATCCAT 


CAACCTTATG 


AAAACGAAGt 


CGTTTTAGTC 


7320 


ATTCGTGCGA 


AAGGTAAAAA 


CCATAAATTA 


TTATTATCAG 


CACATCCTAG 


TTATGCGCGC 


7380 


ATACAACTAT 


CTACAATTAC 


CTATAGCAAT 


CCGGAAACAC 


CTCCTAATTT 


TGTGATGATG 


7440 


TTACGAAAAT 


TTCTAGACGG 


TGCCATTTTA 


GAAACGATTC 


AACAAATAGA 


AAATGATCGC 


7500 


GTGATTCACT 


TCACCTTTTC 


AAAAAGAGAT 


GAACTAGGTG 


ACTTACAAAA 


TATTGTTTTG 


7560 


ATTGTTGAAT 


TAATGGGACG 


TCACAGTACT 


ATCGTTTTAG 


TTAACCAATC 


TTCTGGAAAA 


7620 


ATCTTAGATG 


CGATAAAACA 


CATTGGAATG 


AGCCAAAATA 


GTTATCGTTC 


GTTGCTTCCT 


7680 


GGAGCAACTT 


ATATTGAACC 


GCCAAAACCA 


ATGGGACTTA 


ATCCGCTCAC 


GGCCTCAAAA 


7740 


GAAGAAGTCT 


TTGCGCTTCT 


TTCTACAGCT 


CCTGAATTAA 


ACGGACGCTA 


TTTACAACAA 


7800 


CATTTCCAAG 


GTCTAGGCAA 


AGATACAGCA 


GACGAATTAA 


GTGCCCGTTT 


ACTTGCACAA 


7860 


CCAAATGAAA 


AAATGGCCAT 


CTGGACGGAA 


TTTTGGTCCT 


CAGTCACTGA 


AGCTGTTGTA 


7920 


CCGACATTAA 


CAGTTACTGA 


AAAAAAAGAA 


TATTTTACGC 


CCATTCCTTA 


TCAATCATTA 


7980 


GTTGGTGAAC 


AAACAACGTA 


TGCGACACTC 


AGTGAATTAC 


TAGATGCCTT 


TTATTCTGGT 


8040 


A/uUaCAGAixA 


AAGACCGTGT 


CAAACAACAA 


^^^^^^^^^^^^^^^ ^ 

GGCGGCGAAC 


TAATTCGCAA 


AATTGAAAAT 


8100 


(3AAUIAAAAA 


'n K n T*/^ 9k A Ik Ik 

CaAAATCAAAA 


GAAACTTAAA 


AAATTACAGC 


AAACCTTAGC 


CGATACAGAA 


8160 




jk rprpik r^r^r^^f^f* 

Al lACCGTCG 


f^r* Ik ^»^^1k Ik * 

CGACGGAGAA 


CTATTAACAA 


CCTTCATGGC 


ACAAGTCCCA 


8220 




^M.k^l^Al:>l7 1 GGA 


a'PT*7^^^Yk TV )k rwi 

ATTACCAAAT 


1* 9k IVT* 9k fT*r* Ik TV ^ 

TATTATGAAG 


a 7k 9v a T^/^ a f>t^ 
AAAATGCACC 


GTTGCGTATC 


o o o rk 


1 X MAnX ^ 




Ik ^/■^rn^ 3k li^Yk 9k 

ACCTAATCAA 


Jk 9k(I*^^%^9k Ik 9k 

AATGCACAAA 


9k 9k ni9k n'ii'm'HHi/^9k 

AATATTTTCA 


Ik 9k 9k 9k mik ^V^9k 9k 

AAAAT ACCAA . 


8340 


AAATTGAAAA 


ATGCTGTCCG 


AGTTGTTAAA 


ACACAAATTC 


AGCAAACACA 


ACAAGAAATT 


8400 


TCTTATTTAG 


AGTCAGTTGT 


GGCACAACTT 


GAAATTGCGA 


CACCAATGGA 


CATTGAAGTG 


8460 


ATTCGTGAAG 


AACTGATTGA 


ACAAGGCTAT 


TTAAAACGTA 


AAAAAAATAA 


AAAACAAAAA 


8520 


CAGCCGAAAA 


AAAGCCAACC 


CGATCTTTTT 


TATGCCACAG 


ATGGCACACC 


TATTCTCGTT 


8580 


GGAAAAAACA 


ATTTGCAAAA 


TGATCAATTG 


ACTTTACGAA 


CCGCTAAAAA 


AACTGATTAT 


8640 
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TGGCTCCATG CCAAAGATAT TCCTGGATCT CATGTAATTA TTGGTGACGC ACATCCTAGC 8700 

G/^GAAACCC TTACTGAAGC CGCACTTCTA GCAGCTTATT TTTCAAAATA TCGACTTTCT 8760 

TCACAAGTGC CTGTTGACTA TGTCCAAGTA AAACATGTGC ATAAACCCAA TGGAGCAAAA 8820 

CCAGGTTATG TGATTTATGA AAACCAACGA ACGTTATATG TGACGCCAAC AGAAGAATCG 8880 

ATCAAAAAAA TCCAGcAAAA TAAGGCTTCT GCTTCCTAAT TGTATTAAAC TAAAAAACAA 8940 

CGTACTGcCG ATTGATCGTA CGTTGTTTTT TTGCATTGGC AAAGACATTT ATCAAGTGAT 9000 

AAATAACACA AGTAAGCTCA ATTAGTTTAC AAACTAGCTA TTTCTCtCTA TACTGACTGT 9060 

TGGAGGTGAA ATTCATGGTC TTTTTACACA TTATTTTTGG GATGTTTGCC TTTATTGGCG 9120 

CTTTAGGTTT AGGTGTCGCC TTCCGCGATA GCTTTACAGA ACTTTCAACA CC 9172 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



ATTATTGATC 


GAGAAAATGA 


AATTCGAGAA 


TTTAAACCTG 


AAGAATATTG 


GAGCATTGAT 


60 


GGGAATTTCC 


AAAAAGGTCG 


CAAGAAATTC 


AAAGCAAATT 


TCTGGGGCGT 


AGATGGTAAA 


120 


AAGAAAAAAT 


TACCAAATGC 


AGAAGCGGTG 


AAAGAAATCA 


CTTCTCGGAT 


TGACGGTAAA 


180 


GATTACGATG 


TTACAAAAGT 


TGAGAAAAAA 


GAACGGAAAC 


GTAATCCTGC 


ATTACCTTTT 


240 


ACAACCAGTA 


GTTTACAACA 


AGAAGCGGCA 


CGTAAATTAA 


ATTTCCGTAC 


CCGTAAAACC 


300 


ATGATGGTTG 


CGCAACAATT 


GTATGAGGGA 


ATTGCTTTAG 


GGA/UVCAAGG 


CACTGTGGGG 


360 


CTGATTACCT 


ACATGCGTAC 


AGACTCAACC 


AGAATTGCTG 


"ATTCCGCCAA 


AGCAGAAGCT 


420 


GCTGAATTCA 


TTGAAAAGAC 


CTACGGAGAT 


GAGTTTTCAG 


CACATGGCGG 


CCGCAAAGCA 


480 


AAAAATACGC 


AAGGTGCACA 


AGATGCCCAT 


GAAGCGGTTC 


GTCCGTCAAG 


CGTTTTAAGA 


540 


ACACCTGATG 


AAATGAAGAA 


GTATTTAGAT 


AAAGATCAAT 


TAAAATTATA 


TACTCTTATT 


600 


TGGTCGCGCT 


TTGTTGCTAG 


TCAAATGACA 


CCAGCGATTT 


TAGATACCAT 


GAaGGTCACG 


660 


CTTCAACAAA 


ACGGCGTAAC 


GTTTATTGCC 


AATGGCTCCA 


AAGTGAAATT 


TAAAGGGTTT 


720 


ATGCAAGTAT 


ATGTTGAAGG 


ACGCGATGAT 


GGTAAAGAnG 


ACAAGGAAAA 


ATATTTTACC 


780 


TGAAnTAG 












788 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 67: 



TTATGATTAA 


GGCAACAACG 


CCCCAAGCAC 


GATATGGAAA 


TCCCACGCCA 


CGTGTAGCTG 


60 


AAACACCTAG 


TGGTATGCTA AATGCCaTTG 


GATTGCAGAA 


TCCAGGTTTA 


GAAGTGGTTA 


120 


TGCAAGAAAA ATTACCGAAA 


TTAGAAAAAT 


ACCCTAACTT 


ACCGATTATC 


GCAAATGTTG 


180 


CAGGTGCTTG 


CGAAGAGGAC 


TATGTGGCTG 


TTTGCGCAAA 


AATTGGGCAA 


GCGCCTAACG 


240 


TCAAAGCAAT 


TGAACTGAAT 


ATTTCCTGTC 


CCAATGTGAA ACATGGCGGC 


ATAGCTTTTG 


300 


GTACAGATCC AGAGGTTGCT 


TTTCAATTAA 


CACAAGCAGT 


CAAAAAAGTC 


GCCTCTGTTC 


360 


PGATTTATGT 


TAAACTATPA 

XnnrW^ XtXX 


CCAAACGTAA 


CGGACATTGT 


GCCGATCGCC 


CAAGCAATTG 


420 


AAGCAGGGGC 


CGCAGACGGT 


TTTTCTATCA 

X X X X ^ X n X 


TTAATACATT 


GCTAGGGATG 


CGCATTGATT 


480 


T AAA AArn Af; 


AAAACCTATT 


TTAGCCAATr 


AAACTGGCGG ACTATCTgGC 


CCAGCCATTA 


540 




TATTCGCTTA 


ATrrGAGZUXG 


TCGCCAGTGT 


TTCTCAGTTG 


CCAATTATTG 


600 


GAATGCGCGG 


TGTTCAAACA 


GTGGATGATG 


TCTTGGAAAT 


GTTCATGGCA 


GGTGCTAGCG 


660 


CTGTCGGTGT 


TGGTACGGCC 


AACTTTACAG 


ATCCATATAT 


TTGTCCGAAG 


TTGATTGACG 


720 


GATTACCTAA 


GCGAATGGAA 


GAGTTAGGTA 


TTGAATCGTT 


GGAACAATTA 


ATTAAAGAAG 


780 


TGAGAGAGGG 


GCAGCAAAAT 


GCACGATAGA 


CCAATTATTG 


CGTTAGATTT 


TCCCACGCAA 


840 


AAAGAAGTGG 


CTGTTTTTTT 


AGAAAAATTT 


CCTAAGGAAG 


AAGCCTTGTT 


TGTTAAAGTG 


900 


GGGATGGAAC 


TTTTTTACGC 


AGAAGGACCA 


GCGATTGTTC 


GCTGGCTTAA 


AGAACAAGGA 


960 


CACGATGTTT 


TCTTGGATTT 


AAAATTACAT 


GATATTCCGA ATACTGTGGA 


AAAAGCGATG 


1020 


ACAAATTTAG 


CAAAATTAGG 


CGTTGCCATT 


ACTAATGTCC 


ATGCAGCAGG 


GGGCGTTC6A 


1080 


ATGATGCAAG 


CAGCTAAGGA AGGTCTAATC 


AAAGGGACTC 


AGCCAGGCGC 


TAAGGTTCCT 


1140 


GAGTTAATTG 


CTGTAACCCA 


GTTGACTTCT 


ACTAGTGAAG AAGAGATGCA 


CCAT6ATCAA 


1200 


TTAATCAACG 


TTCCTTTAGA 


GACAAGTGTC 


ATTCATTATG 


CTAAATGTGC 


GGAAAAAGCG 


1260 


GGTCTGGATG 


GTGTGGTTTG 


TTCAGCATTA 


GAAGCACGTG 


GTATTCAAGA 


AGCAACCAAG 


1320 


CAGACGTTTA 


TTTGCTTAAC 


ACCTGGTATT 


CGTCCAGCAG 


GAAGTGCAGT 


AGGTGACCAA 


1380 


CAACGGGTCG 


TTACGCCGCA 


ACATGCTCGC 


GAAATTGGGT 


CCACATATAT 


CGTTGTTGGC 


1440 


CGACCTATTA 


CCCAAGCAGA 


AAATCCTTAT 


GAAGCCTATC 


AAGAAATCAA 


AAAAGACTGG 


1500 


AGCGAAAAGT 


AACATGACAA 


AAGTAGCAAA 


AAAGATTGCC 


AAAGATTTAT 


TAGATATTGA 


1560 


AGCGGTATTT 


TTAAACCCTA 


ATGAGCCGTT 


TACCTGGGCA AGTGGCATAA 


AAAGCCCGAT 


1620 


TTATTGTGAT AACCGTATCA CGATGAGTTA 


TCCTGCTGTG 


CGTAAAGAAA 


TTGCAGAAGG 


1680 


CTTAGCGGCA AAAATTAAAG AGACTTTCCC 


AGAAGTCGAA 


GTGATTGCTG 


GGACAGCTAC 


1740 


CGCAGGGATT 


CCACATGCGG 


CATGGGTAGC 


CGATATTTTA 


GGATTACCAA 


TGGTTTATAT 


1800 


CCGCAGTAAA 


GCCAAAGATC 


ACGGCAAAGG 


GAATCAAATT 


GAAGGCCGAA 


TTTCTGAAGG 


1860 


ACAAAAAATG 


GTAGTTATCG 


AGGATTTAAT 


TTCAACAGGC 


GGTAGCGTGT 


TAGAAGCGGC 


1920 
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uAbCAACGGT 


TT lAGGCGTT 


GCGGCAATCT 


TCACTTATGA 


1 oon 
1980 


GTTACCCAAA 


GoAAWf&GLAA 


ATTTTGCAGA 


tftTk Tk 7k ^7k ^7k m/^ 

TAAACAGATG 


ACGT TACTAA 


OOWPn 7k/^/^7k 7k 

CGTTAACCAA 


2U40 


TTATTCAACT 


TT AAT TGATG 


CTGCATTAGA 


3k <*^9k 7k 7k mm 7k m 

AGCAAATTAT 


7km^^*7k 7k ^» 7k 7k 7k 

ATCGAAGAAA 


7k 7k /^7k m^mmik 

AAGATGTTAC 


2100 


CTTATTACAA 


GAATGGAAAA 


AAGATCCTGA 


AAaCTGGGGC 


7k 7k rvi Ik 7k ^t^^^m 

AAaTAACCGT 


TGAACTAAaG 


2160 


AAGGTTCTTG 


TCGTCTGATT 


TTTAGTGTCA 


TTTTCAGTAA 


CCAGATTTAA 


CGGTAGGAGA 


2220 


ACTTAGGTCA 


TTTGCCGAGG 


AGCGTAATGA 


TCTAAGTTCn 


TTTGTTTGAC 


GTAGAAAAGA 


2280 


TGTTAGGATA 


GAGTTTTAAC 


ATGAGTTTTA 


CCTGATTACT 


ATGTGGAAAT 


GrAGGaAGAG 


2340 


TGTATGAAAA 


AAATAAAAAA 


TATGGACGTk 


GAATGGAGCT 


ATACAGGCCA 


ACGATGGACC 


2400 


CAGAACATTG 


GCATACGCTA 


TGTGACTGGT 


TTGCAGAAGG 


TGCTAAGTTT 


GCTTATCAAT 


2460 


CGCCAaTTGC 


TTTAGAAAAA 


GAAaGTGCTG 


AGACAGTAAA 


TTCTCAGATT 


ACGTTTCATT 


2520 


ATAAAAAAGA 


AGAATTCACT 


GAAAAAGAAT 


TCAAAAATAC 


GTTTCATTTT 


GTACCACCAA 


2580 


ATACGGA7VAG 


TTACGTAATG 


TTTGAAAATG 


TTGCGTATCA 


TTTAACCGAT 


ATCCATTTTC 


2640 


ATATGCCTAG 


CGAACATTTG 


CTATCGGGAA 


AGCAGTATCC 


GCTAGAATTC 


CATTTGGTTC 


2700 


atatgAacga 


TGCTGGCGAA 


AATTTAGTTG 


TCGGTTGCCT 


GTTTACCATC 


ACA6AGGAAG 


2760 


AAAATCGTTT 


TTCAGAAGCG 


AATCATCCGA 


TGGACTGGGA 


AAACGGCACG 


CACCAACAAT 


2820 


ggtttaatcc 


GTCAATCTTT 


TTACCAGAGG 


AGCGTCTACA 


CTACCATTAT 


GTAGGCTCGT 


2880 


TAACAACACC 


GCCAACTAAG 


GGTCCAGTGA 


AATGGTTTGT 


ATTTGATACC 


ATTCAAAAAA 


2940 


tggatcaagc 


CTTTTTAAAT 


AAGATTAAGG 


AAGGTATGTT 


GGCCTTTAAC 


AATCGGCCCC 


3000 


ttcagccgtt 


GAATGGGCGT 


AAGATTTATT 


TTTCAAATGA 


TTAGTGTGTG 


GAGGAAAAAG 


3060 


ggttgaatat 


TCAACAAATG 


AAGTATGTTG 


CTGCCATAGC 


TAACAATGGG 


AGTTTTCGGG 


3120 


AAGCAGCCAA 


AAAACTATTT 


ATTACACAAC 


CTAGCTTGTC 


AAATAGTATT 


CGAGAATTAG 


3180 


AAGAAGAACT 


AGGGATTTCC 


CTCTTTTTAC 


GTACCAATAA 


GGGCGCTTTT 


TTAACAGAAG 


3240 


AAGGAATGGT 


TTTTTTAGAA 


CAAGCAGAGA 


AAGTTTTAGT 


TCAAATGGAG 


TTATTGGAAA 


3300 


ATCGCTATCG 


CGAGACAGTG 


ACCAGCGAAC 


GATTCTCCAT 


CTCTTCGCAA 


CATTACGATT 


3360 


T TT T AGGTGA 


GGTAATTGCC 


AAAGTTCTTA 


AAAAGTATGG 


GGATCAATAT 


AAAGATTTTC 


3420 


GTGTCTTTGA 


AACCACGACC 


mmvk M Ik ik/^mik ik 

TTAAAAGTAA 


TTGAAGATGT 


CAAAGGATTT 


CACAGTGAGC 


3480 




1 TATTTAAAT 


7k Ik /^fk Ik 7k 7k m 7k 

GAACAAAATA 


GTGTCAGTAT 


m^Tk 7k f^^^ft* 7k rn 

TGAACGGTAT 


TTAGAACTVAG 


3540 




u i ACCaAGGTA 


ATTAGTACTT 


• luitTk 7k rrt 7\ ^7k ^7k 

TTAATACACA 


CATTTTTTXA 


7k 7k 7k m^Tk 

GGAAATCATC 


3600 


ATCCTTTAGC 


CAAGCAAAAA 


GAAATTCATT 


TAGAGGAATT 


AGTTCCCTAT 


CCACAAGTTC 


3660 


GCTTTAATCA 


AGAAGGAAGC 


AATTTTTCCT 


ATTTTTCTGA 


AGATTTAGTT 


GAAATTCCTG 


3720 


AACAGGAATC 


GGTCATTCAT 


ACTACGGATC 


GTGGAACCTT 


GATGAATTTG 


TTGGTAGAAA 


3780 


CCAATGCTTA 


TGCTTCTGGT 


TCAGGAGTCG 


TAACGGGTTT 


TACGAAGAAA 


GAAATTCGGT 


3840 


TAGTGCCATT 


GGCGCCAcTT 


TGGAAAATCG 


TATTTGTTTA 


CTCTTTCCCA 


AAAATCGGGA 


3900 
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AATCAGTCCA ATTGGTCGCT ACTTTATCAA AGAnTTAAAA GcGTTGTTtA AAAAAgAGGT 3960 

AGATACCAAA ACATTTmGCa CTAAGGaTaA AAAGAAGAGg TCGGGACAGA. AGTGTTTAAC 4020 

TCCGAGAAAT AAGAAGGAAT TTCCGAAAAT TGTTCTTTAA TTTTCGGAGA ATTTCGGCTT 4080 

ATTTCCGAAG GATTTGCTTC TGTTCCCGCC GTTTATTAGT TTTTGAGCGT GGAGCAAAAA 4140 

TCCAAAGTGA TTTTTGTCCC ACGCTCTTCT TTTTTGGCTC ATTTTGAAAC AGTAGAAATG 4200 

GTTkCTCTTT CAATTAATGT GACAGGCACT TGAATGTCTT CTATGATTTG AAGGTCTTGT 4260 

TGCATCGTTT GTAATAACAC TTCACCAGCT AGTTTTCCCA TTTCTTTAAA GTC 4313 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



• ATTnTTnCAT 


GCGATAAATG 


GTATTCCTGG 


TTTATCAGCC 


GTGAAGCCTA 


AAGCAGCTTT 


60 


CTATATTTTT 


CCGAAATTGA 


TACGGCCAAA 


TTTGACTTTT 


ATGATGATGA 


AAAATTTGTT 


120 


TTaGATTTCT 


TACACAAACA 


TCATATTTTA 


TTAGTCCATG 


GTGGTGGGTT 


CAACTGGCAA 


180 


CAACCCGATC 


ACTTCCGTAT 


TGTTTATTTA 


CCGAAGATGG 


AAGACTTAAA 


AACTACAGCA 


240 


GATAAAATGC 


GTGAATTTTT 


AAGTACCTAC 


AAACAAAAAT 


AAAAGAACTA 


CCAAAAAATA 


300 


GCCTCTAACG 


AAGCTATTTT 


TTGGTAGTTC 


TTGTTTTATC 


AACAATACTA 


TTCTGCGGTT 


360 


TCACATGCTT 


TTTTACTAAT 


AATCATTCGG 


TTGGTATATT 


TTTCTTCCAT 


ATCAGGACTA 


420 


GGATTTGCTA 


CTACGATGAG 


ATAAGAATTT 


TCATACTCCT 


TCTCCACAAA 


GCCTGAATAA 


480 


TTTGTTTTTT 


CCCAAATGAA 


GTGAACCTTT 


TTCTCTTCGG 


CCATACAATA 


ACACTCCTTT 


540 


TTAATAAAAA 


TGAACCTATT 


TATTTTCTAA 


GAAAACCTTA 


GATAATGTTC 


TTGCAGTATA 


600 


TTATGCAATA 


AAGTACAATA 


AAAGCATAAT 


GCGAATTCGC 


ATATTTTGAC 


AAAATCCCCA 


660 


TTTTGCAAGG 


AAAATG6CCG 


ATATAAATAG 


GAGCTATTGT 


GATGAATTCT 


TTAGGTGCAG 


720 


TTATTAAAGA 


AATTAGAAAA 


AATAGAAAAT 


TGACACAGAA 


AATGTTATCA 


GAAGACATCT 


780 


GTTCGCAAAG 


CGTTTTAAGT 


CGCATTGAAA 


ACAATGAAGA 


GTTGCCAAAC 


GTTCTCGTAA 


840 


TGCAACAACT 


TTGTGATCGT 


TTAGGTGTGA 


CTGTTGATCA 


GATTATGCGT 


TATAAATCTG 


900 


GTGATGTGCA 


CGTTGTTACG 


TATTCATTTG 


AAAAAATGGC 


TGAATATTTT 


CGCCATAAAA 


960 


AATATCAATT 


ATTATTAAAT 


TATTTAAAGG 


AAAATCGTAT 


TGAAGAACAA 


TTGTATTTAG 


1020 


ATACGGACTG 


GCAAAAATAT 


TATTATTATT 


TAGGtAGTTG 


TGAATTGTTT 


GTTCTAAATG 


1080 


ATTATGAAAA 


AGCTATTGCT 


AGCTTGCGTA 


AAGGGTTATC 


TTTCACCTAT 


AAAGCCGATA 


1140 


AGTTAAATGT 


TTCTGATTTA 


GAGATTCAAT 


TAATTAGTTG 


TCTAGGTGCC 


ACTTATGGTT 


1200 


ACATGGGCAA 


TCGAGTAGAA 


GCAGAACGAT 


TTTTAAGTTT 


GAGTATTCAT 


TATTTTAATC 


1260 
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AATTGCCAAA 


TGAACGCAGT 


AATGCAGAGT 


TAACGAAAAT 


ATTTTTTAAT 


TATGCTGATT 


1320 


TCTTATTTAA 


AAATTATACA 


GAAAAAGATG 


CCGAAATTTA 


TGTGGATCAA 


GGAATTACTT 


1380 


GGGCCCGCAA 


AAAAAATAGT 


TATTATTATT 


TAAGTGAATT 


ATTAAATCTA 


AAATATTTAG 


1440 


TTTTAATGAG 


A7VAGAATAAA 


CACGAAGAAG 


CAGAACGTTA 


TCTGAACTTA 


GCACAACAAA 


1500 


TGAAAAATGT 


GGAAAGTGGC 


AATCTTTAAA 


AAATCCTCAG 


AATTGCTTAA 


AAGGTCTACA 


1560 


CCATTTCTTG 


ACTTTTGTAC 


TGATTAAGAG 


TACTTTTGAA 


GTAAGTAAAA 


TAAAGAAAGA 


1620 


AGGATGAAGA 


TGAAAACATT 


TATCTTTGCA 


GACAAGTTCT 


TTTTAAAAAG 


CGATGTTAAA 


1680 


GGACCTGGTT 


ATTTAGT^T 


CACGGATGGT 


ATCTTTGGTA 


ATTATACAAA 


AGATGAACCG 


1740 


CAAGGGgACG 


TAAAAATTAT 


TCGTGAGGAA 


GGCAAATGGA 


TTGCCCCAGG 


TTTAGTCGAT 


1800 


ACTCATATTG 


ATGGTTACAT 


GAATCATGAC 


GTAATGGATA 


ACGATGCAGA 


AGGCATTAAA 


1860 


GTGATGTCTG 


AAGGCTTATT 


ATCTTGTGGT 


GTAACTTCAT 


TTTTACCAAC 


AACGTTAACT 


1920 


TCAAGTAAAG 


AACGTTTAAC 


AGACGTTGCA 


CGTACGATTG 


GTCAAGTCTA 


CCAAGAAGTA 


1980 


CCTGGTGCAA 


AAATTCAAGG 


GATTTACTTT 


GAAGGACCTT 


TCTTTACTGA 


AGAACATAAA 


2040 


GGCGCACAAA 


ATCCAAGTTA 


TTTTGGAGAT 


CCAGACCTTG 


ATACATTCCA 


CGAATGGCAA 


2100 


GAAGCTTCAG 


GCGGAATTAT 


TAAGAAAATT 


GCGTTGGCAC 


CAGAACGTAA 


TGGT6TGAAA 


2160 


GAATTTGTTG 


AAACCGTTAC 


AGATGAAGGT 


GTCGTTGTTG 


CTTTAGGCCA 


TAGTAATGCG 


2220 


ACCTTAGAAG 


AAGCTGATGT 


GGCAGTTGAA 


GCAGGCGCTA 


gCGTTTTTGT 


TCATGCATAT 


2280 


AATGGGATGC 


GCGGCTTAAA 


TCACCGTGAA 


CCAGGGATGG 


TTGGTGCATT 


GTTAACGTTA 


2340 


CAACACGTTT 


TCTCTGAATT 


AATTTGTGAT 


GGACACCATG 


TACATCCGCA 


AGCGGCTGAA 


2400 


GTATTGATGG 


AAAAAGCGGG 


CCATGATCAC 


GTTGCGTTAA 


TTACAGACTG 


CATGATGGCT 


2460 


GGCGGTATGC 


CAGATGGCAA 


CTATAATTTA 


GGTGAATTTC 


CTGTTGTGGT 


AGCAGAAGGA 


2520 


ACGGCTCGTT 


TGGATACTGG 


GAACTTAGCA 


GGTAGTATTT 


TAAAACTAAA 


aGAaGCAaTT 


2580 


AAAAATGTAG 


TTGATTGGGG 


TATTGCAACA 


CCTGCACAAG 


CAATTATGAT 


GGCTTCATTA 


2640 


GTTCCtGCGA 


TTAGCTGTAA 


AATTGATGAT 


CAATGTGGCA 


TGATTGCAAA 


TGGTCGCGAT 


2700 


GCTGACTTTA 


TCGTATTAGA 


ACCAACCATG 


GAATTA6CAG 


CCACTTATTT 


AGAT6GCGTA 


2760 


GAACGTTATC 


GAGCATAAGA 


CACAAGAAAA 


AGCTTGCGGA 


AATTTATCCG 


CAAGCTTTTT 


2820 


TTATAATTCT 


TTTGGTAAAA 


TCGCAGAAGT 


GATTTCTTTG 


TTCTCGTAGG 


CAATAAAAAA 


2880 


ATCGATTAAT 


TGCTTGTCAA 


ATTGCTCAAT 


ATCCTCTTTG 


TAAAGGGAGC 


TAGGAATCAC 


2940 


AAAGGTTTGC 


TCTAGGGTGC 


CTGCTAGATA 


ATCGCCATCT 


AAACTTTGGG 


CGACAACTAA 


3000 


A/U^TTCTTGT 


TCGCGCCAAT 


GTTCCACATC 


TGTTTCAGAA 


ATGGTTTCTG 


GAGAGGCTAA 


3060 


AAATTTACAA 


TTTTCGACAA 


ACTGTTCATC 


AATTTCTTCA 


CTGAAAGTTA 


ATTCAGCATC 


3120 


CTCTTCTTGA 


GGTTGA6AGA 


GGATTGCTTG 


CAAAAATGTT 


TGTGGTTTTG 


CATCGGGTAA 


3180 


TGTTTCTTTT 


AGCTGTTCAA 


AATCCAATGA 


AAGTTCCTCC 


TTATTGTTTG 


TTTAAAATAG 


3240 



wo 98/50555 PCTAJS98/0898S 

515 



TTACTGAAAA ATCAATGGGT 


GAATTTGTTT 


TAAGGCTTTT 


GGCTTCTAAG 


ATGATTTTTT 


3300 


CAGCCAACAA ATCGTTTCCG 


TTATATACAG 


GTGTCACGCG 


ATAGCGAACG 


GTTTCGCCGC 


3360 


GATTAAGAGC TGCCCGAATC 


TGATTTTCAT 


ATTTCGTCAT 


ATAAGGAGTG 


TTGACAGGAT 


3420 


TCTGATACAA GGTGGTTAAG 


TTACGTGGAT 


CATCACCACT 


GCCTCCCATT 


TGACGGCCGA 


3480 


TTAGGTGGCC ACGCGAATGA 


TTAGCTTTTC 


CTGAAATGAA 


ACCAGCTGGT 


CGAATATCTT 


3540 


TATTAGCGGA AGTTCCTGTA 


TTGACCATTG 


CGGGTTTTAA 


TAGCGCATCA 


GCGCCTGTCG 


3600 


CTCGTTTTAG ACGATCTAGA 


GAATGATAAG 


TAATCCAGCC 


GTTTGAGTTA 


TCCTTTAATT 


3660 


CCTCTGCTGA AAATGTGGCT 


TTACCATTTT 


CGATGGGTnC 


GGGATTATCT 


CCTTTTGGTG 


3720 


CTGGTGTGCT TGAAaGGGgT 


GGTTGTTTGT 


GGTTGAACAT 


TAAATAAATC 


TTGTAAAGCA 


3780 


TCAGGAAtTTT TAACGCCAAC 


CGCGCCTAAA 


ATTAAAATGA 


TTAATGCGCC 


AATCAGCATG 


3840 


ATGGTTGAAT TAAATGGTGG 


TTTCTTCTTT 


CTGTTTGCCA 


TAACGTGCTC 


CTCTAATTTT 


3900 


ATACTTGCAA TAGTCCAAAT 


TTAAGTGGAC 


CATCTTTTCT 


ATTATACGTA 


AATTTGCTAA 


3960 


AAAATCAAAA AAGGCTTTTC 


TTTGAAAGAC 


TTGTTTTCTC 


AGATATGTTA 


CTATTTTAAT 


4020 


GTAGAAGGGa AgGGATAGTG 


GATGGCCTAT 


ATTGAAGTCA 


AAAATGAATA 


CAAGCGTTAC 


4080 


CAAATGGGAG AGACAACTAT 


TACTGCGAAT 


GACGGCATTT 


CTTTTGAAGT 


GGAAAAaGGA 


4140 


GAAGTAGCGG TTATTTTAGG 


CCCCAGCGGT 


GCtGGAAAAT 


CAACTGTTTT 


AAATATTCTT 


4200 


GGTGGAATGG ACAGCTGTGA 


TGAAGGGGAA 


ATTATTATTG 


ATGGCACAGA 


TATCGCTCAG 


4260 


TTTTCAGAGA AACAATTAAC 


CACTTATCGA 


CGAAATGATG 


TCGGGTTTGT 


TTTCCAATTT 


4320 


TACAATTTAG TCCCAAATTT 


AACGGCAAAA 


GAAAATGTCG 


AATTGGCTTC 


GCAAATCGTT 


4380 


GCGGATGCGT TGGATTCGAC 


CAATGTTTTA 


CAATCGGTGG 


GTTTAGGGGA 


ACGTTTAGAT 


4440 


AATTTTCCAG CGCAACTTTC 


TGGTGGTGAA 


CAACAACGAG 


TAACCATCGC 


CCGTGCGATT 


4500 


GCTAAAAAGC CCAA6TTACT 


TTTATGT6AT 


GAACCGACAG 


GAGCGATGGA 


TTATGAAACA 


4560 


GGGAAACAAA TTTTAACCAT 


TTTGCAAAAC 


ACTGCTCGTG 


AAACTGGAAC 


GACCGTCTTA 


4620 


ATTATTACCC ACAATTCAGC 


CATTGCAGAA 


ATGGCCGATC 


GGGTAATTCG 


AATTAATGAT 


4680 


GCGAAAGTTC GCGAAATGAC 


TGTGAATSVT 


CAGCCTAAAT 


TGGTTGCTGA 


AATTGAATGG 


4740 


TAGGTGCCTA AAATGAAGAA 


AACAGCGCTA 


ATTAAAACAA 


GCTTACGTGA 


AATTAGACAA 


4800 


TCTACTACAC GATTTTTATC 


GATTATGGGG 


ATTATTTTCC 


TTGGTGTAAT 


GGTCTTTGTT 


4860 


GGTTTGAAAG CTACTGGACC 


AGATATGATT 


AAAACCGCGA 


ATAATTATTA 


CCAGAAAGAG 


4 920 


CAGCTTCCAG ATGCGCGAAT 


AATTTCTACA 


ATGGGACTGG 


AAAAGAAAGA 


TTTAACAACA 


4980 


CTACAGTCTT TAAAAGATGT 


CGAAACAGTT 


GTGCCGCGGT 


ATACAATAGA 


TGCCACGATT 


5040 


GGGCCGCAAA ATAATGCAGT 


AAAATTGTTT 


GGCTATCGAA 


AAAATCAAGC 


AGGATCGGTT 


5100 


AACTATCAAG TAGTTGACGG 


TCGGTTACCT 


AAACAAACAA 


ATGAAATTGC 


ATTAGATACA 


5160 


TTAGCGAAAA CTAGATATGA 


CTATAAAATG 


GGTGATAAAA 


TCACTTTAAA 


CGATGCAGCG 


5220 
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ATTAAAG7VAA AAGGGCTGAA ACAAACTCAT TTTACTGTTG TTGGATTTAT CAATTCACCA 5280 

GAATATATTG ATAATACATC CAGGGGAACC ACAACGGTCG GGTCTGGAAC GTTAAACTTT 5340 

TTTGGCGTGG TTTCGGAAAA AGCCTTTGAC AGTCAACGGT ATCCAGAATT GCTAATCTCA 5400 

TTTCGTTCAT TAAAACATCA AAATACGTAT TCCTCTGATT ATGAAAAGAA AAGAGAGCAG 54 60 

GCCTTAAATC AAGTAAAAGA GGCACTGAAA AATCGGCCAG AAGAACAGGT TGCGGCTCTG 5520 

CGTGATTCGG CTCAGCCGGA CATTAACCAA GGTCGGCAAG CGCTGGAAAC GGGAAAACAA 5580 

GCACTTGCGC AGTTGGaGCA ACAACCAGGA ATTCCAGCTG AGATGCTTGA AAAGCAAAAA 5640 

GACGAATTGG CTAAACAAGA ACAACTGTTG GCTGAAAAAG AAACAGAGCT AGCCAATTTG 5700 

AAAGCCCCaA CGTATTATTA TTTTACTAGG GAAGACAATC CAGGATTTTC TGAATATCaA 5760 

GATAACGCTG ATCGGATTTC TTCTTTAGCC ACGCTGTTCC CGTTGTTCTT TTTCCTAATT 5820 

GCGGCATTAG TCAGCTTAAC TACCATGACA CGTATGGTTG AAGAAAAAAG AATGGAAATT 5880 

GGTAGTTTGA AAGCTTTAGG TTATCGTAAT CGTGAAATTG CTTCGATTTT TATTACCTAT 5940 

GCAACGGTTG CCAGCTTAAC AGGTGCATTA TTAGGATTGG CAGTTGGCTA TTATCTATTC 6000 

CCTAAAATAA TTTTTGATGc CTACGGTCAA ATGTACAATA TTCCTGATTT AGTTACACCG 6060 

TGGTACTTAA ATTACAGTTT ATGGGGCATC ATTGTAGCCT TAGCTTGTAC TGTGGGAGCG 6120 

GCCTTAGTGA CATTAAGAAT CGATTTATTA AGTACCCCAG CTACCTTGCT AAGACCCAAA 6180 

GCGCCGAAAG CAGGCCAACG GATTTTACTA GAACGAATCC GCCCTTTATG GCAACGCATG 6240 

AGCTTTATTC AAAAAGTGAC CGCTAGAAAT CTGTTCCGCT ATAAACGACG GATGCTGATG 6300 

ACCGTTATCG GAATAGCTGG TTGTATGGGG CTGTTAATTG TTGGTTTTGG TTTACGCGAT 6360 

TCAATTGTGG ATGTCGCTAC AATTCAATTT AATAAAATCT GGCATTACCA AGCAGTCGTG 6420 

ACTTTTAAAG AGCAAACAAC ACCTGAAGAG ACAAAAGAGT ATCAAGAAAC GTTGCGCAAA 6480 

ATGGATGGTT TGAATAAAAC AATTCCGTTA TATTCAGAAA TCTTTAAAAC GAAGGGCAAA 6540 

GGCGCACCAA CTCAAAATAT TACTCTTTAT GTGCCACAAG ATCCTTCGAA AATGGCTGAT 6600 

TTTCAATTAT TTAATGATCG AGTCACTGGA GAAAAATATT CCCTCAACGA TGACGGTGTA 6660 

ATCATTAATG AAAAATTAGC TAAATTATTT GGCTATAAAG TCGGAGATCA ATTAAATTTA 6720 

GAAAATAGTG ATAATCAAAC CTATCACGTG AAAATT6CCG CGATTGCAGA GAACTACACG 6780 

GGTCATTTTG TCTATATGAC GCCGAAACTC TACCAGACGA TGACGAAACA GAAGCCGGAG 6840 

TATAATACAG AATTCCTACT TTTTGACAAA AAGTTATCCT CAAAACAAGA AACCTCAATC 6900 

GGAGAGGCGT TGACAAAACA GCCGAAAGTG TTGAACATTA CGTTTTTAAC TGCGATGAAA 6960 

GGTTCGTTTG ATGATATGCT GAAAAGCCTT GATATTGTCA TTTGGGTACT AATTGCTGTT 7020 

TCTGGTTCTT TAGCGTTAAT TGTGTTATAT AATTTAACCA ACATTAATGT TTCTGAACGG 7080 

ATTCGAGAAC TTTCCACGAT TAAGGTTTTA GGGTTTTACG ATCGTGAAGT CACCACCTAT 714 0 

GTTTACCGTG AAAACATTAT TTTAACGTTC ATTGGTATTA TTGTTGGCTG CTTCTTTGGG 7200 
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AAAATTTTGC 


ATCAGTATAT 


TTTAGCAACC 


GTTGAAGTTG 


ATTTAATTAT 


GTTTTCGCCA 


726C 


ATTATTCATT 


GGCCAAGTTA 


TCTGTATTCC 


GCAGTCATCA 


CCATGTGTTT 


CACGTTGTTT 


732C 


GTGATGGTCA 


TTATGCATCG 


GAAATTGAAG 


AAAATCAATA 


TGATTGAAGC 


GTTAAAATCA 


738C 


AATGAATAGC 


ACATTTTAAA 


AAaCGAGGAG 


TCCGTAGAAG 


TTCGATAATT 


CTAGGGCTGC 


744G 


TCGTTTATTG 


CTGCTTCTTT 


TATTAAGAAA 


AAATAGGTCT 


TAAGATGTAT 


CTTTTGAGAT 


750C 


TTATTTGGAA 


ATTACTACTC 


TTTTTATTTA 


AAAATTGGTA 


CACTATTACT 


AGGTTATAGC 


756C 


ACAGAAAGGA 


GTGTGGGAAA 


AGTGTTAGAA 


GGTCTACCAT 


TGGAAACGGT 


TTATTTTTAT 


7620 


GCATTAGTGG 


GGAGCsCgGT 


CCTCGCTTTT 


TTACTAATTA 


TTTTTGGGGA 


TGTTTTTAAT 


7680 


TTTGATGGAC 


CGGTTGaCCC 


AATGCTGATT 


ATTCCATGGA 


TTGCTTTTAC 


CTCGCTGTTT 


7740 


GGGTACCTTG 


GGGAAGAATT 


AACAGCGGTG 


AATAGTTGGC 


TTATTTTAAT 


TGTCAGTGGT 


780C 


ATTCTTTCAA 


CCATTATTGT 


TTTCTTCCTA 


AATTTTTATG 


TGTTGGTCCC 


TTTGAAAAAT 


7860 


TCGGAGGCCA 


CAATTTCTAT 


TTCGGAAAAA 


GACATGGAGG 


GTCGTGTTGC 


TACTGTGATT 


7920 


ACCCCAATTC 


CTGTCCGGGG 


AATGGGCGAG 


ATTCAATTGA 


AGAGTGTCAC 


AGGTTCGTTA 


7980 


AGTCGACCAG 


CAGCATTTTA 


TGTGCCACAA 


GAAGTGGCTG 


CGCCAAGAGG 


TAGTGAGGTA 


8040 


TTGATTATTG 


AATTGAAAGA 


ACGGGTTTGT 


TATGTCATTC 


CTTACGAAGG 


CAGTCTAAAA 


8100 


ATATAGAGTA 


GGAGAGAAGA 


AAATGGGGAT 


TTTGTTACCg 


ATTATTATTG 


CaGTTTTAGT 


8160 


ATTATTAATG 


TTATTAATTG 


TGTTTGTTTC 


TAAATATCCA 


GACAGCCcAA 


ACCTGATGAA 


8220 


GCGTTAATTA 


TCcAGCGGGA 


GCTATCTAGG 


CTCTAAAAAT 


GTTCATGTAG 


ACGAAGGTGG 


8280 


CAACAAAATT 


AAAATCGTTC 


GTGGCGGTGG 


TGCGTTTGTC 


TTACCAGTGT 


TCCaACGTTC 


8340 


AAATCGAATT 


AGTTTGCTTT 


CAAGTAAATT 


AGATGTTTCC 


ACACCAGAAG 


TGTACACAGA 


8400 


ACAAGGGGTT 


CCTGTGATGT 


GTGATGGGAC 


ATCAATCATT 


AAAATTGGTT 


CGTCAGTTGA 


846C 


AGAAATTGCG 


ACAGCAGCGG 


AACAATTTTT 


AGGAAAAACA 


ACGGAAGAAT 


TAGAAAATGA 


8520 


AGCACGTGAA 


GTATTAGAAG 


GACATTTACG 


TTCGATTTTA 


GGTTCAATGA 


CAGTGGAAGA 


8580 


AATTTACCAA 


AATCGTGATT 


AATTTAGCCA 


AAGTGtACAA 


GAAGTTGCCa 


GTGTTGACTT 


8640 


AGCTAAAATG 


GGCTTAGTTA 


TTGTGTCGTT 


CACAATTAAA 


GAAGTTCGTG 


ATAAAAATGG 


8700 


ATACTTGGAT 


TCATTAGGGA 


AACCAAGAAT 


CGCTCAAGTT 


AAACGTGATG 


CAGATATTGC 


8760 


AGAAGCAGAA 


GCCTTGAAAG 


AAACTCGCAT 


CAAAAAAGCA 


GAAGCAGAAA 


AAGAATCACA 


8820 


ACAAGCGGAA 


TTGCAACGTC 


AAACAGAAAT 


TGCAGAAGCT 


TCAAAAGAAA 


AAGAATTGAA 


8880 


ATTAGCGTTA 


TACAAACAAG 


AACAAGATAT 


TGCCAAAGCC 


A 




8921 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



TGCAAATGCA CGAGGAGGAA CAAAAGAATG GAACArGGAA CAGTAAAATG 


GTTTAACGCa 


60 


GAAAAAGGTT TTGGGTTTAT CTCACGCGAA GACGGAAGCG ACGTATTCGT 


ACACTTCTCA 


120 


GCTATCCaAG GTGATGGTTT CAAAACTTTA GAAGAAGGTC AAGCAGTGAC 


TTTTGATGTT 


180 


GAAGATTCAG ACCGTGGCCC TCAAGCAGTT AACGTAGAAA AAAACTAATA AAAAATAATC 


240 


CAAAAACACC TAACATGCGT TAGGTGTTTT TTTGTTAGAA AAAATAGAGA AGAAAAATGG 


300 


ACAAGCATTT TTCTTCTCTA TTAAATGAGA GCTTTATTGA TCACTAACTG 




360 


AATCATTGGC AGAATCATTG GATGACGTTC TGTTTGTTCA AACAACAAAn 




420 


GGTCGTCATG GCTTCGCCTA ATTTTGCTTC GGTACAGTTT TTATCTTTCA 


TTGCTTCACG 


480 


TAGAGCATTG AATAAAAGAC GTTGGCCTTC ATGAATCATA TCACCAGACT 


CACGCATGTA 


540 


GACAAAACCA CGGGAAAGAA TGTCTGGACC AGCCATAATT TCTTTATTTT 


TAATATCAAC 


600 


TGTTGGTACC GCTAAAACTA AACCTTCTTC AGATAAGATA CGACGATCCC 


GTAAAACAAC 


660 


ATTCCCGATA TCACCGACAC CATTTCCATC AACGTACACG TCATTAGCGT 


TGAAGTGACC 


720 


AGCAGGGCGT GCGCTATCAG CCGTTAAAGC TAAGACATCG CCGTTGCCCA 


TGATAAAGCA 


780 


ATTTTCTTCA GGAACCCCTG TATCTTGAGC GAGCGAAcAT GGATTTTTAA 


CATTCTAAAT 


840 


TCACCATGAA CAGGCATGAA ATATTTAGGC TTCATCAAAC GAAGCATTAA 


TTTTTGCTCT 


900 


TCTTGTCCAC CGTGACCAGA AGTATGGATA TTATTGATTT TTCCGTGGAT 


AACTTCTGCA 


960 


CCGGCTTCAG ATAATAAGTT GATTAAACGG TTAAC6CTTG TCGTATTCCC 


AGGAATTGGT 


1020 


GAACTTGAGA AAACAACCGT ATCGCCTGGT TGAATACTAA TTTGGCGGTG 


TGTACCGTTA 


1080 


GCAATCCGAC TAAGCGCGGC CATAGGTTCT CCTTGAGAGC CAGTACATAA AATCATTGTT 


1140 


TCATTTGCTG GCAATTGATT CAGTTCCGCC GCATCGACAA ACGTTCCTTT 


TGGCACTTTG 


1200 


ATGTAGCCTA AGCGTTCGCC ATTGACAATG GCATTTTCCA TGCTACGACC AAAAACAGCA 


1260 


ATTTTGCGAC CTGTTTTCAC GGCTGCATCA GCAGCTTGTT GTAAACG6AA 


AATGTTTGAA 


1 j20 


GCAAAACTGG CAAAGATAAT CCGACCATCA ATTTTTTCAA AGATTTTTAA 


AATAGAGGTT 


1 o o r\ 


CCAATTGTTT TTTCGGATTT TGTAAAGGTG GGTATTTCAG CATTGGTACT 


GTCTGAAAGA 


^ A A r\ 
14 4 U 


AGACAAAGGA CGCCTTCTTC ACCTAATTTC GCCATTCTGT GTAAGTTGGC 


CGGCTCACCA 


1500 


ACGGGTGTAA AGTCAAACTT AAAGTCACCT GTTGCGACAA TGTTACCAGA AGGCGTTTTA 


1560 


ACCACAACGC CCAAGGCATC AGGAATACTG TGAGTCGTCC GGAAAAAGCT 


AATCGCTGTT 


1620 


TTTCTAAAAC GAATCACCGT ATCTTCATTA ATTTCGTGTA ATTCCGCTTC TCGCAATAAA 


1680 


CCATGTTCAT CTAGTTTGTT GGTAATTAAA GCCAAGGCTA GTGGTCCCGC ATAGATTGGG 


1740 


ATATTCGCTT GACGTAAAAG ATAAGGAACG CCACCAATAT GGTCTTCGT6 ACCGTGGGTA 


1800 


ATAACGAGTG CTTTGACTTT ATGCAAATTT TGAACGATGT AGCTATAATC 


TGGAATCACG 


1860 
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TAGTCAATCC 


CAAGTAAATC 


ATCTTCTGGG 


AATTTGATCC 


CAGCATCAAT 


AATGATAATT 


1920 


TCATCTTGGA 


ATTGTACCCC 


ATATGTGTTT 


TTTCCAATTT 


CGCCTAAACC 


ACCGATTGCA 


1980 


AAAACGCCAG 


TTTCGTTATT 


TTTTATGTTT 


ACTTTCATGT 


TAAAACTCCG 


TTAGAGTGAA 


' 2040 


GTCCGCATGT 


TCTTGCTCGT 


AAGCAAGATG 


TTTTTCGTCT 


AAT7VATTGAA 


CGTATTCAAT 


2100 


ATTGTATGGG 


GTATTTTCTT 


CAACTAATTG 


GCGCACGATT 


ACATCGCTAT 


CGGCTTCCAT 


2160 


GTAAAGAGAT 


TTTGTGTCTT 


CTCTTTTTGG 


GTTTCTGATT 


TTTGTTTCTT 


GATAGTATAC 


2220 


TTTGTAAATC 


ATCGATTCAT 


TTCTCCTTCA 


GCTACTAATA 


GTAGCAATTT 


ATAGTTTTTT 


2280 


TCATTTTAAG 


ACAGAAAATT 


AAACACAACA 


AAACGAGGTG 


CTGATCACCT 


AAAAAAGAAT 


2340 


GCTCATCAAA AACATCTGTT 


AATGATGAAA 


TGAAGCCTCG 


ACTTTGCTTC 


ATTCTGGGTA 


2400 


TTTGATTTTA 


TGTGTTCTTT 


TTCAACCGTC 


CCGTGTGTTG 


TGAATAACAA 


CAATTAAGCT 


24 60 


TATTTTATCA 


TAGAATGATA 


AGGAAGTATA 


GAAACACTCA 


CTGGACTTTT 


AAAAAAAACA 




AACTTCTTGT 


TTCGCCTTAT 


TCTTGATCAA 


GTTCGAGTGC 


CTTTTTTTTA 


GGACGCCAAT 


£. ^ \J \J 


ACCAACTTAA 


GCCAATCACG 


TGAGCTAAAT 


AAACAATCGC 


TAAAGTCGTA 


GTCATCAACG 


2640 


TATTTCGGAC 


AATTAAACAC 


GGAATGGCAA 


ATAAAAGAAA 


CATGGAAATA 


AATAGTTTCC 


2700 


ATTTACCTTT 


ACGAGTAATT 


TTTTTCTCAA 


AGAAGGCTTC 


TTGGATATAT 


TTTTGATAAT 




AAGAAGAAGC 


TAAAAATTTG 


TTATAAAGTT 


TATCTGAACT 


ACGCATCCAA 


AAAAAGGrPG 




TTAATAAATA 


AAAACCAGTG 


GTGGGTAAAA 


GCGGCAAAAA 


GATTCCTAAC 


GTGCCTAAAG 


2880 


CAAACGTACA 


GGAACCGAGA 


AAAATGAAGA 


ATAATTTTTT 


CATTCAAAGG 


GTCCTTTCAG 

w X \^ w XXX Wl^J 


2940 


TTTTTCCCAA 


TTATGGCACA 


ATCAAAAGGA 


AGTAGCAATT 


TTAATCGTAT 


AAGTAAATCA 


3000 


TTAAT^TTAT 


CTATTTTTTA 


GAAAATCTAA 


AAGAGAAAGG 


ATAGAAAGAG 


AAAGGCTGTT 


3060 


TTTTCTAGCC 


AGACAGCTTG 


TATTACCGAT 


TTCTTTTTGA 


TACAGTGGGA 


GTACTAGAAA 


3120 


ATTTAAAGTA AGATAAGAAG 


GAGATGATTA 


ATTGAAGTTA 


ATGTGGCGTT 


ACACAaTGCG 


3180 


TTATAAGAAA 


TTACTTTTCG 


CTGAtTTTAT 


TTGTGTGTTT 


GGTTTTATTT 


TGATTGAATT 


3240 


AGGCTTGCCG 


ACCATTTTAG 


CACGAATGAT 


TGACAAAGGA 


ATTATTCCTA 


GAGATATGGA 


3300 


CTATATTTAT 


CAGCAAGGGA 


TTTGGATGGT 


TGTAATTACG 


ATTAGTGGCG 


TTGCAATGAA 


3360 


TATTTTGCTA GGGTACTTTG 


GTGCCAGAAT 


TACAACAAAC 


ATTGTTCGTG 


ATATTCGTGA 


3420 


TGATTTATTT 


GAAAAGATTC 


AAACTTTCTC 


ACATAGTGAA 


TATGaAAGTA 


TTGGGGTTTC 


3480 


TTCCTTAATT 


ACGCGAACTA 


CCAATGATGC 


ATACCAAATT 


ATGCTCTTCA 


TGGGAAATAT 


3540 


TTTACGTCTT 


GGCTTTATGA 


CGCCAGTGAT 


GTTTATTGCC 


AGTCTTTACA 


TGGTGATGCG 


3600 


AACGAGTCCG 


TCGTTAGGAA 


TGTACGTTTT 


AGGTGCCTTG 


CCTTTTCTGC 


TGCTAGCAGT 


3660 


TGTCGGGATT- 


GCTCGTTTGT 


CAGAACCGTT 


ATCTAAAAAG 


CAACAAAAGA 


ACTTAGATGG 


3720 


AATCAATGGG ATTTTAAGAG 


AAAATCTTTC 


TGGATTGCGG 


GTAATACGCG 


CATTTGTTAA 


3780 


TGAAAAATTT 


GAAGAATCTC 


GTTTTAATAA 


AGTCAATGAA 


ACTTACACTA 


AAAGTTCAAA 


3840 
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AAGTCTGTTT CGTTTAATGG 


CAGCAGCCCA ACCAGGGTTC 


TTTTTPTTAT 

A X X X xv«x x^\x 


TTAATATTPT 
X XiviXnl Xux 




GATGGTCTTA ATCATTTGGA 


GCGGGACTGT TCAJ\ATCAGC 


CATGGGGATT 


T Af^ A APTT/^P 




GAATTTAATT GCTTTTATTG 


AATATATTTT TCATCrGPTG 

• MAXJnkXnX XXX X V>Jn X V3V«V9Var X \9 


T TPTPnT TT A 


l\3X xAl rXGC 


4020 


CAGTGTCTTT ATGATGTATC 


PGPGGGPTGP G/3TTTPGr;r'T 


TP A P^2P A TTP 


AAVjAAGCCTT 


4080 


AGACATGGAA CPAGPTATTP 


GTGAAGAAf^A AfZCr'CTA AfA 


GAAACAGCTA 


CTAAAGGCTA 


4140 


TCTAGAGTTT AAAAATGTAA 
X v« A A\wnw XXX nnrM)/% x o x nr\ 




GCGGAAAGCC 


CAGTTATTCG 


4200 


CAATGTAAGT TTTAAGGCGT 




TTTATTGGGA 


GCACAGGTAG 


4260 


TGGGAAATCA ACATTAATTP 


AATTfZATTPP AP^S^ITTTTAT 


GATGTATCAG 


AAGGTGAAAT 


4320 


TTTAATCGAT GGAGTGAAPG 


TAAAACAI^TA PA A APTTAPT 


GCATTACGCA ATAAGATTGG 


4380 


CTATATTCCA CAAAAAGPfTT 


TACTTTTTAC TGGTACAATT 


GCTGATAACC 


TTCGCTACGG 


4440 




AAGAAATGGA ACGAGCAATT 


GACATTGCTC 


AAGCCACAGA 


4500 


ATTTGTTTPn PAA AAArTTir* 
r*x X X V3 X X Xv^V3 ^nnAMM^v.^VTV' 


AAGGCTATGA TGAACCTCTT 


TCAGAAGGTG 


GCACGAACTT 


4560 


TTPTGGTGGT PAAAAAPAnr* 


GTTTaGCTAT CGCACGGGCG 


ATTATTCGCA 


ATCCGGAAAT 


4 620 


TTATATATTT f^ATflfiTaPTT 


TTTCTGCGTT AGATTATCAA ACAGATGCAA ATTTACGAGC 


4 680 


GPGTPTGAAA AAAr!&AAr*Aa 


CAGAATCTAC TGTTTTAATT 


GTGGCACAAC 


6TGTTGGAAC 


4740 


ne\x xnxov^AX \3\«l3V3nv»kirO\«A 


TTGTTGTTTT AAACGAAGGC 


GACGTGGTCG 


GAATTGGCAC 


4800 


ACATCGTGAA TTAPTTRARA 


CTTGTCCAAT TTACTATGAT 


ATTGCGGCTT 


CTCAATTGTC 


4860 


AGAGGAGGAA TTAGCATGAA 


ACACGCCTTT TCTTCTATGA AACGAATCGG 


TCGCTATATT 


4920 


AAACCGTACC GAGTGACGTT 


TTATTTAGTT ATTTTATTTA 


CAATATTAAC 


CGTTGCCTTT 


4980 


AATGCAGCGT TGCCTTATTT 


GACTGGATTA CCGACGACAG 


AAATTAGCCG 


TAATATTGCG 


5040 


GCCGGCGAAT CCATTAATTT 


TGATTATGTA ATCCAATGTT 


TAATTTGGAT 


TTTAGTTGTG 


5100 


GGAACAGGTT ATTGTGTGGC ACAATTTTTG TCAGGCTTTT TAATGACGAA TGTCGTTCAA 


5160 


CAATCCATGC GCGATTTGCG 


TCGCGATATT GAAGAAAAAA 


TCAATCGTTT 


GCCAGTTTCT 


5220 


TATTTTGATA AGAACCAACA AgGAAATATT TT6TCACGGG TGACGAACGA TGTGGATGCT 


5280 


GTCAGCAAT6 CGATGCAACA AAGTTTTATC AATATTGTTT 


CAGCAGTCTT 


AGGTATTGTG 


5340 


ATGGCGGTAG TGATGATGTT 


CTTAATCAAT CCGCTGATGG 


CGATTTTTTC 


AGTGATTATG 


5400 


ATTCCGTTGT CTCTGATTAT 


TTCCAGAACA ATTGTTAAAA 


TCTCCCAGAA 


ATATTTCCAA 


5460 


GGAATGCAAA ATTCTTTAGG AGAATTAAAT GTTATGTCCG 


GGGCCGAGnT 


CGAATTTTAT 


5520 


AAT 








5523 



(2) INFORMATION FOR SEQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 70: 






ACAGTCGATT 


TCGAGCACCG GCAACTTTTG CTACATCAAT GATAGACGTC ATTTTTTACC 


60 


TCCAAAGTGG 


AATGTATTCT TATAAGAAGT ATAGCATACA TTTTTTTGAA AATAGTGCTT 


120 


GACATGGGAA 


CATTCCCATA ATTTATCCTA AAGATGTAAA CGATAACAAT 


TTAATCAAGG 


180 


AGGAAGAAGA 


CGGATGACAA GAGGAGCATT AAAAATTGCA ACAATTGGTG 


GCGGTTCTAG 


240 


TTATACACCT 


GAATTAATTG AAGGATACAT TAAAAGAAAA GATGAACTGC 


CAATTAAGGA 


300 


AATTTGGTTA 


GTCGATATTG AAGCTGGTAA AGAAAAATTG GAAATCGTTG 


GGGCAATGGC 


360 


GAAgcGCATG ATTAAAGCTG CTGGTTTAGA TTGGGAAGTT CATTTAACAT 


TGGACCGTGA 


420 


AGCAGCCTTA 


AAAGGGGCGG ATTTTGTTTC AACACAGTTT CGTGTAGGAT 


TATTAGATGC 


480 


TCGCATTAAA 


GATGAACGGA TTCCTTTGTC CCATGGAGTA TTAGGCCAAG 


AAACAAATGG 


540 


CGCGGGAGGA 


ATGTTTAAAG CATTTCGAAC AATTCCTGTC ATTTTAGCTA 


TTATTGACGA 


600 


TATGAAACGC 


CTTTGCCCAG ATGCTTGGTT AGTGAATTTT ACTAATCCAG 


CTGGAATGGT 


660 


TACGGAAGCT 


GCAATTAAAC ATGGTGGTTG GAAAAAAACA GTTGGCTTAT 


GTAATGTGCC 


720 


GATTGGTCAT 


AGAAAACAAG CGGCCGAAAT GCTGGGAATT CCAGAAGAAG 


ACTTATTCTT 


780 


TAAATTTGCT 


GGAATTAATC ATTTCCATTG GCACCGCGTT TGGGACAAAG AAGGACAAGA 


840 


GCGAACACAA GAACTCATTG ATTTAATTTA CGGaCCTAAG CAAGAGCAAG 


AAAGTCATTt 


900 


GAAAAATATT 


TTTGATt-JGC CGTTCCATTA TGAACAGTTA AAAGATTTAG 


GAATGTTACC 


960 


ATGTGGTTAT 


CATCGCTATT ATTATATTGA AGATGAGATG TTGAAGCATT 


CCATCGAAGA 


1020 


GTATGAACGG 


GGCGAAACAA GAGCACAAGT GGTTAAAGAA ACGGAAGGTC 


GTCTGTTTGA 


1080 


ATTGTATAAA 


GATCCCAAAC TAGATTAC7UV ACCAAAAGAA TTAGAAGAAC 


GGGGCGGCAC 


1140 


ACACTATAGC 


GATGCAGCTT GTGAAATGAT TGCATCAATT TATAATGATA AACGAACGGA 


1200 


TATGGTTGTT 


TCTACTGAAA ATAATGGGAC AATTACTGAT TTACCGTATG ATTGTGTTGT 


1260 


AGAAGTATCT 


GGACCTGTGA CAGCTCATGG CCATGAACCT TACAACTGGG 


GGGCCTTTCC 


1320 


GCCAGCAGCA 


CGAGGAATTA TTCAAAATAT GAAAGCTATG GAAGAAACGG 


TTATTCGAGC 


1380 


AGCCATCAAT 


GGTGATTACG GCGCAgCATT gCATGCTTTC ACTATTAATC CATTAGTTCC 


1440 


TGGCGGCGTC 


ATGGCGAAGA CCTTGTTGGA CGAATTATTA ATTGCCCACA AAGCGCATTT 


1500 


ACCGAATTTT 


GCAGATGCGA TTAATAAAAT TGAAGAAACA CAACCAGAAA CAGTCACATA 


1560 


TGTGGCCGAA TTAATGAAAA GTAATTAAAA AATATCGCAC GCTAAAGATT CGTTTAGTGT 


1620 


GCGATACTTT 


TTTTAGAATA GGAAGCAAAG GATTCCGTTT TTTTCTTGAC AGTAAAATGA 


1680 


TAGGTTTGTT 


GCCGGAACAG CCACTGAATA CCAGAGTAAA TTAAAATTTC AGCAATAATA 


1740 


AACGTCGAAA 


GTTTCTGTGC GGAAAGGAAT AATAGCGTTA AAATTAAAAA TAAGACGACA 


1800 


CAGGTATAAA AAACATGTTG CAATTTTTTA GAATTGATAA TCGTGATTTC 


TTTGGTATCT 


1860 


TCAGTAACTT 


CAATGCGCCG ATAAAATTGG AACCAGCCTT CATAGAACGT 


TAAAACATCT 


1920 



wo 98/50555 PCr/US98/08985 

522 

CCAGGATCAG CCTCAATTTC AATAGGTACA TTAGTAAAAA GTTTCCCTTG ATGCACACCA 1980 

TTTTTATAAA ATGtAAAATA GCGTAATGCG CCCCAAGCTG ATGTTTTTCG TGTCAATGTA 2040 

ATCTTCATGG TTTTCACCTC CGtCTTTATT TTGCCmTv#Gg TtTCGGTAAA ATTGgTATCC 2100 

TACTAAAGTC CGATTCTCTC ATTTTAGGGT CAGAATTGAC AAAAGAAAGC AGAGGTGTTA 2160 

ATGTGGGAGT AATTAAACGA GGAGGTATAA TTGTATGTTG GCATCTATCA GAATTTGGAT 2220 

TGACTTTTTG ACAAAATAAA TTGAATAAAT CTTATCAAAA AGTCTGAAAT TAAAATTCAT 2280 

GGGTGGCGTG TGTACTTTTA TACGAAACTA CAAGGATAGC GATTAGGCTG TCCTTTTTTG 2340 

CGTACCCAAG ATTCTTTCGT TTCAGAAATG AAAGAAGAGG GCAAAATGAA AAAATTAAGT 2400 

ATTCATCTGA AAGACGTAAG TATTCACTTT TCGGGAAAAC CAATTCTAGA AATTGATGAA 2460 

TTATTTGTAT ATGAAAATGA AAAAATTGGG ATTATTGGTA AAAACGGCGC AGGAAAATCA 2520 

ACACTGCTCA ATTTAATTAT GGGTAAGATT CAATCAGATA AAGGAAAAGT TCAAAGATTG 2580 

AACGACTTTC ATTACTTGGC ACAAGTAGCT GAAGAAATAA CGAATGAATC AGAGAAAACT 2640 

GACAAAAATT GTTTGCTAAA TCAGAAAAAT CAAAAACTAA GTGGTGGCGA AAAAGTCCAA 2700 

AAACGCTTAG CAACATTATT TTCAGAGTAT CCAACCGGGG TTATTTTAGA TGAACCAACC 2760 

ACACATTTAG ATAAAGAACA TCGTCAGTTG TTAGTGGCAG ATTTAACGTA TTATTATGGG 2820 

ACCGTTTTGT TTGTTAGTCA TGATCGCTTT TTCTTGAATC AATTAGCAGA GAAAATTTGG 2880 

GAAGTTTCGG ATGGACATGT CAAAGAATAT TTAGGGAACT ACGATGCGTA TTGTCGTCAA 2940 

AAAGAATTGG AACAGCAAAC ACAATATAAT GTCTATCATC AGTATCAAAA GGAAAAGAAA 3000 

AAATTACAGG AATCTTACGC AAAGAAACAA GCACAAGCGC AAAAATCTAG TCATGTTTCA 3060 

AAAAAACAAA AACAAAAGCA AATTAAACCT AGTCGTTTAG CTGGTTCTAA ACAAAAAGAT 3120 

ACCGTACAAA AAGCACTCCA AAAACAAGCG AAAGsGwTTA ATGCCAGAAT TGACAGATTA 3180 

CCCGATGTTG CGCAAGCAAA ACAAGAGCGA AAAATTATTT TTCCTACTAA TAATCAGTTC 3240 

TCTTTATACA ATCCATATCC AATTAGAATT GAAAATTTAA CTTTCGCTTA TGAGAATCGA 3300 

ACAATTCTTA ACCAAGTGAA CGTTCAAATT CCTTTAAACG AAAAAATAGC ACTTTGTGGa 3360 

AAAAATGGTG CTGGGAAGAG TACCTTCCTT CAACAAATTG AAGCCCGTCA CCCGGCCATT 3420 

TACTTTTCTC CCAAAGTCCG GTTAGGAACG TaTCATCAAT TAGATTATCG ATTAAAAAAT 3480 

GATGAACCAC TGTTGACTTA TTTATTGAAG CGGACAAATT ATTCTGAAAA GATTGTGCGT 3540 

TCACTTTTAT ATCGACTAGG TTTTCAACAA GAAAATTTAC AGACAAAGAT ATCTTCTCTA 3600 

AGTGGTGGCG AACATGAAAA ATTACTTTGG CGCAATTATT TATAGAACCA AATAATATCA 3660 

TCTTATTGGA TGAACCCACC AATTTTCTTG ATTTAGACAC AATCCAAGCA CTTGAAGAAT 3720 

TTATAAGTGC CTATCAAGGA ACAGTAATTT TCACTTCGCA TGATGAAACG TTTGTGGAAA 3780 

AAGTAGCCAC CCGCACTATT TATTTAGAAA ACGGGAAAAT TATTGATAAA TAaAAAAAGT 3840 

rAACAAAAAG TCGCTATTTT AGCGACTTTT TGTTTAGACT TCTTCAATTG GTAAATAGGT 3900 
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GTATTCTGCA ATTTCTGCGG CATGGGAGCC AGCAACATGT 


CCAGTGACAA ATGCAGCTGT 


3960 


AACATTGTAG 


CCTCCAGTAT AGCCATTAAT ATCTAAAAGT 


TCACCAGCAA AAAATAAACG 


4020 


ATTGACTAAT 


TTGCTCTCCA TTGTTTTAGG GGTCACTTCT 


TTGAGGGAAA TCCCACCGCC 


4080 


TGTGACAAAA 


GATTTTTCCA AAGGCAATGT TTTCGTAACA 


GTGAACTGAA AGTCTTTCAG 


4140 


TAGTTCCACA 


AAGGAAAGmC GTTGCTTTTC TGTTAGTTGT 


TTGGCAGGCA CTTCTTCAAA 


4200 


AGATTTTGTC 


GGAAACACAT CCAAGGCTAC CGTGACAGGT 


TGATTACCGT TGCGAGTTAA 


4260 


TTCTTGGTTA 


ATAAAACTAG AACAGCGGAG CGCGGCAGGT 


CCTGAAATGC CAAAATGTGT 


4320 


AAACAGCATA 


TCCaTTTGAT GATTAACTAA AGGTTTTCCT 


TTTTGGTTCA AAACAGTTAA 


4380 


ATTAACATCT 


TGTAAAGAGA GACCTTGCAA CGTTTTATCC 


AGGATAAAAG GTTCTTCAGA 


4440 


AATAATAGGT 


GATTCGGTAG GGTAGAGCGG GCTGATGGTA 


TGCCCCATTT TTTTGGCTAG 


4500 


TTTATAGCCA 


TCACCAGTTG CTCCTGTGGA AGGATAAGTG 


CGGCCGCCAG TTGTTAATAC 


4560 


AACACACGGT 


GCATAAATTT TTTCCAGTTC GGTTTCAACG 


CCAATTATTT GATCGTCTTT 


4620 


TCGTAGTAAT 


TTTGTGACCT GTGTTTTTGT AAAAACAGTG 


ACTCCTAATT CGTTAATGCG 


4680 


GTTAAATAGC 


GCATCAACAA TTGACTTCGA TTTATCTGTA 


ACAGGGAACA TGCGTCCGTG 


4740 


ATCTTCTTCT 


TTTAAGTGAA TACCATTGGA TTCAAAAAAG 


TTCATGATAT CATAGTTATC 


4800 


AAATTGTGAA AATGCGCTGT ATAAAAATTT TCCATTCCCA 


GGAATAAATG AAATGATTTC 


4860 


TTCTGCGGGC 


CGATTATTGG TTACATTACA GCGGCCGCCA 


CCAGTCATTA ATAATTTTTT 


4920 


CCCAACACGG 


CGATTTTTTT CAATCAATAA TACTTGAGCG 


CCTGCTTCGG CGGCCGCAAT 


4980 


CGTGGCCATC 


ATACCGCTCG TCCCAGCACC GACAATAATT 


ACATCAAATT TCTTCATAGT 


5040 


TCTCCTCCAA 


TTCAATAGGT AgTGTACCAT AATTACGGGA TTTTTTCTGG ATTTAAGGAA 


5100 


AAAATAAACG 


TTTGACTATG GTAATTTTTG AATTTTCAGT 


TAAAATAAAC CTATAACAAT 


5160 


GAGAGGATGA 


TTAAATGGAC GCAAAACAAT ATGTTAAAAA 


TATTCAAGAA AAAATTCACC 


5220 


AATTAGACCA AGGTCAAACA GAATATTTAC AAGCAGTCGA 


TGAGTTTTTA CCAACCGTAG 


5280 


AAGGTTTTTT 


AGAAAAAAAT CCTCAATACA TTGAAGCAAA 


TGTTTTAGGT GTCTTGATTG 


5340 


AACCTGAACG 


TATTTTTCAA TTTCGTGTGC CGTGGCAAGA 


TGACCAGGGC AATTGGCATG 


5400 


TGAACAGAGG 


CTATCGTGTC CAATATAATT CAGCGATTGG 


ACCATACAAA GGTGGTTTGC 


5460 


GGTTTCATCC 


AAGTGTGAAT TTAAGTGTGA TGAAGTTTTT 


GGCCTTTGAA CA7VATTTTTA 


5520 


AAAATAGTTT AACTGGTTTG CCAATCGGTG GCGGCAAAGG CGGTAGTGAT TTTGATCCAA 


5580 


AaGGTAAGTC AGATGCTGAA GTGATGCGCT TTTGTCAAAG TTTCATGACC GAACTTyAAA 


5640 


AACATATTGG TCCAAGCACA GATGTTCCAG CAGGCGATAT TGGCGTAGGC GCTCGTGAAA 


5700 


TTGGTTATTT 


GTTTGGCATG TATAAAyGTT GCGCAACTAT GATGCAGGAG TGTTGACTGG 


5760 


CAAACCTTTA GGATACTGGG GAAGCCAAGC AAGA 




5794 


(2) INEDRMATION FOR SEQ ID NO: 71: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

ATTCGTATGC CaATTA'«t.«T gGCtaTTTGg CgAAAAAgCc AnCnGGAGAA gTCaGrGnAA 60 

AATGTGGAAA ATTATAGGAA GCTACTGGGT TTTATGCTAT GATTTTTAAT AAaACAAAAT 120 

GAAGGGGCGG GAAAACATGT TATTTGGTAC AGCAAAAATG AATCGGGAAA ATCATTTAGA 180 

AATCGGTGGT TGTGACACCG TAAAGTTGGC GCAAAAATTC GGGACGCCGT TATTTGTCTA 240 

TGATGTCGCT CATATTCGCG CACAAGCACG TGGTTTTAAA CAAACGTTAA ATCAATTAGG 300 

TATCAAAAAT AAAGTGGTTT ATGCAAGTAA AGCATTTAGT TGTTTAGCTA TTTATCAAGT 360 

GTTAAAGGAA GAAGATATTG CTTGTGATGT TGTCTCTGGT GGAGAACTAT TCACAGCTTT 420 

AAAAGGGGGC ATGGAGCCAG CGGAAATTGA GTTCCATGGA AATAATAAAA CCCCTGAAGA 480 

ATTACGTTAT GCTTTGGATA ACAAAATAGG TACAATTGTC ATTGaTAATT TTTACGAAAT 540 

TGACTTGTTG GAAGAATTAT TAGCTACACG AAACCAAACA CAAAAAGTTC TTTTTCGAGT 600 

GAGTCCTGGC GTGGATGCAG AAACACATGA TTACATTTTG ACAGGCCAAG AAGATTCAAA 660 

ATTTGGTTTT GATGTTGCAA GTGGCCAGGC AACAGAAGCA CTCGTTCGTC TTTTATCAAA 720 

CCCCTCTTTT GATGTACAAG GGGTCCATTG CCATATTGGC TCGCAAATTT TTGCTGTGGA 780 

AGGCTTTTTA GCTGCTGTTG AAAAAATGTT CACTATTTTA GAAGACTGGC GGCAAGTCCA 840 

TCAATTTACA GCACGTGTTT TAAATATGGG TGGTGGTTTT GGTGTGCAGT ACACGCAGCA 900 

AGATGAGCCA TTAGCACCAG CAACATTTGT TGAAAAAATT GTTTATTCCT TGAAAGGTCA 960 

TTGTGAACAA tTAGGGTATC CaCTGCCTGA GCTTTGGATT GAACCAGGCC GCAGTTTAAT 1020 

TGCCGAAGCG GGCACAACGA TATATACAGT TGGCGCGCAA AAAGAGGTTC CTGGTGTTCG 1080 

TCATTTTGTT TCCGTAGATG GTGGCATGGG TGATAATATA CGGCCTGCTT TGTATCAAGC 1140 

AGTTTATGAT GGTTTTCTAG CGAATCGTGA TGGGCATGAT TCAGTCAAGG AAGTCACAGT 1200 

GGTTGGTAAA TATTGCGAAT CAGGAGATGT CTTACTTCGA GATATTTTAT TGCCAGAAGT 1260 

TAAAGCAGGG GATTTATTGG CCATTAGCAG TACGGGGGCT TACGGTTATT CAATGGCAAG 1320 

TAATTACAAC CGAAATCCTC GACCAGCGgT GGTTTTTGTC GAAGATGGAC AAGCAAAATT 1380 

AGTTGCTCGC CGTGAGACAT ATGAAGATAT GACAACGTTG GaTTGTTGAT AGATTaAAAG 1440 

TAGCCGCCTC TTTTTGAGAC gGCTGTTTTT TTTCcAAAAA ATAACmAAAT GTAAGGGTGA 1500 

TTTCTTTCAG AAaTGaAAAA TCCaTAGTAT ATTAAATGAA GAAGGAAATA CTTTAGAAGA 1560 

GGGTGTGATT TGCA6ATGAA ATGGCTCCTC GGGTTACTGA TCATTTTTGC AATAATTG6C 1620 

GTTGTTTTTT CTTTTTACAT TGCCAATTAT TTTTTGAAAA TTGCGTTGCT TAAAGACAAC 1680 
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CCTTGGTATC 
ACAAGATATA 
TCTTTTGCTG 
ATTCCTTATC 
AAACGCGATA 
CCCGATTTGC 
CGGTTAGATT 
ATTTTAGTGG 
CCAACGAATG 
AAATACGTCC 
CAATTAGCAT 
CACAAAAATA 
ACAATGTTTG 
AAAGCACCAC 
GAATTTTTGG 
TTTTCATGAT 
AAAGAAAAAT 
CTCTTGCTGT 
AATATAAATG 
AAGATAAGGA 
AATGAAAATT 
TCCTGCATTC 
AAAAGGCACA 
CCAAATCTTT 
CTGTCGCTCC 
CTTGAAAAAA 
TAAGTTGCCG 
CTAATTGCTG 
ATAAAAGAAT 
TAAGACAATT 
ACAGCAGCCA. 
CGAGGTTGTG 
TATGAACTCA 



ATAAAAAAGG 
CAAAAATAGA 
AAGATCGTTG 
CAGATAACCA 
TGGCATATAC 
GTGCGCACGG 
TATTAAGTTG 
GCGGTTCCAT 
TCCGCGGATT 
TACATGAATC 
TAAATAACTA 
CGTTGCCCAC 
AAGAAAATTT 
ATTTATCTTC 
AAGAAAACTG 
TCCATGTCTT 
AACGGCGCCA 
AAAAGAGATT 
TAAGCAAAAA 
AATGTTTTAT 
GACCATAAAA 
ATAAAAGCTT 
TCAAAAATGG 
TTCTCCATTT 
GTGTTTGAAA 
ATACACGCAA 
ATCGCATTTT 
ATTTGTTGAT 
ATAGAGAAAA 
TCAGAGAAAG 
GTTAAAAATA 
GAGGATGTTT 
TTTTGGACAA 



GCATCGGTTA 
AGAACAACAA 
GTTGAAAATT 
TCGTTGGGCG 
TGCATTACGT 
GAAAAGTTCA 
GCTTAGTGAA 
GGGAGCGGCT 
AATAGTGGAT 
TTTTCATTTA 
TGGCTTTCAA 
ATTCTTTATT 
AGCGGCAACT 
AAGTGTCTAC 
tCCTGCGGTA 
TTTTTTGTCT 
AACTACCAAG 
CTCGTATATA 
GAATCACTAA 
TTAATAATTT 
AACCAAAAAA 
TCGGGAAAAG 
AAATATAAAG 
CATTTAAAAA 
TAATCGTAGT 
CCAGCCTCGA 
TTCGCATGCT 
TAAAAAAGGG 
TCGTAGGATG 
ATGATTTAAA 
TTTTAAAAGC 
TTTTAAAGTT 
CGTATAAAAT 



525 
TTAAATCCTG 
AAACAAGAAG 
AAAGATGAAA 
ATTTGTGTGC 
TTTGCAGAAG 
GGGAATAAAA 
GTTCTTGCCA 
ACTGTCATGA 
TGTGGCTACA 
CCCGCTTTTC 
TTGA7\AACCG 
CACGGTACAG 
CAGGGGATAA 
GAACCTGAAA 
AAAACAATAT 
GACTCGTTTA 
GCAAAGAAAC 
GCTGAAACCT 
AGGCAAAATA 
GAGGCGTGTC 
ACTAATCAAA 
TAGTCTGCCG 
ACAAAAAAGT 
ATATCTAATT 
TTTGACTATA 
CTATACGAAA 
AGTGTTGCAC 
AAGGAGGGAG 
GGAGTTATTC 
GAAGTTGTTT 
TAGAACATCG 
AACTGGGAAG 
TATGTTACAA 



ATAATTTTCA 
GCGAAGCTTT 
CATTGTATGC 
ATGGTTATCG 
AAGGCTACAA 
TTGGGATGGG 
TTGATATGGA 
TGGCAAGTGG 
CTTCTGTTTA 
CTATTTTAAC 
CTTCTTCCGT 
GTGATCGCTT 
AAAAAGGCCT 
ATTATTACAG 
CCGACTAAAA 
GTTAAATTTA 
CATTTCATTG 
TCGCCATGGA 
CAGACTTCTA 
TTATTCAATA 
TAAAAAATCA 
ATGAATGTAT 
ACTAATGTTA 
ACTTGGTACA 
CCATAGCTGT 
CACACAGAGT 
TTCTGTTTCG 
CGTTTTTCTA 
AAAGTGGCGG 
TTGGGTCAAG 
AATGTCCATA 
GCGACCGCGC 
CTTGTTTATC 



AGAACGGGAA 
TTGGACAGAG 
ACGGTGCTTT 
TTC7VAACGGA 
TGTTTTAGTT 
ATGGCTGGAT 
AGCAGAAATT 
TGAAAAGTTA 
TGATGAATTT 
AATTGCTAAT 
GAGACAATTA 
TGTTCCAATG 
CATTGTTGCT 
TAGTATTTTT 
AAGACATGGA 
AAGTGCTTAA 
AATAAGTTAA 
CCCGCAAATA 
ATATTTCCAA 
GATAAGCAAA 
GATTCGCACT 
TTTTAATTGT 
ACAAGGCATG 
TTTCCATAAA 
TCTCAATTTT 
GTCTAAAAAA 
TTGGTGTTTT 
GTATAATAAT 
AAATTACAGT 
AGTATGACCA 
TTGAAAAGTA 
ACAAAGATAT 
CCGATTTCTT 



1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 
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CTTCTGAACA 
TATTTAACTT 
TATACATGGA 
CCACAaTTAA 
CTAGGATTTA 
CAACTTTCGA 
ATGCAACCAT 
AAAGACTCGT 
AATCAACGAA 



TAGACCAGCG GAGGTAATTG GGGAAAATCA AGAAAGCTAT TTAGAAAAAC 
GAATTTTCTT GCAGTAAATA GCAAGTATCC GCCTCATGAT TCCGGACCGA 
ATTTTGACCA AGTTTTTCCT GACTATTTAC CAAATCCTGT ACCAAAAAAA 
ATGAATTTCT TCTCGACAAT TTTACTAAaT TTGAGCGAGT TCATCmGGAC 
aGCGTTTTGC CgAACTTACC CaTTCAATTG GAAACaTTAC AGTAGTTCCA 
ACAGCGGAAG AAGTTTATCG TTTAAAGATT ATTGGGATTA TTCTTTAGAA 
TATTTTTAGC TTCATTTCAT TCATGGGAAA GCTACGTTCA TACGTATGAG 
TTTTAAATGA ACAGTATCAA CCAGTAGCGC TATGGAAAAA TCATTTGAAA 
TTATCTTACC TCAGAATATA GAGGAGATAA ATGAATATTT AGTTCAAGTA 
TAGAAAAAAG AGGGCAGCGA ATAGTGAATC GG 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
TGGAnTTATG GTTGGGTTTT nAGGTTGGCC GnAGGTTGGG GCCAAAGGTT ACCCGTTATT 
TTCCTTGGAC CAAnCCACTT TATTTCGGAC CAAAAACCCA TTTTAAGAAA CCATTTGTAC 
CCAAATGGTA AAGCAGAAGA TTTAGATGCT GCTTCCGCAG AATATGAAAA AATTATCGAT 
GCACATCCAG TAGATATCCA AATCTTAGGT ATTGGTCAAA ATGGACACAT TGGTTTCAAC 
GAACCAGGCA CACCTCTAGA TAGCTTAACA CATGTCGTTG AGTTGACTGA ATCAACAATC 
AACGCAAACA AACGTTATTT TGACAAAGTA GAAGATGTCC CAACACGTGC TGTTTCTATG 
GGGATTGGCT CAATCATGAA AGGTAAAAAA ATGATTTTAA TGGCTTATGG CGAAgCAAAA 
GCAGAGGCCA TTAAAGGAAT GATTGACGGA CCAGTAACAA CAGACATGCC AGCAAGTGCA 
TTACAAAACC ATCAAGATGT TGTTGTAATC ATTGATGATG CAGCTGCAAG TAAATTATAA 
TCGTTGATGA AAAAGAACTT GAAACATAAG AATTTCCTTA TGTTTCAAGT TCTTTTTTTG 
TTATACTGGA AGTAATAGAA GAAGAGGTGA AAAGGATGTT ACAGCTAACA ATTCCAGAAA 
TTATTTTACG GTTAGTCCTA GCGATGGTGA TGGGTGGCGC AATTGGTTTT GAAAGACAAT 
ATAAGAATCG CCCAGCAGGG ATGCGCACTC ACATTTTAGT TTGTATGGGC GCGGCTATTA 
TTGCGTTAAT TCAGTCGCAA ATTGCAATCA ATGCATTGAA AGATGC6TTA gcTAATCCGG 
CTTTGACGGG GGTCATTCGT TCAGATCAAG CAAGGTTGAT CGCACAAGTT GTAAGTGGAA 
TCGGCTTTTT AGGTGCTGGA ACCATCATTG TAACGAAACA AAGTGTCACC GGATTGACGA 
CAGCAGCGTC ACTATGGACC GTTGCTGGTT TAGGGATCGC GGTGGGCATG GGCTACTATA 
CGATTGCCTT AACAAGTTTT ATCGGTGTAT TATTTGCCTT AACCATTGTC AAACGAATTA 



3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4232 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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TTCACGTACC 
AATTTATTAA 
TTTCTTGGGT 
GCCTAACGTA 
GATgATTAAT 
TTATGGCGGT 
TTACAAAAGA 
AAAAAAATAA 
TGAAAGCTTA 
TAGAGAATGT 
GCAGTATGAC 
CAGGAGCAAA 
GTAATTTTGT 
AAACCTTAAT 
ATATACAAGC 
CAAAACTTTT 
TGCCGCAAGT 
AGGAAGGTTG 
GCATCAGCGG 
AATGGCTGCT 
AGAGAAGCAA 
AGCGTCAACC 
ATTAGTGACA 
TTCAGATAAT 
ACAATTGTTG 
TAGTGGGGAA 
TCGTGAAAAA 
GGAAGTTGGA 
TAATGAAGAA 
TATTGATCAA 
AAATATACAA 
TATTGCTGGG 
TGTTAAAAAA 



AACAATCAAA AAATTAGAAA 
TCAGTATTTT GAAGAAAaAC 
GGATGATACA AAAATCTATA 
TGCCGATGTT ATCGAAGATC 
GTGTAGAAAC ATCTATTaTA 
TTTCTTCTAG GACCTAGGAC 
GTTATGATTG TATTACGAGT 
ATTTCTTAGT ATTATTGTTG 
TAATCAATTG ATTACTTTAG 
AATGCAAAGA CGTGCTGATT 
TCAAGAAAAA GAAGTTTTAA 
ATCTTCTTCT GAAAAAGGGC 
TACAGTATTG AATGAAGATT 
GACTCAATTA GAAGGATCTG 
CGTTGAAACA TATAACCAAC 
AGGATTTGAG AAGAAAACTA 
AAAATTTAAA TAAAAATTTA 
GATAAAAAAG TCGAAAAAAT 
AATACA.\CGA TTGTTGTGGT 
TGGTATGAAG GAAAGGGAAT 
CATGCCTTAC A7VAAAAAGAG 
CAGACAACAA TAAATAACTT 
ATTACTGGAA CGAATGGGCA 
GCAGGTATTG TCTCAAATCA 
GAAAATGCGA ATGGCGCACA 
TCTGTTGAAT CTTACAGCAA 
AATAATGGTG TGGTTTTTCT 
TATGGCTTAG AGAGTATTTT 
GTAAAAGAGG CTTTTAGAGA 
GTATCGGCTG CAATCAATTC 
ACAGAGTTAA ACACTAACAC 
AT6CTCATTT 6TGCTGTTTA 
ATGTATCAAG ATTATTTGAA 



TTCGCTATCA 
ATATTGAAAT 
CGAATATCTA 
TATCTGTTTC 
ATTAATGACA 
TGATTTTTTA 
TAGAAAGAAA 
TTGTCATTGT 
ATAATAAAGT 
TAATTCCTAA 
AAGAAATAAC 
AAGCAAATGA 
ACCCAAAGTT 
AAAATCGAAT 
AGATTTCAAA 
ACTATGTTGC 
TAGAAAATTT 
TCAAGAAAAA 
AATTAGTTTA 
CTCTTATGAA 
AGACGATATC 
TTCTAAACAA 
GTTGAACATT 
ACTTAAACAG 
ATTTATGCTT 
TAAGATTGCT 
AATGGCAATT 
AACGGATAGC 
CGAGGATTAT 
AAAAACCGCA 
AACGAAAAGA 
TATTTTACAA 
TTGTTTACCA 



ACATAAACAA 
TTATGATGTG 
TACTATTGAT 
AAATAATGTA 
ACCGCTATAA 
AGTTTAATAT 
GGTGATAAGT 
GATTATTGGT 
CGAAGCAGAA 
TCTTGTCAGT 
GGAAGCACGT 
GCAAATTGAA 
AGCTTCTTCT 
TTCTGTTGAA 
ATTCCCTGAT 
GACAGAGCAG 
GGGTTAAAAA 
GTTCGTTTGT 
AGTATGATTT 
GCAAAACAAA 
TTAGCAGGGA 
GAAATAACTG 
AATGATGCTA 
AAAATGTTTC 
ATTACCGTTC 
AATCAATTAG 
GAGGATCGCG 
TATGCGGATG 
AATACAGGTC 
CAAGTGGATA 
AATGTTCTTT 
ATTTTAAGAA 
ACAAAGGCTG 



GAAACCAAAG 
AACTTCAATG 
TTGCCTAAAG 
ACAAAATTAC 
GTTAAAATGC 
TTCAAAAAGA 
AAGATGAATA 
GCATTTGTGA 
TGGTCACAAG 
AGTGTTCAAG 
AAAGCATACG 
AAAGGATTAG 
GAAAATGTCA 
CGCCGTAACT 
AAAATAATTG 
GGAAAAGAAG 
GAAAGAAGTG 
TTTATAAAAA 
TAATATTAGG 
CTGCACTGAG 
AAATGACTGA 
GTGAAAATGA 
ATATCTATGT 
AATTGAATCA 
CTGCGTTACC 
GTGTAGGGGA 
AGTCTCGCTT 
ATATTATTAA 
TCAATAAAAT 
ATGAATTAAC 
TAATGGCAAG 
CAAGAAAACT 
TCCTAAATAA 



1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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CTCAGAACAG ACAAAGAAAG TATTAAATAA GATGAAAAAG 
TCTTAATGGA GCAACGTTAT TTGCCACGAA AGGGAAAATA 
TTCTTTACTA AGTATATATC CAAATGCAAA AAAACAATCT 
AGATACTTTA TATAGTTATG ATGGTAGTGT TTTGACGTAT 
CAATCCGTCT AATCATAGTA GTTCTGGTAG CGGCAGTGGT 
ATTCGGCGGT GGCGGCGCCT CAGGAAGTTG GTAAAATAAA 
TTCTAGCTTT TTGCTATTTA TCTAAACCGT AAAGATAGTC 
CTGTTCCTAG TAAATAGCCA TTTCCTACTT GGGAGAAGAA 
TCGAAATACC ATTCATTACA ATCGGGTTCA CATCATTGGC 
CCATACCTAA GTTCATTAAA GCTTTGTTGG CGTTGTAACG 
CGGTCCAACC TAAGTCATCA TATAATTCTT CTGTATAACG 
CGTAAAGTAA ATTATACATC CAGTCTTTTA GTTTATCTTG 
TAAATCCTAA TTGGAATTTA TAGCCAATAT ACGTGCCGTG 
TCAATTTAAT AATTTCAGCG ACGTTCGCTA GTTTATTGTT 
TGTAAAACCC AGAATAGAAT AAAAACGTTT CTAAAAATAC 
ACGGAGTACC GTTTTTATAA ATTTCGTTAA TCCGTTCTGC 
TGGTATTTGT CCACTCAAAA ATATCATCAA TCTCTTTTTT 
TAGAAGAATA ACTTTTGGCA TGGACAGATT CCATGAATTG 
CTTCATGAGG CGTCCGCACA TCATTTCTCA GTTGATCCAT 
TATCTAATAA AGTTAAACCG CCGAAAACGT AGCCGACAGT 
GAGTCCGCCA ATCATCTAAG TCATTGGATA ACGGAATCCG 
CGGTTAATTT TTCCCAAGTG GATTTATCAA TGACGTCTTC 
CTTCATAATA CGTTGCCATG GTTATTCCTC CTGTATGCTG 
AGTTGTCTCG TTTCTTCGCT TAAAACGTTC TACAGAGTCC 
TCTTCTTTGT CCAGTATATC TGAAAATGTC CTGTTTAAAA 
AAGACGCTCA AGCAGAACGC CTTATTTAAG TGCTTTTCAT 
TTGATTGCTG CCAATTTCTT CAGCATCATC TGTAAATGTA 
AATCCCTTTG TTGAACGCAT AATGACGTAA AATGTTCAAG 
TGTTTCTTTC CATTCATACA AGCCTTCAGG GATTTCTGAA 
CATTCCTTGA TCAACGTGTT GTTGCGCTGT TGCGTAGACA 
CATGTCGTAA GCAGAAGTAT AGTAAGGGAT TGTATCATTA 
GTAAATCTTA CCAATTTTCT TCTCTTGACG TTCTTCAATC 
GCTGGCACTA GTATCATTAA TATATGAAAT TGAACCATTT 



ACTTCTTTTT ATTGTTTATA 3120 

AGACGTGCCA CTAAGCGTGG 3180 

TTTGGGCGTT TATTAGTAGG 3240 

GCTTATTTAA ACTCACAATA 3300 

GGTTCATTCG GTGGCGGCTC 3360 

AAGCTAGAAA CGTTGATCGT 3420 

ATCATCTTTC ATGGCTTCAA 3480 

GTCATGGTTG CTAGTTCCTG 3540 

TGTATCCGCA AATAAGGGAT 3600 

TAAAAAGGTT TTTACTTCTT 3660 

TTCTTCGTTT TCGTACAATT 3720 

CTCATCTTCT GGTAATTCAT 3780 

CACTGATTCA TCACGGATAA 3840 

TCCTAAATAG TAAAGAGGAG 3900 

ACTAGCAATT TTTTTCTCTA 3960 

TTTATATTGC AGGTGTTTGT 4020 

TGTATTTAAT GTACTGAAGA 4080 

GATGTTATTT AAGACGGCTT 4140 

GCCGCTTTCT GATTGAACAG 4200 

AGTTTTTTCT AAATCGGAAA 4260 

TGTATCTAAC CAAAATTGTT 4320 

AATGGCGTTC CAGTTAATTG 4380 

GGACAGCAGT CTGTTTGACT 4440 

AAATAGGAGT GTGCAGCCGT 4500 

AGAGAACATT CCTAAAAAAT 4560 

TAAATCACGC AAGACTCACA 4620 

CGAATATAGT AAATTGATTT 4680 

TCACGGGTTG TTTGTTTTGG 4740 

CGCATAAACA AGGTTAAACT 4800 

TCAATCACTT TGCGCATATC 4860 

GCTAAATAAG GCGCTGGATA 4 920 

ATACGAGTAA TTGGATGAAT 4 980 

GGCGCTACGG CTAAACGATT 5040 
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TTGATGATAA AGACCATCTT TTTTCACAGC ATCGCGTAAC GCAGCCCAAT CTTCAGCCGT 5100 

TGGAATCTGA ATGTCTTTGA AAATTTCTTT CACTTTATCA AATTTTGGTG TAAAGTTGCC 5160 

TTCAATATAT TTATCGAAGT AAGCACCAGA AGCATAATCT GATTTTTCAA AATTATGGAA 5220 

GACTTGGTTA CGTTCTTTCG CAATTTGGTT ACTTTCCATT AAAGTCCAGT AATTTAATAA 5280 

CATGAAATAA ATGTCTGTAA AGTCTAATGA TTCTTCAGAG CCGTATTCCA TATGATTTTT 5340 

AGCGAAGAAG GTGTGTAAGC CCATTGCGCC TAGTCCGATT GTATGGCTTA AACGATTTCC 5400 

ATTTTGGATA GTTGGCACGA CATCAATTTC AGAAGCATCT GTTACAAAAG TTAACGCACG 54 60 

CGTCATAGCG CGGACAGATT TTCCAAAGTC AGGACTTTCC ATTAAATTAA CAATGTTAGT 5520 

TGAACCTAAG TTACAACTAA TATCTGTTCC TAAAACTTCA TATTCTTGTT TCCCATTAAT 5580 

AACAGAAGGC- GTTTGAACTT GAAGAATTTC AGAACATAAG TTACTCATGA TAATCTTGCC 5640 

ATCAATCGGA TTGTTTTTGT TAGCTGTATC AATATTGATG ATATAAGGAT AGCCAGATTC 5700 

TTGTTGTAAT TTAGAAATTT CATTTTCTAA GTCACGGGCT TTAATTTTAC GTTTACGTAT 5760 

GTTTGGATTG GCCACTAGGT TATCATATTC TTTGGTAATA TCTACATAGG AATAAGGCAC 5820 

ACCATATTCA CGTTCCACAC TATAAGGACT AAATAAATAC ATATCTTCGT TGTTACGTGT 5880 

TAATTCGTAA AATTTATCTG GAACAATAAC ACCAAGAGAC AACGTTTTTA CACGGATTTT 5940 

TTCATCAGCG TTTTCTTTTT TAGCAGATAA GAACATTTCA ATATCTGGAT GGAAGACATT 6000 

TAGATAAACC ACACCAGCAC CTTGACGTTG ACCTAGTTGG TTAGAGTAGC TAAAGCTATC 6060 

TTCAAATAAT TTCATAACAG GGACAACACC ACTTGCTGCG CCATCATAAC CTTTGATTGG 6120 

TGCGCCCGCT TCACGTAAGT TGGA7VAGAGT AATTCCTACG CCGCCGCCGA TTCGTGATAA 6180 

TTGAAGGGCA GAATTAATTG AACGCCCAAT GCTGTTCATA TCATCTGTTA CTTGAACTAA 6240 

GAAACAAGAA ACCAATTCAC CTCGACGTTT ACGCCCAGCG TTTAAGAAAG AAGGTGTGGC 6300 

TGGTTGATAC CGTTGATGGA TCATTTCATC CGCCAATGTT AAAGCCAACT CTTCATCGCC 6360 

ATCCGCAAAA TACAAGGCAT TAAAGGCAAC CCGATCTTCA TATGTTTCTA AATACTCTGT 6420 

TCCGGCGTTG TTCTTTAAAG CGTATTGAGA ATAAAATTTA TAAGCCGCCA TGAAAGACTT 6480 

GAATTGGAAA TGTTGTTCAT CTAAGAAAGC ATATAATTTT TCAATAAAGT CCATTGAATA 6540 

TTTTTCGATG AATTCAGTTT CAATAAAATC TTCTTTCATT AAAAAGTTGA TTTTTTCTTC 6600 

AGCAGAGCTG AATTTACGTG TATTCGGTTC TACATTTTCT TTAAAGAAGG CTGCGAGTGC 6660 

TTCTTTATCT TTATTTAAAG GGATTTGACC ATCAACAGGA CGGTTAATCT CGTTATTCAG 6720 

TTTGAAATAG CTGACATCTT TAATTTCTTT TAGACTCAAT TTCTTCCACT ACTTTCTTAA 6780 

AGGTCTCCAC GTCTTCGTCT GTTCCGCTAA ACTCAAAGGA GTATAGAAGC GGAACGTGAT 6840 

AATCGCGGGC AATATCTTTG GCGGTATAGA CAAATAATTC AGCAAAATTA CGGTTGCCCC 6900 

CACCAGCTAC ACCAACTAAC AGTTCTTGGT TACTTTTATA ATCAAGAAAG TCATTCACTA 6960 

CCTCTGTAAT TTCTGCATCA TAGGTAGGAA TGACAAGAAT AAAAGGCTCA TTTATTTCAA 7020 
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AAAAAGGATT CGCTGGCTCA AGTTCATAAG CAGGCAAATC CAATTTTTTA ATAAAGCGCC 
TTGTTTGACC GGTGACTGAA AAATAAACAA GTTTCATCTT AGCTAGCCAA TTGTTTTAAT 
TGGTCTGGAC GGAAGCCAAC AACGGTTGTT GCTTCAGATG TGATTACCGG TACGCTTTGG 
AACCCTTGTT CTTTTAACCA GTCGATAGCA TCTGGTTGTG CGTCGATATT GATTTCTTCA 
AAAGCGATGT TATTTTCACT TAGGaAACGT TTCGCCATTT TACATTGGAT ACAGTTATTT 
TTTGAAAAGA TTTTTACGTT CATTGaAAGT TCCTCCTCTT TTTTTCTACA TAGACTAGTA 
TAAATCTATG TAGAAA^VAAG TCAACACTTT AACACTATAT ATTGTGGTGC GTTTTTCTCA 
TAATACTAGT TTTTGTGTTT TTGAAAGAAA AATGAAAACA CGTGAATCCT CGTTTTTAGG 
AGAAAAAACA CATTGGTCAG TGTTTGACAA ACAAAAAAAT TAAAATTTTG AAAGAGAACA 
TTTAGTGAAA AAACACTAAA TGTTGTGTTG ACATTTGAGT TAGGCACAAG AAGAAAAAAT 
AAGCGGATTT GTTATTGAAT GAGTGGGATT TTACAGTAGC TAGTAACATA AAAAACAAGT 
GCTGACTTTT TTTCAAAAAT AGTAACCTGA AAAAAGTGTC TCTATATAGA AGGAAAATAG 
AAGAAAACAG GAGATACATC CAGAAATTCA TTGACTTTCC GTGTAAGCAA GCGTATTCTT 
TTAATTGAGA ACGATTCTCA TTTTCGTTTG AAACTTTTTC ACATGGAGGA AAGTAAATGA 
TTACGTTAGC GCAGGCAGAA GTTGGGAAAG TCTATACAGT CGAAAGCGTC CAAGCAGATA 
TTCAGACAAA GAAACATCTG AACAATTTAG GTGTCGTTGC AGGCCAAGCA GTTGTCCTTG 
TCAATTATCA GAATCAAAAT GGTATTGTTT TATTACATAA TAGTCGAATT GCGCTGACAG 
ATACGATTTT ACAAGCGATT CATGTGGAGG AGCGCTCGGC GAATGAGAAA GTTTGGGTCT 
CCTTGGATAC CTTAAAAGTT GGAGAACGCG CTACAATAGT AGGGATCCAC GGACAAGGAG 
CGGTGAAACG TCGGTTUATG GACATGGGCT TAACAAAAGG AACAGTAATT TTTATTCGCA 
AAGTAGCTCC TTTAGCTGAT CCAATTGAAA TCAATGTACG "tGGcTACGAA CTAACGTTAC 
GAAAAAGTGA AGCGGaACTA ATCTTGGTTG AAAAGGAGGA AAGCGAATGA CTCAGCAAAT 
TGCACTAGCC GGCAACCCTA ATAGCGGTAA AACGACAACG TTCAATATGT TAACAGGAGC 
CAACCAATAT GTTGGAAACT GGCCAGGGGT GACTGTGGAA CGCAAAGAGG GGGTGGCAAA 
GAAAGATAAG ACACTCATTA TTCAAGATTT GCCAGGGATT TACTCTCTTT CTCCTTATAC 
GCCTGAAGAA ATTGTGGCAC GAGATTATCT TTTGGAGGAC CAACCTTCAG TGATTTTGAA 
TATTCTCGAT GTGACCAATT TGGAACGGAA TCTTTATTTA ACAACACAAT TAATTGAAAC 
TGGATTACCG GTTGTTTGTG CGTTAAATAT GATGGATTTG TTGGAAAAAA ACGGCCAAAC 
ATTGAATAGT GAAAAGTTAA GTTATGGCTT AGGCGTGCCT GTTGTTGAGA TTAGTGCTTT 
GAAGAATCGT GGGTTGGATC ATGCTTTAAA ACAAAGTAAG CAGCGAGCGA ATGCCATAGA 
GACAGAGGTT ATTTATCCTT CTTATGATAA TCGCTTAGAA GCAGCACTTG CTGAAATTGT 
TGATATTTTA GGGAATACTG TTCCTGAAAC GCAACAGCGT TGGTATAGTT taaaactttt 
TGAACGAGAT GTTCGAACCA AAGAGCAATT ATTATTGAGT TCTTTTCAAG AAAAAGAAAT 
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TGAAGAAGTC 
TAATGAACGC 
AATGACATTT 
GCCAATCTTT 
AATGGGGACA 
AAATTGGTTG 
CATTGCAGGT 
TTTAGGTTTt 
TTTCGGAAAT 
GGGGTCCCAG 
ATTATGGTCA 
GGTGCATTTT 
TCCATTGTGT 
CCATTCATTA 
GCCTATGAAC 
ATTCTCATTT 
AGTATTCTAG 
AATTGGCAAG 
ACCTTAGGTA 
GGCGCACTGC 
TTATGTGCAC 
TGGACATTAG 
TATCAATTAG 
ATGGGTGTTG 
AGTGTCCAAG 
TCCATTTTAA 
AGTTGTGAAG 
CAACTGAAAA 
TGAAAAGCCA 
CCAAATGCTC 
GAATTTTTTT 
AGAAACGATA 
TTCTAATGCT 



ATTCAAATTA 
TATGCCTTTA 
ACTCACAGTG 
GCGTTTGTGA 
GACTGGTTAA 
GCGCAATGgC 
GTTGGGGCGG 
TAGAAGATTG 
TCGGCCTATC 
GCGTGATGGC 
CGACTTTTAT 
TTCCTAACCA 
TATCAGGAAT 
TGGAACTTCC 
GATGTCGCTC 
GGTTCATGTC 
CATTTCTAGG 
GGGCAGTTGC 
TTTTATTTGG 
AACACACCTT 
CTTGTTTTGC 
GAGCCATTGG 
GCCATGTGTT 
TTTTAGCTGG 
CAATCACTTC 
TTTTTG6AAC 
ATTGTCAAAC 
AAATTCCCTC 
TTTTATACAA 
TGCTTGTGAA 
ATTGCCAACA 
CCACGAAGAT 
TCTGCTGCAC 



CTGAAAAAAT 
TTGCTCGATT 
ATAAGATTGA 
TGTGGTTGGT 
ATGATGTCTT 
AAGTAGCGCC 
TGTTAGGATT 
TGGCTATATG 
TGGTAAGTCG 
TAGTCGAACG 
GCCATGTTCA 
AAGTTGGGTC 
TGCTTTAAAA 
CGCCTATCAT 
ATTTGTAAAA 
ACATTATTCT 
TAAAGGGTTG 
CACAATTACA 
CAATGTGAAA 
TACGCCAGTC 
TGCAATTGGC 
TTATCAATGC 
AGTCGAAAAA 
ATTTTATTTC 
TTTAGAAAGA 
AGCAGGAATC 
AGCTTGTCCT 
TTGTATAAAT 
CGGGGGCATA 
AAAAGCTTTA 
TACTGGCAAC 
ATTGTTTTTT 
GCTCTTTTAC 
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CTTTCAAGAT 
AATTGCTTTG 
CCGAGTGGTA 
CTATTACTTA 
TTTTGGGGAA 
ATGGATGCAA 
TTTACCACAA 
GCCCGGATTG 
TTTATTCCAA 
ATTGAAAACG 
GCGAAGTTGC 
TCGCCCTCTG 
AAGACCAAGC 
TTGCCACAGC 
AAA6CAGGTA 
GTGACGTTTC 
GCGGTGCTAT 
GGATTGATAG 
GATGTTTCTG 
GCGGCCTATT 
GCAATTCGTC 
GGTTTGGCTT 
GGCACACTAA 
CTTATAAGAA 
GGGTAATAAC 
ATCACCTATC 
GTAAAAAAAG 
AATAGCGGAG 
GCTCAGTTGG 
TAAAACTTTA 
AAATTGCCAA 
TGTTTTTTAT 
TCCTGCAGTT 



GAAAGTGAAG 
TGTGCGACGA 
ACCAATCGGT 
TCTATTCAAA 
TGGGTTCCTC 
AGTTTAATTT 
CTGGCCGTCC 
CCTTTGkCAT 
TGTTAATTGC 
AACGCGATCG 
CAATTATCGC 
CTTATTTCTT 
ATTTTTCGGG 
TTAGAAGTGT 
CGATTATTTT 
AACCCGTTCC 
TTATCCCATT 
CGA7UVGAAAA 
AAAACGGGGT 
CGTTCTTAAC 
GTGAAATGGG 
ATATGGTCAG 
CTCTGGGGAC 
AACCAAAGCC 
GGATGGCAAC 
GACGAATCAA 
AACAATCTTC 
TGCTACAATA 
TTAGAGCATC 
TAAATAGGTT 
CGAAAAAAAT 
TCTTTAAACC 
AAATGTGTAT 



CGATTATCAT 
AAAAAACAGA 
GGTTGGcGTT 
CGGTTGGTAC 
AATTTGTGGG 
TAGATGGGAT 
TGTTTTTATG 
GGATCGCTTG 
GACAGGTTGT 
GCGAATGACT 
TCTGATTGCA 
AGGGGTTGCT 
AGATCCAGCA 
GGTTCGGCAT 
TGTTTCCAGT 
AGAAAGTCAA 
AGGCTGGGGT 
TATTATTGGT 
AGAAGTCTGG 
TTTTAACTTA 
AGACCTCAAA 
TTTTGTGATT 
TTTTTTAGCG 
AGACAAAGAA 
AATTATTTTA 
AAAAGGGGAA 
ACAaGAATAA 
TTGAGGTAGC 
TGACTCTTAA 
TTTAGTTTGT 
TCTAAAAACA 
CAAATGCGTT 
ATAAATTAGC 



9060 
9120 
9180 
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9300 
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AGTTGTCGAA 
ATCTAACATT 
ATTTTTCTTA 
ATCTTCCATA 
ACGATTATCT 
AGGCAACGTA 
ATTCTCTGGA 
AGCAGACCAT 
AACTACCAAA 
ATTATAAACT 
TGTACTTATC 
ATGAATGACC 
ATGATAAGGC 
ATTATGGACA 
TTCTAAATAA 
CATAGCTAAT 
TTTATTTCCT 
GTCTTTATAA 
TTTCAATTCA 
TAGCATGATT 
GTATTTTGTA 
AAGCTTTATT 
GATAACTTTC 
GATAATAAGT 
TGTTAGCTCA 
CACAGCCCAT 
AAAACGTATA 
TAAAAGCCCA 
AAATTGCTAA 
CTAAACATAC 
AGAAAAACTA 
AAAAAGTGTT 
CATAGGAATT 



TACGATGCAT 
ATTGAAGCAT 
ATCAACTTAC 
GTAAATACGT 
TGTTGCCATT 
CGATAGCTGG 
TTTTCTATGA 
TTTAATCCTA 
TAGAGCTTTT 
TGAGAAACAA 
7VATTCTGTAA 
GTATTAGCTG 
TTTAACCTAT 
TAATGCTTAT 
TTAAGCAAAT 
TGACTTTCCA 
TTCTTTTTTA 
GATACAATTA 
ATCTCTCCTC 
TTTACCTTCA 
CCTTGCTCCA 
TTTTCCAATA 
AAATATTTTT 
ATTGCACTTT 
GTTGGTAGAG 
TGGGTGCCAA 
TACTGTATTT 
CTGTAGTCAT 
TATAACAATA 
AGGATTAAAT 
AATTTACATG 
GAAATCAACA 
TCACCTCTCT 



GTCCCAGCCA 
TAGAATGACG 
TCAAGCGATG 
ACTCACTATC 
TAGCTTGATT 
AAGAGTTCTT 
TTACATGGTT 
AAACTTCACT 
CTTCTGACAA 
ATTGTTTCTT 
TAACAGCCTC 
TACAGTCATC 
TTAGTTTAAT 
ACCCAACATA 
AGTCATAAAA 
ATtCTAGTAA 
ATTTTGTATC 
CTTGTAATGT 
TTAAACTGAA 
TCTTGTATTT 
ATTTTTATAT 
TTTAACATTT 
TCATAATCTT 
TTTATTTTAG 
CAGATGACTC 
ACCCTTAAAT 
TTTTGTTTTA 
TGATATTTTT 
ATCCGAGAAA 
TTGAGTACTA 
ATGCTATTTG 
TATGGAAGTA 
CCAAGTTTAC 



532 

TTCTTGAATG 
CAAGTCATGA 
TGTTACATAT 
TTTTAAAATA 
TACTAATGCT 
TGTTTGATCT 
TATCATGATT 
TCTTCGCAAT 
AATATCAAGT 
TTTTGGCTTT 
TTGAAGAGCT 
ATTTAACATA 
TTTTTTAGGA 
TGTGTTAAAT 
CAATAAATCT 
AAATTCATCT 
TTTCCACTTT 
TCCTCTTTTC 
ATATCGGCAA 
TTAATATATA 
TTTCTATTGA 
TAGTTAAATC 
GAAAATCCAT 
GTGTTAGAAT 
TTAATCATCG 
ACAGTGTAAA 
AAAAAAGACT 
GATAAGAAAG 
CTTTACAAAA 
TAAAAAACTA 
TCTCACTGAA 
ATTCTTAATC 
GGAGAGCcTT 



GATTTCATAT 
AAGCGAATTT 
TGAGGCTTCA 
TAATCCTCTC 
CGTTCAATCG 
TTCTTTATTA 
TTTTTGTTTA 
CCATAAAATG 
AATTTCAAGA 
CTTACTTTAT 
TTGCGTATAA 
TATTGGTAAA 
TAGAAAAACT 
GCTACTTGTT 
GGATTAGCAC 
CTAATTTTTT 
TGTCTAATTT 
TCAATTATGT 
AAATTTTGAC 
CTCAATGACA 
TTTATCTTTC 
AGTAATTTTC 
AAAATTCACT 
AACAAAGTAT 
GGTCGCGGGT 
CGTTGGTGCT 
GTAGTACCCC 
GAACATAATT 
CTATACTGCA 
AAAAGTGGAG 
TTTGGTTTTA 
AGCTGCTTTT 
ACGGAAGTAT 



TCTGTCCACT 
TTTTCAATCC 
TCAACCGTCC 
CTAAAAGAAT 
TTTCAACCAA 
ACTGGGACAT 
AAAAATCAAT 
CTGTTAAAAG 
TCTCGTGTTG 
CTGCTACATT 
ATGCGTGATA 
AATCTTGTAA 
TGTAAATTCG 
TTTTTGATAC 
ATTCACATTC 
CtGCTCGTTT 
TTCCGTTAGA 
TTGCTCTTAA 
TATTCTATTA 
GCTTTTTTAG 
AGCAAATTAT 
AATATCGAGG 
CCTACATTCT 
TAATTTGGGC 
TCGAGCCCCT 
TAGCATTTAG 
GCTTTTTTAG 
ATGAGTAATG 
AATGAAATCT 
AACGTTTAAT 
AAAATAATAG 
GTCAAAAGCA 
CATTATCCCA 



11040 
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11340 

11400 

11460 

11520 

11580 

11640 

11700 
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12420 

12480 
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12720 

12780 

12840 

12900 

12960 
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AATTCAAATC ATCGCTTtTT ATgACTATAA AAATtaTTTT AGAATGTTtA TCGCTCCTTC 13020 

TTTACGATAA GAAATCGTGG CACAACTAAA ATATGCTCCT TGGAGTGTTA ATGTACTGAT 13080 

AGAATGTATT TAATTTTCAA TGATCAATAG GTTTTCTACA TAAAAGAATT TACTAATCTT 13140 

TTAAATATTA GAAGATTATT TTAAACAGAA AAAGTGGAAA GGGGCTTTTT TTAGTTAGTT 13200 

AATTTTAAAT TTTAAGATAG CTTCTATCAA TTTAATTTCT AGCTGTCTTA GCATATACTC 13260 

ATCTACATAG GTAATAAATT GCTCCGCCTC ATTAAATCCA TTTGTCAAAC ATAACTTAAC 13320 

TATATAACTA TGATAATGTT TGAGTATCAT CTTGATCGAA ATTGGATCTT CTTCTAGAGC 13380 

AGATTGAATA ACGGATAGCG GAACTAATGG ATACTGCCGT TTTTTTTTGT TGTCTACCAT 13440 

AGGTCATACT CCGTTTTTAA TTGTTCAACT GCCTTTTTTC GTTGATACCA GATACCACTT 13500 

GTACTCATGT TGAGAATTTT ACCGATTTCA GTATTATTAA AACCTGCAAA ATAATAAAGT 13560 

AGGATGACCT TTCTTCTTAT CTCTGATAAA TGATCAATGG CTGTTGATAA CTTTTGATCT 13620 

GAAATTAATA TTTCCATTCC AAACAAGAAA TAGACATCTT GTTCATCAAA ATGAGTTTGA 13680 

GAGTACTTGC CtTCAGTTGA ATCTGTCAAT GGTTCCTCAT TATACTTTAT GTTAGkGTTT 13740 

CGTTTTCGAA TAGACGTAAT TTCGTTTCGT AATACTTTTT TGCAGTAGGC ATCAAATTGG 13800 

TGTTGTATAG ATTTCTCATT ATAATTTTGT TGCATGATAA TATGCTCCTT TTTCCCCTTT 13860 

CCACTAATAC AACAATCAGA GAGCCCTTTT TTTTCGAAAA AACTAGAAAA AAACAAAAGA 13920 

CACGTAGTTT TTATCTACGT GTCTTCCATT TTTTTCAGAA CAACTGCAAA GAAAAGA7UVT 13980 

ACATATTATA CAAAACAAAA AAATATTCTA ATTTTTTTCA TATAGAAATA ACTCCTATTT 14040 

TAAAATTATA AATCGTAACG AATGCGATTT ATCCAATGTA ATTTTACATA CTTTATTTTA 14100 

TTTGTCAACA AAAAATGAAC GATCATAATA ACGTTTTCCA AACTTATTTC GCTAGATACT 14160 

ATTTTAATTA TTCTTCACTA TGACATCATA ATAAAGTTTT AATCAGAAAA TTTAGACGAT 14220 

ATGAAACGGA ATAATCAATT AAAAGCTAAA TATGCTTTGA GAAAAAATAA TTTGTTTTTA 14280 

GCGGAAGACA TATAGATAAA AACTATGTGT CTTTTATTTT TCTAGCTTTT TCGAAAAAAA 14340 

AGGGCTCTCT GATTGTTGTA TTAGTGGAAA . GGGGAAAAAG GAGCATATTA TCATGCAACA 14400 

7UUVTTATAAT GAGAAATCTA TACAACACCA ATTTGATGCC TACTGCAAAA AAGTATTACG 14460 

AAACGmAATT ACGTCTATTC GAAAACGAAA CmCTAGAATA AAAGAAAATG AGGAACTATT 14520 

GAACGTCTCA AATGAAGCTA AGTACCCTTA CTCATTTTGA TGAACAAGAT ATCTATTTCT 14580 

TGTTTGGAAT GGAGATATTA ATTTCAAATA AAAAGTTATC AATAGCCAAA CCAGTTATTG 14640 

GACATAAAAA ATTTTTTATT TTGCAAGTTT AATGATACTG AAATCGATAA AATTCTCAAT 14700 

ATAAGTATGA ATGGATTTAA TGAAGTGGTC AATTTATTAC CTATGTAGAA GAGTATACAT 14760 

TGAGGGAGTT ATAAATTTTC AGAATCTATA TTGAAATTTA AAGTTAAATA ATTAAAAAGT 14820 

CCCCTTTCAA TTATTTTGGT AGAAAATTTA TAGaTCATTG AAAATTAAAT ACATTCTAGC 14880 

AGTACGTTAA CACTTCAAGG AGCATATTTT AGTTACGCCA TGATCTCTTA TCGTAAAGAA 14940 
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GGAGCGATAA 
GGATAATGAT 
CTTTTGACAA 
GCATCATGTA 
GTATATTTTT 
TCtTTTTTTT 
GAGAATATAA 
TAATAAACTA 
TATAAATCTT 
ATGAATCATC 
AAGGTGTAGA 
TACAGAATGT 
AAAGAGTGAA 
CAAAAAAAAG 
TTGCCTATAA 
AAAAAACAAT 
CTAAGTGTGT 
ACGCAACCAG 
ACACCTAAAA 
ACCAAAGTAG 
AATGATACCA 
CCAGTAGTAA 
GCTGAAAATG 
GATAAAGCAG 
GAAGACTTAG 
CTACAAAAAG 
GAAAAAATTG 
TAyGArAAAG 
GCGATTAGTA 
TCGATTAAAG 
TCAAAAGATA 
CAGAATTCCT 
GAAAAAAATA 



ATATTCTAAA 
ACTTCCGTAA 
AAGCAGCTGC 
AGTTTAGTTT 
TAGTTTCTTG 
AAATCtAAaT 
AATACGGTAT 
ACATAAGAGA 
TAAGTAATAG 
AGTCATTGGA 
CAAGAATAAG 
TGTTTAGAAA 
AAGAATATAT 
GGG6AGGGGA 
AATAmAAAAA 
ATAAGACATA 
TAGGAGCCGT 
AAACAACGAC 
CGGCAGTATC 
AAGAAGTAGC 
CAAACGCGCA 
GCCCTGAAAC 
AAGTGAATAA 
TTGATGAAGT 
GCAACGTTTC 
AACAAGCGAA 
CCAAAGAAAA 
AAGTCGCkGA 
AAAACCTAGT 
GCGGAAAATT 
TCTTTACAAA 
TTGTAAAAGA 
AACCAGTGAC 



ATAATTTTTA 
GGCTCGCCGT 
TTAAGAATTA 
TTAAGCACTT 
aATTTTAtTA 
GATTATAGTT 
ATACGTTTTT 
CTATAACCTT 
AAAAGATGTG 
TAGGAGGCTT 
TAAAAATAAA 
AAAGGACTAG 
GTTAAATTAC 
GAAGGATGTA 
TTTACGAGGA 
TAAAGCTAAG 
AGGATTAGCT 
GGTTCAACCC 
TGAAGAAGCA 
GCCAGAAAAT 
ACAACCAACA 
AACCAATGAA 
ATCAACGTCC 
AAAAAAAGAT 
TTCTAAAGAT 
AAAGATTGCC 
TGCAGAAATT 
ATACAACAAG 
GTTCGATCAA 
TATCAAAGCA 
ATTACGGAAA 
GGCAAATCTT 
AGTGACCTAT 



TAGTCAATAA 
AAACTTGGGG 
GTTTTAAGAG 
AATTTTGTAT 
TATTAACtAT 
TAGTAAAAAA 
TATGCTAAGC 
TGACCCGATA 
TTGAATGAAG 
TATTTTAAAA 
AAGCTAACTA 
TAACAAAAGC 
TAATTTTATA 
TTGTGCACGT 
GAATGATTCT 
AATCACTGGG 
ACTGATAATG 
AATAACCCCG 
ACAGTACAAA 
AAAGGTACTG 
GTAGGAGCTG 
CCTCTAGGGC 
ATTCCTAAAG 
CCAAACATTA 
TTAGCTGCAA 
CAACAAGCAG 
GCGGCAAAAA 
CATAAGAACG 
TCTGTCGTGA 
ACTGATTTTA 
GATATGGGyG 
GGGTCTAATG 
ACAGGACTAA 



AAAGTGATGA 
AGAGGTGAAA 
TAAATTAAGT 
ATTTAGAGAT 
TTCATTACTC 
GAGAGGATGG 
ACCAATGTTT 
ATTATGATTA 
TATACATTCT 
GTGTTAAACG 
TCTCTCGAAA 
TACCTACATT 
GCATAAAGTA 
GATCCATGCA 
TGAAGCAACA 
TAACTGTCCC 
TACAAGCCGC 
ACCTGCAGTC 
AAGACACTAC 
AACAAAGTTC 
AAAAATCAGC 
AGCCAACAGA 
AATTTGAAAC 
CCGTTGTTGA 
AA6AA7\AA6A 
CTGAATTAAA 
ACAAAGCrGA 
AAAACAGCTA 
CGAAAGACAC 
ATAAAGTAAA 
GGAAAGyTAC 
GTGGGTATGC 
ACGCTAGTTA 



TGTGAAGTTG 
TTCCTATGTG 
AAGACAAAAA 
TTCATTTGTA 
ATAATCATGT 
AGTCCCTTTT 
ACTCTGTATT 
TAGTTATTAG 
GTAATTAACA 
ATTCATCTTT 
TTAi\ATATTC 
AGGTAGCTTA 
AAATATTTGT 
GAAAGAATCA 
AATGAAAGCA 
TATTCTTTTT 
GGAATTAGAT 
AGAAAAGGAA 
TTCTCAACCG 
AGCTACCCCA 
ACAAGAACAA 
AGTTGCACCA 
ACCAGACGTT 
AAAACCAGCA 
AGTAGACCAA 
AGCCAAAAAT 
AAAAGAGCGy 
TGTCAATGAA 
TAAAATTTCG 
TGCAGGGGAT 
TGGCAACTTC 
GGTTCTTTTA 
TTTAGGACGT 



15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 
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16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 
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AAAATTACAA AAGCAGAATT TGTTTATGAA CTACAATCCT CACCAAGCCA AAGTGGAACG 16980 

TTAAATGCAG TATTTTCAAA CGATCCGATT ATCAGrGCTT TTATTGGTAC AAACAGAGTC 17040 

AATGGTAAGG ATGTTAAAAC ACGCTTAACG ATTAAGTTCT TTGATGCGTC AGGTAAAGAA 17100 

GTACTACCAG ATAAAGATAG TCCATTTGCG TATGCGCTGT CTTCTTTAAA TTCAAGTTTA 17160 

AGGAATAAAG GTGGCCATGC GGAATTTGTT TCTGATTTTG GGGCsAACAA TGCGTTCAAA 17220 

TACATTAATG GyTCrTATGT GAAAAAACAA GCGGATGGAA AATTTTACTC ACCGGAAGAT 17280 

ATTGACTATG GCACAGGACC TTCTGGATTG AAAAATAGTG ATTGGGACGC TGTAGGTCAC 17340 

AAGAATGCCT ACTTTGGTTC AGGTGTAGGT CTAGCyAATG GrCGTATTTC CTTTTCTTTT 17400 

GGTATGAC7VA CAAAAGGAAA AAGTAATGTG CCTGTATCTA GTGCGCAATG GTTTGCCTTT 174 60 

AGyACTAACT TAAATGCGCA ATCAGTGAAG CCTATTTTCA ATTATGGGAA TCCAAAAGAA 17520 

CCAGAAAAAG CAACGATTGA ATTCAATsGA TACAAAGCCA ATGTCGTTCC TGTwCTTGTG 17580 

CCrAAtAAAG AAGTCACTGA TGGyCAGAAA AATrTCAATG ATTTAAATGT GAArCGTGGC 1764Q 

GATTCTTTAC AATACATTGT GACAGGGGAT ACGACAGAAC TTGCCAAAGT AGATCCAAAA 17700 

ACAGTAACmA AACAAGGGAT TCGAGATACm TTTGATGCAG 7VAAAAGTGAC GATTGATTTA 17760 

TCCAAAGTGA AAGTTTATCA AGCAGACGCA AGTCTrAACG ArAAAGACTy AAAAGCTGTT 17820 

GCTGCAGCrA TTAATTCAGG AArAGCTAAA GACGTGACTG CTTCTTATGA yCTcAaTTTA 17880 

GATCAAAACA CCGTCACAGC AATGATGAAA ACCAACGCrG ACGGyTCyGT TGTTTTAGCA 17940 

ATGGGGTATA AATATTTACT TGTCTTGCCG TTTGTAGTGA AAAATGTAGA AGGCGATTTT 18000 

GAAAATACAG CTGTTCAGCT GACAAAyGAT GGnGAAACGG TAACAAATAC AGTGATTAAC 18060 

CATGTGCCaG GTaGTAATCC TTCCAAAGAT GTAAAAGCAG ATAAAAACGG TACAGTTGGC 18120 

AGTGTTTCTC TACATGATAA AGATATTCCG TTACAAACAA AAATTTATTA TGAAGTGAAA 18180 

TCTTCCGAAC GTCCAGCyAA CTATGGCGGA ATyACmGAAG AATGGGGCAT GAATGATGTC 18240 

TTGGACACGA CCCATGATCG TTTCACAGGk AAATGGCACG CTATTACrAA yTATGACCTT 18300 

AAAGTAGGGG AyAAAACGTT AAAAGCAGGA ACAGATATTT CTGCCTACAT TCTTTTAGAA 18360 

AACAAAGACA ATAAAGACTT GACGTTTACr ATGAATCAAG CATTATTGGC raGCsTTAAAT 18420 

GAAGGAAGCA ATAAAGTAGG CAAACAAGCT TGGTCTGTGT ATCTGGAAGT CGAACGGATy 18480 

AAAACAGGTG ACGTAGAAAA CACGCAAACA GAAAACTACA ACAAAGAGCT TGTkCGTTCT 18540 

AATACsGTGG TGACGCATAC rCCTGATGAT CCAAAACCAA CCAAAGCCGT TCATAACAAG 18600 

AAAGGGGAAG AyATTAAyCA TGGAAAAGTk GCTCGTGGTG ATGTTCTTTC TTATGAAATG 18660 

ACnTGGGACT TAAAAGGGTA CGATAAAGAC TTTGCCTTTG ATACAGTCGA TCTTGCGACA 18720 

GGCGTTTCTT TCTTCGATGA TTACGATGAA ACGAArGTGA CACCAATCAA AGACTTACTT 18780 

CGTGTCAAAG ATTCTAAAGG GGnAGACATT ACGAACCAGT TCACGATCTC wTGGGACGAT 18840 

GCCAAAGGCA CGGTGACrAT yTCTGCCAAA GACCCACAAG CCTTTATTCT AGCGyATGGT 18900 
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GGGCAAGAAT 


TGCGTGTAAC 


rCTCCCTACA 


AAAGTCAAAG 


CcgATGTTTC 


TGGkGATGTT 


.18960 


TATAATTCAG 


CGGAACAAAA 


TACATTTGGy 


CAACGAATTA 


AAACCAATAC 


yGTTGTCAAC 


19020 


CATATTCCAA 


AAGTGAAyCC 


TAAAAAAGAC 


GTGGTTATTA 


AAGTyGGTGA 


CAAACAAAGt 


19080 


CAAAATGGyG 


CCACAATCAA 


ATTAGGGGAG 


AT^TTCTTCT 


ATGAATTTAC 


AAGTAGTGAC 


19140 


ATTCCTGCAG 


T^TACGCTGG 


wGTTGTGGAA 


GAATGGTCGA 


TTAGCGATAA 


ACTAGACGTC 


19200 


AAACATGACA 


AATTTAGTGG 


cCAATGGTCT 


GTGTTTGCCA 


ATTCTAATTT 


TGTTTTAGCA 


19260 


GACGGAACCA 


AAGTGAATAA 


AGGGGACGAC 


ATTTCGAAAC 


TATTCACGAT 


GACCTTTGAa 


19320 


CAAgGGGTAG 


TGAAAATCAC 


GGCCAGTCAA 


GCCTTTTTrG 


ATGCGATGAA 


TCTAAAAGAA 


19380 


AACAAAAACG 


TTGCACACTC 


ATGGAAAGCG 


TTCATTGGTG 


TAGAACGAAT 


TGCGGCAGGA 


19440 


GACGTTTftCA 


ACACAATCGA 


AGAATCTTTC 


AACAATGAGA 


AGATTAAAAC 


kAATACGGTA 


19500 


GTGACrCATA 


CGCCAGAAT^ 


ACCACAAACr 


CCACCAGAAA 


AAACAGTGAT 


TGTACCACCA 


19560 


ACACCAAAAA 


CACCGCAAGC 


ACCAGTAGAG 


CCATTAGTGG 


TAGAAAAGGC 


AAGTGTrGtG 


19620 


CCAGAATTGC 


CGCAAACAGG 


CGAAAAACAA 


AATGTCTTAT 


TAACGGTAGC 


TGGTAGTTTA 


19680 


GCCGCAATGC 


TTGGCTTAGC 


AGGCTTAGGC 


TTTAAACGTA 


GAAAAGAAAC 


AAAATAATTG 


19740 


AAGACGAAAA 


GAAGGACGAA 


yTGmTTGTCC 


TTCTTTTTTT 


AGTTATGGAG 


GGGAAAmATT 


19800 


GGAAGCAGTA 


GTAGtAGAAA 


GAGAGGCAAA 


GGGGATGAAA 


GAAATTGCCA 


TCCAAGAAAA 


19860 


AGATTTGACC 


TTACAGTGGA 


GAGGAAACAC 


AGGTAAGTTA 


GTTAAAGTTC 


GATTrAArAA 


19920 


TACACGTGCA 


ATGGAAATGT 


GGTACAACAA 


ACAAATTACC 


GAAGAAAACA 


TTGAAGAGAT 


19980 


CACCACGTTG 


AATATyATTA 


AAAAyGGAAA 


ATCTyTGGCA 


TTAGAAGTAT 


ATCCAGAAAA 


20040 


AAGTATCTAT 


GTGAAACCAA 


ATTTAGGCAG 


AATCAATGTG 


CCTGTCTTTT 


TTATCAAAAC 


20100 


ACCTATTAAT 


AGAGGGATTT 


TTGAAGAGAT 


CTTCGGCGAA 


ACTTTAAAAT 


CATAAATAAA 


20160 


AAATTATTTG 


GAGGAAATTA 


CAATGAAAAA 


AATTATTTTA 


TCAAGCTTGT 


TTAGTGCAGT 


20220 


ACTAGTATTC 


GGTGGCGGAA 


GTATAACAGC 


ATTCGCTGAC 


GATTTAGGAC 


CAACAGATCC 


20280 


AGCAACTCCA 


CCAATTACCG 


AACCAACTGA 


TTCTAGTGAA 


CCTACGAATC 


CTACTGAGCC 


20340 


GGTGGATCCT 


GCAGAACCGC 


CAGTAATACC 


AACTGATCCA 


ACAGAACCAA 


GCAAGCCAAC 


20400 


CGAGCCTACA 


ACACCGAGTG 


AGCCAGAAAA 


GCCAACAGAA 


CCAACAACGC 


CAATTGATCC 


20460 


TGGAACGCCG 


GTTGAACCGA 


CTGAACCAAG 


CGAGCCAACA 


GAACCTAGTC 


AACCAACCGA 


20520 


GCCTACAACA 


CCAAGCGAAC 


CAGAAAAACC 


TGTTACTCCA 


GAACAACCGA 


AAGAACCAAC 


20580 


TCAACCAGTG 


ATTCCAGAAA 


AACCAGCAGA 


ACCAGAAACA 


CCAAAAACTC 


CTGAACAGCC 


20640 


CACTAAACCA 


ATAGAC6TAG 


TCGTTACACC 


TAGTGGAGAA 


ATTGATAAAA 


CGAATCAATC 


20700 


GGCAGGAACA 


CAACCAAGTA 


TTCCTATTGA 


AACAAGCAAC 


TTAGCGGAGG 


TAACACATGT 


20760 


ACCAAGTGAA 


ACTACTCCAA 


TTACAACAGA 


AGCTGGGGAA 


GAAATTGTAG 


CAGTAGATAA 


20820 


AGGTGTTCCG 


TTAACCAAAA 


CACCAGAAGG 


ATTAAAACCA 


ATTAGCAGCT 


CGTATAAGGT 


20880 
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TTTACCTAGC 


GGAAACGTTG 


AGGTAAAAGC AAGTGATGGA 


AAAATGAAAG 


TATTGCCACA 


20940 


TACAGGAGAG 


AAATTCACAC 


TCCTTTTCTC 


TGTATTGGGA 


AGCTTCTTTG 


TATTAATTTC 


21000 


AGGATTCTTT 


TTCTTTAAAA 


AGAATAAGAA AAAAGCTTAA 


TCAGAAAGAG 


TAATTTCGAT 


21060 


AAATTTCTGT 


GAGGAAAGGA 


GGTGCTTTGC 


TTGTTTATTT 


TATTTACTAC 


mAAGCnAAGT 


21120 


CATGTATTAC 


TAAATGAATT 


AGGGTTTAAT 


AGAAAAATTG 


ACCTGGTACA 


AGGTATTGAA 


21180 


CAGTTAACAA 


AAATCAATGC 


GGATTTAAAA AAATCGACGG 


ATCGAATTCT 


AATTACGTAT 


21240 


GAGATTTATT 


ACAACGTGAC 


AGACGAGAAA 


GCTTTTTTAA 


GAGAAACAAT 


GACTTTGCCG 


21300 


GTAGTTGAGG 


AAACTGTCCT 


CCAAAACATG 


GTACAGACTG 


TAGAAACCAA 


TGAAGAAGTA 


21360 


AGACAAGAAG 


ATTTAGAAGC 


GTTTCTTTAC 


CTGTTCAAAA 


GAAATGTGAG 


TAATAAGTCG 


21420 


AAGAGAATAG 


GCCGTTCAAC 


AGGTTCTTTA 


AAAAAGAAAG 


CAGAAAAGCA 


ACAAAAATCA 


21480 


ACTGTGTTTA 


CAACAATTTC 


TCGCTATAGC 


TATTTCTCAG 


TGGCTTTACT 


TATATTTTTA 


21540 


GCAGTAGGTG 


TTTTTACAGG 


TGCGTCATTT 


GGTTCTCAAA 


GTAAACCAAC 


GAGCCAAAAT 


21600 


GATTCTAAGT 


TAGAGGAACG 


ATTAACACAT 


TTACAAGAAC 


AAGTGCATGA 


GCAACCACAA 


21660 


GTTGATGTTG 


TTGCTCGGTT 


TTTCTTATCC 


aGTCTGTACT 


CTGGTGAAGA 


GAGTGACAAA 


21720 


GCCACGCAAA 


AGCAAGTTAA 


AAAGTATGTT 


TCTGAAGGTG 


TACTCAAAGA 


GATTAAGGGG 


21780 


TCTAAAGAAC 


AAATAAGAAC 


GTTATTTCTT 


TGGGAATCAA 


AAAAAGAAGG 


CAACCAATGG 


21840 


CTCCTTACGT 


ACGTTGTTAC 


CTTGAAAGTG 


GCTAAAGAAG 


AAACTGAAAT 


TCGTCAAATG 


21900 


AAAGTAGCTA 


TGGAAAAGAA 


AAACAATAAT 


TATCAAGTAA 


CAGAACTTCC 


AGAAGAAAAA 


21960 


GAATTTACTA 


TTAATGAATA 


ATAATTGAGG 


AGGCAATTCT 


ATGAAATATG 


AACGTCCATT 


22020 


AAAAAGAGAG 


TCACAAATCA 


AGGAGTTTGA 


GTTAGGCACG 


CACGCGGCAG 


TAATTGAAAA 


22080 


AGTTCAAAAG 


AAACGGTCTC 


AAAAAGGGAA 


TGATATGTTT 


TTACTCTCTC 


TTTTGGGAAA 


22140 


AAGTAACGAA 


AAAGGGGTTT 


ATTTTCTTAC 


cTTCGGGAAT 


GATTATACTG 


AAGATAATTT 


22200 


ACGCTATATT 


TTAGCCAGTA 


TCCAAGACAA 


CGGTGTAGAG 


ATTCCAGACG 


TTGACTTTGG 


22260 


CTATAATCGA 


GAAACTTTTG 


AATTTCTGAA 


AGGTAAAGAT 


GTCTATATTC 


AGGTTGAAGA 


22320 


mCAAGAGTAT 


AAAGGGAAAG 


TTAAACaTGC 


GGTAACaAAT 


TTTTTAACTC 


aGGATGAATT 


22380 


TGAAGAAAGC 


GAAGAAATGG 


AGTTTTCAGA AAGCAATACT 


GAAGAAGATT 


GGTAAAATAA 


22440 


GCCATTTCTT 


AGAATATAAA 


AACTGAAGGA GAGACAAAAC 


TATGGATATA 


GTCACAAAAA 


22500 


TTATTACACT 


AATTGGAGGA 


ATCATTGGAC 


TTGTTTCAGC 


AGTTAGTATT 


ATGTTTGGCG 


22560 


TGAAAGAAAT 


TCGTTCAGGT 


ATGTCGAATG 


ATGATCCACG 


TACGCTTGAT 


AAAGGAATAG 


22620 


AAAAAGTAGT 


TGTTGGTGGG 


GCTGTTATTT 


TAGCAATTGG 


TGGGGTAGTC 


GCCTATGTTA 


22680 


TTACACAGGT 


AGGaGCTATT 


CGTTTTTAAA AGGAGGAAAA 


AAATGAAGCT 


ATTAGTAGAA 


22740 


ATGATAGTCA 


ACGGTCAAAC 


TGAATGGGAA GTT6TAGAAG 


AAGAAAATGC 


GCCACAGGCA 


22800 


ATTATTCAGA 


GTCGAGGAGG 


CTTTTCATTT 


GATGAGAATG 


GAGAGCTCAT 


TGTCAATGAT 


22860 
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GATGAAATAA GTTATACAGG CGTGTTTGAG GTTTGCGAGA CTAACTTATT 
GTTAAGGAAG CAGAAATCCA TAGATTTTAT CACAAAAAAT TAGAAAAGCT 
CCTCTAACGT TTGAAAACTC GCinAGAAATA GCTAACTGAG CGGAGGGGaA 
ATCCAAAAGG AACGACCGAT CAGCTATTTA AAAGTTTAGC TGAATACmAT 
aTGrAGCCAT GTCTAAAAtA GcGAAAGTCC tAGTCCCtTT AGGAATCgCT 
TTCTTTTTAT GATTGAATTG GCGAATACCC AAAAGAAATT TCAATCAGAA 
TGACAATCGA GCTATTAACT AACATTGCAT TGAAGTATGT AATCGCGTAC 
TGGGTTCTGG CTACATTATT GATGGCATTG TTTGGTTTAC AATTCAAGCA 
TTAATTCCAT TGTAACAGCG ACAGGAACAA GTGAAGCGAT CCCACAAATG 
CTTGGTGGGC AAAACCAATT GTATTTTTCT TTCAAATATT TGCATATATT 
TATCTAGTAT TATTGCAAAT ATTCTTATTT TCTTACGAGG GATACAATTG 
AA6CTCTTGC ACCGTTGTTT GTGGCGTTTT TTGTTCATGA TGAATTACGT 
TGGGGTACAT GAAACAAATC ATGGCCTATG CGCTTCAAGG CGTTCTGTTG 
TTGGGTTAAT TCCGATATTA ACAGCAAACG ATTATCTATC CTTTGAATCT 
GCATTTTGgC AGGTGCTGGA GCGATCATCA TTAATGTTAT GACGTACTTT 
TAAAATATGT GGCAGTGATT ATTCTTCTAG TTGGTTCACA AGGATTTGCC 
TAGGAGCGAT GTAAAATGGC GATTAGTTCA GAATTCTATA rAGATTTGTC 
AAAAAAATCT GGGGAATTAC AGTCAGAGAA TTTAAAGCGT ATGTTTGCTT 
GGGATATTTC TTCTGTTAGA AGTATTTTTT TTGCCTGATC TTTTATTTAT 
TTAG 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
ATTTGAAACC TGTTGTTGAA TCCATGAATA CTACTGGTTG GTAATTTGGA 
GTTTCATACG TTTTTTCGCT CCTTTGCCCT GATCCATTTG CTGTATCAGA 
CGGTATACAA CAATGCTATC TTACCATGTT TTCAGTTGGT ATGCAATTAT 
TTCTTTTAAT TTGTTTGCCA AACGAAACAT CATGGAAAAC TTCAAATAAT 
TTTGGGTCCG - TTTCAACATT TTTATGAATT GATCCGTATA TTCCAGCGAG 
TATTATTTCT CAATTTCCAC GTTTCTTCTA GTTGTTCTGG TGTCATTAGC 
TTCGCGTACT TGATTTTTTA ATATCAATTG CTGGGAAAAT ACGGCGTTCT 



GGATTTCACT 
AGGGaTAGAT 
GCnATGAGAA 
CCmACmACgA 
ATTTTAGGCA 
GACGGTGGTT 
GTCTGTATTA 
GCCAAATGGA 
GAAAAAGTTT 
GCATTAATAT 
TATATGATTA 
TCGATTGCCA 
GTTCTATTAC 
CTTGAAGGTG 
AATTTAATTT 
AAACGATTAG 
AAAAGTAGAG 
TGTTTTTATA 
GGTTTGTGCG 



nnGATATCTT 
GTTATTTTGT 
CTTTTTGGAT 
TGTTGATTAT 
TCTCCTGACA 
AACTCTTCTT 
GCTAATTCAC 



22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24004 



60 
120 
180 
240 
300 
360 
420 
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GAGATAAATG 


GAGCTCCATG TTGCCTGTAC CTTTGAATTC TTfATAGATG ACATCATCCA 


480 


TTCGGCTACC 


TGTATCCACC 


AATGCCGTGG 


CT/^AAATAGT 


CAGACTGCCC 


CCTTCTTCGA 


540 


TATTTCTGGc 


AGCACCAAAA 


AATCGTTTAG 


GTTTGAAAAA 


CGCAGCTGGa 


TCTATTCCGC 


600 


CACTTAAGGT 


TCGACCACTA 


GGCGGGACCA 


CTAAATTATA 


GGCCCTTGCT 


AAACGAGTGA 


660 


TGCTGTCCAT 


TAAAATAACT 


ACGTCCCGTT 


TGTCTTCAAC 


TAAGCGCATT 


GCTCGATCCA 


720 


GAACCAATTC 


AGCAACTCGC 


GTATGATTTT 


GTGGTTGTTG 


ATCAAAAGTT 


GAAGAAACCA 


780 


CATCGCCTTT 


AACGCTTCGC 


TCTAAATCAG 


TCACTTCTTC 


AGGGCGCTCG 


TCAATTAACA 


840 


ACAGAATCAA 


TTCCACATCA 


GGATGATTTT 


CTGTAATCCC 


ATTAGCAATT 


TCCTTTAAAA 


900 


CACTTGTTTT 


TCCCGCTTTT 


GGCGGCGCTA 


CAATTAAACC 


ACGTTGCCCA AAACCAATTG 


960 


GGGCAAATAC 


ATCAATCATT 


CGTGTTGATA 


ATCTGCCAGC 


CGTTGTTTCT 


AACGTTAATT 


1020 


GTTTTTCAGG 


ATACAACGGT 


GTTAGCGCTG 


GAAAATGTGG 


CCGCTCTTTC 


GCTTCTTCAG 


1080 


GATCTTTCCC 


ATTGACACTT 


TCAACATGCA 


TGAGTCCATA 


GTAACGTTCT 


GATTCCTTTG 


1140 


GCGGACGCGC 


CTTGCCAGCT ACTTTGTCCC CATTTCTCAA TCCGAAACGA CGAATTTGAg 


1200 


AAGAAGAAAT 


ATAAAtGTCT 


TCTGCACTTG 


GTCCATAATT 


AATTGGTCGC AAAAAGCCAT 


1260 


AACCATCCTG 


AGAAACAATA 


TCTAAAATTC 


CTTCCATGAA 


GAAAAACCCT 


TGCTTCTCAG 


1320 


CTTGTGCACG 


AATCACAGCT 


AAAGACAATT 


CTTTTTTGGT 


CATTTGGCAT 


AATAAGGATT 


1380 


TTAAcTCTTt 


CGCGTAgCAT AAATATCTTt 


TAACGTACTG 


tTTTCCaATt 


CAGCCATgTa 


1440 


AATAGTCGCT 


CATTGATCCA 


CCTCAAnCGG 


ACCTTCTTGT 


TCTTCGGTTT 


CTAGCATTTG 


1500 


AATATCAGCG 


CCTAAAGCAG 


TTAACTTTTC 


GATAATGTGG 


TCGTACCCAC 


GTAAAATATA 


1560 


TTCTACATTA 


TAAATAGTGG 


TCGTACCTTC 


AGCCATCAAG 


CCGGCAGTGA 


CTAGGCAAGC 


1620 


GCCTGCCCGT 


AAATCAGAGG 


CGACTACTTC 


TGCACCGTGC 


AATTGATTTG 


GGCCATTTAA 


1680 


GATAATCATA 


TTGCCTTCAA 


CCGAAGCATC 


AGCCCCCATC 


CGCACCAACT 


CAGGGATATG 


1740 


CTTATTTCGT 


TGAGCATAAA 


TCGTATCAAT 


AATTTCACCA 


GTTCCTTGTG 


CTTTTAAAAG 


1800 


TAATGGCGTT 


AACGGTTGTT 


GCAAGTCTGT 


GGCAAAACCA 


GGATACGGAT AGGTTTTCAC 


1860 


CGTCGTCATT 


TTCAGATCAT 


6TGATGGATG 


AACTTCAATC 


ATGTCTTCTT 


CAATTGTCAT 


1920 


CTTAACGCCC 


ATTTCTTGCA 


ATTTCGCAAT 


GAAGCTTTCT 


AAATGCTCAT 


AAATAACATT 


1980 


TCGAACCTTG 


ATTCCTTCAC 


CCATAGCTGC 


TGCCATAGCT 


AAATAAGTCC 


CAGCTTCAAT 


2040 


TCGATCTGGA 


ATAATCGAGT 


GACGACAGCC 


ATGTAATTCT 


TCGACTCCTT 


CAATCCGAAT 


2100 


AACATCCGTT 


CCAGCTCCTC 


GAACTTTTGC 


ACCCATGTTA 


TTTAATAATG 


TAGCAATATC 


2160 


AATAATTTCT 


GGcTCACGCG 


CAGCATTTTC 


AATAATCGTT 


TTGCCTTTGG 


CTTTAACTGC 


2220 


GGCTAACATG 


ACATTGATTG 


TAGCGCCAAT 


AGATACCATA 


TCCATAAAAA 


TTCGGGTTCC 


2280 


TTGTAAGCCA 


GCTTCGTCTG 


TTCTCAAGTA 


CATCGCCCCA 


TGTTCGTTGG 


TTACATGAGC 


2340 


GCCTAATGCT 


TCAAATCCTT 


TGATATGTAA 


ATCAATTGGA 


CGAGGGCCTA 


AGTAACTU^CC 


2400 
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GCCAGGTAAG CCGACAACGC CTTCCCCAAA TCGACCAAGA AGTGAGCCCA TAAAATAATA 2460 

GGATGCACGT AAGCTATTGA TTTTTCCTTT TGGCATTGGC ACAGAAACAA CTTCTGTTGG 2520 

GTCAATAATT AATGTATTAT TTTCAAAAGT AATTTTAGCA CCCATAATTT CTAAAATTTC 2580 

AATTAATGAG TGAACATCCT GAATGTCAGG AACGCCATCT AAAGTCACGG GCGAATCTGC 2640 

TAAAATAGCA GCTGGAATTA AAGCGACCAC ACTATTTTTG GCGCCGCTAA TGGTTACTTC 2700 

GCCTTTT 2707 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



TCACCTTATT 


GATAGCGCTT 


CCTTATATTG 


nAATAATGCG 


ATTTTCTAAA CAAAAAATTA 


60 


TAGGTGGTAT 


TGATAATGAA 


AAATCCGTAT 


GTCCAAATAA 


CCGACCAAGA ACAATATAAA 


120 


AAGTCAATTC 


GTCGAATTCC 


TCCAAAAGAT 


CCTAATATTT 


TTTTAATTGA GAGACCTACA 


180 


GCAG6TGAGA 


TTATTGCTCT 


ATGGTGGACA 


ACCTCTTCCC 


GTGGCAAATC AACTGATAAA 


240 


TATATACCAA 


AATATTTCAA 


GCGTATCTAT 


CGAGTAGATA 


TGATTAAACA ACGTAAACTA 


300 


TTCATCAAAG 


CTAACGTTAT 


CTCCGAAGAA 


AATAATACTT 


ATGTACTGAC TGAACTAGGA 


360 


GAAATGTTAG 


TACATAAATA 


CTTTTATATT 


ATTGAAGAGC 


ATAATAATTC TTCCAATGAT 


420 


TCTTTCAATA 


CTGAACGAAG 


AAAGTTAGCT 


GATAAAATTC 


TATATTCGCT AGGCGGTGAA 


480 


ATATAATGCA 


AACGGAAGTT 


AAAGAGTTTA 


TTGCTTTTGA 


TATTGAAACA ACAGGTTTAG 


540 


ACTTCCATCA 


GGATGCTGTT 


ATTCAGATAT 


CAGCTGCAAG 


ATACGTTGAT- GGTATTGAAG 


600 


TTGATTATTT 


TGATACATTG 


GTAAATAGTG 


ACTACATTCC 


AGATGAAATT ACCAAATTAA 


660 


CAGGAATCAC 


TAATGACCAA 


GTACTAAATG 


CGCCAACTTT 


AGATGAAGTT ATGCCTTATC 


720 


TTTTTGATTA 


TTTAGGAGAT 


ACGATTCTAG 


TTGGACATAA 


TATAAAAAGT TTTGACTTTC 


780 


CTTTTTTAAA 


AGCAAAAGGA 


TATAATATTG 


CAGAAGGACA 


TGA7UVTTTAT GATACTCGCT 


840 


ATTTTGCAGC 


CACACGAAAG 


CATGGGGCTG 


TAAACAATCA 


ATTAACTACA TTGAAATTAC 


900 


TTTTTGGTAT 


CGATGCGATT 


TCACATAATG 


CATTGAATGA 


TGTTAGGATA TCTGCAATTG 


960 


TATTTATGGA 


ATTATTAAAG 


ATAGAACCTC 


AAGAAAAAAG 


CTCTACTCTT AAGAAAAGTA 


1020 


GCTTGGACTC 


TTTTGAATTA 


GATGATGATA 


CTACACCTTT 


TTTTGAAGGA ATGTCTTTTG 


1080 


TAGTAACGGG 


GGCATTTGAT 


GATTCTAAAT 


ATAACCGTAA 


ACAAATTGAA ACACTGATAA 


1140 


AACAACATGG 


CGGTAGGGTA 


TCGTCAAGTT 


TATCTGCCAA 


GACTGATTAC TTTATCCAAG 


1200 


GTATTCAAAT 


TTCTTCCCAA 


CTTAAAGACG 


GAAAGCACAG 


CAGTAAAGAA CTAAAATATA 


1260 


TAGAGTTACG 


TGAAAAAGGG 


GTTGATATTT 


ATAAATTTAA 


TGGTAATCAG TTTTATGAAC 


1320 
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TAATTACAAA CTATAGAAAA TTGAGGAAAA ATTAATGACA GAACTAAAGA AAATAACTTC 


1380 


AAAAAATbCsA 




TCTTTAAAGA 


TATACAACAT 


AAACAACGAA ATGAGTTAGG 


1440 


CATAGCCTCT 


/»»/^ 7» /^IV TV nt/**iv ^ 

GGACAATCAG 


TACTGATTTC 


AATTTTTGAA 


CCGGATCCAT 


TCTATTTTAC 


1500 


AGAAGATATT 


CAACTTATTC 


ATTATAAAGA 


TATCGTTAAA ACCGAATCCG 


TTTCTTCAAA 


1560 


AGAAAATGAA 


TTTTTTGAAT 


ATGTAAAACA AAATTCACCA 


GTTGCTTATA 


AAAAGAAAAT 


1620 


CTCGGATAGT 


TATAGTATCT 


CTGAGTATAT 


TAATTATGCT 


CCACAACTTA AATTTAAGTA 


1680 


TATTATAAAA 


GATAGTATAC 


TAACTATTAC 


GGGGGAAATT 


7\AAGACAGAA 


ATGTTAACTT 


1740 


TCCAATTTCT 


CAGCCTTACG 


TATATGTCGA 


TAAACCAGAA 


AATGAAATTC 


CGTTAGATAA 


1800 


TAATTTAAAA 


TTCCAATATG 


TAGTCAACGA 


TTTCGATATA 


CATGCTCCCG 


AATACAAAAT 


1860 


ATCGTTTATT 


TTAGGTACTT 


CAAAAGCCGA 


GTATGGTCGT 


TCTTTCAATA 


TGAAAAATTT 


1920 


CGAAGGTGTT 


TTACGCACCC 


CATCTGAAAT 


TAGTTCTATG 


AAAGCAAAAA 


AAAAGAAATA 


1980 


GTTACTTAAA 


TAAAGCAAAA 


AAGTTCTACG 


CCTAAAATTC 


GAATTTAGAC 


GTAGAACTTA 


2040 


TTTATTTCAA 


TGTCCGAACC 


AAATAAACTT 


CTTCACAAAA GCCTTTAAGT 


CCATTATAAA 


2100 


TCCGTTGTGC 


TCGTGTTTTC 


TTTTCACATA AAGCAAAGAC 


TGTCGGTCCA 


CTGCCGCTCA 


2160 


TCAACGCTGC 


ATCAGCGCCG 


TACTTCAGCA 


TGCGATCCTT 


GATTTGTTGC 


ACGATTGGAT 


2220 


GTCTGGCAAT 


CGTGACACTT 


TCCAACGCAT 


TGCCAACCGT 


TTGGGTCATC 


CCAGTATAAT 


2280 


CCCCATTTTC 


AATAGCTATT 


CTTAGTCCAG 


CAATATCTGG 


ATGGTGCAAT 


TCATCTACCG 


2340 


CTAAATCATT 


AAAAAC6GTA 


CTAGTGGATA 


CACTAATTCT 


TGGTTTAACT 




2400 


AACATTGAGG 


CATGGTCGGT 


AATGCTTCAA 


TTTTCTCCCC 


ACGTCCGTTT 


GCAAAGGCTG 


2460 


TTCCGCCCCT 


TAAACAATAA GGAACATCCA 


TACCAATTTG 


GCTGCCAATT 


rnf^t\ TV ^ TV O ffP 

TCACAGAGTT 


2520 


CGTCCATCGT 


CAACCCCAGA 


TTCCACAGTT 


TGTTCAAGCC 


TCGTAATGCT 


GClaksCACACaJ. 


2580 


CACTGCTCCC 


CCCTGCTAAA 


CCAGCTGCCA 


CGGGGATACG 


TTTTTCAATA 


XAiAl 1 1 lAA 


2640 


TTCCTTTATG 


AATATTATAG 


GTACGCTTTA 


ATAAGTCAAC 


TGCTTGATAG 


ACATGATTCC 


2700 


GTCGATCCAC 


GGGTAAAAAA 


GAGCTGTCCG 


TTTCAATACG 


AATGATGTCT 


TCTTCTAGAT 


2760 


'l*'M'l>/*^ TV TV 7\ 7\ / 'III 

TTTCAAAAGT 


TAAATGATCG 


GCTAAATCTA 


CACTTGCAAA 


AATAGATTCA 


ACTTCATGAT 


2820 




ACGTTTATGC 


AGAACATCTA 


ACCCAAGATT 


AATTTTCGCA 


GGAGCTCGTT 


2880 




CATCGGTTCC 


CTGATTCACC 


TTCCTTAATC 


TTTTGTCTAC 


CAACATTTTA 


2940 


ATCTTTTAGA 


TTCTATCATA 


TGTCCATGAA 


GAAGTACACC 


GAAGATTGCT 


CTTTCTCTGA 


3000 


CGACAGATGG 


CTGAAATTAA 


AAAGAGCTTA 


GGAACGAAGG 


GCTCTTCCTT 


CACTCCTAAG 


3060 


CGGACTTTCT 


AAATTATACG 


GCTTGACCGA 


TGAATTCGAT 


TTCAACGGTG 


CGTGTTAAAA 


3120 


CATCTGAATA 


GCTATAAGAT 


ACACGTTCAA ATGAGTTCTC 


GTCTGGATCC 


AAGTCAACAA 


3180 


CAAAAACAGA 


TGGATAAGTC 


TCCGTTAAGA 


TTCCTTTACG 


TTCTGTTTGG 


CGCTTTCTTC 


3240 


CAGTCTGAGC AACTAACATA ATTTCACTGC 


CGATACGGCC 


TTCCAAATCT 


TTCTTAATTG 


3300 
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ACGATAACGT 


TGTTGGCATT 


GCATTCACCT 


CGAGAAACAT 


TATATCACTT 


TTTTTAAAAA 


3360 


ATTTCAAGTT 


TATCATACTA 


GAAGTCTGTT 


TGCAACTACT 


TTTGCTCGAC 


AAACCGGCTG 


3420 


TTTTGTCGAA TTTTTATTCA 


TTTTCTAATG 


TATATTTCAT 


TTTTGCTTTG 


GCACGATCAT 


3480 


AGGCACTTCG 


GACAGCTGAA AAAGTTCCCC 


CACGTACTTT 


AGCAATTTCT 


TCTAACGTGT 


3540 


AGCCTTTTAA GTATGCGTTT 


AAAATTTGTT 


GTTCCAACGA 


GGAAAGACGT 


GTCGGAAAAG 


3600 


CTGCCAATTT 


TTCTTGAATC 


ATTAATTGAC 


CTAATGGGTC . 


GGTAGCCGCT 


GTCCCTAAAT 


3660 


AGTCCACTAA 


CTCTTCACCA 


GCGCTTTCTA 


ATCGTTGTTC 


CAGAGAAATA 


GACAAGGTAT 


3720 


CTGTTCGCCG 


CTTATAGGCG 


CATTGATGGC 


GAATTAAGCT 


ATGTACGTGA 


TTTTCAAAAT 


3780 


TTCGTTTAAA 


AAACTTTCCT 


AATGATGCTT 


GTTTTTCTGA ATCAAATCGT 


TCTAACGTCC 


3840 


GAAAGAAGAC 


AATTCGCCCT 


TCTTGAAGCC 


AGTCCTCAGT 


ATCATAATCT 


TTTAAATAGT 


3900 


ATTTTTTCAT 


GCACTTATAA 


ATCACTGGCT 


GATATTGCCC 


ATATAAGGCT 


TGAAATTGTT 


3960 


CTTGGTCTCC 


TGCTAATGCT 


TGTTGAATCT 


GTTCTAACAT 


AAAAAGCCCT 


CCTTTTCGAT 


4020 


AAACGTATCG 


AATGATTTAG 


GAGAGCATAG 


CATGTTAGAA 


ATTTATGACT 


TAAGCGAACA 


4080 


TTTGTTTAAT 


CTTTTTTTTG 


AGACAATTCT 


TCTAGCTTCT 


CAGAAAGTTT 


TGATAACTGC 


4140 


TCCACGTTCC 


ATGGAGAATT 


TCGACGAAAT 


TGCtGGAAAT 


GGATATCCGT 


TGCATGAATG 


4200 


GCAATCGTCT 


TTTCTGTTTT 


CTGGACAGCT 


TTGTAAAGTT 


CATTGGCCGA AGTCCGCAAT 


4260 


GCACCTTGTG AAAAGACCAC 


CCATTGTTCA GCCAAATCAC 


TCGTTGCGAC 


TGTTACTTGC 


4320 


GTTAAACGAT 


CATTTTTTTC 


ACCAGCTAGA 


CGTTCAATAT 


AACTATCTGC 


TGTTTCATCT 


4380 


TCTTTGGTGA AAACCACGGT 


CAAAAGGTAC 


TTTTTGTACG 


TTTGCTGAAT 


ACCTGGGACT 


4440 


AATTGGGCAT 


CAAAGACCAC 


AATGATTTCC 


AACCCTTCAT 


ATTTGGCGTA ATTGGACAGA 


4500 


CGACTAAGAA 


GCAGTTCACG 


CGCATCCTCT 


AAACGCTCTT 


GCTTTTTCAA 


CTGGACCAAT 


4560 


TCTGGCCAAG 


CACCAATCAT 


ATTATAGCCA 


TCAACGATTA ACAATTGTTT 


TTTCATACAA 


4620 


TCGCCTTCTT 


CTATAATGGG 


TTGCGGCCAC 


GATAGACTTC 


ATACATTAGC 


AAGCCTGCAG 


4680 


CCACACCAGC 


GTTTAAGCTT 


TGAACGTGTC 


CAGTCATTGG 


AATCGTTAAT 


AATTCATCCA 


4740 


CTTCTTTATG 


AAGACCTTGA 


CTCATGcCAC 


GACCTTCATT 


TCCGATGATT 


AAGGCAATTG 


4800 


CGCCTTGTGT ATTCCAGCGA 


CGATAATCCG 


TCCCAGACAT 


ATCTGTACCA AAAATCCAAA 


4860 


ATTGATTCTC 


TTTTAGCGTT 


GCAATACTTT 


GCGCTAGATT 


CGTTACACGA GCCACTGGAA 


4920 


TATGTTCCAC 


GGCCCCTGTT 


GACGCTTTGG 


TTACGGTTGG 


TGTAATTCCC 


ACTGCACGAT 


4980 


GTTTCGGAAT 


GATAATACCG 


TCCACGCCTG 


TCGCATCAGC 


TGTTCGCAAA ATTGAGCCAA 


5040 


AGTTATGTGG 


GTCCTCTAGA 


CTGTCTAAAA 


TTAGGAAAAA 


TGGTGTTGCT 


GTCTTTTCTT 


5100 


TTGTTTGTTC 


CAATAATTCT 


TCTAAAGTTA AATATTGATA CGCCGTGATT 


GCTAAAACCA 


5160 


TGCCTTGATG 


CACGCCGTGA 


TCACTCATCG 


TATCTAATTT 


TGCTTTAGGC ACCCATTTAA 


5220 


CTGGAACAGC 


CTGTTCTTTC 


GCTGCTTGTT 


TTAATTGCTC AATTTTCTCG CCACGAGCAT 


5280 
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CTTCTTGTAA AAACAATTTA TTCCCGCGTC 


CTTGCTGCAA 


CGCCTCAACA 


GTGGCATGGA 


5340 


AACCAAAGAC AAAGTTATCT GCTAGCTCTT 


CTGAGGTTCC 


ATCAGCACCT 


GCTGTCGCTG 


5400 


GTCGTGGGTT 


CCGTTCTTTT GAACGCTTAT 


CTGAACGTTT 


CTTACCTTTT 


TTAAAAGGAC 


5460 


GAGATTTTTC ATTTTTCATT GGTTTCACCT 


ACTTTCTTGA 


TACACCAGTT 


AATTAATTCT 


5520 


TCTAAACGTT 


CTTTTTGTTG TGTTAAATGT 


AAATAACCCA 


TCAAGGATTC 


GAAACCGGTC 


5580 


GCTACCCGAT AGGTCGTCAC ATCAGCATTT 


TTGGCAAAGG 


TGTGACTTTT 


AGCATTCCGA 


5640 


CCGCGTTTGT 


ACATCACTTC TTCTTCCTCC 


GTTAGCATTT 


CTCCCGCTAA 


CATCGCTTGC 


5700 


ATTAAGCTTG 


CTTGGGCTTT TGCTGACACG 


TAATGTGTCG 


CCATGCGATG 


CAAAACATTC 


5760 


GGTTTCGTTT 


GTCCCAAGGA AACTAAATAA 


TCACGAATGT 


AAATTTCATA 


AATCGCATCG 


5820 


CCAACATAGG 


CCAAAGCTAA ACCGTTTAAT 


TGTTTGTAAT 


CTCTCATTCA 


CTTCTTCTCC 


5880 


ATCTCGTTCC 


TTGCGGAGTA TCTTCTAAAA 


CAATGCCTTG 


TTCTTTTAAT 


AAATCGCGGA 


5940 


TTTCATCACT 


TCGGGCAAAA TCACGGTCTT 


TCCGTGCTTG 


GTTCCGTTCT 


TCAATTAACG 


6000 


CATCGACTTG 


TGCATCCACT AATTCTTCGT 


TTTTAAAGAA 


TAAACCAAAA 


ATAGCTAACC 


6060 


AGCCACTGAA AAGCTTATCC ATCGCCACTA 


GAACTGGTTC 


CGAAACTGTC 


GCTTGTTCAC 


6120 


TGTACTGATT 


CATCATCTTC GCTAATTCAT 


AAACGACCGT 


AATCCCGTTA 


GCTGCATTGA 


6180 


AATCATCGTC 


CATTTCCGTA ATAAAACGTT 


GTTCTAATTC 


GTTTAATTGA 


GCCAAATAGT 


6240 


GTTCATCCTC 


TGGCAAACTT GCGACCGCAT 


CTGCTTGTCT 


AAAACGTAAA 


TTTTCAAACG 


6300 


CATTTTTCAA 


TTTTTGATAA TTGGCTGCTG 


CCTCTTTTAG 


AGTGCTTTCG 


CTATAACGAA 


6360 


TCGGGCGACG 


GTACTGGGTG GTGGACATAA 


AGAAACGTAA 


AATTTGCGGA 


TCCACTTTTT 


6420 


GAATCATTTC 


ATGAACAGTA ATGAAGTTAC 


CTAGGGATTT 


ACTCATTTTT 


TCATCGTCTT 


6480 


CACCAATTGT 


GACATAGCCG TTATGCATCC 


AGTAATTGGC 


AAAGGTATGG 


CCTGTTTTGG 


6540 


cTTCGCTCTG 


TGCAATTTCA TTCTCATGAT 


GTGGAAATTC 


TAAATCTTGT 


CCACCACCGT 


6600 


GAATATCAAT 


CGTCTCTTCC AAGTGTTTTG 


TCGCCATGAC 


AGAGCATTCA 


ATATGCCAGC 


6660 


CGGGACGTCC 


TTTCCCCCAA GGAGAATCCC 


AAGAAATTTC 


ATCTTCTTTG 


GCACTTTTCC 


6720 


AAAGAGCAAA ATCCAATGGG TCTTCTTTTA 


ATTGTTGTTC 


AACGCCAGTT 


CGTTGACTTG 


6780 


CCCCCACTTC 


TAATTCATCA ATCGATTGAT 


GGCTTAATTT 


ACCATAATTA 


GGGAATTTAC 


6840 


GGGTGCGATA 


ATACACATCA CCTGCGACCT 


CATAAGCAAA 


ACCTTTTTCA 


ATTAAAACTT 


6900 


CGATGAAAGC 


TAAGATATCA GGCATGTGGT 


CCATCACCCG 


AGGATGTAAC 


GTTGCTGGTT 


6960 


GGACATTCAA 


TGCTTGCGT6 TCTTCTTCAA 


AGGCTTTGAT 


AAAACGCTCC 


GCCACTTCCG 


7020 


GCGCAGTAAT 


TTTCAATTCT TTAGCTGCCT 


TGATAATTTT 


ATCATCGACA 


TCTGTAAAGT 


7080 


TCGAGACATA ATTCACTTCA TACCCACGAT 


ATTCAAAATA 


ACGACGAATC 


GTATCAAAGG 


7140 


CGATCGCACT 


GCGCGCATTA CCGATATGGA 


TATAGTTATA 


CACGGTTGGT 


CCGCAGACAT 


7200 


ACATCCGAAC 


TTTACGCGCC TCAATTGGCG 


TAAATACTTC 


TTTTTCTCTG 


GTCAATGTAT 


7260 
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TATAAATTTT 


AATCATGCCC 


TTTTTCCACC 


TTTCTTCCAT 


TTAGACGAAC 


GACTTTCGCA 


7320 


GGAACACCCA CAGCTGTGGC ATCTGCTGGG ATGTCTGAAA TAACAACAGC ACCCGCACCA 


7380 


ATTTTTGAAC 


GTTCACCAAT 


CACCACTGGC 


cct;u\aattt 


GAGCTCGTGC 


CGAAATCATC 


7440 


GCTCCTTTTT 


TCACTGTTGG 


ATGGCGTTTA 


CCCGTATCAC 


GACCGGTCCC 


ACGTAACGTC 


7500 


ACTCCGTGAA ACAAAACAAC ATCTTCTTCT ATTTCCGCGG TTTCCCCAAT CACAATGCCC 


7560 


ATGCCGTGAT 


CGATAAAAAC 


TCCCGCTCCA ATTGTGGCTC 


CGGGATGAAT 


MtfTt^^m ih ^^^^^^^^^^ 

TTCTACGCCC 


7620 


GTTATGAAAC 


GCCAAAATTG 


TGCATGAATC 


TTCGCCAACA 


AAAACAACTG 


ATGCCTATAT 


7680 


AAGAAATGAG 


ACAAGCGATG 


CCAAAACAGG 


GCGTGCAAGC 


CAGGATACGT 


TAAAAGAACT 


7740 


TCTAGTGTGG 


TCCGCGCTGC 


AGGATCGTTT 


CTTTTAACAG 


CTGCGACGCT 


ACGTTTCAAC 


7800 


CAACTCATAT 


GGTGAACTCC 


TTTCTATTAT 


ACAAAAATAA 


AAAACGTCTT 


TTAAGCAACA 


7860 


AAAAAACGTT 


GCTTAAAAGA 


CnCCCaTGCG 


TGGTTCCACT 


TTTCTTCACA 


AGAACTAGCT 


7920 


CTTGCCTCGA 


TACAGTTAAC 


GCCTGCCGAC 


GTCTTCGGTT 


ACTTTTTTCA 


CCAAAGCCAC 


7980 


TCATAGATGC 


ATTTCGACAA 


AAGATGCTCG 


CTGTTCTCAC 


CACCCACAGC 


GTCTCTGAAA 


8040 


TCCATCTGAT 


TGTCTACTTC 


TTCTATTCTT 


TGTGTTTTTA 


TAACACTTTA 


TTTAAATGAT 


8100 


CTAATGCTTT 


TTCTTTGCCT 


AATAATTCGA 


TGGTTTCGCC 


TAATTCAGGG 


CCATGCATTT 


8160 


GTCCAGAAAC 


AGCAATTCGA ATTGGCATAA ACAAGTTTTT 


GCCTTTAACG 


CCGGTTTCTT 


8220 


TTTGGACGGC 


TTTAATCGCT 


GCTTTAATAC 


TTGGGACATC 


GAAAACGTCC 


AAGGCTTCTA 


8280 


ATTGTTCTTT 


AAACGCATGT 


AACACAGTTG 


GCACTGTTTC 


TCCTTGCAAG 


AATTCTTTCG 


8340 


CAGCATCATC 


TAAAACTGGA 


TGTTCATTAA AGAACAAGTT 


TGATAGTTCC 


ACAATTTCAG 


8400 


CTGCATAACT 


CATTTGTGGT 


TGATAC7UVGC 


TAACTAATTG 


i|ii|ii|if|'i 9i TV f*f* IV rn 

TTTT AACCA 1 


i L>(3 A i i 1 1 1 i 


8460 


CAGGAGATGG 


ATTTTCTTCA 


ACACGACCGT 


CAGCGACTAA 


TV rp TV TV IV I\ T J\ 


LiAv^AlAll^l u 


8520 


TTAGTTCACT 


TAAGTCCATT 


TGTTTCATGT 


ATTGATTGTT 




/vil 1X11 Iv^o 


8580 


CATCAAAGGC 


CGCTGGTGAC 


TTGCTTAAGC 


GTTGTTCATC 


AAAUAAi 1 1 A 


M 1 VM^nrlO X v« X X 


8640 


CTTGTGAGAA 


GATTTCTTCT 


TCGCCAACAG 


GAGACCATCC 


TAGTAAAGCA 


ATAAAGTTGA 


8700 


ACATGGCTTC 


TGGTAAGTAA 


CCTAATTCAC 


GGTATTGTTC 


AATAAATTGT 


AAAATTGTTT 


8760 


CATCACGTTT 


GCTCAATTTT 


TTGCCAGTTT 


CTGAATTAAT 


GATTAAGGTC 


ATATGGCCAA 


8820 


AGACAGGTGG 


TGTCCATTCA AACGCTTCGT 


AAATCATTAA 


TTGTTTTGGT 


GTGTTCGCAA 


8880 


TGTGATCATC 


GCCACGTAAA ACATGAGAAA 


TTTTCATTAA 


GTGATCATCT 


ACTGCAACGG 


8940 


CAAAGTTATA 


CGTTGGCATC 


CCATCGCGTT 


TTTGGATGAC 


AAAATCGCCA 


CCAATGTTAT 


9000 


CTGACTCAAA AGTAATTTCA 


CCTTTCACGA 


TGTCATCAAA 


TTTGTATTCT 


GTATTACGTG 


9060 


GCACACGGAA ACGAATCACT 


GGTTCTAGAC 


CAGCCGCTTC 


TTTGGCTGCT 


TGTTCAGAAG 


9120 


GTGTTAAATT 


CGCACAAGTG 


CCTGCGTAGT 


GAGGCATTTC 


CCCACGAGCT 


TGTTGTGCTT 


9180 


CACGTTCTGC 


TTCTAATTCT 


TCTGGCGTAC 


AATAACACTT 


ATACGCACGA 


TTGCTACTTA 


9240 
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ATAATTGATC AATCAATGGT TGGTAAATTT CTTTACGTTC TGATTGACGA TATGGACCAT 9300 

ATTCTCCCGG ATGAGCAGGT GATTCATCCC ATTCCATCCC TAACCATGCT AAGTTTTCCA 9360 

ACTGACTTTT CTCGCCGTCT TCGATGTTCC GTTTTTGGTC GGTATCCTCG ATACGAATGA 9420 

TAAATTCTCC GTCATTGTGG CGGGCAAATA AGTAGTTAAA TAATGCTGTT CTTGCATTCC 9480 

CAATGTGCAA GTGTCCAGTT GGACTTGGTG CGT7\ACGCAC ACGAACTTTC GTCATGTTCA 9540 

TTCCTCTTTC CATTTTTTAA AAACAGACGG TTTAACGTCG TTTTCAGTTA TTTTTCCAAT 9600 

AAAGCAACGG CAATTGCGCC GATTCCTTCT TTTTTACCAA CAAAGCCCAT CGTTTCCATG 9660 

GTCGTTGCCT TAATGTTCAC TTGTGTGACC GCTAAGTGGC AACTCGCGGC TAGATTCTTT 9720 

TTCATCTCTG CCAAGTATGG GCTCATCTTT GGCTCTTCCG CTAAAATTGT ACAATCAATA 9780 

TTGCCAATAG TAAAGCCCGA CCGCCCAACT TTTTCATTTA CTTTTTTTAA TAAGTTTACA 9840 

GAATTCGCAT TTTTGAACTG CGGATCGGTT TCGGGAAACA GTTGCCCAAT ATCACCCAGA 9900 

CCAGCGGCAC CAAGGATTGC ATCAATAATT GCATGTGTTA AAACATCGGC ATCTGAGTGA 9960 

CCTAACAGAC CTTTTTCATA AGGAAGAGTA ACCCCACCAA TAATTAAAGG TCGTTGCGGA 10020 

ACTAATTGAT GAACATCAAA ACCTTGACCG ATTCGAATCA TACTAAGTCC TCACTTTGCT 10080 

TAATTTTTAT CGTTTTTAGG ATGGTTATCC TTGCGATCAT TAATCCCGCG ACCCGCGTGC 10140 

GCCGGTTTGG CAAAAATCAT ACGCCCAGCT GCGGTTTGTA AAGCACTTGT GACGACCACT 10200 

TCGATTTGCT CATTCATGTA ATGCTGGCCA TCTTCTACCA CAACCATTGT GCCATCATCT 10260 

AAGTAAGCGA CCCCTTGTTG ACGCTCTGTT CCGGCTTTTA CT^CCATCAC ATTCATCGTT 10320 

TCACCTGGAA TCACCACTGG TTTCACAGCA TTAGCTAAAG CATTAATGTT TAAAACAGGC 10380 

ACATTTTGAA ATTCAGAGAC TTTATTTAAG TTGTAGTCAT TGGTCACAAC CACGCCATCT 10440 

AATAATTTTG CTAATTTGAT TAATTTACTG TCTACTTCAG AGATATCCTC GAAATCTCCG 10500 

TCATACATCT CAACGGAGAT ACCTTCTTCT TTTTGTAAAG CATTTCAACG CTGGTTCCCT 10560 

GCCCTTTAAA ACTAATAATT TnAACTGTAC CGCCCATGCC GACAACACGT CTCGGAATAT 10620 
TAATTAAGCC ATAACT 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6531 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOUX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
GA/mGGGTCT ■ TTTTTCGAGT AGACGTTTGA TnTTAGCGTA TCCAAGAAAG GAAAACCCtA 
ACATAAAaAT TTTTTATTTA CGAaGGAGAC CGATTTATAT GGAAAAGAAA GAATTTCACA 
TTGTAGCAGA AACAGGAATT CACGCACGTC CAGCTACTTT ATTAGTACAA ACTGCAAGCA 



60 
120 
180 
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AATTTAACTC AGATATTAAC 


TTAGAATACA AAGGTAAATC 


TGTTAACTTA AAATCAATCA 


240 


TGGGCGTTAT. 


GTCT.TTAGGC 


GTTGGTCAAG GTTCTGACGT 


AACAATCACT 


GTTGATGGTG 


300 


CTGACGAAGC 


TGAAGGAATG 


GCAGCAATCG TTGAAACATT 


ACAAAAAGAA 


GGATTGGCTG 


360 


AATAATGTCT 


GAGATGCTAA 


AAGGGATTGC CGCAAGTGAT 


GGCGTTGCTG 


TTGCTAAAGC 


420 


TTACCTGCTA GTTCAACCTG 


ACTTGTCTTT CAACAAAACT 


TCCGTAGAAG 


ATACTGATGC 


480 


AGAAGCAACT 


CGCTTAGACG 


ATGCTTTAGC AAAATCTACT 


GAAGAGTTGC 


AAGCAATTCG 


540 


CGACAAAGCA 


GCTCAAAGCC 


TTGGTGAAGC AGAAGCGCAA 


GTTTTCGATG 


CTCATTTAAT 


600 


GGTTTTATCT 


GACCCAGAAA 


TGGTTGGTCA AATCAAACAA 


AATATCCAAG 


ACAACAAAGT 


660 


CAATGCAGAA 


GCTGCCTTGA 


AAGAAGTTAC GGATATGTAT 


ATTGGTATGT 


TTGAAGGAAT 


720 


GGaCGACAAT 


GCTTACATGC 


aAGAGCGTGC aGCAGATATT 


CGTGACGTTG 


CCAAACGTAT 


780 


CTTAGCACAT 


TTATTAGGTG 


TGACTCTTCC AAATCCTTCA 


ATGATTAACG 


AAGAAGTAAT 


840 


CGTGGTTGCC 


CATGACTTAA 


CACCTAGTGA TACTGCACAA 


TTAGACCGTA 


CGTATGTTAA 


900 


AGCCTTTGTG 


ACAGACATCG 


GCGGACGTAC TTCTCACTCA 


GCCATCATGG 


CTCGTTCGTT 


960 


AGAAATCCCT 


GCAATCGTTG 


GTACAAAAGA AATTACTGAC 


AAAGTAAAAG 


CAGGCGATAT 


1020 


TTTAGCAGTG 


AACGGAATCA 


TTGGGGATGT TATTATTGAC 


CCAACAGATG 


CTGAAAAATC 


1080 


TGAATTTGAA 


GCAGAAGCAA 


AAGCTTATGC AGATCAAAAA 


GCAGAATGGG 


ATAAACTAAA 


1140 


AAATGCAGAA 


ACAGTGACAG 


CTGACGGCAA ACATGTTGAG 


TTAGCTGCAA 


ACATTGGTAC 


1200 


ACCAAAAGAT 


TTAGAAGGCG 


TACACAAAAA CGGCGGCGAA 


CTGTTGGTTT 


ATATCGTACA 


1260 


GAATTCTTAT 


ACATGGATTC 


ATCTGACTTT CCAACTGAAG 


AGGACCAATA 


CCAAGCGTAT 


1320 


AAAGCAGTTC 


TTGAAGGAAT 


6GAAGGAAAA CCAGTTGTGG 


TTCGTACAAT 


GGATATTGGT 


1380 


GGGGACAAAG 


AGCTTCCTTA 


CCTAACATTA CCACACGAAA 


TGAACCCATT 


CTTAGGTTAC 


1440 


CGTGCATTAC 


GTATTAGCTT 


ATCAGAACTA GGTGACGGCA 


TGTTCCGTAC 


ACAAATGCGT 


1500 


GCATTATTAC 


GTGCGTCTGT 


TCATGGTAAC TTACGTATCA 


TGTTCCCAAT 


GGTTGCTACT 


1560 


TTGAAAGAAT 


TTAGAGCAGC 


GAAAGCAATC TTTGAAGACG 


AAAAACAAAA 


ATTAGTCAAC 


1620 


GAAGGTGTTG 


AAGTTTCAAA 


CGATATCCAA GTAGGTATTA 


TGATTGAAAT 


TCCAGCAGCT 


1680 


GCC6TATTAG 


CTGATAAATT 


TGCCAAAGAA GTTGACTTCT 


TTAGTGTTGG 


AACTAACGAC 


1740 


TTAATCCAAT 


ACACAATGGC 


GGCAGACCGT ATGAACGAAC 


GCGTTTCTTA 


CTTATACCAA 


1800 


CCATACAACC 


CATCAATCTT 


ACGTTTAATT AAAAATGTAA 


TTGATGCAGC 


ACACGCTGAA 


1860 


GGTAAATGGG 


CTGGTATGTG 


TGGTGAAATG GCGGGCGATC 


AAACAGCTGT 


TCCATTACTA 


1920 


TTGGGTATGG 


GCTTAGATGA 


GTTCTCAATG AGTGCTACAT 


CTATCCTTAA 


AACACGTAGC 


1980 


TTGATGAAAC 


• GTCTAGACAC 


AACTAAAATG GCTGAACTTG 


CTGACCGCGC 


ATTAAAAGAA 


2040 


TGCGACACGA 


TGGAAGAAGT 


CTTTGCATTA GTTGAAGAAT 


ATACAAAATA 


ATTAAGTGAT 


2100 


TATGAAAGCA 


CATCGACAAT 


TGTCGATGTG CTTTTTTATT 


GTTGCGATTC 


ACGTATTTTT 


2160 
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TTCTTGAAAT 
ACCAGTTTCC 
TATTTTTTAG 
CCATCGTTTG 
TCAATCCGAT 
TAGCGAGGTT 
ATAGCCGCTT 
GTTTTTACTT 
ACACCGACTA 
TGTTCATCTG 
TCCGGTAAAG 
TTTGGCAATT 
GCATGAGAAT 
ACGGTAATGT 
GGAGCGACAC 
GCTTTTACAG 
AAAATGATGA 
CCTTTTTGAC 
TTGTTTTCTT 
TAGCTGCAAC 
sTCCGCTACC 
CAGGAGAAGT 
AAAGTAACTG 
CAGCGG6TGT 
CAAGATTTAG 
CAGTACTATT 
GTTTTGATCC 
GTTGATGAGG 
GTGGCGTGAC 
CTTGAGCTGC 
TGCTTGAAAT 
TTTTAAAAAA 
CAGTGTAAGA 



AATAAATCGT 
AAGGGAAATT 
CTTCTTTTTC 
CTTTGATTTT 
AATTAAAATT 
TGGTTTCTTT 
GTTTGTTTTG 
CATTTAAAGA 
CATAAGAAAA 
ACTTTTGTTC 
TAAGCTGACA 
GCGCAATCTC 
AATCTGCCAA 
CGTTGGCTGT 
GTAAATCAAA 
AAAAAGTCAT 
CGCTAATAGC 
AATTTTTTAT 
TTAAAGTGGT 
TTTTTTCGTT 
AAAAGCTAAT 
GGCGGCACCA 
TGTTGAATTG 
TGAATCACTT 
ACCACGTTT6 
TTGCACTTGA 
AAAATGGATA 
ATCAGCAGGA 
AGTATTATCA 
TTCAACTTTT 
GCGCACGATC 
GCTAAATCCT 
TTCTTTGACC 



GCTTCCCAAA 
GGTGTGTTTT 
TTGGATGACT 
tACAGTATAA 
TGTATTAGGA 
TTCATAATAC 
GAGATTCATT 
TAATTCTGGC 
ACGATTATTG 
AAAGTAGAGT 
TTCAACAGTC 
ATTAAAAGTA 
GCCATTATCA 
TTGATTCGTT 
ATAACTTGTC 
TTCGCTTGCT 
ATGGAGCCAT 
AACAAGGTAT 
GTATCATCCA 
GTCGCAGGAA 
GTCCAAGTGC 
GTTGGATCCA 
GGTGTTGCTG 
GTAGCAAATT 
TCAGTTACTT 
TCCAGTTGTG 
TTTGAAACAT 
TTAGGTGTCA 
AGAGCAAACG 
GAACTATTTA 
TTTTTCATTT 
AATTAGAGGG 
GTTAAATTAA 



547 
ATGAATAGAA 
TCAGGTGGGA 
AAGTGTTTTT 
TTTCCAGGT^ 
GCCATTGTTA 
AAAGAGGCAT 
AATACGCGGT 
TGAACAGGTT 
ATGGCCACAC 
CCTCCTAAAA 
TTGGTTGAGT 
AAAGGTGCAG 
TTAGTAATCG 
AGTTCGATTT 
TCTTTGGTTC 
TCTACAGAAT 
TTAAAAAGCT 
TGCATGAAGT 
AAATCC7UVGT 
CAGTTAATTG 
CCATTCCTGT 
AGGTTACTGG 
CTAAAAAGGT 
GCGCACTCTG 
GAACATAGTT 
CCGAATAGAT 
AGTCAATACT 
CAGGCTGAGA 
TAATATCAGC 
AAAGCATAAG 
TTTATTCCTT 
GTGTATCAGA 
GGTCAGGAGC 



PCT/US98/08985 

AAACCAGCGT GATGCTACTC 2220 
JGGXT.GACGGC GGTAGCATTG * 2280 

CCCACGAATA GTCTTGATAG 2340 

CAAAAGGTTG TTCCTTTAAG 2400 

ACGACTCTTG GTGATTTTCG 2460 

CGACTTGTAA TTTTTTAATC 2520 

TACGACCGTT AGCTTGATCG 2580 

CATCTGTTTC ACTAAGTAAG 2640 

CGTTTTCTGA GTGAGCAGGT 2700 

CGTACCCATT AAATGGTTTT 2760 

GTTTAGGTAG GTGGATTTCT 2820 

ACGGATCTTT TTTTGTTTCA 2880 

CTGCATTGGC TGAAGCTAGC 2940 

GTAGCTTTTG TGTTTGGTTT 3000 

GTTGATTGTC GGGTAAAATG 3060 

AAGAGCCAAT AAAAAATAAA 3120 

GCTGATTCAT AGTGTGCACT 3180 

TGTTTCATGC AATACCTTGT 3240 

AAGCGTTGTT TTATATTGTT 3300 

AATGCCTTGA GCAGCGGTcg 3360 

TGAAAGAGCG GCAGTCGCCA 3420 

GACCGTTAAT GGCGCCAAAG 3480 

CAAGGATGCA TTATCCAAAA 3540 

TTTAACTGAT AATTTCCAGC 3600 

TGGCACGCTA" ATTAAATCGC 3660 

CGCTGTTCCG GCTTGAATTT 3720 

AAGGGGTCCG GCTGTACCAG 3780 

AGGGTTCGTC GGGTTGACAG 3840 

GTTGGATTGA ATAGAAGCGA 3900 

AGGCGTAgcA ACGAACAAAA 3960 

CTTTCTTGTT TATTTTTAAC 4020 

AAGAATCCAT TGTAATTCGG 4080 

AACGCTTAAA AAAAGTCCAT 4140 
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TTT6TTGGTT 


CCAAGGAATT 


TCTACATTGT 


TATCGTCCGT 


TGCGGTGT3T 


TGATAPJVCTT 


4200 


CGGTTGCTAA AGAGGAGAGA 


GGAAAATCAC 


TGCCATTTTT 


TCGATAAAT3 


AAAGCTTGCr. 


42£0 


GGATCGTTTT 


ATTTGTAGAA 


GTTGAATGCA 


TGGGCTGACT 


CTCTCTTACC 


aacaa;^tgcc 


4320 


ATGGTGTTTT 


TGCAACTCGA 


CCATCGGCAA 


TGAGGAAAGG 


AGCGCTGArS 


GTCGACGGTG 


4380 


AGTATATGGT 


TCGTTTGCTG 


GGGAAAGGTA AATTTTTAAA GGAAACTG?.-. 


GTTGGCATCr 


4440 


GCACAAAACG 


GAGGGTACCG 


TTTTTGATAG 


GTAAAGATTG 


TTCTTTAGTG 


ATTGGCGTA^ 


45C0 


TTGcGTTCCG 


CCACCATCAT 


TGCCAGTAAC 


CGTGCTAGAA 


AATAAAAG^VL 


ATTGGTTTA? 


4560 


CCATTCTTCT 


GTAACCGTTG 


CTGTATATTG 


TAAGGTGAAT 


TGTGTATTGG 


CTTCCGTAAA 


4620 


ATTATCTGCT 


GGCAGCGAAT 


TATGATAACT 


CCAATACGGC 


AAACCAGTGG 


AACrATCAGT 


4660 


TAAAAGTTGA AGCCCAGCTA 


TTGGCGTACT 


TAAACCTGCT 


GCTGATTGTT 


TTGTTAATG? 


4740 


AGCAGCT6AA AGTT6GGACA 


GTCCTTTAGG 


CACCGTTGCT 


CGTAATTGCT 


GATTCATCCa 


4800 


CGTTGCTGGT 


TGATTGGTTA 


AACGTGTGGT 


TAACGTAAAA 


CaAATGATTT 


GACCA.CTTAC 


4860 


GACAGAGTTT 


GCAGCTTCCC 


AGAGCCCTTG 


TGCATTTTTC 


CGCTCAATC^A 


TTT3CGTAAC 


4920 


CGTTAGTTGA 


GGCACAGGAT 


AAGCAACTGT 


TTGAGAAACA 


GTAGAACT3G 


TTAACAAAGG 


4960 


CAGGCCAGTG 


CGATTGCTGA 


GACCTTGGAC 


CCACAAGCTG 


ACCGTTTCrC 


CCCCTTGGAA 


5040 


AGTGG6TAAA 


TTAGCAAATT 


GAAAATGGCC 


ATTGGAATCT 


AAGGACGTIG 


TGGAAr-^TAa^. 


5100 


ACCACCATTT 


ATTTGTACTT 


TAACTTGATA 


GGTTCCGTTA 


GA7ACTTGAG 


AATCXrTACC 


5160 


TGAAAGTTGA GTTGAGCCGG 


CATTGGCGAC 


TACGTTTGTG 


ACTGGTTGGA 


TTGACCAAT? 


5220 


GACACTTTGG 


TCAGCTGAGG 


GGCTGCTTTG 


AATATAGTCG 


GTTGACGTT? 


TTGAACGAGC 


5280 


AAAGACAATT 


GTAGAGAGC6 


TGTCGCCATA 


TTCTAATGGT 


GAAGCTAACG 


ATG7CGAGAA 


5340 


ATTTCCGGCT 


GCATCGACAG 


AAATCCCTTT 


CACCACAATG 


GGGGTAGT3G 


TGCCA7CATG 


5400 


GCG6TTCAAG 


GTTAATTGAA 


CA6AGTAGTT 


ATAGTTTGGG 


TCTTGTAC7T 


GTTGAATGGT 


5460 


CCCTGTTACA 


GTAGTGTCCA 


GATGATTTGG 


TTGAGTTAGT 


TTGGCTGG7G 


TGACTTGAGG 


5520 


AACGGGATAA 


TTACCTAAAC 


TAAGTTGAGC 


TTTGCTAATT 


ATGCCTGT7T 


TTGTATTATT 


5580 


CCCAATGATA ATATTAATTG 


CGCCTAATCC 


aGTGCCGACA 


TCTGTAAA3G 


TTCCTCGAAT 


5640 


CGGATAGTCT 


TTGCCGTTAA 


GAGCAGGAGG 


GATTGAAATT 


CCTCGTTTAT 


AAAGTGCGCC 


5700 


CACATCAAAC 


GAAATGGCGG 


CAGACCATTT 


GGTTCCGGGA 


AGTAACACCA 


AGGTGCTACT 


5760 


TGGAAAAGTC 


AGATAAACAG 


AATGATAAGT 


GCTACTATAA 


GAGAGcTGAC 


GCCGTTAGTT 


5820 


GAACTGTGAA 


TATCTTGAGT 


GGCTGTTCCT 


GGATAACTGA 


CAGTCCCTGT 




□ □ O V/ 


AGAAAATCTT 


GTTGCTTTAG 


AGTGTCGTTA 


TCTAACTGGC 


TTGAAATTTC 


TG7AGGAAGT 


5940 


TGAATAGCAA 


TAATCGGTCG 


ATCATTAATT 


GCTAAGCCGA 


TAAGTGTCAC 


ACTGTATTCA 


6000 


TACGAAAGTG 


TGATAATATC 


TTGGTTATAC 


GTTCCCGTGC 


TTAAGGTACT 


TTTTAAGGTG 


6060 


CTATTACCAA 


CAGAAACACT 


AGCTGGCGTA 


GGTGCCATAG 


GTTC7VTCGTT 


TTCAG.--GGAA 


6120 



wo 98/50555 



PCTAJS98/08985 



549 



GCATGGGTTT GCTCGGTTTC TTGTTCCACG ATATCGGTTG CTIIGAGTTGT 
GTTGAAACTG AAGTAAATGT TTCACTAAAA GTAGAGGCGT GAGTTGTAGG 
TCAGAATGAG CTATCAAAGG TTGAGTCAGC GATAAAAAAA GCCATGATAT 
AAGATTCCCC ATACGAACTT ATTCAAAACC ATCCTCTCCT TTTTTGaTAT 
ATGACGAAAA TATATAATGT GTAAATATTA AGTACATACA TTCTATAATG 
AAAGTAAAGC GATTACATCC GATGGTGnAC AGGATATTTT GTTCACAATC 
TTAAAGAGGG AAATTTCTCG AAATTCTATG CGTTTTnAAT CCGGGAAAGA 
C2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
TTTTCGGGAA AGAAACACCT ACCATTTTTG ATGTTTTGGG CAGGGAATAG 
ATGGAATGGA GTTTTCCTTG GCTATCAATA GATGGCGACA GAATGTACGA 
TTCTC6AAAT TTTTTGAGGG TCTTTTTTCT TATGGCGTTT CTCTAACAAC 
TTAAAGGTTA CTGCCAGTCC TAATGGCGGG ATGAAAGTAC AAGTA6ATTC 
TTTGTCGGTA AAGTCTTCTT GAATAGTACT ACAAAAGCTT TAAGTATTGA 
AGTATGCAAG ACAGGACGGA TAGTATTGTG GTTCGCGTGG ATAAAAGTGT 
TTTTTAGCTG TTAAAA7VGAA CGATACGACA GTAACCCGTA CATCGGACGT 
CAGTTAGCGA CAATAAGAGT GCCCCGAAAT GTTTCTAGTA TTACAGGCGA 
GATAAACGGG CAGATACGAA AGTATGCGGG ^ATTCTTCGC CGTTTCAA7VA 
TCAGGGTTAG AAGAGCAATA CGGTACAATA CTAAAAAAAA TTGCGGAAGC 
AGTTATGAAA AAATCCTAAA TGATTTTAAA AACTATGTTG CAAAAGCGCA 
GATTCTAATA TTGAAGAAAT TATCCAATCA GGTAATGGAA AGGTAAGTGC 
TTAATTCACG AATGGTTTGC AGCTTTAAAA AATGAATTAG ATACAAATCA 
TTACAGAATC AAATCAATGA AATGAAAGCT ACTGAAGAGT TACCAGCTAT 
TTACTCGGTT ATCCTAATGT ACAAGTTTTG TATTGGGAAT ACGGTATTGG 
TTAGCTAATG AGCCAACAGG TCTAGGCGGT AGCAATGTGA AAAAGATTCC 
GAATATCTTG ATTTATTCAG TTTCAAAGTT AAAGTGCCAA TGAACTTTAA 
CCAACAGTAA CAAAAATAGA TAGTCGAACT ATTCGCTTTA TTGAAGCATT 
GAAATTAAAT TTTAGGAGGA AAAGAATGTA TACATTTAAA AAAGGTGATG 
AGTCATGCTA AATGAAAATT TTAAAGAGAT AACTGATTCT TTCGAAGATG 
CAGAAAAAAT TCTAAAATAT TAGATTTAAA CGATGCTATA TTGCCAGGCA 



TTGTTGTTCT 
GTTCAGTGTT 
GATGCAAGAA 
TATATGtATA 
GGCCATnCAT 
TTTTAGTGTT 
T 



GAGGTAATAC 

TGATAGCGAC 

AGCTAACGCA 

AG6CTATGCA 

TGTAGCAAGC 

GCGAGATGTA 

TTACGAGCTA 

TTTAATCACA 

GGTTAACGTT 

CAACAAGACG 

AACCGATATG 

TTTTGATGTT 

AGCATCAAAT 

AGAACATAAT 

CCTATCAGGA 

TCACAGTGTA 

ATTGGTAAAT 

TAAAGTTATA 

CAGACTACCA 

GTTTATTTGT 

TTTATTCAAT 



6180 
6240 
6300 
6360 
6420 
6480 
6531 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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CCCAGCTACA GGAGTTGATA ATAAACCTTT ACCTAATTCT GGAAGTTTAT TCGTTTGTAA 
AGACCCAGGA GGGGTTAGAC AATTATTTCA GACAGAAAGA ACAATATTTA TTCGACAATT 
AGGGGGAGTA CCTTCATCTT GGaCGGATTG GaAAAAAGTA GCTTTTGAAA AAGAACAACC 
TTTTGAAGCT TGGTATTCAC CAGGAACTAA CCATGCTGGA TTCAAAAATA AGGCAAGATA 
TAATTTAGGG CCAGAATTTA GCAACATAGG CCAGCGACTT GGGTTGCCTA TGAAAAGTGA 
ACCGTTGGAG TGGAATAGTG GGCGATGGCA AGCAAAAGTT CTTAGAGACT GCAAGCTAAA 
TATAAGCGGA ACTGTAAAAT ACCATGTTGG TAGTTCGAGA GGTGTTCTCT ATGCTTATAC 
TCATATAGAC AAAGGCCTTG ATGAAGGCGT AGGTGACTTA GGTATTGGAT CAGCAGTCGG 
AGCTGTTGGC GGCTTGGATT ATAAAAATGT CGCAGCTTTT GATTTAAACG TTACACTAAA 
AAAAGGTGAG TATCTTGCGT TTCGCTTAGA ATTAGCAGCA GAT/U^GCAAC TTGATTATAC 
TCAATTATCT TCTATGCATA TCACAGAATT AGTATAGGAA CTGAATTTTA AAATAAAAAC 
CGTTTAGCAA AAAAGCTAAG CGGTTTTTAT TTATAGAGAG AAGGAACTTT ATGTGAAGGA 
TGAACTGATT CAAGATGTTG TAGAGCGTTT AGTGCGTATT GAGACAAAGC TCGATAACTA 
CGAAGTGTTA AGAGAAAAGA CAGAAGATGC TAAGGAAAAG GCTGATTATG CTTTCTCTAT 
TGCTAAAAAT AATGAAGAAG ATATCAAAGA AATAAAGGAA AATCAGAAAT GGTCTTGGCG 
GACAATTGCG GGCATAGGGG TGTCAGTAGT TGTCTATTTA TTTACAAAAT ATTTAGGAGG 
TGTATAAGAA TGATTCTACC AGATAAGTAC TACAAAATTA TCAAATGGGG CGTGCTAACA 
GTGCTACCTG CAAGTTCTGT TTTGGTTGCC ACACTAGGTA AAGCTTATGG ATGGCAGCAA 
ACAGATATGA CTGTTTTAAC TATTAATGCC ATAGCAACTT TTTTAGGAGT GGTAACAGGA 
GTATCAGCAT ATAATTTAAA AGACAAGGAG AAATAAAAAT GAAAAAGAAA ATTTTAGCAG 
GAGCGCTTGT CGCTCTGTTT TTTATGCCTA CAGCTATGTT TGCCGCAAAA GGAGACCAAG 
GTGTGGATTG GGCGATTTAT CAAGGTGAAC AAGGTCGCTT TGGCTATGCA CATGATAAAT 
TCGCTATTGC CCAGATTGGA GGCTACAATG CTAGCGGTAT TTATGAACAA TACACATATA 
AAACGCAAGT GGCAAGTGCT ATTGCCCAAG GTAAACGTGC GCATACCTAT ATTTGGTATG 
ACACTTGGGG AAACATGGAC ATTGCGAAAA CAACAATGGA TTACTTTTTG CCACGTATTC 
AAACGCCTAA AAATTCCATC GTTGCATTAG ATTTTGAACA TGGAGCGTTG GCTAGTGTTC 
CAGATGGATA TGGAGGATAT GTAAGTTCAG. ATGCCGAAAA AGCAGCAAAT ACAGAGACAA 
TTTTGTACGG TATGCGCAGA ATCAAACAGG CTGGCTATAC TCCAATGTAT TACAGCTATA 
AGCCATTTAC ACTAAATCAT GTAAACTATC AACAAATCAT CAAAGAGTTT CCTAACTCTT 
TATGGATTGC TGCGTATCCT ATCGATGGTG TGTCACCATA TCCATTGTAT GCTTATTTCC 
CAAGCATGGA TGGTATTGGT ATTTGGCAAT TCACATCCGC TTATATTGCA GGTGGTTTAG 
ATGGTAACGT AGATTTAACA GGAATTACGG ATAGTGGTTA TACAGATACC AATAAACCAG 
AAACGGATAC GCCAGCAACA GATGCAGGCG AAGAAATTGA AAAAATACCT AATTCTGATG 



1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 
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TTAA/fGTTGG CGATACCGTC AAAGTGAAAT .TTAATGTAGA TGCTTGGGCA ACTGGGGAAG 3300 

CTATTCCGCA ATGGGTAAAA GGAAACAGCT ACAAAGTGCA AGAAGTAACT GGAAGCAGAG 3360 

TATTGCTTGA AGGTATCTTG TCATGGATTA GCAAAGGTGA TATTGAATTA TTGCCAGACG 3420 

CAACAGTCGT CCCTGATAAG CAACCAGAAG CGACTCATGT GGTACAATAC GGAGAAACAT 3480 

TATCAAGTAT TGCTTATCAA TATGGAACAG ACTATCAAAC GTTGGCGGCA TTAAATGGAT 3540 

TGGCTAATCC AAATCTTATT TATCCTGGTC AAGTTTTGAA AGTCAATGGA TCGGCAACAA 3600 

GTAATGTCTA CACGGTTAAA TACGGCGATA ATTTATCTAG TATTGCAGCA AAACTTGGCA 3660 

CTACTTATCA AGCTTTAGCT GCATTAAACG GATTAGCAAA TCCTAACTTG ATTTATCCAG 3720 

GTCAAACATT GAATTATTAA TAGCTTTAAT ATAAAATAAG GATACACTTA TTTAAATTTC 3780 

TCTCGAG-TCG CCGTCGCCAA GGCGGCTCTT TTCAGGACCA TTAGCTCAGT TGGTTAGAGC 3840 

AAACGGCTCA TAACCGTTCG GTCACAGGTT CGAGCCCTGT ATGGTCCATA GCTAAAAAGG 3900 

GCACGAAAAG GGCAAAAGTA GTAATACTTT GTCAGATTAT GTTGTTTTTT ATAAATTTTA 3960 

TTAGTATTAT ATAGCATTAT AAATGCCTGT TTTACGGGCT TTTACAATTT TTGTGAAAAT 4020 

TGGACCAAAT CCTTGGGATC CAATGGGCGC ATTAGCATAA AACAGAAAAA GAAGGCCCCC 4080 

GCCGTCAGGG CGGGGGCCTT CTTTTTACGT TGGGCGGCGA ACTTCACGGA TTTCGCCCAA 4140 

TGTTGTATAA TCATTGCCTG CTAAACGAGC AATCGGTTGG AGTTTTTCAG GTAAAATGTA 4200 

CTGATGTTCT TCATCGAAAA CGGCTGAATC AAAATAGAAA TCGGTAATTT CTACAATGAA 4260 

TAAATCCGTA ATAATTTGAC CTTCATGGTT AGCAATAGGT ACATATTGAT GTAAGCGAGC 4320 

TTCCATCCGA ATCGGCGCGG CTTTGATTGC TGGAACAGCA ACAGTCTGGC TGGCAATGGT 4380 

TTCAATACCA AATGTGTCAA TTTCACTGAT CTCTGCTGCT AAAGACGCCG AAGTCTGATT 4440 

CATTTGAGTA AGAACGTCTT CGTTAACGAA ATGGATGACA AGTTCTTTAG TAGCCAAGAT 4500 

GTTTCGTGCC GTATCTTTAG GCTCTTCGTC TGGTCGT/^ ATGGACAACG TTGCTAAAGG 4560 

AATTTCAGCT GCCGCCGCAT TAAAGAAACT AAAGGGAGCA GCGTTGACGA CACCTGTAGC 4620 

TGTATCTTGT GTTGTAACCC AAGCGATTGG CCGAGGAATA ATACTTCCTG ATAAAAATTT 4680 

ATACGCTGTT TTTGCAGATA ACTCGTTACT ATTGTAATGA ATCATAGGAG CCTCCTAGAA 4740 

AAAATACTAA TTTCTTTGAA CATTCGCATC TGAAGTATCG AAAGGACGCA CAAATTCTTC 4800 

AATCCCTGCC CGTTTTGGTT CTAAAAATGG CGGTAATGAT AATTTTTCGC CAGCGTGTTC 4860 

ATAGGTTTCA TCCTCTAAAA ATCCAGGACC ATCGGTTGCC AGTTCAAATA AAACATGTGT 4920 

TGCTGCCAAG AAaTATTCAG ATTTAAAGTA AAAaCGTtCC ACAAAGCCAG AATTAGGAAA 4980 

TTCCAAACGA TTAATTTTTT Ca^ATCCAGAA ACGTAGTGCT TCCTCATCGG CAACGCGCAA 5040 

GGCTAAGTGA TGGACATTGC CGTAGCCTTC TTGAGCCGCT GGCAAATCTT CCCGATGCTC 5100 

TACGATGATG CTGGCACCAT TCCCACCTTC GCCAACTTCA AATAAATGAA AACTACCTTC 5160 

AGCATCAATC AATTTGAACC CTAAAACTTC AGTTAAAACT AGTTGCAGAT GTTCGTAATT 5220 
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TTCGACiVGTT 


ACAAAGGCTG 


GTCCTAAGCC 


GATTAAAGCA 


TGTTuAU 1 Ala 






TTTTTTCCAA 


GGCGTGCCGG 


CTGCAACACA 


TTGGTTCTTT 


TCATCAitAAA 






ACGTTGATTG 


TCAAAGTCTT 


CAAATTCTAA 


ATACTTTTTA 


CCAAAACGTT 


CTTGAATTTC 


c A nn 


TCCATGATCA 


ATTGCATATT 


CATTAAAGCG 


TGATACCCAG 


TAATCCAAAG 


TAGCATCATC 


54 oU 


TTTTACACGA 


AAGGAAGTAC 


GAGAAATTGT 


ATTGGTTCCT 


TTGGTTCCtT 


TrGGTAAGCC 


5520 


TGGGAAGTCA 


AAGAAAGTCA 


TATCTGTTCC 


GGGAGAGCCG 


GCATCaTCAG 


CAAAAAwTAA 


55dU 


aTGATAJcGTT 


TCAATATCGT 


CTTGGTTGAC 


AGTCTTCTTA ATTAAACGTA GACCTAAAAT 


5b4U 


ATCGGTGAAA 


AAGTGATAAA 


TTTTTTCGGC 


ACTTGAAGTC 


ATGGCAGTGA 


CGTGGTGTAA 


570U 


ACCTAAAATT 


TGTGTATTCA 


TAATAAAAAT 


CCTCCTAAAA 


GAAAAATAAT 


TTTTTGATTA 


5760 


ATTAACTTAC 


TAAAAGTAAG 


TTTAAATGAT 


TAGTCCTAAA 


AATGCAAAAG 


AAATGTCTCG 


5820 


AATTCGAACT 


ATTATTTTTT 


AGTTGATTTT 


TTAAAAAAAT 


AAGAATATAA 


TAGCCTGAGA 


5880 


TATTTTAATA 


CATAAACTTT 


CGTAAACATC 


GCCGGATGCT 


TGCGACTAAA 


GTGTTAATGA 


5940 


ACAGTTCTTA 


GGCCATAGCT 


GAACTGTTTG 


TTGGCACTTT TTTTGTTTGG AAGGGAAGGG 


6000 


AAACAAGATG 


AGTAAAGCTT 


GGTTAAAAGT 


TATTTTTGGC 


GCTTTTTGTG 


AAGTGATTTG 


6060 


GGTCATTGGC 


ATGAAGCATA 


GCACAACATG 


GTGGGAGATT 


TTAGGGACAG 


TAATCGCAAT 


6120 


CTTTATCAGT 


TTCTATGCGC 


TGATTAAAGC 


AGGAGAAGAA 


TTGCCAGTCG 


GTACAGCCTA 


6180 


TGCTGTATTT 


GTTGGCTTAG 


GTAGTGCTGG 


AACTATTGTG 


ACAGATCATT 


TTTTATTCCA 


6240 


CACACCATTG 


GGCATAGGAA 


AAGTTCTTTT 


TCTGTTGCTG 


TTATT7VATAG 


GTaTAATTGG 


6300 


CTTAAAAATG 


GTAACTGGrA 


ATAAAGAGGA 


GGAAACAAGA 


TGAGTTGGTT 


ATACTTAGTT 


6360 


ATTGCAGGCT 


GTTTTGAAAT 


TTTTGGCGTA 


GGTTCAATT 






6399 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

AACAGGAGGA AAACTTGATG GGCATTCCAA TCACCATTGT TTCTGGCTTT TTAGGCGCAG 60 

GAAAAACAAC CTTAATTAAC CATGCGCTTG CGCAATCACC GTTTCCTAAA GAAGAAATTA 120 

TTATTATTGA AAATGAATTT GGTCAAACAG GCGTTGACCA CGCATTACTT TTACAAACCA 180 

CGGAACGAAT TATTCAATTA AATAACGGTT GCATGTGCTG CAGCTTACGA GGAGATCTTC 240 

TTGCCTCTTT' ATCAGCAATC TTAGAAGTGT ATCAAGAAGA ACAACAACCC ATTGCCCAAG 300 

TCATTTTAGA AACAACGGGC ATTGCTGATC CGCAACCAAT CATTCAAACA ATTTTAACGA 360 

CCCCGCACAT CAAAAATCAT TTCTATATAG ACAGTCTCTT AACGGTCGTT GATGGGCATC 4 20 
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ATTGGCAACA ACAGCTTCAA GAAGCCGAAG CQATCAAACA ACT.GGCCTTA GCAGATCGCC 480 

TCTTJTXTTC TGTCAAGGAA CCAGCAGACA GCACTCAGCT ACCAACGTTC CAAGAAACCC 540 

TTCGTACCAT TAATCCTTTT GCTGATATTT TATCCTTTCA AGCCAACGAG CCTCTTTTGG 600 

CTAGTGACTT TTTTCAATTA AATAAGTTTA CGGCAACACT TACAGAACAC GAAATGACTC 660 

ATTCCCCGCA TGAGCACGCC CACGAGCATC ATCACCATAC CTTTCACTCC CTCACGCTAA 720 

CGGCGTCCTC AGCCATCAAC GAACCTCTTT TCACTCGTTG GCTTGATTGG TTAATGTATA 780 

CGCATCAGGA AAAGTTATAC CGTTTCAAAG GGATTCTCGC CTTACAGGAG CATGACTTGG 840 

CCATTGCCAT GCAAGGCGTC AACCAGCAAG TGGCCTTTCA AATGACGAAT CAACCCCCTC 900 

AAGAAACGAC AATTGTTCTG ATTGGCAAAG AATTAGAGAC AAAAAAAATT CAAGAAACCT 960 

TTCAAACCCT CAACAAAATA GCTACTCCGT AAATTAATTT TTTACGAAAT GGAGGTCCTA 1020 

TATATAATGG AAAAAACTGA CTTATCTAGC GCTTATCGTC GACTGAAAAG TCCCAACATC 1080 

AAAACACGAA AACGCGCACT TAAAATCATT AAAGAACATA AGCGCAATAA ACAAAAGAAA 1140 

ATTGCGTAAT TCACTAAAGG AGCACACCTA TGAAAAAATT TACTCTTCCC CTGTTAGCCG 1200 

CCTTATCGCT AATCCTTTTC GGCGCTTGCG GCAAAACAAA CACCTCTGAT AA7VACCGCTG 1260 

ACGGTAAAGA AAAACTATCC ATTGTCACGA CTTTTTATCC TATGTACGAT TTCACTAAAA 1320 

ATATTGTAGG CGATGAAGGA GACGTCAAAT TGTTAATCCC TGCTGGTTCT GAACCACACG 1380 

ATTATGAACC ATCCGCCAAA GATATGGCTA CCATCCATGA TGCGGATGTT TTCGTTTACC 1440 

ACAATGAAAA TATGGAATCT TGGGTACCAA AAGCTGCTAA AGGTTGGAAA AAAGGAGCCC 1500 

CGAACGTCAT TAAAGGTACC GAAAACATGG TCTTACTTCC CGGCAGTGAC GAAGACGGAC 1560 

ACGACCATGA CCACGAACAT GGCGAAGAAG GCCACCACCA TGAATTAGAC CCGCATACTT 1620 

GGGTTTCGCC TCATCGTGCC ATCCAAGAAG TCACAAACAT CAAAGAACAA TTAGTCAAAC 1680 

TTTACCCTAA AAAAGCCAAA ACATTTGAAA CAAACGCAGA AAAATACTTA ACAAAATTAA 1740 

CAGCCTTAGA CAAAGAGTTC CAAACAGCTT TGAAAGACGC TAAGCAAAAA AGTTTTGTTA 1800 

CCCAACATGC TGCATTTGGT TATCTTGCCT TAGATTACGG CTTAAAACAA GTGCCAATAG 1860 

CTGGTTTAAC ACCTGAACAA GAGCCAACCG CAGGcGCTTG GCAGAGTTGA AAAAATATGT 1920 

CACAGACAAC CAAATTCGCT ATATTTATTT TGAAAAAAAT GCCAACGATA AAATTGCTAA 1980 

AACGTTAGCT GACGAAGCGA ATGTTCAATT GGAAGTCCTA AACCCGCTAG AAAGTTTGAC 2040 

ACAAAAACAA ATGGACAATG GCGAAGATTA TCTTTCTGTA ATGAAAGAAA ACTTAACTGC 2100 

TTTGAAAAAA ACAACAGATA CAGCCGGGAA AGAGGTTCAG CCAGAAACCT CTGAAAAAAC 2160 

AGAAAAAACC GTGGCTAACG GATATTTCAA AGACAGTGAG GTGGCTGAGA GAACACTGAC 2220 

AGATTACGCT- GGAAATTGGC AATCCGTCTA TCCTTTATTA AAAGATGGCA CATTAGACCA 2280 

AGTCTTCGAT TACAAAGCGA AACTGAAAAA AGATAAAACA CCAGCCGAAT ACAAAACCTA 2340 

CTATGATGCC GGCTATCAAA CCGATGTCGA CCACATCAAC ATCACTGATT CCACCATTGA 2400 
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ATTTQTGGTC AATGGCAAAC CACAAAAATT CACCTATAAA GCAGCCGGTT ATAAAATTTT 2460 

AAACTATGCA AAAGGCAACC GTGGCGXGCG- -TTTCCTTTTT GAAACAGACG ATGCCAATGC 2520 

TGGGCGGTTT AAATACGTCC AATTTAGCGA CCACAACATC GCACCAACGA AAGCCGCTCA 2580 

TTTCCACATC TTCTTCGGCG GCGATAGCCA AGAAAGTCTG TTCAATGAAA TGGACAACTG 2640 

GCCAACGTAT TATCCAAGCG ACTTAAGCAA ACAAGAAATT GCCCAAGAAA TGATTGCGCA 2700 

TTAAGCATTC AAAAGAGAGG AGGTCGGGAC AGAAGTGTTT AACTCCGAGA AATAAGAAGA 2760 

AATTTCCGAA AATTGTTCTT TAATTTTTGG AGAATTTCGG CTTATTTCCG AAGGAGTTGC 2820 

TTCTGTTCCC GCCGTTTATC AGTTTTTGAG CGTGGAGCAA AAATCCAAAG TGATTTTTGT 2880 

CCCACGCTCC TCTCTTTTTT TATTTAGTCA AGCACACCTG kTCTGTTtCA AACGTCGCTA 2940 

ACamTGGCGC AcTTCATCCG TTGATTTkGT ATTCATTAAT TGTGCTCTTA AATCACTGGC 3000 

ACCTGGAAAG CCTTTGACAT AAATTTTAAA GAAGCGATGC AGCCCAACGA TGGAGCGAGG 3060 

CACCAATTCC GCATATTTGT CTTGCAAATC CAATTGTAAA CGTAGCAAGC CCAGCAATTC 3120 

TTGTGGCGTA TGTGTTTTGG GTTCTTTTTC AAAGGCATAA GGATTTTTAA AAATCCCCCG 3180 

ACCAATCATG ATGCCATCTA CACCATATTG TTCTGCTAAT TCTAGGCCCT TTTGACGATC 3240 

GGGAATATCC CCATTAATCG TAATTGTCGT TTGCGGTGCG ACACGGTCCC GAATAGCCAT 3300 

AATTTGCGGA ATGACCTCCC AATGGGCATC CACTTTGCTC ATCTCTTTTC GTGTTCGCAA 3360 

ATGAATGGAT AGATTCGCAA TGTCTTGCTC TAATAAATGC GTGATCCACG CCTCCATTTC 3420 

CGCCATTTCA GTAAACCCAA TGCGTGTTTT GACACTAACA GGTAAGCCAC CTGCTTTGGC 3480 

AGCGTCAATC AATTCAGCCG CGACTTCTGG GCGCAAAATT AGCCCGCTGC CTTTTCCACG 3540 

CTCAGCCACG TTAGGCACAG GACAGCCCAT ATTTATGTCT AACCCTTGAA AGCCCATCTC 3600 

CGcTACGCCG ATACTCATTT CACGGAAAAA TTCCGGTTTA TCCCCCCAAA TATGTGCCAC 3660 

CATCGGCTGT TCATCTTCTG TAAAAACGAG GCGCCCGCGT ACACTATCTT TTCCTTCAGG 3720 

ATGACAATAA CTATCCGAAT TAGTAAATTC TGTAAAAAAC ACATCTGGTG CCCCAGCTTC 3780 

TTTCACGACA TGACGAAAGA CCACATCAGT GACATCTTCC ATCGGTGCTA AAACAAAAAA 3840 

GGGCTTTGGC AATGTTGCCC AAAAATTCTT GCTCATCTTC ATATACTCCG TTTCTGATTT 3900 

TTTCyTTAwT TTCAACGACA AviTGTCGACA TAATCTGGGG CGAAACAATG CTAATT 3956 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

AAnCAATTTT TGgATTAgCy GCTGGGTTTT CCAAnGnGTT TTTAgCkGCT AAAAGCGCTT 60 

TTgGTAAcTC TGTAACGAGG AGGCAGTAAA nTGGTGCCCt TCACTTGGAG TGAACTTnyG 120 
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CGCTCGTtTG ACTACTTCTT GTAATTGGTC JTTATTTACA AGGGGTACTT CTTTTAATTG 


180 


AGCCATTGCA GTTTCCAATG CGTTTAGAGC CTCTGAGACT TCTTCTTTTG TCGCCGTTTC 


240 


TTTCGCTACG ATTGCTTCAG CCGCTTGAAT TGCTTCTTGC AATTGTTTTT CTGTTTCTTT 


300 


TGTAAATTGG TATCCTTCTT 


CCGGCTTTCT 


TCCTCGGGCT TGAGCGATTT 


GCTCTTTTAA 


360 


GGTTTCCTTA TCAACGGCGA CGGGTTGTTC TTTTAATTGT GCAAtTGcTT CACGCAATGT 


420 


TTTAGTCGCT TValalUlAni 




HAPTTTTTCT TGAGCAAATA 


CTTTTTGAGC 

w X X X X X on>wx« 


480 


TGCTGCTAAT GAC iCTTCGA 


oAAC 1 i X 1 1 1 


Aoi 1 xv^i iv«V9 \9X\3nnx av«x i 


GPCCATCACT 


540 


TGGTTTCTTC GClAbCv^CCl 


L# 1 1 oAAl 1 


1 1 X X X X t\Jnt\ X v^/WTV* X X X n X 


Tr*GPTAC!ATC 
X \.)»V9V^ X n\^t\ X \^ 


600 


AGGCATTTTT 1 v^CAAC 1 viAu 


X X VjCAov^^ X X 


TTCTAGGTTT TCCGCGGCAG 


CAGAAACGTC 


660 


TGTCTGTGTG GCTTGTTCAT 


TCjCjCTAvaAAU 


AGCTACTGCC TCATCTAAGA 


TTGGTTTAAA 


720 


GATTGCCCAA GTTTCAGGCG 


TAT ATTCTT T 


TTCGACACGT GCTTGATTTA 


GCGTAATTAG 


780 


CGCTTGTAAT AACTCTTTAT 


CTACTAGCTC 


TGTGGGCTTC ACTGTTACAG 


AAAAATCTGA 


840 


AAGTGTGGCT TGTTGGAATC 


CTTGGCCTAC 


TAATAAGCCT TGACTCATAT 


CAAATTGTAA 


900 


TGGCTGCTCA ACAGTTGTCG 


CACGTTTTAC 


GCGAGTTGTT TCTTTTACTT 


TAACTTTAAA 


960 


GGTAACTAAG TCCATTGAAC 


CATTTAACAA 


TTGTTTATTT CCTTGATTAC 


TAAGAACTAA 


1020 


ATATAGATTC TCCACATCAT 


TTGAATGTTT 


CCGATATTTA GAATAATTTT 


CCATTTGTTG 


1080 


TGTATTATTC GTGGTTGCAG 


GCTGACCGAC 


AAGTTCAAAT TTGCTAGAAT 


CT^GGACAT 


1140 


TTTTGTGCTT AACGCATTAA 


TGTTTTTTAA 


GTCTTGACCT TTAAGTATAA AGGTTAATGT 


1200 


ATCGCCAGGA AGATAAGTAT 


CCTTGCCATT 


TTCATTAACC ACAGCAAGGG 


ATAAACCGCC 


1260 


CGCAATTTCT TCGACATCAG 


GTATCTCAAT 


CCCACCATCT AATTGTCGTA AAACATAACT 


1320 


GATATCATAA GCATCAATTA CCCCATTTTT 


ATTTAAATCA CCTTTTTCAA CATAGCCATT 


1380 


AAAGTCACTG TCCACAGATT 


CTAAGCCTGT 


ATAGTTTCGA TAACTCATTG 


CATCATTTTC 


1440 


ATCAATCGTT CCATCGTTAG 


TAATGTCTCC 


ATGTAAGATT CCTTCTGTTC 


CTGGTTGTTT 


1500 


GTAAAAGAGC ATCTCACGAC 


CAGAGCCAAA 


ATTACCCACC GaTTTTAATA 


CCTTCATTTc 


1560 


AACAAAACGa TACGCTTGGt 


CCTTCGTTTc 


AAkGGTTTTT GtCAAAGCAT 


CTTGcTTCCA 


1620 


GTTAATAGGT TCTGAAAACT 


CTGTCCAATT 


AGCTCCATCT TTCGACGTCC 


GATACTGGAG 


1680 


CTGCAAAATA TTTCCGTTGC 


CGGcATTGTC 


TCTTGGCAAG TATTCAATTT 


TATCCATTTG 


1740 


ATACTCCGCA CCAAGATCAA ATTTTAAAGA 


CAAGAAATTT CCATCACTTG 


GATTAGCGAT 


1800 


GCCTGTTGAC CAGTTCGTAT 


GCCAACCTGT 


ACTCAAATCT TTATCTGTTA 


ATTTCTTCAA 


XodU 


TTCTGCGCCT GGTTGTTCTG GTAAATTACT 


TGTTGCTTTG ACTTGATTAA TTGTTTCTTT 


1920 


ATAGGGATCA TCTTGTGTTT 


TTCCTTTAAT 


CGGTTCACTC CATTCAGAGA 


CCCCGTTTTT 


1980 


TCCAACAGCT CGAACTCGAA AAGTATGTTC GCTTAGGAAG GAAAATCCAT CAAATGTTGC 


2040 


TGTATTCGTT TGAATATTTC CGrAAACAGT CCCGTCACGC TCTACTTCAT 


AGCTTGTTGC 


2100 
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CTCRGTTACT 


TGATCCCATT 


GAAGAGTCAA AGAAGATGGC 


6TTGTTTTTT 


CTTCATTGAT 


2160 


TGCGACATTT 


GCCGGAATAG 


TTAACTGGTC 


ATCGACCGTT 


GTATTTCCTC 


CGTCAACCGT 


2220 


TCCTTTATTA 


ATGAAACCTT 


TGACAGTTAT 


TTGAACATCT 


TTGGCGGTCA 


CAGATTGTTT 


2280 


CGCTAGTTTT 


ACTGATAACG 


CTGATTGGTT 


CAGTTTTTCT 


CCTGAAGCTT 


CAGAAAGATA 


2340 


CTGATTCACT 


TGGAATTCTT 


TATCAAAATA 


ATATAGATTT 


GTGCCAGCTA 


AAAATTCTTC 


2400 


TTTATTAGCT 


GCTtGTTTCA AGGTTACTTC 


TGTCCCCCCC 


ACCATTGCTG 


TTACGCTTTC 


2460 


GGsCGCTtCT 


GAAGCTAGCA AwTCAAGCGt 


GGTGCTtCGT 


tCATCAACAA 


AGnCTTtATA 


2520 


ACTTCCTTTt 




u Ao 1 3A(^oo 1 


CAGATCCCtT 


TTcGTTTGAT 


TTAGGGCCTT 


2580 


GACTATTTAT 


T» T »T» T TV 




CAGCTTCATA 


ACTAGTTGAA 


ATTCCGTCAT 


2640 


CTTCAT7VCAT 


ATTAAAAtiAA- 


ti 1 Cto 1 CiCvJA i 


TTGGATAAAT 


CAAGAAAGAT 


CGTTGATCTC 


2700 


GTTGGATTTC 


CTTTGGATTG 


T^^^rrtcTi fTi ry* 

I Ibi 1 IGviAl 


TCGTCATTGG 


GATAATACTG 


CCATCTTTGA 


2760 


CGAAAACGGG 




AVsCviVS 1 V3 1 1 1 


TTACTCCATT 


TAAGACACGG 


CCACCTTGAT 


2820 


ATTTTTCGCC 




1 L*AA^L> wilaA 


CTTGTTTTTC 


ATCTGGTAAG 


TAGATTCCAT 


2880 


CACGGATTGA 




lUAi^x liaAx 


TGCCATTATA 


AATT6GGGCA 


ACGAGTAAAT 


2940 


TTGGTCCCCA 


LmAIAIAX Xuu 


XAl loAVaAAX 


CTTTTGTATA 


TGCTGTTCCT 


TCATTAGGAA 


3000 


ATTCTAAAGC 


W>A 1 Ou^V4>V«u 1 


HfOZlTTIVSTll 
/\L>v«nX lowXA 


GTCCATCAAC 


TGATTCCTTG 


GC/U^TACTGT 


3060 


AGTTATAAGG 




vsnx X X x/mx X 


TTAGATACGC 


TCGGTTCAAA 


TCTGTGGCTT 


3X20 


CTTGATCAAA 




GTTTTTGGAT 


TGGAGCCCCA 


GCCGTCCATA 


TTTAGCTGCA 


3180 


CTGGTGTAAA 


AGTTTTCCAT 


TGGAAATCAC 


GTATGTTGAT 


CTCTTTATTT 


TTTCCACCAA 


3240 


AAATTCCATC 


CATATCTGAA 


CCGACATTTG 


GTTGTCCAGA 


CAACGAAGTC 


CCGATATATG 


3300 


TTGGGATATG 


GAAGCGAATA 


TATTCCCATT 


GACCACCCGT 


TTGATCGCCT 


GTCCAAATAC 


3360 


CAGCGTGCCG 


TTGGGTTCCG 


GCCCAACCAT 


CTAATGAAAC 


AATCATCGGA 


CGAACTGCGC 


3420 


CGTCTGTTTC 


TTTTACAAAA 


ACATTCGCGG 


CATCTTCTAC 


ACCATTTAAG 


CCAAAGGAAT 


3480 


ACCCATACCC 


GACCCACGCC 


ACGTCAGTTT 


TGAGAGCTTT 


AACACCAGCG 


ACACTAACTT 


3540 


CTTTCGCAAT 


ATCCCGTTCT 


CCTTTTTTAG 


GATTTTTAGG 


ATCCGCTGGA 


TGGAGATTAG 


3600 


ATTGAGTCCA 


CAGCCCCACT 


TCAACACCAT 


TGGCTTGAGC 


ATACTCTGTA 


AATTCTTTTA 


3660 


AGTTTTGTAC 


ATCGCCGTCC 


AATGAATCTG 


TTTGGCCATA 


GCCAGCACCG 


TAGCCGTCAT 


3720 


TTGGCAAGAA 


CCACCCTAAC 


GGCATATCGT 


TTTTCTTATA 


TCGATCAATA 


ACTGCGCGTG 


*3 T O ft 

3780 


CAGA7\AATTG 


ATAATTCTCT 


TTTTCTCCAT 


TTAAAGATTC 


TAGCGTGCCA 


TTTAAGTTTC 


3840 


CTAGATCGCC 


TGGTTGGTAT 


TCTTTGTAAA AATTACCATC 


TTCAAACTTA 


ACTGCTCCCG 


3900 


CQGTTCCTTG 


TGCTACTTTC 


ACCCAATAGT 


CCCGATTATA 


CGCGTTTAAA 


TGCGCTTCAT 


3960 


AGAACCCGTA 


TTCTGGCATC 


AAAGCTGGTT 


TCCCAGTTAA 


TTCATAGTAA 


TCTTTTAAAA 


4020 


TACCTGCTGA 


GGAATCATTA AAGAAATAAA AAGCATCAAA 


ATCTGTTCCt 


TCGTGCGTTG 


4080 
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TTGTGGTTTT TTGTGGATCA TGACTTCCAA. AATCATAGrP GCCAGG7TTC CATGTATrTC 4140 

GfeACAACGCC ATACCCTGCA GTTGACCAAT AAAATCGSTT AGSCGArGCT ACTCCGCCAT 4200 

CTACCCAATT STPCGTATTG ACAATCTSGA TAGCTGTTCC TTTATGCGTA AATCGACCAT 4260 

TTTGAGTTCC TCCGCCAAAA TAArTTTCTT GXCTACTrTG TTTTAACGTT TGTGTAGCTT 4320 

TATCATTTTT AARGGArAGC GGAgCGGTTT CrTCTAAAAT AACTTGATTC TTTTTCTTGT 4380 

CTAGAACCTT CATTAAAGCG CTTTCTTTTT C3AACATAAT TTTTAAAXCa TTATTTtCTA 4440 

GAATAAATTG GTTTCCTGAA TCT6T6ACGT TC6TTTGCTC AASVGGCTTGT GTACCATAGT 4500 

CGGCCATTGT TTTAGCCGTA A7T7TCGCAA CATGCTTTGG ATCATTT3GC GTTGGATATT 4560 

CTTCAAATTT aCCAGT?GGA TCTAGATGAT AGCGAAAAAG ATGGTCGTTT AAAATACTAA 4620 

TTTGTGCTTT rmCCAGAA GCATACGTAA TGACAAAGTI TTTCTCCGCT TTTtCAATCG 4680 

ACACAATTGC TCCGAC7GAC GrTGCTGAAT CSGCTACGTC TTCTTTTGCT GTTTGCTCTT 4740 

GCGCATCAAC CACTTGAGAA TCCATCGTAA CAAACGAATG GAACGCCRAT ACAGAC6AAC 4800 

TCACTAATAA AAATGICAAC CaGTGCTTTT TACCGTAAAT TTTTTTACCC TGATACTCTT 4860 

TGTTCATTTT TTCCTCCTTA AaATTGCTCC CCAIAGAATA GTTCCGftOVA AATTACTCAC 4920 
AAAATCCAAA ACCCaACAAA GTTTTATCCA GATGAAAAGT CAIAACGATT TTCTTA6TTT 4980 
ACTGAGflCTG TTAGAAAGTC ASTTTCAGTA ACCCCTTCRT AAACACATCC tTTCAATAAA 5040 
ATATTAAATG AARACAATCT ATTTTCATCT TATATAAAAA TTTTAG^ACA GAATCATAAT 5100 
TTCCTAA7GA ATGCGTTTTC ATTTGTTGAC TJiCCAATTTT TCACTTTTTT AATAACATAA 5160 
TTATC6TATG ATGGTTTTTT TSTSTTTGTC ASCGGTTAAT TATAGCAGCC ACCATTAACA 5220 
AGTATTAACC CTTGATAAAC CTGTTAArrC ACTTATAATG ATGATTTTAT TGCGAAAATA 5280 
TCTACATCAG AARAAGAAAG ACTTTTTCAA AAATCTAATA CCTGTTGCTC AAATCTAGTC 5340 
TATTTAAAAA TAATCAACAC CCTCCTGCTC AGCAAGAGGG TGTTAATTAC TCGTTTTTTA 5400 
GTGAT7GATT rCACTTGCTG GTGTGAAAAG CAACTAATG ATGATGCCAA CTATCAGTAC 5460 
CGCTAGCAAT ATAAAAATTA ACTTTCCRAA CCGGTGATAG GGATTGATCG AAAGCCCAAT 5520 
CCCATAAACT ?TCGGAACCC ACAAAGGAGG CSirGGATTA TCTTTTTGAT CTTTATAGGC 5580 
TATCTTTAAA ATCTGCCACG CAAAATACAA CPATGGTAAA ACCATAATTA CCCCAACTAT 5640 
ACCTAGCCAT TTTGCTSATA AGTCACTAGT ATGCAAAAGA ATGATTGTCA GAATTGGTAC 5700 
AAACAAAAGA ATAAAGCTAA TrGTGGCTCC CSCTTGTGCT TGCTTAAGCA GTCGTCTCTC 5760 
ACGGAGAGTA AGATTTGCAA A=ATTCCATC CSTCCTTTCA TGGTTTrCAG AITAAATCAA 5820 
CGAATCaCTA AAAAAC7AAA TTAACTGCGA AXATCAATGA CCATGCSCCC GCGAAITTTT 5880 
CCTTGTTCCA TTTCTTCAAA AATAGCATTA A7TTCTTCTG GTTGACGTGT TTGACAAACA 5940 
GGAACAACTT TTCCTTCTGC fiCCGAATTGG ASAGCTTCTC TTAAATCTTG ACGGGTTCCA 6000 
ACTAACGATC CGACTACG7G A.-.T6CCATCA AGAACAAGAC GTGGAAtAGA TAAATCCATT 6060 
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GTCTCAACAG 


GCAATCCAAC 


GGCGACAACv^ - 


GTTCCTCCCG 




AX L-wU.* 1 UCb 




CCATTAAATG 


CTGATTTAGC 


TACCGCAGTG 


ACTACTGCTG 


CATGGACACC 


(jCCAATI TCT 




TTTTGAAGTT 


GTGCAGCAAA 


ATCAGGATCT 


TTAACTGGAT 


TAATTAAAAG 


7% m/^Tk r*f*lK fyf^r* 

ATCAGCACCC 




ATTTCTTTCG 


CAAAAGCTAA 


CTGATTATCA 


TTCACATCAA 


CAGCTACTAC 


TTTTGCATTA 


6300 


AAGACATGCT 


TGGCATATTG 


TAAAGCTAGA 


TTACCTAGCC 


CACCTAATCC 


TGAAATCATT 


6360 


AGCCATTGAC 


CTGGACGAAT 


CCCAGATTCT 


TTTATCGCTT 


TGTAGGTTGT 


TACCCCTGCA 


6420 


CAAGTAATAG 


AGGAAGCTGG 


CTCTGATGCT 


AAATTTTCTG 


GCACTTTCAC 


TGcATAACCG 


6480 


TCTGTGACGA 


TGCATTCTTC 


GGCCATGGCA 


CCATCAACTG 


TATATCCAGC 


ATTTTTTACT 


6540 


GAACGACAAA 


GCGTTTCTCG 


ACCAGAAGTA 


CAATATTCGC 


AATGCCCGCA 


GCCTTCATAA 


6600 


AACCATGCAA 


CAGATGCACG 


ATCACCAATT 


TTTAAACTAG 


TAACGTCTTT 


TCCAACCTCC 


6660 


GTCACAATCC 


CAATTCCTTC 


GTGACCTAAG 


GTACAACCGC 


TAACATCGCC 


AAAATCTGCA 


6720 


TTCTTAACGT 


GTAAATCCGT 


ATGACAGACA 


CCACAGACTT 


CCATTTTCAA 


CCGCGCTTCA 


6780 


TTATCTTTCA 


ATGGACGCGA 


TTCTTTTTCA 


ACAATTTTTG 


TTGTTTTTCC 


TTTTGCTGCG 


6840 


ACAATTGCTT 


TATACATGGT 


TCTTTCCTCC 


TTGTTTCTAT 


TGTGAATCAT 


TGAACAATAA 


6900 


TAGTATAATA 


CTACTTTCCA 


AAATAAGGrA 


CACCAATGAC 


TtGCGTAAAT 


TAACGCTAAT 


6960 


TTGaCAATTT 


ATCCTJcGCTA 


GACAGCTAtc 


TTCCCTTATG 


TCATAGATAA 


TTCATACTGG 


7020 


CATGAnTTTC 


TCTGCATAAT 


TGnCAACTAT 


TTT 






7053 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

ACTTATTTTA CTAACGTCGT GACGCTAAGT TTAGGTGGAC TTGTAGCGAT AGTGATGTAT 60 

ATCGATCGAA TTTATACGCC GATAGCCATA TTTAATGTTT TATTTGTTCA ATATAATTTG 120 

GnATAAAGTA GCCTATCAGC GCTTGGAAGA TTTTTATAAA AAAGAAGATG ATCCTGACTT 180 

AGCGGTTTCC GGCAAAGCGT TACCTGAAAT TCAAACAATT TCATTAAAAG ATGTGTGTTT 240 

TAGCGTAGAT AGTCAAACGA TTATGTCCCA ACAAAATCGA CAATTTTCAA TGAATAAAAC 300 

CTATGGCTTG ATTGGAAAAA GTGGGACAGG TAAATCTACC CTGATTAAAT TGATTTTAGG 360 

ATTATTGAAG CCGACAGAGG GGACAGTTTA TGTGAATCAA TTTCCTCTAA CACAATTTAA 420 

TTTGGAAGAT TATTATGAAA AAGTTTTTTA TTTATCTCAA GATGTGCCAA TATTTCAAGG 4 80 

AACGTTAAAA GAAAATATTG TATTTAATCA AGAAATTTCT GATGAACAAG TAATTGAAGC 540 

TATGTATCGT TTTCAGTTGG GGGAACTTTA TGAACGGTTG CCGGAAGtTT AAACACTATt 600 
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GTGAGTGAAA AAGGAtGAAT TTTTCTGGTG GTGAAAAACA ACGGATTGCT TcACACGCTT 660 

AGCATTTACA CAAGCAGAGA TCCTTATTTT- AGATGAGGCG ACTTCTGCaT TGGACGAAAA 720 

GnCCGAAG/^ AAAGTGTTAC AAGAAGTACA AAAATTTACC CACAATAAGT TGACTATTTT 780 

GGTTACTCAT CGACCTAAAA CCCTGCGATT TGTAGATGAA ATTATTGATT TAAATGAGTG 840 

AACTTGTTAG CTCTTTGGCG CTTCTTTTGC TACACTTTTT ATAAGAATTG AGAGAAAGGG 900 

GCAACTGTTG ATGAATAATA ATGAGCAAGA ATTAGCGGAA TTTTGGGATG AATTTGCAGA 960 

AGAATATGAA GAAATTCAAC AAGAATCGCC ATTTCCAATT GCTCGAGAAT TACGTGATTT 1020 

CTTGGTCCAA GAAGGCATTT TTCCGTGTCA GACATTTCTG GATATCGCAG GTGGGACGGG 1080 

ACGATACTTA CCTTTTTTTC AAGAGCAAGT CACGGAGTAT ACGCTAGCCG ATATTTCTCA 1140 

GAGAATGTTG GAGATTGCGG AAGCAAAAGC ACAAAGCAAC GTTGTTTTTC TTCACCAAAG 1200 

CCAAGAAAGA CTGATTGAGA CTGGGAAAAA ATTTCAAGTC GTTTTTTCAG CGATGAACCC 1260 

TGCTTTGGAT ACGCCGGAAA AAGTTAATGC ATTGTGCCAA TTAAGTGAAG 7UVTGGTGTCT 1320 

TATATTTCGG CTAGTGGAAG AGCAAGATTC GTTGTTTTCT CCTTTTGAGC AAGAGAGCAA 1380 

CCCGCAGTTA AAATGGATGG CACAATACAA AGCATTTCTA AAAAAAGAAC AGCGACCTTT 1440 

TTTCACAAAA AAATTTTTCT TTGAAGCATC AGAAGCTATT TCAAAAGACT TTTTCCGCAG 1500 

CTATTTTGAA GAACAATGGT CAGTTCCAAT TTTAGAACAA CGCGTACAAG AAATTTTTGG 1560 

TTCTCATGAA ATTAAGcAAA ATCAGCGTAC CATTATTTAT GAATTAATTG CGATTCCATG 1620 

TAAAAAAACA ACCAGCGATG ACTAATTATC GCTGGTTGTT TTTCGGTTAT TTAATTGTGC 1680 

GAAAGCTTTT TCTGGAATCT CTTTTTTATC AATTTTGACA GGAGATTGGA TGATTCTTTT 1740 

TTTACTAATG GTTGCTTCTA TGTAAGCATT TGGTTCCAAA GGTTGAGGAT TTTTGTCTGT 1800 

CCGTTCGTAG TCCATAACTT GCGTTTTTCC ATTACTTTTA ACAAAAGTTA GTCGATAGTG 1860 

TAAAGAATAT TGGCCATCTA TGCTTTTTTG ATTGTCATCT ACTGTTTGTT TTCGTTCAGG 1920 

AATTTCTAGA GGAACTTGTG CATAGGCTTT TTCACCCTTA TATGTATCTG CATAATAACG 1980 

ATAGCCTGTA TAACCACCAA CAATGAATAA GCCAATAAAT AAAATAGCGC CGATTTTTTT 2040 

CATTTTTTCT CCTCCTAACA ATTTACAAAA ATAGCATAGC AAAAGAAGAA GGAGATGGAA 2100 

CATCGAAAAA ACGAACAGTT ATCTTACAAA ACTGTTCGTT TTTTTTGGTC TATTTAGCTT 2160 

TCATCAGAGA GTAGAAAGGA ATCATTTTCT TACGATAACT AAAGAAAATA ACn 2213 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

ACGTTCAGGA ACTGATACCC TAGATTTGCG CAAGTTGCAT CGTCCGCAAa GATATCaTCA 60 
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ATCACCACCA ACGCCGTGGC ATTATTTTGA TGAAGAATGT CATCAAAAGC CATGTAATCT 


120 


TCAGGTTTTT 


CTAACGCAAC 


GCTTGGTAAA ACAGTTAATC 


CTTGCATCAT 


CATTGCTTTT 


180 


TGATAACCAA AGTAACGTTC AAAGTACAAA CTTTCATGCG TTGTATTGGT 


CACAAATAAA 


240 


ATATTTTTAT 


GGCCGTTTTC 


AATTAAAAAT TCCGTCGCTT 


GTTTCCCTAA 


TAATTGaTTA 


300 


TCATTATCCA CGTAAACTAT 


TTCCGTTTCA TGTTGGTAAG 


GTTGACCGAT 


TAAAGTAAAG 




GGAATGTTGT 


TTTCGTCTAA 


GTAATCAATG ACGGGGTCCT 


TACTATCTGA 


ATACACAAGA 




ATAAAACCGT 


CCACTTGTTT 


TTGAAGATGC ATTCTTTTAA 


CGTTTTCCAA 


TAGTACATCA 


HO\M 


AAACTTTTGG 


CAGTTGCAAT 


AGCAGTAGTC ATATCGTAAA 


GACGGGCTTC 


CTCGTTGATG 




GCTTCCATAA 


TTTCTAAATA 


AAATGGATTC CCTAACCGTT 


CTTTCGAATC 


TAAAGGAGGT 


600 


AAAATTACGC 


CAATGGCACT 


TGAAATTCGT TTACCCTUUVt 


TCCGTGCCGT 


CATATTTGGC 


\f\f\i 


ACATAGCCTA 


AGTCATCCAT 


GACTTTTCGA ACTTTCTTTT 


TTGTCGATTC 


AGAAATACTT 


f c^yJ 


GGATGATCGT 


TAATGACACG 


CGAAACGGTT GAAGTTGCAA 


CCCCCGCTTT 


CTTAGCGACA 




TCTTTTACTG 


TAATTGCCAT 


TTTTGCCACG CCTTTCTTAG 


TAATTATAGA 


AGGTTAAATT 




GATAAACCGT 


TTGGTAGGTT 


TGCTTTTTAT TAGCTGGTAA 


TAAAATAGAG 


CCCCACTGAG 




GATGATGAAC AATGTCTGGA 


TACTCTTGTG GTTCAATTGC 


TAAGCCATAG 


TTTGAATGCA 




TCGCTTGTCC ATTAATAGAA AATGGAGCCT CAAAACCAGT 


CGTAGAAAAT 


AATACCATAC 




TTTGGTTCGT 


TGTTGCTATC 


GTCATTTTTC TTCCAGAAGT 


AGGCTCTTGT 


AAGCATAATT 


1 nfifi 
xuou 


GTGGCTGTCT 


TAAGTCAGAG 


CCAACTAAGG AGAAAACATC 


ATCTAACCCA 


GTTGGATAGC 




GTTGGAGGAT 


TGTTTCGATA GGTGTCAGTT CATTAAAACG 


TATGGATAGT 


TCAGAAGCTG 


1200 


GTAAAATCCG 


TCCAGTAGGC 


AGCTTATTAT CGTCTAATTC 


TAACATACCT 


TGACAGTTCA 


1260 


AAGATAGCAA ATGTGACAGA ATGTCTAACT TTCCATCGCC 


AGATAAATTA 


AAGTAGGCGT 


1320 

X<^b V 


GGTTCGTAGG 


ATTAAAAAGT 


GTTTCCTGTT TACTCCGTCC 


AGTACTGATC 


aTTCGTAAAT 


1380 


GATTATCCTC 


TAGCTGATAA 


GTGATGGTGG CCTCAATTGG 


TCCAGGATAG 


CCAGAAATGG 


1440 


TGTCGGTTAA 


ATGAAAAGCA 


ACGCCAATGA TATCCTTTTT 


TTCAAAGGGC 


TCGACTGTCC 


1500 


AATATTGAAA 


AGACCACCCC 


TTTGAGCCAC CATGGATATG 


ATGACCGTCG 


CTGTTTTGTT 


1560 


CTAATATTCC 


ATTTTGCCAA 


TGGCCATTAC GTATGCGTCC 


AGCAACTGGT 


CCAACAGTGG 


1620 


CACCAAAAAA 


GCTTTGATCC 


TTTAATACAT CTTGATACGA ATCATACGAT 


AAAAGGATAT 


1680 


TTTCCCAATG 


ATTATTTTTG 


TTTGGCATCA AAAGTTGATG 


GAGGCGCGCG 


Vi* V* A L MM 1 1 V3M 


1740 


GAAGAACGAC 


TTCTAAAAAG 


TCGTTCTTCA AAGTAATTTT 


TTTTAAGTTT 


GTGTTTGGAA 


1800 


TATCGGTTAC 


GGTGATACGA TTATCCTTCA TGATCAAGCC 


ACACTTTTTT 


TAAGTAATCA 


1860 


AACGTTAAAG 


CGCTCGTCTC 


TGGCACGATA GGCAACCCTT 


CGCCTAACTC 


TTCTTTGCGG 


1920 


CCAACGCCAA 


CAGGTTGCGC 


TCCACTAGCA AGAATCGCTT 


GAATCCCAGC 


TTTGGCATCT 


1980 


TCAATGCCAA 


GACATTCTGC 


TGGTGTCAAA TCAATTTCTT 


TCGCAGCTAA 


AATGAAAATA 


2040 
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TCTGGAGCGG 


GTTTACCTGC 


CTGGACGTCT 


-GCTGGATTAG 


CAATGGCATC 


AAAAAGGGGC 


2100 


GTCAATTGCA 


TTTTTTCTAA 


AAGAAAAGGC 


CCATTTTTAC 


TTGCAGAAGC 


CAAGGCGATT 


2160 


TTAATATTGG 


CTTCACGTAA 


TTCCGTTAAT 


AAAGAAAGAA 


TTCCTGGGTA 


GACATCTTCA 


2220 


GGCGTAATCG 


CTTGGATCAT 


TTCCAAGTAA 


TAATCGTTTT 


TCCGTTGCGC 


TAACTGGGCA 


2280 


AATTCTTCAG 


AAGAAAACGT 


GCCTTCTTTT 


TTTCCGTATT 


TTAACAGTAA 


CTGCAGAGAA 


2340 


TCTTCACGGC 


TTACACCTTT 


CAGTTGTTCG 


TTGAAAACAC 


GGTCAATGGA 


AATTCCAATT 


2400 


TCGTTCCCTA 


ATTTTTTCCA 


AGCGTGGTAA 


TGAAATTCTG 


CTGTATCAGT 


AATAACGCCA 


2460 


TCTAAATCAA 


ACAAAACGCC 


TTTAAACATG 


AACTTCTTCC 


TTTCTGATCG 


GTAAAACTAA 


2520 


TCGATCTTTT 


AACGTGTATT 


TTTGATCGTA 


AACAGTTAAT 


GGAATTGCAG 


GACCGTCGAG 


2580 


TAAAGTGAAG 


GCCACTTGGT 


CTGCTGCAAC 


TTCAACAAAT 


AGTAAACGAT 


TCCGATAATT 


2640 


AATGTGGAAT 


GAGTAGGCTG 


TCCATGTCGC 


TGGTAAAAAG 


GGTGCAAATT 


TTAGTTGTTG 


2700 


ATGGTCCGTT 


TTCATTTGGG 


CAAAACCTTG 


AACAATCGCT 


AGCC7\ACTAC 


CAGTCATTGA 


2760 


GGTAATGTGT 


AAGCCATCTT 


CTGTATCATT 


ATTGTAATTA 


TCT7\AGTCAA 


GGCGCGCTGT 


2820 


TCGTTGATAC 


ATTTCCACTG 


CTTTTTCTTC 


CATGCCTAAT 


TCAGCAGCTA 


ACACAGCATG 


2880 


AATGCTTGGT 


GAAAGAGAAG 


ATTCATGGAC 


AGTCATTGGT 


TCGTAAAAGT 


TAAAGTTCCG 


2940 


GCGTTTTTCT 


TCTAAAGAAA 


AGGCATCATT 


GAAGAAATAA 


ATTCCTTGAA 


GAACATCGGC 


3000 


TTGTTTAATG 


AAACAAGACC 


GTAAAATTTT 


ATCCCATGAC 


CAATGTTGAT 


TTAAAGGCAG 


3060 


CTCACTAAGT 


6GTAAATCAG 


AAACAGGCAT 


TAAATCTTTA 


TCTAAGAAAG 


TGTCATGTTG 


3120 


TACAAAAATG 


CCTAATTCGT 


TATCCACTGG 


GAAGTACATA 


TTTTCTACAA 


TGTCTGCCCA 


3180 


TTTAGCTAAT 


TCGTCGTCTG 


CAATTTTGAG 


CGTTGTTTCT 


TCTTGGAATT 


TTAAATAAGA 


3240 


CTCACGGGTG 


TAACGTAGAA 


CCCACGCAGC 


GATGGTATTG 


GTATACCAGT 


TATTATTGAT 


3300 


ATTATTTTCA 


TATTCATTTG 


GCCCCGTTAC 


GCCATGAATC 


ATATATTGTT 


TGTGACGTTG 


3360 


GGAAAAGTGG 


ACCCGATCTG 


CCCAAAAACG 


AGCGATTGCT 


ACAAGTACTT 


CTAGACCAGC 


3420 


ATCTTTTAGA 


TAATCTTCGT 


CTCCTGTATA 


GTTGACATAA 


TTATAAATCG 


CATAAGCAAT 


3480 


CGCACCATTT 


CTGTGT^ATCT 


CTTCAAAAGT 


GATTTCCCAC 


TCATTATGAC 


ATTCGACACC 


3540 


AGTAAAAGTA 


ACCATTGGGT 


AAAGCGCACC 


CTTTAACCCT 


TGTTGTTGGG 


CATTATGAAT 


3600 


GGCTTGAGGC 


AATTGATTGT 


GGCGATATTT 


CAGTAGATTT 


TTAGTAACTT 


CGGGCTTAGC 


3660 


CAACGCCAAG 


TAAAGTGGGA 


CAGCATATGC 


TTCTGTATCC 


CAATATGTGG 


CACCTCCATA 


3720 


TTTTTCGCCC 


GTAAAGCCTT 


TTGGTCCGAT 


GTTTAAACGA 


TCGTCTTCTC 


CATAGTAGGT 


3780 


TGAGAAAAGT 


TGGAACAGAT 


TAAAACGAAT 


GCCTTGTTGG 


GCTTCGTCAT 


CTCCTTCAAT 


3840 


AACAACATCA 


GCTAATTGCC 


AAC6TTTTGC 


CCAAGCGGCA 


GCTTGCCCAG 


CTTTTGCTTC 


3900 


AGGGTACAAG 


GTAGTCATTT 


CTGCAAATAA 


TTCGTTTACT 


CGTGTGATTT 


GTTGCGTTTC 


3960 


TGGCACATCA 


CGACTGGTAA 


CGACAAGCAC 


TTCTTTTGtG 


ATGGCGGTCG 


TTtCGCCtTC 


4020 
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GTTTAAAGAC 


aCTGTCAATT 


CTGCGGTTAA 


AGCCAAGGGC 


TGTTGTGTAT 


ATGTAGGCAC 


4080 


AACAGAGGCT 


CCATCGATGA 


AATGGCGCAT 


GCCTGCAGTC 


ACTGTAAAAC 


GCTCTATTTC 


4140 


AAAGTTGTTA 


GGAATGGTCT 


TAGTCGTAAC 


AAAGCCAAGT 


GTTTCTTGTG 


TTTCAATAGC 


4200 


TCTTTCTTCC 


C7VAAAATGTT 


CTTCATAATT 


GCTGTCTTCA TTTTGAACAT CGCCATCCAA 


4260 


TTTTGAGTGA 


AGAGTGATTG 


TGCCTGTACC 


TTCCAGCATT 


TCAATTGTTA 


AGTGGATGTA 


4320 


AGCTGCCTCT 


TTTTTTTCTA 


AGCTTAAAAA 


ACGTTCAAAA 


GAGAAGCGCA 


CTTTATTCTT 


4380 


CGGTGTTTGG 


ATGGTGAATT 


GCCGAGAAAG 


AACCCCGTTT 


TGCATGTCTA 


ATTCTAAAGA 


4440 


AAAATCTTCG 


TAAGGTGTAG 


TGGCTAAATC 


AATTGTTTGT 


CCATCAATTT 


GTAAGTCCAT 


4500 


TGCGATGAAA 


TTAATTGCGT 


TAATCACTTT 


ACCAAAATAT 


TCAGGATAAC 


CATTTTTCCA 


4560 


CCAACaiACA 


CGTGTTTTGT 


CTGGATACCA 


AACACCTGCT 


AAATAGGTTC 


CTTGATGGTG 


4620 


GTCGCCAGAA 


TAGTGTTCTT 


CAAAATTTCC 


CCGCATCCCC 


ATATAGCCGT 


TGCCAATGCT 


4680 


TGTTAATGAT 


TCTTGTAAAC 


GTAAATTTTC 


TTTATCTAAG 


TGTGTAGTAC 


GGATTTTCCA 


4740 


AGGATCAATT 


TGGAATAAAC 


GTTTGATTTG 


TTTCATGAGA ATCCCCCTCA 


GTGATTTTAT 


4800 



TGATAGGT 4808 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



GAGTAGGAAA 


CAGATATACT 


TAATATAGAA 


AATTCAGATG 


CACTTATTGG 


GAGGTTATTT 


60 


TATGCAACGA 


AAAGATGTTA 


CACCTAATTC 


AGATAAAAAA 


AGTTTATTGC 


AAAAATTTGG 


120 


GATTGGTTTA 


GCTGGCGGCT 


TACTTGGTGG 


CGCCCTTATT 


CTAGGAGGCG 


CTTATTCTGG 


180 


GATTATTCCC 


ACACCCAACG 


GCGGAAATAA 


TGCGGCGACG 


ACGACATCCA 


CTAATCACGG 


240 


CGACACAAAA 


GTCAGCAATG 


TAAGTTACAA 


TGTGTCTAGC 


GATGTCACAA 


AAGCCGTGAA 


300 


AAAAGTTCAA 


AATTCTGTGG 


TTTCTGTCAT 


TAATATGCAA 


AGT6CTAGCA 


ACAATTCTTC 


360 


GGCAGATGAT 


CCTTTTGGGG 


GATTGTTCGG 


TGGGAATGAA 


GGTACTCAAG 


ACTCTTCCGG 


420 


cAACAATGGT 


AACGATTTAG 


AAGCAGCCAG 


TGAAGGTAGT 


GGGGTTATTT 


ATAAGAAAGA 


480 


TGGTAAAACG 


GCTTATGTCG 


TGACAAATAA 


CCACGTCGTA 


GATAAAGCGC 


AAGGATT6GA 


540 


AGTTGTTTTA 


TCTGATGGTA 


CCAAAGTGAA 


AGGCGAATTA 


GTCGGAACCG 


ATGCTTACAC 


600 


GGATTTAGCC 


GTGATTAAAA 


TTTCTTCCGA 


CAAAGTT6AT 


CAAGTCGCTG 


AGTTTGGAAA 


660 


TTCTAGTAAA 


ATCACAGTCG 


GTGAGCCTGC 


TATTGCAATT 


GGTTCCCCTC 


TAGGCTCTGA 


720 


TTATGCTAAC 


TCTGTCACAC 


AAGGAATCAT 


CTCTTCTGTG 


AATAGAAATA 


TTACCAATAA 


780 
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AAACGAGTCT 


GGTGAACCAT 


CAATATTAAT • 


GCTATTCAAA 






A 


GGAAACTCTG 


GTGGTCCACT 


AATCAATATT 


GAAGGACAAG 




TAACTCAbTA 


7UU 


AAAATTGTGC 


AATCAACTAG 


TCAAGTGAGC 


GTTG7\AGGGA 


TGGGCTTTGC 


CATTCCAAGT 


ft ^ r» 

960 


AATGACGTAG 


tCAACATTAT 


CAACCAATTA 


GAAAAAGATG 


GTAAAGTGAC 


GCGGCCAGCT 


1020 


CTCGGGATTA 


CGATGTCTGA 


TTTAACAGGT 


ATCTCTTCAC 


AACAACAAGA 


ACAAATTTTA 


1080 


AAAATTCCAA 


CTTCTGTAAA 


AACTGGCGTA 


GTGGTTCGTG 


GTGTTGAAGC 


AGCGACCCCT 


1140 


GCTGAAAAGG 


CTGGATTGGA 


AAAATACGAT 


GTTATCACGA 


AAGTTGACGG 


CCAAGACGTA 


1200 


AGCTCTACTA 


CAGATTTACA 


AAGCGCGCTT 


TACAAGAAAA 


AAGTTGGCGA 


CAAAATGGAA 


1260 


GTGACTTATT 


ATCGTGGTTC 


TAAAGAAATG 


AAAGCAACTA 


TCGATTTAAC 


CATTGATAAA 


1320 


TCAGCTTTAA 


CACAACAAAA 


TAATCGCAGC 


AATCAATAAA 


AGAAAGAaAC 


GACCTGTTGT 


1380 


TCAAACAACA 


GGTCGTTTCT 


TTCTTTTATT 


CTCCTACCGC 


AAATGCAGAA 


CCAACTTCTG 


1440 


CATCAAAAGG 


CAATTCTAAG 


ATTCCTTTTT 


TAGCTGGTGC 


ATTTGGTAAA 


CGTAATTCTT 


1500 


TTGCTGAACA 


AATCATTCCA 


AAACTTTCTT 


CACCACGTAA 


AACCCCTGGC 


CAAATCATTA 


1560 


AACCATCAGG 


CATCATTGCG 


CCTGGTTTGG 


CCACAACTAC 


TTTTTGTCCT 


GCTTTGATGT 


1620 


TCGGTGCGCC 


ACAAACAATT 


TGTAAGATTT 


CGCCGTTGTC 


TACTTCTGTT 


TGTGTGATAG 


1680 


ATAAATGATC 


TGAATCTGG6 


TGTTTTTTAC 


ATGATTTGAC 


AAAGCCAACC 


ACAATTTTAG 


1740 


GGGTGTCGTC 


TTTTTCAAGG 


GTTTCTGAAA 


ATCCCGCATC 


TTTAATCGCT 


GTATTGACTT 


1800 


TTTCGATTTG 


TTCGTCTGTT 


AAAGTCACTT 


GGCCGGTGCC 


ATTAATGTCA 


CCTAGAAGCG 


1860 


CTGAAGCATT 


GAAAATATTC 


CAAGCTACCA 


CTGTTTCTGG 


CGCCTCAACT 


AAACTTACAC 


1920 


GTGCCACATC 


GCCGACACGT 


TTTACTTGGT 


TTTCCGCCCC 


TTGATCGTCC 


GCAACGATGA 


1980 


CAAGTAAGAC 


ATCGCCAACG 


TGTTCTTTGT 


TATATGCAAA 


AATCATTTCT 


GTTCCTCCTA 


2040 


AATTTGAAAG 


CTATAGGCTG 


GaTAGCTAAA 


TTCTCTCCTG 


TCCTATAGTA 


ACAAAAAAAG 


2100 


TGGaAAGTTG 


CACTTACAAT 


TATTGAATTT 


TAGGCAATCC 


CTAATGCTGC 


CAAAAAAATA 


2160 


GATAATAGAA 


TAACCCCAAC 


TAAAAGTGTC 


GTTGTTTTCA 


CTTTTTTCCC 


T7UVTAACCAG 


2220 


TAAATAACTA 


AGAAGAATAA 


AGCCGGGAAA 


AAACCTaACA 


TGATTTGGTC 


GATATATTCT 


2280 


TGTAATGGCG 


TGGCGATTTT 


TCCAGAGCCA 


ACTTTGGCAG 


mmik K y^m^^ik ik ik 

TTAACTCAAA 


CACGACATTA 


2340 


GAAGCCGTCA 


TCCCACCAAT 


AACCATTAAC 


CCTAAAATAG 


AGGCAGCTGT 


GGTAACTTTT 


2400 


TCGATTACGC 


CGCTATCTTC 


AACATCACTA 


AAAAATTTAG 


CACCTAATTT 


AAACCCTTTA 


^4oO 


TCCAAACAGA 


TATACCGTAA 


AGCAAAATGC 


GGAATATTGA 


TAATCAGTAA 


AAAGGCAATC 


2520 


GGACCAAAAA 


TATTTCCTTG 


TTTAGACAAT 


GAAATAGCTA 


CACCTGTTGC 


GATAACTTTT 


2580 


AAGGTCCCCC 


AAAAGAAAGA 


ATCCCCAATC 


CCAGACATTG 


GTCCCATTAA 


ACTAGTTTTC 


2640 


ACTGCATTAA 


TACTAGCTGA 


GTTAAATTCA 


TGATTTTTAG 


CATTTTCTTC 


TTCCATTGCT 


2700 


CCAGAAATTC 


CTAATAATAA 


GGTAGAGA7A 


TGCGGGGTGA 


CAGACATAAA 


CTCTAAATGT 


2760 
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CGTTCTAAGG 


CCGCTTTCTT 


CTCTTCATTA 


TTTTTATACA 


ACCTACGTAT 


AATTGGAGCC 


. 2820 


ATTGCATAAG 


CATAAGCTAA 


ATTTTGCATC 


CGCTCATAAT 


TCCATGATGC 


ATCTAACGTT 


2880 


AAAGAGCGAA 


AAAATACAGA 


CCAAAAATCT 


TTTTTAGTAA 


TGACAGATTT 


TGTGTTTTTA 


2940 


GAATTCATTG 


TCATCGACAA 


CAGCCTCCCT 


TGTTTGACCA 


TCACTCATAT 


TCATATATAG 


3000 


TAGGACAATC 


ACGATGCCAA 


ACAAAGCAAT 


CCCCACTACT 


GAAATTTTTA 


AATAAGCAGC 


3060 


GAATAAAAAG 


CCTAATAATA 


GAAATGGAAt 


CaTCTTTTTT 


GTTACCATCA 


TTCTTGCTAA 


3120 


CATCGCAAAA 


CCAATTGCTG 


GaATaATACC 


TGCCGCAATA 


TCTAATCCaT 


GCGTAATAAA 


3180 


GTCTGGGACA 


TAAGACAATA 


CCTTTTCAAT 


AAACGGCACG 


CCTAAGTAAA 


AGCCTAATGC 


3240 


CACAATTAGA 


GCTAACAAAA 


CTTTATTGCC 


AACTCCTGCA 


ATTAGTTGAA 


TTCGCTCTAC 


3300 


CCCTTGAATA 


TTGCCTCGTT 


TTGCTTCCTT 


ATCAGCTATUV 


TGGCTAGCGC 


CTGTAAGTAA 


3360 


7KAATGTCATC 


TGAAGATTAT 


CGATTAAAAG 


TACTAATGTA 


GCGATTGGAA 


CAGCTAAAAC 


3420 


AACCGCACTA 


GAAATTGATT 


GCCCACTCAT 


AATTACATAA 


GCTGTTGCAA 


TAATCGTTCC 


3480 


TGAAACAAAA 


TCTGGTGGAT 


TACTTGCGCC 


AACAGGAAAT 


GCACCCATAA 


ATACCAACTC 


3540 


TAATGTTGCG 


CCAAGCTTAA 


TTCCCGTATC 


TAAGTCTCCT 


AAAGCTAACC 


CTACTAGCGG 


3600 


AGcGACAACA 


ATTGGACGGT 


TCCACATCGT 


TGAACCAAAC 


CAGCTATGCA 


TGTAGGCAAA 


3660 


AAAGGCAATT 


AAAAACATTA 


ATAGAACCTT 


GATAAATGTC 


ACTTCCATCT 


ATTTGACCCT 


3720 


CCTATTTATT 


TAATTTAATA 


ACTTGTTTAT 


AGCAACCTTT 


TGATCGTCAG 


GGACTAATTG 


3780 


AACGGTTAAT 


TCGATTCCTT 


CATTAGTTAG 


TTCTTTAATT 


TGTGCAATGT 


CAGTTTCTGA 


3840 


TAAATGAACA 


GCTTTTGAAA 


TTTGTTTGCG 


CGTTTCCCCA 


TTTTTCATTC 


CACCTAAATT 


3900 


AACTTCCTTA 


ATTGTTGGCA 


CTTCTTTGAC 


CAATTGATAC 


ATATCAGCAA 


CTGATTCACA 


3960 


TAAAATAAAT 


AGTTTATATT 


TATCTGTCAC 


CCCAGAATTT 


AACGCCTTAA 


tTGAATCATC 


4020 


AATCGTTTTC 


ATAACTAATT 


TAACACCTGA 


AGGTTTTGCC 


ATTTTCATAG 


CGCTCATTCT 


4080 


TAATGGATCC 


TCAACCAACG 


TATTACTAGC 


AATCAAAATA 


CAATCTGCTT 


CTAAAAATTT 


4140 


TGTCCAAGTA 


AACCCGACCT 


GACCATGAAT 


CAACCGATGG 


TCCACTCTTG 


TCATTTTAAT 


4200 


CATCCTTATT 


CACCTCTCAG 


TTCTTTCACA 


ATTTTCATAA 


ACGAATCAAC 


ATAAGGAGCA 


4260 


TAAATATCTC 


CCGTATCTTT 


TCTTGTATCT 


TTGAAGTTAC 


TCCCTACAAT 


TGCTGCATCG 


4320 


CAAATCGCCA 


ATTGTTCTTT 


CACACTCTTG 


TCATTTAACC 


CCGCTGCGAC 


AATTAATGGA 


4380 


AACTCAGGTA 


ATTGTTTTCT 


AAACAACTTA 


ATCTTTTCTA 


GTGATGTCTC 


TTCACCCGTT 


4440 


GCATTTj^TG 


TCACTGCAAT 


AGCATCACAA 


CGTTGCTGTG 


CAATTTTCAA 


ATCTTCTTCG 


4500 


ACTGATTTTT 


CAGACAACAT 


TGGTTGATAT 


TTGAAGCGCA 


CGCCACCAAT 


TAATTTTGCT 


4560 


GTTGTTTTTG 


CACGATATAA 


ATCAAAAAAA 


GCTTGTAAGG 


ATGCTTCATC 


CCGCGGTTTC 


4620 


ACATGACCAA 


CCACAGAATC 


AATCTGTAAA 


AATTGTAAAT 


GATATTTATT 


GGCTAAATGA 


4680 


AAACCTAATG 


GGTCAACATT 


TAACACATTT 


ACACCAATAG 


GAATCGGTAA 


ATCTAATGAA 


4740 
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GTAACATATT GGaGCGCTTT TTCTAGCTGA ACATAATCCC CATAATAGTT 


CTCCATCAAA 


4800 


ATkGCATCTA CCCCATGTTC ACTGTAAATT tGAATCTCTT TTTTTGCTCG 


CTCTTGAATC 


4860 


TCTTGATCTG TTTTACCTTT TAAATGAATC ACCCCAATAA TTGGTTTTTC AACAGCAAAC 


4920 


AGCTCTAAAA AATCAGCTTT TTTCTTCATT AATACCCCTC CTAGAAATCA 


TCATTTTCTA 


4 980 


TCTCTTCGAT ATGAACGCGT TGAATTCCTT GTCGTGCATT TCGAATAATT 


ACCTGAATCT 


5040 


TTTCATCTGT TAATTTTTCT GAACCTGCTA GCATAAGCTC TAGTGCCAAA 


TTCAAATTAA 


5100 


CGTCAGAGAA AACGATTACA TTTTCAAGTA CAGTTAGCCC CATCATAGCT 


GTAACCACAC 


5160 


TTGCACCAAA TAAATCACCC ATGAGGATAA ATTCATCTTC TGGCGCTTGT 


TCGGCTTTCT 


5220 


TTTTTAACTC TCCAAAAAAA TCAACTGCAC TTTCACCTGG ATACAGACTA 


TATGTATGAA 


5280 


CGGCAGCTGC AGAGTGTCCC GCAATCATTT CTAACGAGTG TTTAATCCCC 


TTAGAAAAAT 


534 0 


CACCATGTGA AGCAATGTAA AACCTTCTCA TTTCATAAAC CTCCTTTATT 


TATTATATGA 


5400 


ACGAATGGCT ACTTTGTAGG ATTCGCTAAG GATATATTGC TCATTAAAAT 


ATA/U^TTTG 


54150 


ATTATCATTA TCCACAACTG TTTGATTAAT TTTGACCAAC GCTTCTTCTT 


CACTAATCGC 


5520 


TAATGCTTCT GCCAACTTTT CATCGGCCAA GGCAATTTCA ATGATATTAT 


CAAAATAGCT 


5580 


AGCAATTACT CCTCTATATT GCCGTAAAAT ATCTAATAAT GAAGTATCTG 


TTAATTCTAA 


5640 


GTCCAACATA AACTGaTCTT CTTTTCTAAA ATAATCAATC TCGAAAATTG 


CAGGCATGTT 


5700 


ATCTAACGAA CGGAGGCGCT CAATAACAAT AACTTCTGTC GCTTTTTCAA 


CACCCAATGC 


5760 


TTCTGCAACA TGCCCTGTTG CTTTTTTCAA ACAAATTGAT ATGATCTTAG 


TTGTTGGTTT 


5820 


ACGATTCATT TTTAAACCTG ACtCAGTAAA ACTCCCACCA GCTGTCATGC 


TCTCAACATA 


5880 


GGATGGCATT GAAACAAATG TCCCTTTTCC ATGCCGTTTA ATTAAAATAT 


TTTCATCAAC 


5940 


TAATTGTTTA ATCGCATTAC GAACGGTTAC TCGGCTCACT CCGTAAATGG 


TCATCAATTG 


6000 


TTCTTCTGAA GGGATTTTTT CTCCTACTTG ATATTCACCA CTTCTAACTT 


TATTTAAAAT 


6060 


AGCATCGTAT AATTGTTGAT AAAgCGGTGC TTTTTTATTT GATTCTAATT 


TATCCATTAA 


6120 


TCATACACCT CATTTTAATA CGTCTTAATA CGTCTTAAAA ATAGCAAATA 


AAAAAGAGAA 


6180 


TGTCAAGAAA GAGTTTCCTT TCTTGACATT CTCTTTTTGC TTATTTTTCT 


TCAGAGTGAA 


6240 


CAATTAAGAC ATCGCAAGGG GCTTGACGAA TAATGTAGCT ACTGACGCTT 


CCCATGACCA 


6300 


CACGTTCGAn TGCGTTTAAT CCTGATTGTC CCACCATAAT TAAATCAACA 


TTGTATTTTT 


6360 


CAGGTAATTC ATGACTCATG ACTTCTTTGG GTGAACCAAA CATGGTAACT 


GTTTCAACAT 


6420 


TATGAAAATC AACGCTTTCA GCATACTTTT TACAATCGGC CATTAATTCT 


TCCGCAGCGG 


6480 


CTGTTTCTTG ATCAATTAAT TGATCATTTA ACGTCGAATA CCCCATCATT 


GTATACACTT 


6540 


GATTTTCTAA AATATTTGCA ACAATGACTC GGCTTCCGTT TCGACGAGCA ACCTCAATTG 


6600 


CACGTTCATA AGCTAAATTG GCTTGATCAC TACCGTCTAC GCCGACAAGA 


ATATTTTTAT 


6660 


ACATTTGTTC TTCCATTTTC GTCACCTTCC CGATGGAGGC TGATCTAAAA 


GGCTTCTTTG 


6720 
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ACTAA6AAAC 


TAGAGTCAAA 


AATGAATTTA CGACCCTTGC 


GCAGAACGCC 


CTTTTGTCCA 


6780 


GCTGATTCTC 


TCTTAATTTA 


ATGGTTCCTC 


TGGTTCATGA 


AAATTAGCCT 


AAGCTAGTTA 


6840 


AAAATGCACT 


GATTTCTTCT 


TTTGTTTTTC 


GGTCTTTATT 


TACTAAGCGT 


CCTAACTCTT 


6900 


GACCGTTTTC 


AATTACGACA 


AAACTCGGAA 


TACCAAAAAT 


ATTCCACTCG 


GCAGCCACGT 


6960 


CAATAAACTG 


ATCACGGTCG 


ACTTCGATAA ATTGT^CGC 


TGGAAATGCT 


TCTTCAATTT 


7020 


CAGGCATGAC 


TGGTTTAATG 


AAACGACAGT 


CTCCGCACCA 


ATCAGCAGTA AAGAAGAAAA 


7080 


CATTTTTACC 


TTTTTCAACA 


TAAGTTGCTA 


GTTCTTCTAA 


TGAATTGGGa 


TAAATCATTT 


7140 


7WVCATCCTC 


TACTTTCCGT 


AAAC7\ATTGT 


ATTAACTGTT 


GATTTATCCA 


AGCCACGTAA 


7200 


TGCTTGTAAC 


AACATTTCCC 


GCGCTGCTTC 


ATAATCCGCA 


ATACTAAACA 


TTGTTTGATG 


7260 


CGTATGGATG 


TAACGACCAC 


AAACGCCGAT 


GACTGTACTT 


GGTACACCGT 


TATTAGTTGT 


7320 


GTGCGCTGCC 


CCAGCATCCG 


TTCCGCCTTT 


AGAAACAAAA 


TATTGATACG 


GAATATTGTG 


7380 


TGTCGCTGCT 


GTATCTAATA 


AATATTCACG 


TAAGCGAGGC 


AATGTGATCA 


AACCTGGGTC 


7440 


GAAAATCCGT 


AATAACGTTC 


CTTCACCTAA 


ATGACCATAC 


GTCCCTTTTT 


TCGTGTGAAT 


7500 


ATCATCTGCC 


GCTGAACAAT 


CCACCGCAAA 


GAAGATATCT 


GGATTAAATT 


TATGAACAGA 


7560 


TGGTTTAGAA 


CCACGTAAAC 


CAACTTCTTC 


TTGTACGTTT 


GCTCCAGCAA 


TTAATGTGTG 


7620 


GCCTAATTCT 


TCATTTTGTA 


ACGTTTCCAA 


TGCATCTAAT 


ACCAACGTAC AACCATAGCG 


7680 


GTTATCCCAT 


GATTTACTGA 


TAATATTTTT 


GCCATTGGCT 


GTTTTAATCG 


TTTCTGTTTG 


7740 


TGGAACAATG 


GAATCCCCTG 


GACGAACGCC 


AAAGCTTTCA 


GCTTCTTCTT 


TTGAAGAGAA 


7800 


TCCTGCATCA 


AATAACACAT 


CAGAGACTTC 


TAATTGTTTT 


TGACCACTCG 


TACCACGCAA 


7860 


CAAGTGTGGC 


GGAATAGATG 


AAlj AAA 1 AUA 


iCAiAXAl?! 


CCTTTACTTG 


TTTTTAACGT 


7920 


AAAACGTTGT 


GCTGAAACAA 


CATAGGGATT 


CCAGCCACCT 


AAAGGAACCA 


CTTGGAATAA 


7980 


ACCATTATCG 


TTAATTTGAG 


TTAACATGAA 


GCCAACTTCG 


TCCATGTGAG 


CTGCTACCAT 


8040 


TACGCGTGGC 


GCTGCTTCCA 


CTTTTGAACG 


TTTAATCCCA 


AAAATACCAC 


CTAAACCATC 


8100 


ATATTGGACA 


TCGTCCACTA 


ATGGGGTGAT 


ATGTTCTTTC 


ATGTACGCGC 


GAATGTCGTC 


8160 


TTCAAATCCG 


CTGGTTCCTT 


GTAATTCAGT 


TAATTCCTTG 


ATTCGTTGAA 


ATGTTTTTTC 


8220 


TTCCATAGAT 


GTATGTCCTA 


TTGCTTTTGA 


TTTGTTGGAA 


ATTACGCCAA 


CCAAGCGTCA 


8280 


CTTTTTCTCT 


AAAAAGTCAC 


TTTATCCTGT 


CGATAAAAAT 


AGGTACTCCT 


TTCATATGAT 


8340 


TTCACTCTTA 


TTATAACGCA 


AATACTAGAA 


AATGCACATC 


GCTTTCTCTT 


TTTAAACGTG 


8400 


ACAATGAGAA 


TGCCCTAAAA 


GAATGATTGA 


AAAACTTTTT 


TTATTTCTCA GCGGCCCCTT 


8460 


CTCTATGGTA 


AAATAAGAGT 


GATACTGACA AAAGTAATAG 


TAAGCAGAAA CCGACCAAGG 


8520 


GAGGTCATAC 


AAATGAACGA 


AGAAAATGAG 


TTGACATACT 


TTAAAGGCGG 


CTTAGCGCTA 


8580 


GGCGTTGGTC 


TTGGTTTAGT 


AGGCGGCATT 


GCCTCAACCT 


TGTTCTACCA 


T7UVGAAAAAA 


8640 


ACAATCTCCG 


CTGATCTCGT 


CTTAGAAAAT 


GTAAAAGCTG 


CTTTTTTAAA 


AGAAGGACCG 


8700 
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ATTGAAGGTT 
AAAACGTATA 
ACTGCAGATG 
TGTTACAAAA 
TAAAAGGGAT 
ATCGAATGCT 
TTTTCATAAA 
ATTTTTTAAT 
GGAGCGTGTC 
GCTTTAGCnG 
AACCAAGTAT 
ACAGATGAAG 
AAAGACAACA 
TTAGTTTACA 
GCTGCAGATT 
GCCTACATGT 
GAAGAATTGG 
ACACCATATT 
GTTGAACGTT 
TTTACGCTAA 
GAAGAATATT 
GAAGCCCCAA 
GGCGAACTGG 
TCTTTCTATT 
CGTCGTGCTA 
GGCTCTCTTC 
GAAGACTTTG 
GTGGAATACT 
ATGGTAGACG 
GATACTTTGG 
CGTACCTTAA 
CCAATTAACT 
AATCCTGAAT 



CGTGGATTGA 
CTGGAGGAAT 
CCTACACAGG 
GCACTTATTT 
GTTTATTTTA 
TTCTGCATGG 
AAAGATTAGA 
TTCTCCTTTT 
AAATGAAGAA 
CcTGCGGCGG 
ACACAATGGT 
TGAGTTTTAC 
AACCCGCGCC 
AAGTTAAATT 
ACGTTTACGG 
TTGAACCAGT 
GCATTAAAGC 
TTGACGATTT 
TTGGTAAaGA 
CTGAGTTTGA 
GGGACAAAGA 
CTGCCTTAAA 
CTCAACAAAT 
TAGAAATGAA 
TGTCTTATGC 
CTTCACAAGG 
TTAAAGAAGC 
GGAACAAAGC 
ACTCTGAAGG 
AAGGCTTAAA 
AAGGGGATTT 
TCTTAGATTT 
ACGATAAATT 



GTTTGAGAAA 
TTGCCGCATT 
AACTATCATT 
GCTTTTGTAA 
TGTCAAAAAA 
AAATCAGCTT 
CATTTTTTAA 
CATTTTTACA 
ATCTGTTTTA 
TGGCAGTGAC 
TGAATCCCTiA 
CACTTTAAAT 
TGCTGGTGCA 
ACGTGAAGAA 
TTGGCAACGA 
AAAAAATGCT 
AaTCAATGAT 
ATTGGCTTTC 
TTATACaAAG 
TGGTCCCGGA 
GACGGTCAAG 
TCTTTATGAA 
GCAAAACTCG 
TCAAGCAGAT 
CATCGATCGC 
CTTCGTTCCT 
CGGCAGCGAC 
GAAACAAGAA 
CGCTAAAAAA 
AGTAACTGTG 
CCAAATCGCT 
ATTAGAAAGC 
CATTGCTGCG 



AAGCCCTTGC 
GAAGACGACG 
GACATTCAAC 
CAATCTTTTT 
ATCAAATCAA 
AAATCTTTAC 
AAATTGATAA 
TTATACTGAG 
TTTACTTCAT 
GATAAAGGAG 
GAAATGCCTA 
AATGTCTACG 
GCCGAAAAAG 
TCAAAATGGT 
ACAGTGGATC 
GAAAAAATTT 
CATGAATTAG 
CCTTCTTTCT 
AGTAGCGATA 
ACAGATACTA 
TTAGATAAAG 
ACTGGTGAAG 
CCTGACTTGG 
GAAAAATCAC 
GATTCATTAG 
GTGGATGTCG 
AAATTAGTCA 
CTTGGTGTTT 
ATGGGCGAAT 
ACGCCTGTCC 
GTTCGTGGTT 
TCAACTTCTA 
TCCAAAACCA 



GGAAATTCGC 
GCATTGTTCA 
GATTAAAAGA 
TTCTTCTCAT 
TGATGAATAA 
TTATCAACGT 
AAGAATAATC 
AGTATTGCTA 
TACTTGTATT 
CTAGCAACGG 
GTGCCGATCC 
AAGGAATCTA 
CGACTGTTTC 
CTGATGGCAA 
CTGCCACTGC 
CTAAAGGGGA 
AAATCACTCT 
TGCCGCAACG 
AAGCAGTCTA 
AATGGTCTCT 
TCGCTATCAA 
TGGATGATAC 
TCCAATTAAA 
CATTGACTAA 
CTAAAAATAT 
CGAAATCACC 
AATACGACAA 
CCAACTTAAC 
ATCTTCAAGG 
CTATGGCTGT 
GGAGTGCCGA 
ATAACCGTGG 
CAGATGTTAA 



CATCCACTCC 
ATACGAATTC 
TTAAAAAGAT 
TTGTTTTTTG 
ATACTCAAAA 
TTGTCCACTT 
ATTTCTAACG 
AAAATGAAAA 
ATCAAGCTTA 
CGGCAGCGAC 
GTCCCTTGCG 
TCGTTTAGAT 
AGAAGACGGT 
ACCAGTTACT 
TTCAGAATAT 
ACTACCTAAA 
AGAAACAGCA 
TCAAGATATC 
CAaTGGTCCC 
AACT/U^AAAT 
CGTGGTGAAA 
GTATTTATCT 
AGCCGCTTCT 
TGCAAACTTA 
TTTAGCTAAT 
AAAAACGGGT 
GAAAAAAGCT 
GGTTGATTTA 
ATCACTATCT 
TCGCTTAGAT 
CTATTCAGAC 
ACGTTACAGC 
CGATCCTGAA 



8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
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AAACGCTGGG 


AAGATCTAAT 


CAACGCTGAA AAAACAGTAA 


TTGCTGATAT 


GGGTGTTGTG 


, 10740 


CCAATTTACC 


AAAAAGCAGA ATCACACTTA CGTGCACCAA 


ACGTGAAAGA 


AATTATTTAT 


10800 


CATCCAACAG 


GTGCTAAATA 


CGACTTCAAG TGGGCGTATA 


AAGAATAATC 


ACTCGATTAA 


10860 


ATTTAGAAAA AGATTTGGAA CAATAGCGAT TTTGGCTATT 


GTTCCAAaTC 


TTTTTTATAT 


10920 


AATCCCGATT 


AAAG7VACCTA ACAACCCAAA AATCACAGTA AAAATAATTA AAGCAATCGG 


10980 


CGTGCGTCCT 


TTTTTCAACA 


TAATAAAACA AAACAGCGTA AACACTAAAG 


GT7\ATAAATT 


11040 


GGGCATGATT 


GTATCAAAtlA 


ATTGTGTTTG AAGGTCTAAC 


TTAGTGTCTT 


TCAAGCGAAA 


11100 


AACAAAAGTT 


GTCGTTAATT 


GTACACTCTT CGGAATCAAC 


GCGCCCACCA 


CGGAAATTCC 


11160 


TACAATTGTT 


GCTGAACGAG 


ATAATTTTTT AGTACTTTCG 


CTTAAAACTG 


AAATTGCTTT 


11220 


GACACCCATT 


TTATAACCTA 


GATTAATTAA CCCAATTTTG 


GCAGCCATAT 


ACGGAATATT 


11280 


AAATAACAAC 


AAGAAAACAA 


TTGGTCCTAA GATATTGCCT 


TCTAATGATA 


GTTGTGAACC 


11340 


AATCGCTGCT 


GCAATCGGCA AAACGGTTAA ATAAAATAGC 


GCATCCCCAA 


TTCCGCCTAA 


11400 


CGGGCCCATC 


AACGCTGTTT 


TAATCGCTTG AATGGATTTT 


GGTGATTCTT 


TTGACTCTTC 


11460 


CATTGCAATA ATCACACCAT 


GCATGAACGT CACTAATTGT 


GGATTCGTGT 


TCATAAATTG 


11520 


TAAATGATTT 


TCCAAAGAAA 


GAGCTAAGTC TTCTTTATCA 


CGATGAATCT 


TTTTCAATCC 


11580 


TGGAATTAAT 


GAGTAG6TCC 


AGCCGCCTGC TTGCATTCTT 


TCATAATTAA 


ATGAAGATTG 


11640 


TAGCAGATAC 


GAACGGAACA 


TTATTTTTTT TAACTCTTTT 


TTTGTTATTA 


CTTTTTCGGA 


11700 


TATCATATCA 


TTATATGTTT 


CTTCAGATGC CATCTTCTTC 


AAACCCTCCA 


TTATTATTTG 


11760 


ATGCAGTGAC 


TGTATTATTT 


CTTTGTGCGT AAAATTCATA 


TA/UVGCAATT 


GCGACAGCTA 


11820 


AACCTGCAAC 


CGCTAATGTA 


TCTAACCCTA AATAGGCAGC 


CATAACAAAC 


CCAAAAATAA 


11880 


AGAACGGCAC 


GTAGGATGTT 


TCCCACATCG TTTTTAATAA 


CATGGCGAAA 


CCTACTGCTG 


11940 


GCATCATCCC 


ACCTGCAACG 


CTCAAGCCAT GAATACTCCA 


TTCGGGTAGA 


ATAGAAATGA 


12000 


AACTAGAGGC 


TTTATCAGCA 


CCAAAGAAAA TACATAAAAA 


TACTACAAAA 


AAATAACAAA 


12060 


TGAATAAAAC 


GATTGGTAAA 


AAACTAGACA TTCTTTCGAT 


GCCCCTTGTA 


TCTGCTTTTT 


12120 


TTGCATAAAC 


ATCAAATTTA 


TGCATAATTG GTGACATCGC 


TGTATATAAC 


AAGGTAATTA 


12180 


ACGCTTGCGC 


TGCAACTGCG 


AAAGGAACAG CCAAACcTAC 


GGTCACTTGT 


GGGTCAGATT 


12240 


TTGTGATAAT 


TGCTAACGCT 


GTTCCAATAA CTCCCCCAAT 


TACTACGTTA 


GGTGGTTGTG 


12300 


CACCGCCGAT TGCTACTGCA CCAATCCAAA CTAACTCTAG CGTTGCTCCA GTGATTAACC 


12360 


CTGTGTTGAC 


ATCTCCCAAT 


ATTAAACCAA CAATCAGCCC 


TGTAACAATC 


GGACGATGAA 


12420 


TATGTGTCTG 


TAAGACAAGC 


TTATCAACCC CTGCTATCCC 


AGCCCAAATT 


CCTACCAAAA 


12480 


TTGCCTGTAC 


TAACATTTTT 


TTCCTCCTaT TAaCTTCGTC 


TACTTTTTAT 


AAACTGTTTC 


12540 


ACCATTTACT AAGGTTTCTA GCACTTCATA TTTCTCATCA ATAATTGTAA AATTACCTAA 


12600 


AAGCCCTAGT 


TTGATTTCTC 


CCATAGATTC GTTCACATTC 


AACAAACGAG 


CTGGATTTTT 


12660 
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CGTGACTGAA TTAATAGCAA TCACTTCTGG 7VAGTAATGCT 


TTTTCAACAA 


GGTTGCGAAC 


. 12720 


CATATTATTT 


AAATGGTTCG TACTTCCAGC 


TAACTTGCCA 


TTGGCTAAAC 


GGACCACATT 


12780 


TTGTTCATCA ATAACCATTT CGATCCCTTT 


TTCTGGTTTT 


GGATAAACGC 


CTGGTTGACA 


12840 


CCCTTTGGCC CAGATAGAAT CTGTCACAGC AATTAAATAG TCCTTGCCTT 


TTAGTTTGGC 


12900 


TAACACACGT 


ACTAATGAAT AATCAACATG 


TACCCCATCA 


GCAATAATTT 


CAGCAAACGT 


12960 


TTCTTCACTA 


TCCAAAGCAG CAACTACTGC 


GGTTGGTTTC 


CGGTGACTGA 


TATCTTCCAT 


13020 


TCCATTAAAC 


GTATGGGTAA AACTTTTTGC 


ACCTGCTTCG 


ACAGCTGCCA 


TTGCTTGTTC 


13080 


ATAGGTTGCA GCCGTATGGC CAATTGAAAT 


GACTACATCA 


TGTTCATGAC 


AATATGTAGT 


13140 


CAAGGCATTC 


TCTACATCAT TTTCTGGTGC 


CAACGTCATT 


AATTTTATGT 


GCCCTTTTGC 


13200 


TAACTCTTGC CATTTTTTAA ATGTCTCAAC GGATGGTTTT ACTAATAACT CTGGATTATG 


13260 


AGAGCCACGA AATTCTTCAG AAATCATTGG 


TCCTTCCGCA 


TGAATCCCTA 


AGATTTGTGC 


13320 


TCCGTTGGTT 


CCTTGGTCTT CCTCAATAAA 


TGAACCAATC 


ACTTCAAATG 


AATGTTCCAA 


13380 


GTCTTTTGGA AATGTTGTTG ACGTTGTTGG 


CAAAAAAGCT 


GTAATCCCTT 


CTTCTGGAAG 


13440 


GTAAGCTTGC 


CACTCTTTGA TAAATTCATG 


ATTGGCATGA 


TTGGCATCCC 


CACCATGCCA 


13500 


ACCATGATCG 


TGAATATCAA TAAATCCTGG 


AAGAATGCGC 


TTTTCTCCGT 


ATTCTTTGTC 


13560 


GACAGCTTTT 


TCATTGTACG GTAGAATTGC 


AACAATTCGT 


TGTTCTGACA 


CTTCAACTTG 


13620 


AGCTGGTTGA 


AATTGTTCGT TAATCCATAC 


GTTTTTACTT 


TGAATAATCA 


TTCTTCCCAC 


13680 


TTCTTCTCTT 


CAAAATTATT GTCCAACATT 


ACTTTTTTAT 


GAAAATAATG 


CATCTAAACT 


13740 


TTCTTCTGGT 


GTTGACGGTA CACGTTGAAT 


CGAAAGTTTA 


ACACCTAATT 


CTTGTAGCTT 


13800 


TTTAAAGGCG 


TCTACATCTG TTTTATTAAC 


GGCAACAGCG 


GTAGCTACTT 


GTtTTTGcCT 


13860 


TCTTCCATAT 


GCATATTACC AATGTTTAAT 


TTTTCAATTG 


GCACACCACC 


TTCTACCAAG 


13920 


CGCAATGCAT 


CAACAGGATT ATCAACTAAC 


AGAAAAATCT 


TTTGTCTTTC 


TGAGGCTTTA 


13980 


TGAATGACCT 


CAATCGTTTT CTTAATGGAA 


AAAAAACGGG 


TTTGTACTTC 


TGCAGGGGCC 


14040 


GCCATGTTTA ACAATTGTTG GCGCAGTGTA TCCTCAGAAA TTGCATCATT 


TGCGACTAAA 


14100 


ATTAAGTTGG 


CGCCTAAAGA ACCGTTCCAT 


TGTGTTGCGA CTTGACCATG 


AATCAAGCGA 


14160 


TTATCAATTC 


TTGTTAATAA AATATTCGGC 


ATAGTGCTTC 


CTCCTCTTTT 


CTAATTGGTT 


14220 


AGATTCCGTC 


TGCTTCTTCT TGAGTATTTG 


CTTGTGGTTG 


TTCTAGTTTT 


TTATGAACAA 


14280 


TCCCATCTCT 


GGCTGAATCT GTGACCATTG 


AAACAAAATC 


TTCTACGTTC 


GTGCTGATTG 


14340 


TGCGTAATGC 


AAGCCCTTCA AGAACCATCG 


GTAAATTTGT 


TCCTGCTAAC 


ACTTCGACAT 


14400 


TTGGATAATT 


ACTTGCGACA GTCATTGAAA 


TTCTAAACGG 


TGTGCCACCT 


AAAAGGTCAG 


14 4 60 


TTAAAATGAC 


CACTGCTGGA CACGTCTCCA GCATAGAGTC 


AACGGCTTTA 


CGAATGTTTG 


14520 


ACTCTAATAC 


ATCAAGCCCT GTATCATTGA AAAAAGGGAC 


TGCCTGAACA 


TTTGCTTGCT 


14580 


CTCCTGCAAT 


CATACTAACT GATTGATGCA 


TTCCAGTTGA 


AAATACCCCA 


TGCCCCGTTA 


14640 
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AAATT^GACC 


TATCATTTAT 


TCTTTCTCCC 


CTAACTATTT AsTnnGCTTA CTATATATTG 


14700 


GATAAGCAGA 


ACTTATTTCT 


CAAAGAGTGT 


AACAATTCTT 


GTGACAATTG 


AGTATGCTTT 


14760 


CCCATGATTT 


TGGACGATGa 


TTTCTATTTA 


AGTU^CATGTC 


CCTATACAGA 


TACAAGGCGT 


14820 


CCCAACTTAA 


AGATGCTTTT 


GAAAAAATGC 


TCGACTACAA 


CCGTGAGTCC 


AAACGAGCCA 


14880 


ACAAATGTAC 


CCATATACAA 


CACAAGAATA 


ACCAAGCTGA 


CTGGTTGATt 


GGTTAAAAAC 


14940 


GGGTGGACCA 


GTTTCAAGAC 


CACTTGAAAA 


ATAAAAACAT 


TCGCTAAATA 


ACTGCGATAC 


15000 


GCATAAgTGG 


CTATCCATTT 


AACCATTGGT 


AAAAGACGAA 


TCTTTTTGGC 


TATTCCTTGc 


15060 


GTTCCCAAAT 


AAAAAACCAA 


CCCGATAATC 


AATAAACTAT 


AGGCACTTTG 


AAGAGGATCC 


15120 


AAATAAGGTA 


GATCGTTCGG 


AGTCATCTGT 


GTTCCATCAA 


ATAAATTCCC 


TTGGACTGCC 


15180 


TGCCAAAAAA 


TAAAaAGaAA 


CAACAGAAAT 


AGTGCCACTC 


TCGTCTTTTT 


CAaGAAAGTa 


15240 


AAAATTTTCC 


CTTGATACTT 


CGCTGCGATG 


ACCCCGAAAA 


TACCATAAAT 


GAGATAACTA 


15300 


AGCCCTGAAC 


GATCTATTAA 


GTACCAAAGA 


GAAGTTTCTA 


CATATGGAAA AATCATTTTC 


15360 


TGGTACCAGA 


AGCACCAAAG 


GATATGCGCA 


CCGATTGTAA AAAAGACAAT 


TGGTAAAATC 


15420 


ACTTTTTTGT 


TTTTTGTAAT 


TCTGGTTGCC 


AACCAAATAA AATACGGCAT 


AAATAATTGG 


15480 


AATTGCAACA 


TCATTACGGC 


ATACCATAGA 


TGTGCCGCGC 


CATCACCACT 


GAGAACTTTT 


15540 


CCTAGAAATC 


CTAGGGACGT 


TTCATAGGGC 


ATTCGTTGTT 


GTAAACTAGG 


AAAAGCGTAT 


15600 


AGATAGAGAC 


TGGACCAGAC 


CACGTAAGGA 


AGCACCAATT 


CATAAAATTT 


TTCCTTTAAA 


15660 


AATTCAAGAT 


ACGATTCAGC 


TTGACTCTTT 


AACACCATGT 


TGTAAATAAT 


TGCAAAAATA 


15720 


AACATGGGTG 


CCGAATATTT 


AGACATT 








15747 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

TGCGTGAAAA GTATAAAGCT TTTGAACTGC CAnATAAGGT AGGAACTTCC CAGCAAAATA 60 

ACCTnGTAAT TGACCATAAG CAAACAATGG CGAAACATTT CTTGAAAAAA ATTGTAACTT 120 

GTAAAATTCT TCATAAGGAT CCCCAAAATC CCATCGATTA A/^TCAATCA CACCGACCTG 180 

CCTAGTTGGC AAGTAAATAA AATTTCCCAC ATGGAAATCT CCATGCTGAT AAACCACAGG 240 

CCGTTGTTGC AATAACGGTA ATTGTTTTTC TAAATATGAA ATTATTGGCT GCTGGTACGG 300 

CATACAAAAT TGAGATTCCT TATACTTATT TAGTTGGCTC TGCTTTTTTT CGTATAAATT 360 

TTGCGCAATC TCTCTTTGAC TAATCCTTTC AGGCi\ACAAA AGCTTATGGA TTTTGTTTAA 420 

ATAGCGGCCT GCCTCTACGC CTAAGTTCAG TTGTCTTTTG GGAGAAAGAG CGGGTAAAAC 480 
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ATCGCTTAAG TCTTCACCTT CTAGATACGT TAATAACATG TAT.CCTTGTT CACTTTCAGG 540 

AATAGGTGCA CATAAAAAAG GTTTAGAACT GGGAAATCCT AACGCCGCTA CTTTTTTGAT 600 

AAATTGATAT TCTTGTTGTT TGGCTAATAA CTCTTCTTGT TGAAAAATTC GCAATAAAAA 660 

GGTTTCTTGA TTTTTGGTAA CTTTAAACTT TAAATCAGTT GACCAACCAC GGTGAATAGC 720 

TTCAACTTTT TGCCATTGCT GAAACGCTGG AATCTTCTCA AACGCCCTCA TTACTTTCGC 780 

CTCCATTTCC CTATTGGTTT TAGTATACCA AAGAAAAAAG GCTGGAGACA ATTGGGGATA 840 

AATTGTCTCT AGCCTTCACT GGTTATCATT TTCTAGTTTT CCTTATATAA AtCGTTCCAC 900 

TCGTTACTAA GAAAGCTCCT CCTAAAACAA CCCAGAAAGT ACTAGGTACT TCTGCACCTG 960 

CTTTTGGTAA TACTTTTTTG GGAAGCTGTT CTTTTTTCCC AATCATATTT TTATCAGAAA 1020 

AAGCATGAGT ACTTTCTGTT GATTTGGTTC CCTGATTTGC TGATGAATTC AGCGAATTAT 1080 

CTAAGGATGA CTCACTAGTA GTTTTTCCAG ATTTGTCTGG GTTTTGGTTT TGGTTTTGGT 1140 

TTTGATCTTT TCCTTCGATT GGTCCTTCAC TTGTTTTTGT TTCTTTGTCT TTATCTGGTT 1200 

CGCCACTTGT ATCTAGTTCT TTATCTTTAT CTGGTTCGCC ACTTGCATCT GGGTCTTTAT 1260 

CTTTGTCTGG TTCGCCACTT GCATCTGGGT CTTTATCTTT ATCCAGTTCG CCACTTGCAT 1320 

TTGGTTCTTT GTCTTTGTCT GGTTCGCCAC TTGTATCTGG TTCTTTGTCT TtGTCTGGkT 1380 

CTTCTACATG TGGTTGATTT ATCTTGTGGT TATTTTCAAA TAAAGCACCT AATTGTAGCA 1440 

ATGTTCGGTC TTCATTTAAT GCACTATTAA ATTGAATCCC CAAAGGCAGT CCTTCTTTTG 1500 

AAACGTAAGT TGGTACACTT AATGCTGGAT GTCCAA 1536 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17250 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS": double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

GCCAAATTTG GTGGATCGTG TCCGCTTGGG AnnGCCTTAT GGGTGGACCC TACGTGCCAT 60 

TTGGGTGAAA ACTAGGCCAT ATTTTAAAnG GAATATCCGT GCTTTCCTTG GCCACCGTTC 120 

ATAATCCATT TCCACTGAAC CGCAAAAAGA ATCGTATCCA GAGAGATGGC AAGGCCAnAC 180 

AGAGAAaTTT TyCTTkGGaA CAACTTCAAG AAACAATTCT TGAAAAAGCC AATCAACAAG 240 

TAGGCAGTCG CCGGaCGACT AAAGAAGTCA TTAAATTTAC TACTAAAAAT AAAATTGGAG 300 

ATAAAATGAC TCTTGGTACG AAGAGCTTGT TAATTTTGCT ACTTTGTATG ATTTTTTATG 360 

TAAGTCTTAC CTTTGTATW^G GGTATTTCAT TAACAACAGC TTTCATTCAT CCAGCGGAAA 420 

CAATTCAACG TATATTTGTT CCTGAGAAAG TAGCAGGGAA GGATACAAAT GCTGTGAAAa 480 

CGACTAAAAC TAAACCAGCT TCAACAAAAG AATCTAGGCA GCAGGAGGAA GTTCAGCCTA 540 

GCCCAACGAA CGTACCAGAA AATAATAATA GCGAACAAGC TGTGTCAACG TATACTGTGG 600 



wo 98/50555 



PCTAJS98/08985 



572 



AAGTTGGAGA 


TTCAGTCAGT 


TTGATTGCTG 


AAAATCACGG 


ACTCACTATA 


GAACAACTTC 


660 


AAACTTTGAA 


TCCAGAAATC 


ATTGAGGTAC 


CTATTTATCC 


TGGACAAGTG 


TTGAAATTGA 


720 


AAGAGGTGAC 


TGAATAATGC 


GCGAGGTAAA 


ATTAATTTTA 


GCTGaACAGT 


TTTCaTACTT 


780 


AGGCsTTGCa 


CAACGAATTG 


CTAAATACAG 


TAATAAAGCA 


TTATATCAAa 


GCCACATCTT 


840 


aGgAAATTTA 


TGGCAAATTT 


TAAGTCCTTT 


AATTCAGTTA 


GGTGTCTATT 


ATTTTGCTTT 


900 


TGGGATTGCC 


CTTGGTGGTG 


CTCGAACCGT 


AAAAGGAGGA 


GTATCCTATA 


TTGCTTGGTT 


960 


GATGGTAGGA 


TTAAGTACAT 


GGATTTTTTT 


AAGTACAGTA 


ACAAAACAAG 


CTTCTAGTTC 


1020 


TGTTTATATG 


CAAGTTGGCA 


TGGTCTCTAG 


AATGAAGTTT 


CCAATAAGTA 


TTTTGCCAAT 


1080 


GGTTAAAATA 


TTTTCTGAGT 


TACCAAGTTA 


TTTTGCATTC 


ACTACGTTAG 


CAGCGATTGT 


1140 


GTCAATCTCG 


ACAGGTGAAA 


AGATTTCTAT 


TTATTGGATT 


CAGCTACCTT 


ACTATATTGT 


1200 


AGCGATGATT 


ATCTTTTTAT 


ATAGTTTTAG 


TCTAATTAAT 


TCAACTATTG 


CTGCATTAAT 


1260 


TAGGGATTAT 


CAAATTTTCT 


TGAATTCTAT 


TATCCAAGTA 


TTAATGTATA 


TGTCAGGGGT 


1320 


CTTTTGGGAT 


TTGAGTACAA 


AGAATTTACC 


GTCTTGGTTA 


TCAAAAATTC 


TTATGTTGAA 


1380 


TCCTTACGCA 


TATATTATTA 


ATGGATTTCG 


AGATACATTT 


TTCTATAAaa 


GATGGTTTTT 


1440 


TGAAGATAAG 


CCCCAGCTTG 


TTGTTTTTTG 


GCTAACAACA 


TTAGTAATAT 


TTATCGTAGG 


1500 


AGCTCATTTA 


CATGTTAAGT 


TCAGAAGTAA 


ATTTGTTGAT 


TATATGTAAC 


ATGTGAAGAA 


1560 


GTAAAACCAC 


CGTATACTGA 


CAAGAATACG 


GTGGTTTTAC 


TCTTTAATTG 


ACAATAGCCT 


1620 


TCTCTTTAAA 


GAGAAGGCTA 


TCTAACGAAT 


CAATATTATT 


ATTTTAAAGT 


TGGATCAGAC 


1680 


TATTTCTAGT 


ATTCAAGTAT 


TTTTCATTTA 


AGCGATGTAT 


TCCCTACCTC 


TAATACATCC 


1740 


TTGTATCTCT 


CTAGACACGC 


CAgTTTTGAA 


TAAAATCATC 


AAAACAAyTG 


AATGGTTCGT 


1800 


TTAAAAATAT 


TTTTCGTGTT 


CTAATTATTA 


TAACATAAAA 


CTCGTCAACT 


GACCACTAGC 


1860 


TGgACGAGTt 


CATAATTTkT 


CCAGATAAAT 


AGACAGCCGT 


TTTCCGCCGA 


CTGTCTAAGA 


1920 


TTAAGATAGA 


TTTGTAAAGA 


AGTGgAATAA 


ATGgCTTCCT 


ACTATCTCtT 


TATAACAkGA 


1980 


GGrAAAgAAA 


ACkGCaAAaG 


ATAGCTGTGT 


TTTTTGAAAA 


AAAGATCAGC 


TGGTTTCCGC 


2040 


CAGCTGATCA 


TCCGTTGAAA 


AGAGATATAA 


AGAAGTGAAT 


AAAGCTTGCT 


TTACCTCTAT 


2100 


TTTATACCAT 


GTTTT.GATTC 


AAATAACTAT 


TCAGTATTAG 


TTATACTTAA 


ATAATAAATA 


2160 


TGTATACGGA 


TGTATTGTTT 


TGAGGTATAG 


TCATAGAGAA 


AAGAGGGAAC 


GTTTATGAAA 


2220 


AAATTTTGTT 


TAAGTTATAG 


CTCTGGTAAA 


GATTGTTTAT 


TGGCGATGGA 


TCGATTGGTA 


2280 


CAAGCAGGAA 


ATCAGCCAGT 


TGCGTTAGTG 


ACAACATTAA 


GTGATGAAAT 


TAATCGTTCT 


2340 


TGGTTTCACG 


GCATACCAAT 


CTCCGTCTTA 


GAAGCAGCAG 


CCGAAGCGTT 


AGATTTACCA 


2400 


CTTGTTATAT 


CACATAACAA 


TGAAACAAAT 


TACACAGAAA 


AAGTGGTTGA 


GGCACTGCAG 


24 60 


GAAACCAAAA 


AATTAGGTGC 


AGAAACGGTT 


TGTTTTGGCG 


ATATAGATAT 


TGAACAAAAT 


2520 


GGTGCTTGGG 


ATCGCCAAGT 


TGCGTTAAGT 


GCTGGACTTG 


AACCACAATT 


ACCTCTTTGG 


2580 
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CAGGAAAATC GCGAacACTT GTGAAAGAAT 


TCTTGGCAAA 


AGGGTACACA 


GCAATTATCA 


OCA n 


AAACGGTCAG CAAAGAGGCA GGCATTCCAA TAAAATTTTT 


AGGAGAACCG 


TTAAATGAAA 


2700 


CATTTATCAC 


GTACCTTAAA GAGCATCAAT 


TAGATATTTG 


CGGAGAAAaC 


GGGGAGTACC 


2760 


ATACGTTGGT AATTGACGGC CcGCTATTTA AAAAGAGGCy CAtTATTATm CTcAGGAATT 


2820 


aTGAAgcCTT ATGCTaTyCT TgATtATtGA TGCTTAAaTG 


AAaGTAAaaG 


GACACAAGCT 


2880 


GCcACTTTTt CTAGGGATAT gCTATAaTTG mCTtAaGAaA AGTAGAGAAA 


GTGAGTGGTT 


2940 


AATTATGGAA 


GAAACTGTCT TTTTTAATTT 


AGGAAATGCC 


CTTGCTTC7VA 


AAAGAGATCA 


3000 


AAAAGAATTA 


ATTAAAGAAG CCCAAATTGC 


GAAAGATACT 


CGAAAGATTC 


CAGGAAAACT 


3060 


GATTATCGTT 


GAAGATGAAG AAAATGGTAC 


CCATATTTTA 


TTTGAATTAG 


CTGATCAAAC 


3120 


AGAACCGTCT 


GCTGAAACAA AAG/UVTTCAA 


AGTGAAAAAA 


GTGATTGATT 


AATAAAAAAG 


3180 


CAACGAATTT 


TTCTAAATTC GTTGCTTTTT 


TATATTTTTT 


TACACAAATG 


ATACAAAAAG 


3240 


AATATCAAAA 


TCAAATAAAT GATTAGTATA 


CTTGAAGaAA 


CAAATAGGAA 


GCAAAGCAAA 


3300 


CGTTTTGAGG 


TGTTCTTTCA TGGAAAATCG 


AGGAATTTTA 


TTTGATAAAG 


ATGGTACGTG 


3360 


TATTCGTTTT 


GACACTCTTT GGCAAGCAGG 


TTTGCAAGCT 


TGTTTTGAAA CACTAAGTAT 


3420 


GTTaGCCCCT CATCATTCAG CAGAAATAAA AAAgATATTA GCTATTCAGG AGCAACGTTT 


3480 


TTTGCAAAAA CATTTACTTG ATGAAGTCCT 


TTATCAGGAA 


CTTTATCAGG AATTGGCGCA 


3540 


ATTTGAGGAA 


TTAGTCGAAC AGGGAATCAG 


CAGTCGATGG 


CTGGAGCAAT 


TTTTTTATGA 


3600 


TTATTTACGA AAAAATCTGA AAAAGATCGA ACCGATTGGT GATTTAAAAC AGTTATTTCT 


3660 


TGAGCTGAAA 


CGGAAGAACT ATAAAATTGG 


ATTAGCAACT 


TCAGATACTT 


TGCCAGCGAC 


3720 


TATGTTGATT 


ATGGAATATC TT.GGTTTAAC 


AGAAATGTTT 


GATTTTATTG 


CGACAGGAGA 


3780 


TCGTTACTTA 


CCGAAACCAG ATGCGGACAT 


GCrCCAaGCC 


TTTTGTCAGT 


CATGTCAATT 


3840 


GAAGGCGACA 


GAAGTAATTA TGGTAGGTGA 


TTCGCTCGTG 


GATGTTTTTA 


TGGGAACGTG 


3900 


TCATGGC7WV 


GCAGGAATTG GTGTTTTAAC 


CGGCAACTGC 


CAGTCAACTG 


ATTTTGAAAA 


3960 


GTTTGAAGTA 


GCCTATTTCC GTGATATCCA 


TGAAATACCT 


TATCAAGAAT 


TATGGGAAAA 


4020 


TACCAAGAAG 


AAAAAAATCT ATTATTTTTG 


ATATATGTAT 


TGTGAAAAAC 


GTTTCTTTTA 


4080 


TTCGAGTATC 


TAAGCAAGCA TACTAGTGTA 


TTTCAATAAA ATACCATGAT 


GTTCATAGGT 




TTTATAATTT 


CAGCTCACAG ATTTTTGTTC 


AATAAGAAAA 


AAATTACAAC 


TAAAATAGAA 


A OAA 


AACGGGCGTG 


GGACAAAAAT CACTTTGGAT 


TTTTGTCCCA 


CGCTCAAAAA 


CTGATAAACG 




GCGGGAACAG 


AAGCAACACT TTCGGAAATA AGCCGAAAGT 


GACTTTCTCA 


CACAAGTGTT 


4320 


CGGGATATCT 


TAATCCTTAC CTACGTGGTG CTCGGAGTTA AACACTTCTG 


TCCTGACTTC 


4380 


GTTGTTAATA 


TTTTACAATG ACAAGTTTTT 


CTTATTTTAA 


TCCTGAACTT 


GATTTAGAAA 


4440 


TTGCACTTCC 


ATTAGAAAAA GAAACAGTAA CAGTAGCACC 


TAAATCACCA 


TCTAAGTTTT 


4500 


TCATCCAGAT 


TAAAGTATCA TTTTTTTCAC 


CATTGATATT 


TGTAGAAGTA 


ATCCCTGTTG 


4560 
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GGTCTtCTAA ATCTTTTCTA GCTTGTTCTT -CAGAATATGA ACCATCAGTA GCAATGTTAT 


4620 


TTACTTGGTC 


TGCAGTAACT 


TTATCATGTT 


TTGCAAUCTT 


CAAGCCACTT 


ACAGATTTGC 




TCGCTGCTTT 


GCCATCAGAG 


AATGAAACAG 


TAATAGATGC 


aAGAAGATCA 


CCACCCTTGA 




CATTTGTCCA AGATAGAGTA GTGGTAGAGA TGCCTTGGAT 


ATCTGTTGTA 


GAAGAAGAGG 


4oUU 


CTGGTTCACC 


TAAAATTGCT 




CTTCTGTAGA 


ACCGCCTTCG 


CTGCTGTTCA 


A Q cn 


TAATATCACC GACATTAATT 




TkGCTTTTAA ATCAGAGTTA 


TCTTTCGATT 


A oon 


TATCTTCTGr 


TTTCTTTTCA 


G AGCTT t CT T 


1 i 1 1 xuXAlk^ 


ATTTTTTGAA 


CTATTCGTAG 


49oU 


CTTTTGAAGA 


AGAGGCTTCT 


Ci r 1 1 1 c AC i 1 


CTTTGGTATC 


ATTGCTTGCT 


TTTTTATCCT 


Cf\A ft 

3U4U 


CTTTGCCAAG 


ACCACTACCA 


GCAA rCCC^CA 


GTACAATAAC 


TACTACAACC 


CAAAACCAAA 


CI nn 
51UU 


TTTTCTTATA 


AAATGGTTTT 


TTTACTTTTC 


CTTCATTTGA 


ATCATTTTTT 


TTCATTTTCA 




CTTTCCCTCC 


CATAGTTTAG 


TTAATGATTT 


GTAGCGCTTA 


CTAAATAGTT 


GAAAACAATC 


C O O A 


CCCAATTAAA 


TAACTTTGGA 


CTACAAATCA 


CTTAGTTTAA 


ATGAAAATAA 


GTATACCGCC 




ATGTGTAAAA ATAACACACC 


ACAGTATACT 


TATTTTATCA 


TTATTTCATA AAAATAACAC 


b J4U 


CAl 1 Al i 1 1 1 


ATGAAATAAT 


TAAAAAAGAT 


AAAAAATCTA 


AAAACTTCGA 


CCGCCGCCAC 


34U0 




GCCGCCAGTT 


GAGTGGGTGG 


TGCTACCACC 


ACCGCCCATT 


CCGCCACTGC 


C A Cf\ 

54 oU 


CGCCATTGTT 


TTTAGGAATA 


CGACGCGTAG 


TGATGAAAGA 


GTTGGTTAAC 


TGATCTGTGC 


C C O A 

5520 


GGGAGGTTAA 


GTTTAAAGTT 


GTTTTTTCCC 


TAAAGGGATA 


TTGATAACTT 


GAAAATTTTA 


5580 


ATTGATATTT 


AGAAATATTA ATGCCTAAGA AGACCAAACT 


GAGTATCAGC GCAgCAGCAA 


C£ A/\ 


AAGCAATTAC 


CATTTCCAGC 


GGGGTAATGA 


CTTTATAACG 


AGTGATTTTA 


CCTGTTTCGC 


D fyjv 


TGTCCACACG 


ATAGTGCCCC 


CCAGGAACCC 


CTTTATTAAC 


AAATGCTTGA 


GTTTCCTGAA 


D / OU 


CAAAGGTTTG 


AGCAGCCGCG 


AAATAATTTC 


CTTGACTCAT 


ATTATCCCAT 


ATTTTATCTA 


coon 


AGGTATCATC 


AATTCGTGCA 


TCTGTCATAT 


AATCAATCAT 


GTTTCCAG7UV 


GTAGAGATGT 


3ooU 


AGATTTTCCG 


TAAGTCCATA 


TCAATGAGAA 


AAAGAATCGC 


ATTTTGGTCC 


TTGCCAACTT 


D5j4U 


TATTTAAAAG 


ATAATGATCT 


GCATATTCTT 


GTTCATCGCC 


ATAGGTATTA 


TTATTTGTGG 


DUUU 


TTACAATAAA 


GACAGAGGCT 


TTTGTTTTTT 


CTTCTAAAGG 


TTGTATCTCT 


TGCTTTAGTT 




GGTTGATTTG 


ATCAGGTGTC 


AGTAATTGTG 


CGCCATCATC 


AATTGAATTT 


TCAGCCGCCG 




AAACGGGTAG 


GGCAAGGCCA 


AAGGTAAGAA 


GTATTAGGAA AAAAATAGGT 


AAAAGTCGTT 




TTTTCATAGG 


AAATAACCTC 


CTAATAAAAA GAGTGCTAAG 


AGTCCAGCGA 


AAATGCCTAA 




CGTTGCTAGC 


CCTAAACGAG 


TGTAACTAAT 


TGGAAGTACA 


CCGCTTACTT 


TACCTGTTTG 


6300 


TCCATTCATG 


GCATAATAAT 


AAACTTTTTT 


TGATTGTTCC 


TGACTGCGAT 


AGGTGACTAA 


6360 


CCAAATTGGC 


AAAAGCATAT 


AGTGATTATT 


TTCTTGGTTT 


AGAGAAATAT 


CCGTGCGAAC 


6420 


ATTTGTCAAC 


GTAGTATAAC 


CAGAAGCAGA 


GTCCCGTAAC 


AATGATTCCG 


AATAATCTTG 


6480 


TAATTCTTGT 


TGGACGTGTT 


TTTTGATTGC 


TTCATATTCA ATATCGCGTT 


TTTCTGCTTG 


6540 



wo 98/50555 PCT/US98/08985 

575 



GAAACCTGCT AGGTATTGGC TTTTGAAAGG CACCGCTTGA 


TCAATTAAAA 


ATGGl TGAAC 


ODUU 


GCCTTCCACC ATTTTTTGTT 


GGACGTTTTT 


TGACAAGGCA 


TTTTTAACTA 


ATTUTTTAAA 


£i€iCf\ 

oboU 


CGATAATTTT CCTGAACGTC CCACATTGAA ACGTTTTGTT 


TCGGTGTATT 


CAATATCACC 


6720 


TACGCGCCAA ATGCGAATAC 


TTGTGCCAGT 


AGCTTGCAGT 


TGTCCGTCGA 


CTTCTGCATC 


6780 


AACGACCCAA 


TAGGGGTUVGT 


AAACCCCAGT 


TAATTTATCG 


ATTTGATTTT 


TGCTGAAAAA 


6840 


ATCTCGGGGA ATG7VACCATT TTTTCTTGGT CCAAGCGAGA AATTTTTCAA 


TCGCTTCGTC 


6900 


TTTTTCAATT 


GTAAAGGGTA 


AGACATTTTC 


TGGTAAAAAT 


TCGCCACT 1 A 


ATCGACCAGC 


6960 


CAAGACAACT 


GGATTATGGC 


7UVTAATAACA 


GTATGTGGCA 


GCCGTTGTTG 


CATCGGTGAC 


7020 


AATTTGAGCG 


CCACAGCTTG 


GACAATTAAA 


AATTTCCATT 


GTGCCTACAT 


CTGCAGTTGT 


7080 


TTCATTCTCA 


GTGGTCTCTG 


TTTGTTGCGT ACGGAgGATT 


CTTCTTCCGC 


TGTGAAAGTC 


7140 


AAATTTTCCT 


CGGCAGTCGT 


TGTTGAGGGA 


TTCTCTAAAT 


GCGCCTCATG 


TTGCTGTTGC 


7200 


TCATAACGAG 


TCACTTCTTC 


TTCCGTGTAA 


ATATTTAAAC 


AATATTCACA 


ATGAAATTTT 


7260 


TGATCTTTGG 


GATTAAATAA 


GAGCGGGCCG 


CCACAGTTGG 


GGCATTTATG 


GGTAAATGTT 


7320 


GTTTCCATTG 


CAACCGTCCT 


TTCTAATAGA 


GTGAGTAAAT 


AACAGGAATC 


TGATTCAAAG 


7380 


AAAAAGGGTA 


GACTTGTGCC 


TGACCCTTTT 


CTTTAATTGT 


TCACTACTTG 


CTTATAAAGG 


7440 


AACCCCTTGG 


AATGGTTTAC 


CACAATGTGG 


ACAGAACTTA 


GGAATACCAT 


TACTTAAGTC 


7500 


AACAATTTCG 


CTACATTCAC 


TACATTTCAT 


TTCTAATTTT 


GGTTTCTCGT 


TTGTTGGTTT 


7560 


TGGTGTACCA 


CAGTTTGAAC 


AAAACTTCCC 


AGTGTTTTCA 


GTCCCACATT 


GTGGGcAAGT 


7620 


CCAAGTGTCT 


GTAGCCCCTT 


GTGCGTTTTG 


CTGGTTTTGT 


TTTTCTTGCA 


TTTGTTGTTG 


7680 


GTTTGTTTGA 


GACGctTGCG 


CAAAGTAGTT 


GCCATTCGCA 


TTCATGCCCA TACCAACGCC 


7740 


CATAAAGCCG 


CTCATAGCGL- 


CACCTTCATT 


TTTACCAGCA 


GCCTCCATAC 


CACGAGCCAA 


7800 


GGAACCTTCiA 




CGCGAACACC 


TGGATCGCTG 


AGCATAGCAC 


CTTCATTTCG 


*» o c: f\ 

7860 


CATGTTGATT 


AATTTTGTTG 


AATCATCTGT 


ATAAGAAATG 


CTCGCAACTG 


CTACAGAAAC 


7920 


AATTTCCATG 


CCGCGATTGG 


TGCGCCAATC 


CTCATCAAGC 


GCAGTTTGCA 


TATATTTGCT 


7980 


TAATTCAACA 


CTTTTTGAGG 


GAACATAAGA AATACGTTGC 


CCATCAGCAG 


ACATTTGATT 


. oU4U 


GATTGCTGCT 


TGCAATGCTG 


TAATGAACTC 


TGCCAAATAT 


TGTTCATTAA 


TATCATTAAT 


o 1 no 


ATCTACTTGC 


GTTTGGTTTT 


TAGGAATCGC 


ATTAGTATAA 


AATAGCAAAG 


GATCTGTAAT 


D J.t5U 


TTTAATTGAG 


TAATTTCCGT 


GTGCACGTAA 


GAAAAGTTCG 


GCATTATAAA AATTATCAAA 


ft oon 


ATAGTTTAAT 


GGCGCAGGTG 


TTCCAAATTT 


AATGCCTTTA 


ATTTCTTGCA 


AATTAATATA 


8280 


AAAAACTTGT 


TGTTTTTGTG 


GAGTTACGCC ACCAAATTTG 


AAACGACTGA AGGTCTCTGC 


8340 


AATCGCATCT 


TTTAAAGAAC 


CGTTAAACAT 


TGAAGGCGCA 


GAATCATTTT 


TGACTGTATA 


8400 


• ATAGCCTTCT 


TCAGCAGTGT 


AATCGATGAT 


TTTGCCACCA 


TCGACAAGAA 


GCATCATCAT 


8460 


GTTGGGATAG 


ACATGTACAA 


CAGAACCATC 


TGTTAAAACA 


TCGTCCGTTC 


CTTTACGATT 


8520 
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AGAACCACGT 


TTATCATCTT 


TACGTACAAA 


AACACCTTTT 


GTCATGACGG 


TTGTGTCCCC 


8580 


CATATTATCA 


GGTTCAATAA 


CCTCTAGCCA 


TTGATCGGCC 


AAGCCACCGC 


CAACTGAACT 


8640 


TGTTGCTGCT 


TTAATAAGTC 


CCATGTTCAT 


TCCTCCTAAT 


AAAAAACTAA 


TTGTTAAAGC 


8700 


TAGTATACCA 


CACAATGAAT 


GCAGAGAAAT 


CCAACGAAAG 


CCTGATGAAA 


TGGAAAAAAT 


8760 


TGATCAAACT 


TTATTAAGAA 


ATTA7\AAAAA 


GTATAGGTGA 


AACACAAGCA 


TTTTAAAAAA 


8820 


AGTTACGTAT 


AATAGAAGAA 


AAGACATATT 


ATATAGTTAG 


GGGATTATTG 


AATGGAACGA 


8880 


AGCAATCGTA 


ATAAAAAATC 


TTCAATW^ 


CCACTTATTC 


TTGGTGTTTC 


TGCCTTGGTT 


8940 


CTAATCGCTG 


CTGCCGGTGG 


CGGGTATTAT 


GCTTATAGTC 


AATGGCAAGC 


CAAACAAGAA 


9000 


TTAGCCGAAG 


CGAAGAAAAC 


AGCTACTACA 


TTTTTAAACG 


TATTGTCAAA 


ACAGGAATTT 


9060 


GATAAGTTAC 


CGTCCGTTGT 


TCAAGAAGCT 


AGCTTAAAGA 


AAAATGGCTA 


TGATACTAAA 


9120 


TCTGTTGTTG 


AAAAATACCA 


AGCAATTTAT 


TCAGGGATTC 


AAGCAGAAGG 


AGTCAAAGCT 


9180 


AGTGATGTTC 


AAGTCAAAAA 


GGCGAAAGAC 


AATCAATACA 


CATTTACCTA 


TAAATTATCG 


9240 


ATGAGCACGC 


CTTTAGGCGA 


AATGAAAGAT 


TTGTCTTATC 


AATCAAGTAT 


CGCCAAAAAA 


9300 


GGCGATACCT 


ACCAAATCGC 


TTGGAAGCCA 


TCTTTAATTT 


TTCCAGATAT 


GTCAGGAAAT 


9360 


GATAAAATTT 


CGATTCAAGT 


AGATAATGCC 


AAACGTGGAG 


AAATTGTCGA 


TCGTAATGGT 


9420 


AGTGGGCTAG 


CAATTAACAA 


AGTGTTTGAC 


GAAGTGGGCG 


TAGTGCCTGG 


CAAACTCGGT 


9480 


TCTGGCGCAG 


AAAAAACAGC 


CAATATCAAA 


GCTTTTAGTG 


ATAAATTCGG 


CGTTTCTGTT 


9540 


GATGAAATCA 


ATCAAAAGTT 


AAGCCAAGGA 


TGGGTCCAAG 


CAGACTCCTT 


TGTACCAATC 


9600 


ACAGTCGCTT 


CTGAACCAGT 


GACAGAATTA 


CCAACAGGGG 


CTGCGACAAA 


AGATACAGAG 


9660 


TCACGTTATT 


ATCCGCTGGG 


GGAAGCAsTG 


CGCAATTAAT 


CGGGTATACG 


GGCACCATTA 


9720 


CTGCAGAAGA 


TATTGAAAAA 


AATCCAGAGT 


TAAGTAGTAC 


GGGCGTAATT 


GGTAAAACTG 


9780 


GCTTAGAACG 


AGCGTTTGAT 


AAAGAATTAA 


GAGGGCAAGA 


TGGTGGCTCA 


CTAGTTATTT 


9840 


TAGATGACAA 


AGAGAaTGTC 


AAAAAAGCTT 


TACAAACCAA 


AGAGAAAAAA 


GATGGTCAAA 


9900 


CGATTAAATT 


AACGATTGAC 


AGTGGCGTAC 


AACAACAAGC 


TTTTGCTATA 


TTTGACAAAC 


9960 


GCCCtGGTAG 


TGCGGTCATT 


ACCGATCCTC 


AAAT^GGCGA 


TTTATTAGCA 


ACGGTTAGTT 


10020 


CACCTTCGTA 


TGATCCTAAT 


AAAATGGCCA 


ATGGCATTTC 


TCAAAAAGAA 


TATGATGCTT 


10080 


ACAATAACAA 


CAAAGATTTA 


CCATTCACAG 


CACGTTTTGC 


GACAGGCTAT 


GCACCAGGCT 


10140 


CTACCTTTAA 


AACAATTACC 


GGTGCCATTG 


GTTTAGATGC 


AGGGACATTG 


AAGCCAGATG 


10200 


AAGAGCTAGA 


AATTAATGGT 


TTGAAATGGC 


AAAAAGATAA 


ATCTTGGGGC 


GGCTATTTTG 


10260 


CGACACGTGT 


AAAAGAAGCA 


AGTCCAGTCA 


ATCTGCGTAC 


CGCTTTAGTT 


AATTCAGATA 


10320 


ATATTTATTT 


TGCCCAACAA 


ACATTACGGA 


TGGGAGAAGA 


CAAATTTAGA 


GCGGGTCTTA 


10380 


ATAAATTCAT 


TTTCGGTGAG 


GAACTAGATT 


TACCAATTGC 


CATGACGCCT 


GCTCAAATTT 


10440 


CAAATGAAGA 


TAAATTTAAT 


TCTGAAATTC 


TTTTAGCAGA 


TACGGGCTAC 


GGTCAAGGTC 


10500 
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AATTGTTAAT 


TTCGCCAATC 


CAGCAAGCTA CCATGTATAG 


TGTTTTCCAA AATAATGGAA 


10560 


CATTAGTCTA 


TCCAAAACTA GTTTTAGACA AAGAAACGAA GAAAAAAGAC AATGTAATCA 


10620 


GCGCCAACGC 


AGCGAATACA ATTGCCACAG ATCTTTTAGG 


TAGTGTGGAA 


GATCCTTCAG 


10680 


GTTATGTTTA 


TAATATGTAC 


AATCCGAACT 


TCTCATTGGC 


TGCTAAAACA GGAACAGCTG 


10740 


AAATAAAAGA 


TAAACAAGAT 


ACAGATGGCA 


AAGAAAACAG 


TTTCCTTCTA ACGTTAGATC 


10800 


GTAGTAACAA 


TAAATTCTTA 


ACAATGATTA 


TGGTTGAAAA 


TTCAGGAGAA 


AATGGTTCTG 


10860 


CAACAGATAT 


CAGTAAACCA 


TTAATTGATT 


ACTTAGAAGC 


AaCCATTAaA 


TAAaAAGAGa 


10920 


AAATGAaCGA 


AcGAGGTTtC 


TGTAAAAAGA 


AcCTCGTTTT 


CCCTTTTTAG 


AAAAAGGAAA 


10980 


GTGTGCTACA 


ATGAAGCGAC 


TAAATTGTAG 


AAGGAAAGGG 


ATTCGTGTGC 


GAATTTTAGC 


11040 


AATAGATACA 


TCTAACCAAA 


CATTGAGTAT 


TGCTGTGTGT 


GAAAATCAAA AAATTCTGGG 


11100 


AAGTTATACA 


GCAACGGTTA AAAGAAATCA 


TAGCTTAACA 


TTAATGCCGG 


CAATTGACTA 


11160 


TTTGATGAGC 


CAATTGAACT 


TGGCGCCAAC 


GGCAATTGAT 


CGTTTTGTCG 


TAGCAGAAGG 


11220 


ACCTGGATCA 


TACACTGGCT 


TAGGTTTAGG 


CGTGACGACT 


GCCAAAACAT 


TGGCCTATAC 


11280 


TTTAAAGAAA 


GAGCTAGTAG 


GCATTTCCAG 


TCTACAAACG 


TTAGCTGCCA 


ATTGTGTTGG 


11340 


ACAAACTGGT 


TTAATTGTGC 


CTCTTTTTGA 


TGCGCGTCGT 


AAAAACGTCT 


ATGCTGGTGC 


11400 


TTATCGGTTT 


GTCAATGGCG 


TGTGGCAAAA 


TGAGTTACCG 


GACCAACATA 


TTTcCCTGCG 


11460 


GGAGTTATTA 


GAACAACTAA 


AAAATGAACC 


AAATCTTTTT 


TTTGTTGGTG 


AAGATGTTGA 


11520 


AAAATTCACT 


GAAGAAATAG 


CTCAAATCAT 


TCCTCATGGA 


GAAATTTGTG 


ATGTTCCACA 


11580 


ATGGCAAATT 


CCTAATGCCG 


CAGTTTTGGC 


TGCGTTAGGT 


AGTGTGGCTG 


AACCTGTTGA 


11640 


GAATGTTCAT 


GGGTTCTTAC 


CTGCGTATTT 


GAAAAAAGTA 


GAAGCGGAAG 


AAAATTGGTT 


11700 


GAAGACACAC 


ACGCCTAATG 


GAGAAAGCTA 


TGTGGAAAAA 


CTTTAATTTT 


GTGAC7VAAAT 


11760 


ACTTTACAAA 


AAGACGGCAA 


CGGTATATAA 


CTAAAACCAT 


TGCAATGAAC 


CAGCAAGAAT 


11820 


TTCAATTGCG 


TGAAATCACC 


TACCAAGATA 


TTAAGGATTT 


ACTAGCAATT 


GAACGTGAAG 


11880 


TTTATGCTGG 


GGAACTTCCT 


TGGACGATGT 


CTGCTTTCAT 


GGCAG/VACTT 


GGCTCAAAAG 


11940 


CGCCGCATCT 


CTATTTGTTA 


GCGGCAATTA ACGGAAAAAC 


AGTTGGCTTT 


ATTGGTTGTC 


12000 


GGATTCAAGG 


AACGGATGCA 


CACATTACGA 


ATGTCGCTGT 


TCACACAGCA 


TACCAAGGCT 


12060 


TAGGTTTAGG 


TCGTTCTTTA 


ATAGAAGAAA 


CGCGAATTTT 


TGCCAAGAAA AATCGGTGTG 


12120 


AAACCTTATC 


GCTAGAAGTA 


CGAATGAGTA 


ATGTTAAAGC 


CCAACGTTTA 


TATCGAAAAA 


12180 


TTGGCTTTGT 


CTCACAGACG 


GTGAAAAAAG 


GCTATTATGA 


CGAAAATAAT 


GAAGACGCAC 


12240 


TAGAAATGAT 


TU^TTGATTTA AAAGAAGAGT 


GAAGAAAAAA 


TGTTAGTAAC 


AAAAAAAGAA 


12300 


ATATCTGTAG 


CAAATATTGG 


TGAAGAATTA 


TGGCGTTTTA 


GTAATTCTTC 


TTATACAACT 


12360 


GGCTCCCCTT 


GGaCAGAAAC 


ACAATTCGCG 


GAAGATTTTG 


CTCAGGaAAA 


TAGCGAATAT 


12420 


CT7TTTTTAG 


TTGAAAATGG 


CCAGTGGTTA 


GGCTATATTG 


CCTATCATTT 


TATTTTAGAT 


12480 
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GAAGCAGAAA 


TTAGTCACGT 


TGTTGTGAAT 


GGTCAGAAAC 


AACACCAAGG 


AATTGGTTGT 


12540 


CCAGTTAATG 


aAAGCTTTTA AAGAATATGT TAAAAGTCgT GACATTACAC AAATTTTcTT 


12600 


AGAGGTCAGA 


GAAyCGAATA TACTAGCyCm AAAACtTTAk 


GrGAAAaCAG 


U 1 X X UVrUU/Vl 


12660 


AGTAGCTGTT 


CGAnAAAATT 


ATTATAAAGA 


ACCGCAAGAA 


AATGCCTTTG 




12720 


GAAATTAAGG 


AAGGAAGCCC 


AATGACAATT 


TTTACAAAAG 


AACGTAAATT 




12780 


GTGGAAAGTA 


GCTGTGATGA 


AACAAGTGTG 


GCTGTCATTG AAGATGGCGA 


UAAAAX 1 V, i 1 


12840 


TCCAATATCG 


TAGCTTCGcA 


AATTAAGAGC 


CATCAACGTT 


TTGGCGGAGT 


Cb 1 1 CCCjGAA 


12900 


GTTGCGAGTC 


GTCATCATGT 


TGAACAAGTG 


ACGATCTGTA 


TTGAAGAAGC 


AT T AAC AG AA 


12960 


GCAAAAGTCA 


CACCAGAAGA 


ACTCAGTGGC 


GTTGCTGTGA 


CCTATGGGCC 


TGGGTTAGTC 


13020 


GGTGCTTTAT 


TAATAGGACT 


TTCTGCAGCA 


AAAGCATTTG 


CTTGGGCGCA 


CCAATTACCA 


13080 


CTTATTCCAG 


TCAATCATAT 


GGCTGGGCAC 


ATTTATGCTG 


CCCGTTTTGT 


GGCACCGCTC 


13140 


GAATTTCCAT 


TGATGGCCTT 


ACTCGTTAGT 


GGAGGACATA 


CGGAATTGGT 


CTACATGAAA 


13200 


GAAGATGGCT 


CTTTCGAAAT 


TGTTGGTGAA 


ACACGTGATG 


ATGCAGCTGG 


TGAAGCGTAT 


13260 


GATAAAGTCG 


GACGTGTGTT 


AGGGTTGCCT 


TATCCTAGTG 


GCAAAGAGAT 


TGATGCTTTG 


13320 


GCCCATGAAG 


GCACAGATAC 


CTATCAATTT 


CCACGGGCGA 


TGCTAAAAGA 


GGATAACTAT 


13380 


GACTTTAGCT 


TTAGTGGTTT 


GAAAAGTGCG 


TTTATTAACA 


CGGTTCATAA 


TGCAGAGCAA 


13440 


CGTGGTGAAG 


CACTGTCTAC AAAGGATTTA GCCGCaAGTT TyCAGGCGAG TGTGGTAGAA 


13500 


GTATTAGTGA 


CTAAAACCAT 


TCGTGCTTGT 


CAAGAATATC 


CTGTTAAGCA 


ATTGCTAATC 


13560 


GCTGGGGGCG 


TTGCTGCGAA 


TCAAGGATTA 


CGAGAAGCAA 


TGAGACATGC 


AATCAGTGaA 


13620 


CAATTACCAG 


CAGTGACATT 


ACTTATTCCG 


CCATTAAAAC 


TTTGTGGGGA 


TAACGCGGCG 


13680 


ATGATTGGCG 


CTGCCGCCTT 


TATCGAAGCT 


GcAAAAGAAT 


CACTTTGCTT 


CGTATAACTT 


13740 


AAATGCAGAA 


CCTGGCGTTA 


GTTTTATGAC 


AATTAGTGAA 


GAAGGCTAGC 


CAGCTAGACA 


13800 


AAAATTTGCT 


ATAATGAAGC 


CAGTTTAACA 


ATGACATTCG 


ATGATAAAGA 


GAGTACTTAT 


13860 


TTCAACGTCA 


GCTTAGCGAT 


CTTGGGAAGG 


TGGGAGCCAA 


GAGTGATCGG 


AATGAGGAAG 


13920 


CGCACTTTTA 


GAATTCTGCT 


GAAAGCAGAC 


GGCAATCACC 


GTTATCGATG 


AAGCAGGTCT 


13980 


ATCCAAGTAG 


TATAGACAAT 


TAGAGTGGTA 


CCGCGGGTAA ATTCkGGGnT 


CTGACAAAAC 


14040 


AAATTATTTT 


GTTTTGTCAG 


AGACGTTTTt 


GTTTTTTTAA 


TACTTACAAA 


TTAnTCAGGA 


14100 


GGGTTCTTAT 


GAaCAACAAA 


GAAATCGTAG 


CAAAAGCGTT 


ACATGATGtT 


TTAAATGAAG 


14160 


AATTAACGAT 


GGATCAAA7C 


GAACAATTAT 


TGGAAAaTCC 


TAAATCTGTT 


GATCATGGAG 


14220 


ATGTTGCTTT 


TCCAGCCTTT 


TCATTAGCTA 


AAATTTATCG 


TAAAGCACCA 


CAACAAATTG 


14280 


CAGCAGAATT 


AGCAGAAAAA 


ATTGATGGCA 


CAAATTTTGA AAAAATTGAA 


GTGGTGGGAC 


14340 


CTTATTTAAA 


CTTCTTCATG 


AATAAAGAAG 


CAGTCAGTCA AGCTGTCATT 


GGCGAAGTAG 


14400 


TCAAAGAAAA 


AAATAACTAC GGAAACAGTA CCTTTGGCAA 


TAATGGCAAT 


GTTCCTATTG 


14460 
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ATATGTCTTC 


ACCAAACATT 


GCTAAACCAA 


TTTCTATGGG 


GCACTTACGT 


TCTACGGTAA 


14520 


TCGGGAATTC 


AATTGCCTTT 


ATTTTAGAAA 


AAATTGGGTA 


TCAGCCAATT 


CGGATTAACC 


14580 


ACTTAGGTGA 


CTGGGGTACA 


CAGTTTGGTA 


AGTTAATTGT 


AGCTTATAAA AAATGGGGTT 


14640 


CTGAAGAAGC 


AGTCCGCCAA 


CAACCAATTA 


ATGAATTATT 


GCGCTTATAT 


GTGCAATTTC 


14700 


ATGAAGAAGC 


AGAAGAAAAA 


CCAGAATTAG 


AAGACGAAGC 


ACGTGCATGG 


TTTAAAAAAT 


14760 


TAGAAGAAGG 


AGACCAAGAA 


GCTAATGAAT 


TATGGAAATG 


GTTCCGTAGC 


GAGTCTCTAA 


14820 


AAGAATTTGA 


TAAAATCTAT 


TCCATGTTAG 


AAGTTGAATT TGATTCTTAT AATGGCGAAg 


14880 


cyTTTTATAA 


TGATAAAATG 


GATGAAATTG 


TTACTTTATT 


AGAAGAAAAA 


CATTTATTAA 


14940 


CAGAAAATCA 


AGGTGCAGAA 


ATTGTTGATT 


TGACAGAATA 


TAATTTAAAT 


CCAGCACTTA 


15000 


TTCGTAAATC 


AGATGGTGCG 


ACGTTATATA 


TCACACGGGA TTTAgCTgcT 


GCGCTTTACC 


15060 


GTAAACGGAC 


GTATGATTTT 


GCCAAATCGC 


TTTATGTAGT 


CGGCAACGAA 


CAAAGCAATC 


15120 


ACTTCAAACA 


ATTAAAAGCA 


GTCTTGAAAG 


AACTAGGGTT 


TGACTGGTCA 


GATGATATGG 


15180 


AACACATTCC 


GTTTGGTTTA 


ATTACTCAAG 


GCGGTAAAAA 


ATTATCCACA 


CGTAAAGGGA 


15240 


AAATCGTTTT 


ATTAGAAGAA 


GTATTGAATG 


AAGCTGTTAC 


TTTAGCAGGA 


AATCAAATTA 


15300 


ACGAGAAAAA 


CCCAGACTTA 


GCAAATCGTG 


AAGAAGTGGC 




GGTGTAGGTG 


15360 


CAGTTATTTT 


CCATGACCTT 


AAAAATGATC 


GTCTAAATAA 


TTTCGATTTT 




15420 


AAGTTGTTCG 


TTTTGAAGGA 


GAAACTGGGC 


CATATGTTCA 


ATATACACAT 




15480 


TGAGTATTTT 


ACGTAAAGCA 


AACTTTACAC 


CAGACGCTAC 


ACAACGTTAT 


cr* a r* T a. a a T 1^ 


15540 


ACAAGGATAG 


CTGGGAAGTC 


GTGAAGTTGT 


TACAAAAATT 


CCCTGAAACT 


la 1 onl V3\i^/ln>3 


15600 


CTGCTGAAAA 


ATATGAACCA 


TCTGTTATTG 


CGAAACATAG 


CATTCATTTA 




15660 


TCAATAAATA 


TTATGCCCAT 


GTTCGGATTT 


TGGATGAAGA 


TGCTCAAAAA 




15720 


TAGCGTTAGT 


TTATGCAGTA 


GCTACTGTTC 


TAAAAGAAGA 


TTTACGTTTA 


CTAGGGTTAC 


15780 


ATGCACCAGA 


AGAAATGTAG 


AAGTTTTTGT 


GAAAATTTTA 


CTGCGTTGAA 


CGCTAAACTT 


15840 


ATTAAGCAAA 


TATTAAAGTT 


AAGTTACAGA 


TTGAGAAATT 


CATAGTAATT 


AAATAGCTGG 


15900 


GGTGCTATAC 


TCGAGTCAAG 


TTATAAAAGG 


ATTAGAAAAG 


GTGGCACCCA 


ATTATATGAC 


15960 


AGAAAAATGG 


CGTGAAGACG 


aAGAATATCT 


TTCTTATGTA 


GAAGATTTAT 


TAGcAACCGA 


16020 


AGAAGTCCAA 


AAATTAGCAG 


AATACACACA 


ACATATGCAT 


TCAACTCGTT 


TAGAACATTC 


16080 


AATTAGTGTT 


TCCTATCATA 


GCTATAAATT 


AGCAAAGCGT 


TGGAACGGCA 


ATGCTCGTGC 


16140 


AACAGCTCGT 


GCTGGTTTAT 


TGCATGATTT 


GTTTTACTAT 


GATTGGCGCA 


CAACAAAGTT 


16200 


TGACGAAGGC 


TCTCATGCCT 


ATGTTCaTCC 


aCGAATTGCA GCCAAAAATG 


CTGAAAAAAT 


16260 


TACAGAGTTG 


TCTGATTTAG 


AACGAGACAT 


TATTATTAAA 


CATATGTGGG 


GCGCAACTAT 


16320 


TGCACCACCA 


AAaTACAAAG 


AAAGCTACAT 


TGTAACCTTT 


GTTGACAAAT 


ATTGTGCTGT 


16380 


CAAAGAAGCG 


gCGTTGCCGA 


TGACGACAGC 


CATGAAAACC 


AAATGGCGTC 


AGTATTTTGG 


16440 
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TCAACAGTCA 


ATCTAAAAAA 


AGCTTATCCG 


TAAGGATAAG 


CTTTTTTTGT 


TTGACGTTGA 


16500 


CTAAAAAGTC 


TAGCATTCAT 


CTATTTAAAA 


TAAATATTAA AAGTAATTTT 


GTTCTGTTAA 


16560 


TTTCTTTTGA 


GCGTATCGAA 


AGACTGCTTC 


TTGATTTTCA 


GTTGTTAACT 


GTTTGTAAAT 


16620 


GTTCATAATA 


TTGGTTGGGA 


CAACAAATGG 


CGTTTCTTCC 


GTTTCAGGAA 


TAGCTAAGAT 


16680 


TTCAAGAGGT 


GATACATCTA 


AAATTTTAGA 


TAAGGCGACA ATTTTATCCC 


TCCCCATATT 


16740 


TTCAATCATT 


CCAGTTTCCC 


ATTTCCTGAC 


GGTACTTTTT 


CCAACACCAA 


CAAAATTTCC 


16800 


AACTTGTTCT 


AATGTAAGAT 


TTTTCTCCAT 


CCTTTTTTGC 


TTCAAAAAAT 


TTTCTTTCAA 


16860 


AAAAACCACC 


TCCGTAAAAA 


CATTCTAGCA 


CAAAAAGTGT 


CGCAAAAGAA 


ACATGTTTTT 


16920 


tTCAAAATGA 


AAGTAACGAA 


TATAATAGnA 


AAATCAGCAT 


TTTTTAATGA 


TATTGACGTT 


16980 


-TTTTTCACAA 


AATGATATAA 


TTTTAAGTGT 


CTTATTAGAC 


ACTGAAAraGG 


GGGAGAGTAA 


17040 


AATGGAATCA 


AATTTAATTC 


ACTCAAAATT 


GaAAATGAAA 


GGAAAAAATG 


TTGaGTGGTT 


17100 


GGCAAATGAA 


ATGAcTGCaT 


ACGGGCAACC 


AATATCCGCT 


TCAACGATTT 


ACAAAAAATT 


17160 


AAAAGGAGAA 


GTTGTTTTCA 


AAGCATCGGA 


TATTAAATTG 


ATTTCACAAT 


TGTTGGATTT 


17220 


GAATAATGAA 


GAAATTATGG 


ACTATnTTTT 








17250 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7947 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



AGATTAnCGG 


AAnCTACCTA 


TCTGAACGTA 


CAGATAATAA 


TATTACinATG 


CAAGCAATTC 


60 


GTGATCATTA 


TGAAGTATCC 


AATCGATTTG 


GCGCAAAAGA 


AGTCTTGGCT 


TTTTCTTCTG 


120 


AACGAAAATG 


GGGAGCTATT 


GAATTTCCAG 


AAATTGGGAC 


AGTTTATTTA 


GGGGCTCCTG 


180 


AGCGTTTGGT 


AGACGATAGT 


CGGTTGCCAG 


AAGCTGTTTT 


TACAGCACAA 


GAGAATGGGT 


240 


ATCGCGTCTT 


GATGTTAGCT 


ATTGCGGAGC 


AACAACCGTT 


AAACGAAACC 


AAAATGCCTT 


300 


ATTTAGAGCC 


ATTAGCAATT 


CTAGAAATTG 


ATGATCCAAT 


TCGCCAAAAT 


GCCAAAGAAA 


360 


CACTGGCTTA 


TCTAAAAGAA 


GAAGGAATCG 


ACTTAAAAGT 


AATTTCTGGT 


GATAATCCGG 


420 


TGACTGTTTC 


AAATATTGCT 


CGTCGTGCTG 


GGCTACCCGG 


TTATGAGTCT 


TATATTGATT 


480 


TGTCGACCAA 


AACGACAGAA 


GCAGAAGT6C 


GTGAAGCAGT 


TCAGCAATAC 


ACAGTGTTCG 


540 


GACGTGTATC 


GCCTCAGCAA 


AAACGAACCA 


TTGTGCGTGA 


GTTGAAAGAC 


ACAGAGCATG 


600 


TGGTGGCGAT 


GACCGGCGAT 


GGCGTTAACG 


ATGTTTTAGC 


GTTACGAGAA GCGGATTGTA 


660 


GTATCGCCAT 


GGCAGAGGGA 


GATGGCGCTA 


CTCGGCAAAT 


TTCTAATCTA 


GTTTTGCTAG 


720 


ATTCTGATTT 


TACAACGTTA 


CCAGATGTCT 


TGTTTGAAGG 


ACGACGTGTG 


GTAAATAATG 


780 
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TAACCCGAGT 


TTCCAGCGTT 


TTCTTCATTA AAAC6ATTTA 


TTCGTTTATT 


CTCTCAATCA 


840 


TTTGTGCCTT 


AACGGCCATT 




Xk^AX XV«v»\3nX 


TCAAGTGACC 


TTGATTGATT 


900 


TAGCAATTGA 


AGGATATCCA 




T A TPTTTffi A 
XnXLfX X XVoKan 


AGGGGACAAA 


CGAAAAGTTG 


960 


TCGGGAAGTT 


TTTACCAACA 


XXX \3f\nni\ 


AX uL>u X \^ X l9 X 


TAATGCGCTT 


TTAGTTGTAG 


1020 


CGAATATCAT 


TGCTGTTTAT 


L. X (iAX X Kj^oAK^ 


an a a TP a A flf^ 

AnnnX wAAuo 


ATTTTCTTCT 


TTAGACACCA 


1080 


CGACGTTAAT 


GTATTATTTA 


T x AQil 1 CjVsAA 


i X AbdlsXAX 


GGCTGTTGTT 


CGAGCATGTC 


1140 


TGCCTTTAAA 


TCCACTACGC 


ATTTTCTTGG 


TTTTCAb 1 AL. 


CATTATTGGT 


ATTTATGTAG 


1200 


CAGCTATGTT 


ATTCCACAAC 


ATTTTAGAAA 


TTGGTTTCTT 


AACTTCGCAA 


ACAATGGGTC 


1260 


TCTTCTTTAT 


TATGATGGCC 


ATTAATATTG 


TGGTGCGTGT 


AACAATTGGT 


TTCGTACAAA 


1320 


TGAAACGTGC 


TGGAAAAACA 


ATTAAAGATT 


TATAAATVAGC 


AGAGGTTGAG 


CT7UVAAGCAT 


1380 


TCGTCTCCAG 


AAATGCAGTT 


CTTCTTTAAA 


TCTTGAAGGA 


AGTCGGAGCA 


AGCTTGATGT 


1440 


TTAACTGATT 


ATTTTTUVGAG 


TTAGTGTCTG 


GGGCAGGAGT 


CAGGAAGACT 


TTTGTCCTAG 


1500 


ATACTTTCTT 


TTTTTGTTGA 


GTTGAAACAA 


^> -m rwTfc ^ million iw 

GATACTTGAA 


AAACAGCGAT 


TATGTGATAC 


1560 


AATAAATCAT 


GTGT/^AATGT 


GTCCATTGGA 


ATTCGCACTG 


AAATGACAGT 


GCGTTTTTTA 


1620 


AACGGTCAAT 


CACATGAGTT 


AGCACAATGC 


TATTCTTAGG 


AGGTGAGGAC 


AATGGCACAA 


1680 


CGCATTCCTC 


AGGAAGTCAT 


TGAGGAAGTT 


C6TCACCGAA 


CGAATATAGT 


AGATATTATT 


1740 


GGGCAGTATG 


TCCAGTTAAA 


AAAATCCGGT 




TGGGTTTATG 


TCCGTTTCAC 


1800 


GAAGAACGTT 


CCCCTTCATT 


TTCAGTAGCA 


GAAGACAAAC 


AAATTTTTCA 


CTGTTTTGGC 


1860 


TGCGGGAAAG 


GTGGAACTGT 


TTTTAATTTC 


TTGCAAGAAA 


TTGAAGGCAT 


TAGTTTTCCA 


1920 


GAATCCGTGA 


AACGTGTTGC 


AGATTTGGAA 


CATTTAx Ux vs 


TGGACTTTGA 


TTGGTCAGAG 


1980 


CCGCGTGAAG 


TAGCGGATAC 


CCCAuAAAAC 


CAAuAAAoAC 


GGAGTTTGTT 


ACAACTGCAT 


2040 


AGTAAAGCCG 


CAGAACTTTA 


CCATCATATT 


TTAGTvjAATA 


CTAAAATTGG 


CGAACCCGCA 


2100 


TTAAACTATT 


TATTAGAGCG 


GGGGTTAAUA 


CAAuAAClAA 


TTGAGACCTT 


TCAAATTGGA 


2160 


TTTGcGCCGC 


AAAAAcGTGa 


I 1 1 ^ 1 i AAlj 1 


U AAu X X X X X A 


AAAACGAACA 


ACTGGATGAA 


2220 


ACGCTTTTTG 


AGCCTTCTGG 


oL» XuiXXXuXk^ 


r*2iAACAf^ATA 


ACGGTACgTT 


TTTGGaTCGT 


2280 


TTCTATCAAC 


GAATCATGTT 


TPPAATTAAT 


CATPrCCAAG 


GCAATGTCAT 


CGCCTTTTCT 


2340 


GGTCGCTTGT 


TaAAAACAGC 


X\3t\X X L X 


r;GCCATGAGA 
u\3\3\mx wnun 


tGCCTAAATA 


CTTGAATAGT 


2400 


CCTGAAACAa 


CACTTTTTAA 


TAAGCGAGAA 


ACGCTGTTTA 


ACTTTGATAG 


AGCCAGAAAA 


2460 


GAAATTCGTA 


AAGAAAATAC 


GGTCTTGCTT 


TTTGAAGGAT 


TCATGGATGT 


TATCGCTGCT 


2520 


TGGCAATCAG 


GCGTAAAAAG 


TGGGGTGGCT 


TCAATGGGGA 


CTAGCTTGAC 


CAATGAACAA 


2580 


ATTCGGCGTT 


TGGAACGGGT 


CGCTAAGGAA 


GTAGTTATTT 


GTTACGATGG 


CGATAATGCC 


2640 


GGTGTTCAAG 


CCACAAACCG 


TGCGATTCAA 


TTATTGCAAG 


AAAACAGTCA 


TTTTGACTTG 


2700 


AGTATTGTCA 


GCATCCCTGA 


AAAGCTGGAT 


CCTGATGAAT 


ATGTGCGGAA 


ATATGGCGCA 


2760 
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GAAGCTTTTC 


AAAATTTAGC 


CAATCATGGC 


CGAGAAACAG 


TTTTTAGTTT 


TAAAATGAAT 


2820 


TATCATCGGT 


TAACTAGAAA 


CATGAATAAC 


GAAAAAGAAC 


AATTGGATTA 


TGTGAACGAA 


2880 


TTGTTACGGG 


AGTTAACCAA 


TGTTCAATCT 


CCCTTGGAAC 


GTGATCGTTA 


TTTAAACCAA 


2940 


ATTGCTCAGG 


MTTTCAACT 


TTCTGTTCAT 


AGTTTGGAAG 


AGCAATTCAA 


TCAATTGAAG 


3000 


CAAGAGCAAC 


GTTCAGTCCA 


ACGGCAAGAA 


AGGCAACAAT 


TTTATCAAGA 


TGAAATGATG 


3060 


CCACCACCAA 


TGGAAGAACC 


TGTTTTTGAA 


GAGAATCACG 


TCCAGAACAA 


GTTACCGTTA 


3120 


ACCCAAGTTC 


AAAAAGCAGA 


ACGTTCTTTA 


TTATTTCGTT 


TAATGAATGA 


ACAAGGAGTA 


3180 


CGGCAAACCA 


TTCAACAATT 


GCCAGATTTT 


TCATTCGCTC 


ATGATGAATA 


CCAAGAATTA 


3240 


TATTTCTTGC 


TAGAAAGTTA 


TGCGACACTG 


CATCAAAGTT 


TCGACATTGC 


CGATTTTATT 


3300 


AATTTCTTGC 


AAGATAATCA 


AACCaAACAA 


TTAGCAATTG 


AAATTGCGTA 


TCAGAATTTA 


3360 


TCGGAAGAAA 


GCTCTGAGCG 


AGAAGTTGCC 


GATTTATTAC 


ATGTAATTGC 


CTTATcAAGT 


3420 


ATTGCAGAGG 


CAATTGAGCA 


GAAAAAGATT 


CAACAGCAAG 


AAGCAAAACG 


AGTGGGCAAC 


3480 


CAACAGCTTG 


AAGCCGAATT 


AACAATGGAG 


ATTATCCAAT 


TGGCTCGACA 


GCTTAAAGCT 


3540 


CAACGAACAT 


TTACTTAAGC 


AATATAATGA 


ATAGAACTAA 


GCGTTTTGTT 


CATTTTATTT 


3600 


TGATAAGTCC 


CAGTGATTTA 


TCAGTTATTT 


CAAGCGTGAA 


CCGAAAAATG 


AAGGGGGCCT 


3660 


TCTTTCATGG 


AAAAAGAAAC 


AAGTAAAAAA 


TATGAAGCTG 


CAGTGGCAGC 


ATTTATCAAA 


3720 


GAT^CAAGC 


CTAAAGGAAC 


AGTGGTTTAT 


GATGATTTAG 


CGAATCAATT 


AGCCACACCG 


3780 


TATACTTTGG 


ATGCAGPAGC 


TATGGAAAAA 


TTGATCCaAA 


AAGTTGAAGA 


TGCTGGTATC 


3840 


AGCGTGGTCG 


ATGAAAATGG 


AGACCCAAGT 


GAACATAGCT 


TGAAAAAAGA 


TGAAAAAGAA 


3900 


GCGGAAAAGG 


CCCAAGCAGA 


AGACTTATCT 


GCGCCAACAG 


GTGTTAAAAT 


TAATGATCCC 


3960 


GTGCGTATGT 


ACCTAAAAGA 


AATTGGGCGC 


GTTCAATTAT 


TAACTGCCGC 


AGAAGAAGTT 


4020 


GAATTGGCAC 


TTAAAATTGA 


AGAAGGCGAT 


CAAGAAGCAA 


AACAACGTTT 


AGCAGAAGCT 


4080 


AACTTACGTT 


TGGTTGTTTC 


AATTGCCAAA 


CGATATGTAG 


GTCGTGGTAT 


GCAaTTCTTG 


4140 


GATTTAATTC 


AAGAAGGTAA 


TATGGGCTTA 


ATGAAAGCTG 


TTGAAAAATT 


TGACTACCGT 


4200 


AAAGGATTCA 


AATTCTCTAC 


GTATGCTACT 


TGGTGGATTC 


GTCAAGCGAT 


TACGCGGGCA 


4260 


ATTGCGGACC 


AAGCTAGAAC 


GATTCGAATT 


CCTGTTCACA 


TGGTTGAAAC 


AATCAATAAA 


4320 


TTGATTCGGA 


TTCAACGCCA 


ACTATTGCAA 


GATTTAGGTA 


GAGAACCAAC 


GCCAGAAGAA 


4380 


ATTGGTGCGG 


AAATGGATTT 


ACCAACAGAA 


AAAGTTCGTG 


AAATCCTAAA 


AATCGCACAA 


4440 


GAGCCAGTCT 


CTTTAGAAAC 


ACCAATTGGT 


GAAGAAGATG 


ATTCACATTT 


AGGTGATTTC 


4500 


ATTGAAGACC 


AAGATGCTAC 


CAGTCCTGCT 


GAACATGCAG 


CTTACGAATT 


GTTAAAAGAA 


4560 


CAACTAGAAG 


ATGTTCTGGA 


TACTTTAACA 


GACCGTGAAG 


AAAATGTTTT 


ACGTTTACGT 


4620 


TTTGGTTTAG 


ATGATGGTCG 


GACACGTACG 


CTAGAAGAAG 


TCGGAAAAGT 


TTTCGGTGTC 


4680 


ACTCGCGAAC 


6TATTCGTCA 


AATTGAAGCA 


AAAGCGTTAA 


GAAAACTAAG 


ACATCCTTCT 


4740 
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CGTTCAAAAC 


AATTAAAAGA 


CTTTTTAGAA- 


TAAACATACC 


GTTTAGgGGC 


TCTTTTCGTG 


4800 


CATGCGAGAA 


GAGCCGCTTT 


TGATTTTTAA 


AAAGTCTCTA 


AATGTCATTG 


CTAGTTATTG 


4860 


GAAGAAGGCG 


CAAGTTGTGG 


TAAATTAGAA 


GTAATGGAAT 


ATTAGCAGAA 


TGTGAGGCGG 


4920 


AAGAATGACA 


AAAAAATGTC 


CGAAGTGCGG 


GAACGAATTC 


GATGCAGAAT 


TAACGACCTG 


4980 


CCCAACATGT 


GGCTATTCAC 


TAACTGACAC 


AACAGTTGAT 


AAAGAAGAAG 


CGGAAACGAC 


5040 


TTCAACAAAT 


ATAGATTTTG 


AAACACAGGA 


AAATGT^GAA 


CACGAAGATC 


AATTAAATGA 


5100 


GAATATTGAG 


TGGTCTGAGT 


TAAAAGATAT 


GAGCTTAGGC 


CATGTGATGG 


AACTGTTTGG 


5160 


CGAATCACCA 


GAAGAAGAAA 


GTAATGATGA 


CAAAAAAGAA 


GAGTCGACAG 


AAGATAACTT 


5220 


AATAGTCTCT 


GATTCAGAGG 


ATGTTTCTGG 


TCTAGAAGCA 


TCGCTTCAAG 


AAGGGGCTAG 


5280 


TGAGGAGACT 


CATGATTCGG 


TGGAGGAGTC 


GATACCAGCG 


ACAGAAGAGA 


CGCCTACTCA 


5340 


TTCAACAGAA 


GAAATU^CTAG 


CAACGGACGA 


AGCAGTTAAT 


TTAGAGGAAA 


CAACAGAAGA 


5400 


AACAACGGAA 


GAAACAACTA 


CTGTTGAAGC 


AGAAACAGCT 


GAGGTATCTG 


AGACGGTGAA 


5460 


GTCAGAAGAG 


GAAGCTTTAA 


CAGAGATTTC 


AGGAACTGAA 


GTTATCTCTA 


CCACTTCTGA 


5520 


AGAAGAAATT 


TTTTCACAAC 


CACCGATAGA 


AGATCAAGAT 


GTAACACCAA 


ACGAAACTTT 


5580 


GCAAGCTTAT 


ATTCAAGCGC 


ATCGAGCAGA 


TACAGAAATG 


TCTGAAAATC 


CATCAGAAGA 


5640 


AACAGCAGAG 


ACCCAAGAGT 


TAGAAAACAG 


CGGTGAGGCA 


GTATTAACCC 


AAGCCGAAAC 


5700 


ACCAACTGAA 


TCCATTTCAG 


ATTCTGAAGA 


AGGGTTAACA 


AGTGCAGCGT 


CGCTAGAGGC 


5760 


ACCGAGCGAC 


GCTGATGCAA 


CCGAGGGAAA 


AGTTTCAGAT 


TCAGGTGTGA 


TACCACCGAT 


5820 


GAATGAAACT 


GGGAATGCTC 


AACCAGCGCC 


TGCTCCGAAG 


AAACCATCAA 


AAAAAGTAGT 


5880 


TTTCGTTGCG 


CTTGCGGTTG 


TTTTATTAGC 


TGGAGGTAGT 


GCTTGGGCGT 


ATCATGATCA 


5940 


AACGCAAAAA 


GCTGCCGCGC 


AAGAGGCGGC 


GGCACTAACC 


AAGAAAACAG 


ATACGCTGAA 


6000 


AGATGAACTA 


GCAGCCTTTT 


ATACCACGAA 


GGAGCAAGTT 


TTTATTAAAC 


CCGATATGGT 


6060 


GACTGTTAGT 


CCAGAAAAAC 


TCTCAAAACA 


AGTCGCGGAA 


ATTAAAGATT 


CAGAAGAGTA 


6120 


TTCTCAATTA 


AATAAACAAA 


TTCAAACATT 


GAAAGAGAAG 


CAACAAACCA 


TTCAACAAAT 


6180 


CAACCAATTA 


TTTGAGGCGC 


CGATTGTCAA 


TGGCAATGAG 


TTGAAACCGG 


CGATTCTAGC 


6240 


TGCTGATCAA 


CCAATTTCCG 


TAAAGAAATT 


AACAGGAAAT 


GATCCATTTG 


ACCAATTGAT 


6300 


GAACCAAGCA 


ATCGATCAaG 


CGAACCaACA 


ATACAACCAA 


TTACAAAAAG 


CTAAAAAAGC 


6360 


GGTTGAGGTC 


ATTTATAAAG 


ATGGAAAAAC 


TACCAATCAG 


CTGAATCGAG 


ACACGTATCA 


6420 


AGCTGCTAAG 


GCAGAAGTAG 


ATAAAGTAAC 


AAGTGATAAG 


CTGAAAAAAG 


AACTGGTCAA 


6480 


ACAAGTCACG 


ACAGCTGACC 


AAGCGTTAAC 


CAAGGTTGAA 


GAAGAACAGA 


AAAGAATTGC 


. 6540 


AGAAGAACAA 


GCTGCCGCAG 


AGCAAGCAAA 


ACAAGCAGAA 


GAACAAGCGA 


AACAAGCTGc 


6600 


TGCCGCAAAA 


AAAGA6GACG 


CTAAGAAGGA 


AGAAACAGCA 


AAAACAGAAG 


CGAATGGCTA 


6660 


CACGGCGCCG 


AATAGTGACG 


GGGTTTACAC 


TAGTCCGCTT 


TATGCGCCAG 


ATGCTGCCGA 


6720 
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TATCGCTGAT 


AGTAGCAATC 


CAGCGTGGAC 


ATGGGCACCA 


GGTGTGAAAG 


AAAAAGTCTT 


6780 


AGACACA6TG 


ATTGCTCGTG 


GCTATGTGGT 


TCCTGGCGGA 


TATTCTTTAG 


AACCTGCCAA 


6840 


AATTGTGAAT 


GGCGAAGGTT 


ACTACAATCT 


TTATGCAACA AACAATCAAT 


CAAAATTATT 


6900 


AGAAGGCACT 


ACAGAGAAAA 


ATGTTCACAT 


GTATTTAGTT 


ACAATCAATG 


CTAAAACAGG 


6960 


TTGGTTCAAA 


GGGAACGCTT 


CTCGTAATGC 


AGGGCAATAA AAGAAAATAG 


AAACAATCGA 


7020 


ACAAAAGCGG 


AAATCTTTTG 


TTCGATTGAT 


TTTCCTTGAG 


GAGTTAATGA 


TAAATGAGAG 


7080 


AATTACTAGC 


AACAACATTT 


ACCGGCATTG 


TGGTCGATGA 


AAATGAACAA 


TTTTATTTTG 


7140 


TTCAAAAAAA 


TGGCATCACA 


TTCCGTTTAA 


AAAAAGAAGA 


AGGTACGCAT 


GCTATAGGGG 


7200 


AGGCTGTAGA 


AGGGTTTGGT 


TATTTAAATC 


AAAAACAAGA GCCAGCACTG ACAACTACAA 


7260 


TTCCTACTGT 


GCGCATTGGT 


AGCTATGGCT 


TTGGTACTGT 


CACAGGAACG 


AGACGTGATT 


7320 


TAGGTGTCTT 


TGTAGACGTT 


GGCTTACCAG 


ATAAAGATGT 


CGTTATTTCG 


TTGGATGAAT 


7380 


TACCAACTAT 


GCGGGAACTT 


TGGCCTAAAA 


AAGAAGACCG 


AGTAATGGTT 


ACTTTAAAAG 


7440 


TGGACAGTAA 


AGACCGCATT 


TGGGGCGAaT 


TAGCAGnTGA 


nAAAGTCTTT 


AAAGCAATGG 


7500 


CAAAACGTGG 


GAATGCTGAA 


ATGCAAAACC 


AAAACTTAAC 


CGGGCATGTG 


TATCGTCTGA 


7560 


AACTTGCAGG 


AACGTATATT 


TTAACTGATG 


ATTTATACAT 


TGGATTTATC 


CATCCTTCTG 


7620 


AACGATTCCA 


GGAACCACGT 


TTAGGTGAAA 


AAGTAGTGGG 


GCGTGTGATT 


GGCGTGCGGC 


7680 


CGGACGGCGT 


ATTGAATCTT 


TCATTAAAAC 


CAAGAAGTCA 


TGAAGTAATC 


AGCGATGATG 


7740 


CGTTAATGAT 


TCTAACTTTC 


TTAGAAAGAG 


CAAACGATCA 


CCAAATTCCT 


TTCACCGATA 


7800 


AATCAAGCCC 


TGATGAAATT 


AAACAAACCT 


TTGGCATTAG 


TAAAGCTCAA 


TTTAAACGCG 


7860 


CCTTAGGCCA 


TTTAATGaAA 


CAAAAATTAA 


TTAAACAAGA AgATGGTAAA aCAATTCTAA 


7920 


TTGGcTCGTT 


GGrACAATCm 


AAAGAAA 








7947 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



GCACCAGCGT GGTTGGATCA TTCTGATCAC CCGTTTCATT GGCCCGATAC TGGTGTTTAT 60 

TTATATGAAC CTGGCCAAGA AGTTGTTTTT CAATTCGATC AATTGGTAAC AGATATTCGT 120 

GAATGACCCA GAAACCAGTT AAAAAGCAAT TGCGTTTAGG GTTAACTACT TTATTTGCTG 180 

TACAATTtAT GmAAGAAaTT TCACGATTTT TAaCAACCCA TCCACATGTG AaTTTAATAT 240 

TACAGCArGA CGGCTCGCCA AAaTTACAAa CGATGTTAGC AAATAAAGAA ATTGATATGG 300 

GACTAATTTC TTTCCCAAAT ACCTTACCTG AAATTATTCA TATTGAACCT TTAGAAACGA 360 
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CTACCAAAGG 


CTACCATGTT 


TATGTCGTAG 


TTCCAGAATC 


AAATCCTCTC 


TCCCAATATG 


420 


AAAAATTAAC 


TTTTAAAGAT 


TTGAAAGATC 


AACgATTTTC 


ATCGTTAAGT 


GATAATTTTA 


.480 


TGATTGGTCG 


CCTACTACTA 


GATCGGACTC 


GGAGCTTCGG 


TTATGAGCCG 


AATATCATTT 


540 


TGCATAACGA 


TGATTTACAA 


GTACTTCTTT 


ATAGTTTGCA 


AAAAAATAAT 


TCGATTTGTT 


600 


TGCTGCCGAT 


TGAATATTAT 


GAAGTGGGAA 


AAAGTCAGGG 


ACTGAAATGG 


ATTCCtTTAA 


660 


AAGATAAGTT 


TGACTATTTC 


CCCaTCGGCA 


TTGCTTTGCG 


CCGCGATTTT 


AGTCTGACAG 


720 


AAGATGTTAG 


AGATTTTATC 


CaAATCATTA 


AAGAAAATTA 


ATGGATAATC 


TAGCCTTGAT 


780 


AAGCAAAGAC 


GATTTTACTA 


TCATAAATTT 


TCAACACTTT 


TTCAAATAAG 


AATTTTGAGA 


840 


ATAAACCCGC 


GACAATTGGA 


ACAACAAACC 


AAGAAAAGAG 


TAAGGGTAAG 


ATCCCTAAGC 


900 


CCGCATCCAA 


TGAAGCAAGT 


GGTCCCACGA 


GTCCAACCAA 


GCCAAATCCA 


GCCGATTGTG 


960 


GGGTTCCTGA 


AATGTTGAAT 


AGAACCACAG 


GAATTGCTGA 


AATAGTTGCT 


GTAAATAAAC 


1020 


AAGGAATTAA 


AATAATTGGG 


TATTTAAATA 


AGTTAGGCAT 


CATCATTTTC 


ATGCCGCCCA 


1080 


AAGAGACAGC 


CAGTGTCACT 


CCCGATTGGT 


TGACATTCCA 


AGAATTAATC 


ACAAGAACAA 


1140 


CAGTTGTGGC 


TGCAATTCCC 


ATTGCAGCAG 


CACCTGCAGA 


CAAGCCATTC 


AACTGT^ATCG 


1200 


CCAAACCAAT 


TGCCACAGTA 


GTGATTGGCG 


AAATAATCAG 


TGCAGCAAAT 


GAACAAGCAA 


1260 


TTAAAATACT 


CATGATAATT 


GGCTGAAAAT 


CAGTAAAATT 


GTTAATCACT 


TTCCCGATGG 


1320 


CAACCGTAAT 


TTGTGTCACA 


TATGGATAAA 


TCAACATGCC 


AAACAAACCT 


GCACCTACTC 


1380 


CAACAACGAT 


CGGTAAAGCA 


ATGATTTCCA 


CTGAACCGAA 


TTTTTTATCG 


ATGACTAATA 


1440 


AAAGTAAGAC 


TGCAACAGAA 


GCGGTAATCA 


TAATATTAAT 


GATGTCACCA 


GTGCCTGCAC 


1500 


CAATAAAGCC 


TTTCACxTCT 


GaATTAAATT 


TAATCACGCC 


AGAGCCAGCA 


AATGCTGCAC 


1560 


CGCCAGCAAT 


CATCATTTTC 


TGTGGCGTTA 


AGCCAAATTG 


AAAAGCmATC 


AGTCCACCAA 


1620 


TAATTAAAGG 


GGTGGCTAGT 


TGGAAAATTT 


GCGCCGCATG 


AATAATCATT 


TCGATAATCT 


1680 


TGTATTCTGC 


AAAATATTTT 


AAAATGGCTC 


CTAATACAGC 


ATTGGGAATT 


AAGGCGATAA 


1740 


TGGTGCCGGA 


AgcTGTTCCA 


GCTAACACTT 


TGTTAAAAAA 


TATTCTTGGG 


GTTAATTGAG 


1800 


CATCCAGTTT 


ATCAGTGGGC 


ATTTTCAACA 


CTCCTATTGA 


TTATTTTTGT 


TTTTTTAGAA 


1860 


CAAGTTTGAG 


AATGATTGAA 


CTTTCTATTA 


GATAAAGAGC 


GGCCTCAAAG 


AAGAGCGCTT 


1920 


ATTTCTTTGA 


GGCGCTTTTT 


TTTTAACTAA 


AGCAAGGAAC 


GATTTTTAAT 


ATATATCCGT 


1980 


ATCCGCATTA 


TATTTCGCTA 


AATTATCTTT 


AACATCTCGc 


AAGAACTTAC 


CTGcTTGTTG 


2040 


TCCATCTAAA 


ATTCGATGAT 


CAATAGAAAG 


ACATAAATTA 


ACCATATCTG 


CCACTTTAAA 


2100 


GCCTCCATCT 


GCGGTTGGCA 


CAAGGCGTTT 


ATTAATTGAC 


TCTACTTGTA 


ATATCGCAGC 


2160 


TTGTGGATGG 


TTAATAATTC 


CCATCGATTG 


GACGGAACCT 


AGTGTTCCAG 


TATTATTAAG 


2220 


AGTAAAGGTC 


CCTCCTTGCA 


TTTCTTTGCT 


TGCTAGCGTT 


CCTTGTCGGA 


CTTCTTGTGC 


2280 


CAAGCGATTG 


ATTTCTTTGG 


CTAGCCCTGC 


AATCGAATAG 


TTATCCGCTT 


GTTGAATCAC 


2340 
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TGGCACATAA AGATGCTCGT CTGTCGTTAC AGCTArGGAT AAATTAACAT CTTTATGATA 2400 

AATAATT6AG CCATCATCCC AAGATGTATT ZMTlZTGGk TTTTTCTTTA ACGCTTGAAT 2460 

GACTGCTTTG GCAAAAAATG GGAAGAAGCT ?AGAG.-.TAGG CCCTCTTGTT GTTTAAATTC 2520 

ATCTTTTAAT GAATTTCTAA G7TGG;ICTAA GTTGGTCACA TCCGCTTCCA CCATTAACCA 2580 

A6CGTGAGGG ATTTCGTT6A CGCTT7GGAC CATTTrTTTA GCAATTGCTT TACGTACAGG 2640 

ATCTGCAGAG ACAATTTTAT CTGTGCTTGT TTCAGrTGGA CTAGCTGTTG CGGCGCTTTC 2700 

ATTTTGCGAA gCAACTGGCT CCTCTGAAAT 7GAAGKCTA GTTCCTGGAC TCACTGTTTT 2760 

CTCAGGTGTT CTPGCCTGTG TTGGTGTAAA 6TTXGIAACA 7CTTTTCTAG TAATTCGACC 2820 * 

ATCACGACCT GTCCCTGTAA CTTGTGTTAA ATCAAtTCTTT TTTTCTTGGG CAATTTTAAG 2880 

GACTGCTGGG GAATAACGAC CATTA7TTTT TTGATGGGAC GTCGCAGTGC TTGTTGTTTC 2940 

AGCCGTTTCA TGTTCTTGCG CTTGT7CCGC ACTTGCTTCT TTCACTGGTG CTAAAGTAGC 3000 

TACTTCTGTT TCTTCTGTTG TTTCT7CTGT CTCTMCGTC ATCACCGCTG TTCCGATGGG 3060 

CACATCTGTA TCTAGAGAAA TCAGAAATTC ?TTTACTACA CCATCAAAAT CTGATGGGAC 3120 

TTCCGTGGTG ACTTTATCAG AAACGACTTC CATTAAGGGA TCATAgCGTT TCACTGAATC 3180 

TCCTGGTTTA ACTAACCATT GGACAATGGC TGCCTCTGTG ACACTTTCGC CTAGATGAGG 3240 

CATTTTGATT TCTTTTGTGG CCATT7ATTT TTTCCuCCTT GCTTCAATTA AAATTCTGCT 3300 

AATTCTTTCA TTGCTGCTAA TACTTGCTCT 7CATTGATTA AAAATTCTCT TTCTAAAGGT 3360 

AATGCATAAG GCATGCTTGG ACAATCAGGT CCTGC^AGTC GTTGAATCGG TGCATCTAAA 3420 

TCAAAGAGAG CATCTTCTGA AATCATTGCC GCAATITCAC TCATCACGCT GCCTTCTTTA 3480 

TTGTCTTCTG TTACGAGTAA GACCT7CCCT CTTTTCTTAG CTGCAGCCAC TAATGTTTCG 3540 

CGATCTAAAG GATATAATGA GCGAACATCG ACGATTTCGG CATCAATTCC TTCAGCGGCT 3600 

AGTTTTTCCG CCGCAGCTAA AGCTAATTGC AAGGTCATGC CGTAACTAAT CACGGTTAAA 3660 

TCACTACCCG TTCTAACGAC A7TGGCTTTG TCAATTGGTA CGATATAATC ATCTGCAGGC 3720 

ACTTCATCTT TTAGTAAACG ATATAAACGC TTATGTTCgT AAAAAATGAC TGGATCATCG 3780 

GAAC6AATCG CTGCTTTAAT CATCCCTTTT GCATCMAAG GATTAGAAGG GGTCACCACT 3840 

CTCAATCCT6 GTTGTCCACA AAAGACTTTT TCTGTi^GACT GAGAATGATA CAGCCCACCA 3900 

CGAACACCGC CGCCATAAGG GGTCCGATAA ACGATTGGCG CAGTCCAATC GCCTTTTGTT 3960 

CTATAGCGCA TT6TCCGAGC CTCTGATAAC AATTG^^TTGG TTGCAGGCAA AATGTAATCC 4020 

GCAAATTGAA ATTCACCAAT TGCGCGATAG CCCATTAAAC CTAAGCCGAC AGCTAAACCA 4080 

CCAATTAAGC CTTCTGTTAA CGGTG?ATTA AAACAACGCT CGTCACCGTA TTTAGCAGCC 414Q 

AAGCCCTTTG TTACACCGAA CACGCCGCCT rXGTCGCCGC CGACATCTTC TCCAAAAATA 4200 

ACTACTTTTT CATCACGAGC CATTTCTTCA GAAAT!rCCTA AGTTAATTGC TTCTAAATAA 4260 

GTCATCTCAG CCATTATATT TCTTC?CCTC CTCTCTTATT TTGCATACAC TTCTTCTAAA 4320 
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ATTGATGTCG 


GTACAGGATC 


TGGCATTGCT 


TCTGCTTCAT 


CCGTTGCTTG 


ATTGATTTCT 


4380 


GCACGAATTT 


CTTCATCAAT 


TTTGGCAATG 


TCTTCGTCTG 


TTAAATAGCC 


CTCTTCTAAT 


4440 


AATTGTTTTT 


CAAAAAGCTT 


CACTGCATCG 


TTCTTTTTCA 


TTTCTTCAAT 


TTCTTCTTTT 


4500 


GAACGATAAA 


CAGATTGATC 


ATCGTCAGCG 


GAATGAGACG 


TCAAGCGCGA AACCATTAAT 


4560 


TCAATCAATT 


TTGGTCCTTT 


TTTCCCGCGA 


GCCGCTTTTA 


CTGCTTCTTT 


AAATGCTAGA 


4620 


TAGACTTCAG 


TAAAATCACT 


ACCATCAACG 


GTTACACCTT 


CAAAGCCATA AGCTtTCGCG 


4680 


CGATCGGCCA 


TTCGTTTATT 


GGCATACTGT 


TCTTCAATTG 


GGACAGAAAT 


CGCATATTCA 


4740 


TTATTTTCAA 


CAACAAAAAT 


AACTGGTAAT 


TTTTTTACGC 


CTGCAAAGTT 


CATAGCTTCT 


4800 


TGGACTTCTC 


CTTGATTGGC 


AGAGCCTTCC 


CCAGTGGTGG 


TTAATGCAAC 


AAAATCAGCT 


4860 


TTTTGAAGTT 


GCGCTGCATA 


ACCAACACCT 


GTCGCTAATG 


GCATTTGTGT 


ACTTACTGTT 


4920 


GAAGAGAAGG 


AAACAATATT 


ATGCTCTTTT 


GAACCATAAT 


GATTCGGCAT 


TTGACGACCA 


4980 


TGGGAAGAAG 


GATCCGCTTC 


TTTTCCAAAA 


GAACCCATTA AAATATCTTT 


GGAGGTCATG 


5040 


CCCCAAACCA 


AGCACGCGGT 


CATATCACGA 


TAATACGGTA 


AAAAATAATC 


TTTTTGAGGA 


5100 


TCAAAAGCCA 


TCGCCATCGC 


TACTTGTGCA 


ACTTCTGCCC 


CTTGACCAGA 


AATATTGAAA 


5160 


GAGGTCTTAC 


CAATCCTTGT 


TAATTGCCAC 


AACCGTTCGT 


CTAAACGaCG 


tCCTCTTAGT 


5220 


ACTTcAcGAt 


ATGCCTGaAT 


CAaTtCTTCT 


TTTGATAAAC 


CTGATTTTTT 


AAGCGCTTTC 


5280 


ATTTTCATCA 


ACCTTTCTCT 


ATTTGTGAAT 


CGCTAAGCCA 


TAAGTATCTA 


AGGCTGCTTC 


5340 


TTGCAATACT 


TCTGTCATTG 


TTGGATGCGC 


ATGGATTGCT 


TCACCAATTT 


CAATGGGCGC 


5400 


GGCATCTaAA 


TACATGGCAG 


TACTTGCTTC 


GGCAATTAAA 


TCCGTGACAT 


GTGGGCCAAT 


5460 


CATAGAAACC 


CCTAATAAAT 


CATCGGTCTT 


CTTGTCACGA ATTACTTCTA 


TAAAGCCATC 


5520 


TGTTTCTCCA 


TAAACAAGTG 


ATTTACCATT 


ACCATTAAAA 


TTAAAGGTGC 


CAATTACTAC 


5580 


TTCTTTTTCT 


GCTGGTAAAG 


TTTCTCTAGT 


ATAGCCGACA 


CTTGCTATTT 


CAGGATTTGT 


5640 


ATAAACACCT 


CGAGGTACGT 


TTGTATAATT 


TAAAGGTTCA ACTGTCTCGC 


CTAAAAGATG 


5700 


CTGAACAGCT 


AACTCTCCTT 


CTTTCATAGC 


CACATGTGCT 


AGTTGGAGTG 


TATCGATACA 


5760 


ATCACCAATC 


GCATAAATAT 


GTCCTtCTGT 


GGTTTGATAA AATTCATTTA 


CTTCAATGCC 


5820 


TTTGTCAGTA 


TATTTCACTG 


AGGTATTCTG 


TAACCCTAAC 


TTATTGATAT 


TGGGCTGACG 


5880 


TCCAATTGCT 


ACCATGACTT 


TGTCAACGGT 


GAGGGTTTCT 


TGTCCTGCAA 


CTTCTACTTG 


5940 


AACCTTTTGT 


CCAGTAACTT 


TCGCTTCTTG 


GACTTTGCTT 


CCTT^ATAAAA 


TGTTAATTCC 


6000 
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AGAAATTGTT 


GCACTCTCAT 


TTATAAGTAA 


6060 


ACGATCCAAA 


AATTCAATGA 


TAGTTACGTT 


GACACCTAAA 


CTATTTAATA 


AAGAAGCCCA 


6120 


CTCGACACCA 


• ATAACGCCAC 


CACCAATAAT 


TGCAATTGAT 


TCTGGTAGCT 


CTTCTAATTC 


6180 


AAGCATCCCA 


TCAGAAGATA 


AGATAAATTC 


TTCATCCAAT 


GGCAAATTAG 


GTAACGTTTT 


6240 


AGAACTAGAG 


CCTGTCGCAA 


TAATGACATT 


TTTAGGAcGA 


TAATTTCTtC 


TTCTCGTGTT 


6300 
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GGAnCATTAA 


ACGTAACTGC 


CACTGCGCCA 


GAGACTGGTG 


AAAAAATGGA 


GGGACCTAAA 


6360 


ATCGCACCTT 


QGCCTCCXAA. 


MTTgATTyT 


ATTTTTCTTG 


CATAAGCCTT 


CAACACCTTT 


6420 


GTGTAACTGC 


TCAATAATTC 


CTTCTTTCCG 


TTGTTGTATT 


TTAGAAAAAT 


CAATGGACGC 


6480 


TGCCTCTGTT 


TCAATCCCAA 


AAGAAGCCGC 


TTGTTTTAAG 


GTATCAAAAA 


CCTCTGCGCT 


6540 


TCTTAATAAA 


GCTTTTGTAG 


GGATACAGCC 


TTTGTGCAAA 


CAAGTGCCAC 


CTAATTTGTA 


6600 


TTTTTCAACA 


ATTGTGACGT 


TTAGCCCTTT 


TTGTGCAGCT 


CGAATTGCTG 


CTACATAGCC 


6660 


TCCTGTTCCG 


CCACCTAAAA 


TCAGTAAATC 


TGTTTGTTCC 


GCCATTTTGA 


TCACTCCTGC 


6720 


TCAATTTTTG 


CTTCACTATA 


ATTTAAAGCT 


TGTTCTTCCC 


CTGTTAATAC 


TCGATTGACG 


6780 


CCTTCATATA 


AAGCAGCCAT 


CTCCATTTCA 


CCTGGGTAAA 


CCTTAATTGG 


TGCAATCCAA 


6840 


GTGACTTTTT 


GACTAATTTC 


TTGAACAACG 


GTTTGCGAAT AgcTGCGCCC CCTGTTAAAA 


6900 


TAATTGCATC 


AATTGTGCCT 


TCAAGAACCA 


CGGCCATCTC GCCAATACTT TyyGCGATTT 


6960 


GGTAACACAT 


TCCTTTTAAA 


TAGTAATTTG 


CTGTTTGATC 


ACCTGCAGCT 


ATTTGTGCTT 


7020 


GGATATGCCG 


TAAATCTGTT 


TCACCTAGGT 


ATGATTTAAG 


ACCGCTATTC 


CCCGCAATTA 


7080 


GCTTTTTCAC 


TTGGCTAATC 


GTTAACTCCT 


GTTCAAGTAT 


CCATTGCGCA 


AATTCAACGA 


7140 


GCGGTAAGGC 


ACCACTGCGT 


TCTGGCGTAT 


AAGGaCtTCA CCATCAgCCC ATTTACCACA 


7200 


TCCACawGCG 


nCcTTTTgAT 


GCGCCCCCnA 


GCTAATGCCC 


CCACCTAAAT 


GAACAACGAT 


7260 


AAAATTGCTT 


TGCTCATATG 


TTTTCCCtAA 


ATCCTCAGCG 


ATTTTTCGAG 


CGACCGCTTT 


7320 


TTGATTTAAG 


GCGTGACCcA 


AaCkaCGCCG 


CTGAATGCCT 


TTTAATCCAG 


AAATCCTGGC 


7380 


TAATGGCTGT 


AGCTCATCTA 


CAACAACAGG 


ATCGACAATA 


AAGGCTGGCA 


CGTGATATTT 


7440 


TTCTGCAAAC 


TCATTAGUTA 


AAATCGCCCC 


TAAATTTGAA 


GCATGGGTAT 


TAAAGCGTTC 


7500 


TGTCCGCAAG 


TCTTCTAGCA 


TTTGTTGGTC 


GACAAGATAC 


GTACCGCCAG 


GAATAGGCTT 


7560 


AAGCAAACCA 


CCACGCCCTA 


CAACAGCGGC 


TAATTGTATG 


ATGTTATGTG 


TTTCTAAAAA 


7620 




ATTTGTTTTC 


laAAA 1 isA 1 V9 1 


TTGACTGACA 


ACATTTTCAA AAGGTGCCAA 




TTCTTGGACG 


CTATGTCTAA 


GCGTTTCTTC 


TGCCAAGCAA TCATGATTAG 


CAAAAAGCGC 


7740 


TAACTTCGTG 


GAGGTCGAGC 


CAGGATTAAT 


AACCAATACT GTTTCCATTG 


TtTTCTGCTC 


7800 


CTTkTTTtAA 


ACTTGTCTCA 


TGGCAAATCT 


TAATGAATGG 


AATTTGCTTT 


CTGTCGAATC 


7860 


GCTTC 












7865 



(2) INFORMATION FX)R SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
TGCTCCnGnT GTGCTCGATG TATTACAAGA AAGATTCCAA ATTCTTCGCA ATATTTATTG 



60 
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GATGCAACCT ATTGGACGTA GAAGTTTATC AGAAACGATG GGAATTACAG AGCGAGTCTT . 120 

GCGTACAGAA ACAGATGTTT TAAAGCAACT CAACCTGATT GAACCTTCAA AAAGCGGGAT 180 

GACATTAACA GAACGCGGTT TGGAAGTCTA CCAAGGCTTA GAATTAGTCA TGAATCAATT 240 

GCTTGGCATG CACCAAATCG AAAAAGAAAT GACACAGTAC TTTGGCATTC AAGGTTGTAT 300 

CGTTGTTGCA GGAGATAGTG ATATCCAAAA GAAAGTTCTT TCGGACTTTG GAGATGTCTT 360 

AACCAATACG CTGAATTTAC TCTTACCAAA TGGTGAGAAT ACAATTGCGG TTATGGGTGG 420 

GACAACAATG GCCATGGTGG CCGAAAATAT GGGATCACTA GAAACTGAAA AACGTCATAA 480 

TTTGTTTGTT CCAGCCAGAG GAGGGATTGG TGAAGCAGTT TCTGTCCAAG CCAATTCAAT 540 

TAGTGCTGTA ATGGCCAACA AAACGGGTGG TAACTACCGG GCGTTATATG TCCCAGAACA 600 

ATTGAGTCGT GAAACGTATA ATAGTTTATT ACAAGAACCC TCGATTCAAG AAGTTCTTAC 660 

GTTAATTAGT CATGCCAACT GTGTGGTGCA CAGTATTGGT CGAGCGTTGC ACATGGCGGC 720 

ACGCCGaAAA ATGTCTGATG ACGAAATGGT TATGCTCAAA CAAAAAAATG CCGTGGCAGA 780 

ATCGTTTGGT TATTTCTTTG ATGAAGAAGG AAAAGTGGTC TATAAAATTC CACGAATTGG 840 

CCTTCAATTA AAAAACTTAC AAGAAATTCC CTATGTCGTC GCAATTGCGG GCGGCAAAAC 900 

GAAAGCCAAA GCCATTCGCG CATATATGAA AAATGCACCA AAACAAACGT GGcTCATCAC 960 

AGATGAAGCT GCCGCAAACG AGATTTTAAA AGGGGTAACC CTTTAAAATA AAAAATTTTT 1020 

GATTTTCATA AGGAGGAAAT CTTTAATGAC AGTTAAAGTA GGTATTAATG GATTTGGACG 1080 

TATCGGACGC TTAGCTTTCC GTCGTATCCA AGATGTAGAA GGAATCGAAG TAGTTGCAAT 1140 

CAACGACTTA ACAGATGCTA AAATGTTAGC ACACTTGTTA AAATATGACA CAACTCAAGG 1200 

CCGTTTCAAC GGTACAGTTG AAGTTCACGA AGGTTCATTC AACGTTAACG GCAAAGAAAT 1260 

CAAAGTTTTA GCTAACCGTA ACCCTGAAGA ATTACCATGG GGCGAATTAG GCGTAGACAT 1320 

CGTTTTAGAA TGTACTGGTT TCTTTACTTC TAAAGAAGCT GCTGAAAAAC ATTTAACTGC 1380 

TGGTGCAAAA CGTGTAGTTA TTTCAGCTCC TGGTGGTAAC GATGTACCAA CAATCGTTTA 1440 

CAACACAAAC CATGAAACAT TAACTGGAGA AGAAACTGTA ATTTCAGGCG CTTCTTGTAC 1500 

TACAAACTGC TTAGCTCCAA TGGCTAAAGC TTTACATGAC AACTTTGGTG TTGTTGAAGG 1560 

TTTAATGACA ACTATCCACG CTTACACAGG TGACCAAATG ACATTAGACG GACCACATCC 1620 

TAAAGGCGAC TTCCGTCGTG CGCGCGCTGC TGCTGCAAAC ATCGTACCTA ACTCAACTGG 1680 

TGCTGCTAAA GCAATCGGCT TAGTAATCCC AGAATTGAAC GGTAAATTAG ATGGCGCTGC 1740 

TCAACGTGTT CCTGTAGCAA CTGGTTCATT AACT6AATTA GTTACTGTAT TAGACAAAGA 1800 

AGTTACTGTT GATGAAGTAA ATGCAGTAAT GGAAAAAGCT GCTAACGAAT CTTATGGTTA 1860 

TAACACAGAC GAAATCGTTT CTTCTGATAT CGTAGGTATG ACTTACGGTT CATTATTCGA 1920 

TGCAACTCAA ACAAAAGTGA TGACAGTTGG CGACAAACAA TTAGTTAAAA CTGTTGCTTG 1980 

GTATGACAAC GAAATGTCAT ACACTGCTCA ATTAGTTCGT ACTTTAGAAT ACTTCGCTAA 2040 
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CTTATAAGAT 
AAGCAACGCG 
AGGAGGTCTA 
CCTTGTCCGT 
TATCGTAGCG 
TTTCTCTCAT 
AGTGGCTGAA 
TGGTAAAGAA 
AAACACTCGT 
TAAATACTGG 
TGCACATGCC 
T^ATGGAAAAA 
AGCAATCTTG 
AAAAGCCGAT 
CTATAGTGTT 
GGAAAAAGCT 
CAATGACGCT 
GGATATTGGT 
TGTTTGGAAT 
TGTTTGTGAA 
TGCTGCTGCA 
CGGAGCAAGT 
TAAATAATCA 
CAAACCAATT 
TGCAGAAGCA 
TCCAGCATTA 
TGCACAAAAC 
AATTGCAGAT 
CCACGAAACA 
ACCAATCTTC 
GATCGAAGGT 
GGTTATCGCT 
TGCAGATGAA 



TCATTCTTAT 
CTTcTCCGCT 
CTCATGGCTA 
GTTGACTTTA 
GCTTTACCAA 
TTAGGTCGTG 
CGTTTAAGCG 
TTAGAAGACG 
TTTGAAGATG 
GCTTCTTTAG 
TCAAACGTTG 
GAAATTAAAT 
GGTGGCGCAA 
AAAATTTTAA 
GGTTTATCTT 
GGCGACAAAT 
CCTTTCCACA 
GAAAAAACAA 
GGACCAATGG 
GCGATTGCTA 
GCAATTCAAT 
TTAGAATTAT 
GTTTAAAAGA 
ATCGCTGGTA 
GTAAAAAAT6 
TTCTTAGCTC 
TGCTACTGGG 
TTAGGTGTAG 
GATGAAGATA 
TGTTGTGGTG 
CAAATCACTA 
TACGAACCAA 
ATCTGTGGTG 



CATTAGTGAA 
TTCTTTTTTT 
AAAAGACAAT 
ACGTTCCGTT 
CAATCAAATA 
TAAAAACAGA 
AATTACTAGG 
CTGtTAACAA 
TTGACGGCAA 
GCGATGTATT 
GAATTGCTTC 
TCATCGGCGA 
AAGTTTCTGA 
TCGGTGGCGG 
TACTTGAAGA 
TAGTTTTACC 
CGGTTCCTTC 
TT6AATTATT 
GCGTATTTGA 
ATTTAGAAGA 
TAGGCTACGA 
TAGAAGGCAA 
CAATATTTTT 
ACTGGAAAAT 
CTGTTCCATC 
CTTTAGCTTG 
AAAACGCTGG 
ACTATGTGAT 
TCAATAAAAA 
AAACTTTAGA 
AAGGCTTAGT 
TTTGGGCAAT 
TTGTTCGTTC 
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ATTTGAACAA 
AAAAAAGAAA 
TAAAGATGTA 
AAAAGATGGC 
CGTGATTGAA 
AGAAGACAAA 
TCAACCAGTT 
TATGAAAGAT 
AAAAGAAAGC 
TGTCAATGAT 
AACTGGTATC 
AGCTGTTGAA 
CAAAATCGCT 
TATGGCTTAC 
AGATAAAGTG 
GGTTGATACA 
AACTGAAATC 
CGCTAACGAA 
AATGAGTAAC 
CGCAACAACA 
AAACAAATTC 
AACTTTACCA 
AAAAAAAGAA 
GAACAAAACT 
AAACGACGTT 
GAACCTAAAA 
TGCCTTCACT 
TATCGGACAC 
AGCCAAAGCA 
AACGTATnAA 
TGGTTTATCA 
CGGAACTGGT 
AACAGTTGAA 



TTGTGAAATA 
AATCCTAAAA 
GATTTAAAAG 
GTGATCACAA 
AATGGCGGAA 
GCTGGCAAAT 
ACTTTTGTTC 
GGCGACGTTT 
GGCAATGACG 
GCTTTTGGTA 
CCAACAGTTG 
AATCCAAAAC 
GTCATTGAAA 
ACATTCATGA 
GACTTAGCGA 
GTTGTCTCTA 
CCAGATGATG 
TTACAAGGTG 
TTTGCCAAAG 
ATCATCGGTG 
TCTCACATCT 
GGTTTAGCTT 
AAGGATGTGC 
TTAAGCGAAG 
GTTGACGCTG 
GATTCTGAAG 
GGTGAAAACT 
TCTGAACGTC 
ATCTTTGCAA 
GCTGGTAAAA 
AACGAACAAG 
AAATCTGCAG 
AAATTATACG 



ATAAGCGGGG 
TTGACAATAC 
ACAAAAAAGT 
ACGATAATCG 
AAGCAATTCT 
CATTAAAACC 
CAGAAACACG 
TAGTATTTGA 
CTGAATTGGG 
CTGCACACCG 
CTGGTTTCTT 
GTCCATTCGT 
ACCTAATTGA 
AAGCACAAGG 
AAAGCTTAAT 
AAGAGTTCAG 
AAGAAGGCTT 
CGAAAACAGT 
GAACAATTGG 
GCGGGGACTC 
CAACAGGTGG 
CTATTAACGA 
TTTTCATGCG 
CACAAAGCTT 
TGATTGGATC 
TGA/^TTAGC 
CACCAGCTGC 
GCGAATATTT 
ATGGTATGAC 
CAGCGGAATG 
TTGCTTCAAT 
ATGCTAATAT 
GCAAAGAAGT 



2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760. 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 
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TTC 4023 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



GGGCGGTGTC 


ACTGCGTCTA 


TTTTCGGTTA 


TCGGCCAACC 


TTTTTCATTA 


CTGGTACGAT 


60 


TTTATwACTC 


GTTTTTGTTT 


TGAGTCTCGT 


CTTTGTCCAT 


GAAGAGTTTG 


TGCCAATTGA 


120 


AAAAAATCAA 


GCCGCATCTG 


GGAAACAAAT 


TTTAAAAAAA 


CTAGAACATC 


CCCACGTGAT 


180 


TCTTGGAATG 


TTTATTACGA 


CATTAATCAT 


TCAAGCTTCC 


AATAATTCAA 


TTAGTCCCAT 


240 


CATCAGTTTA 


TATATTCAAC 


AATTGTTGGG 


TGGTCACGGA 


AATGTCACCT 


TAATTAGCGG 


300 


AGTCATTGCT 


TCTATTCCAG 


GGATTGCAAC 


GTTAATAGCC 


GCCCCTCGTT 


TCGGCCGGTT 


360 


AGGCGATCGT 


ATTGGCAGTG 


AACGCATCTT 


GACAATTGGC 


TTAATTTTAG 


CCATTTTTGT 


420 


TTACCTACCA 


ATGGCCTTTG 


TTCAAAATGT 


CTGGCAGCTT 


GCCATGCTTC 


GTTTCTTAGT 


480 


TGGGATTTCC 


GATGCCTGTT 


TACTACCAGC 


TGTTCAAACG 


TTGATCACTC 


GTTATTCCCC 


540 


AAGTGACGCT 


GCAGGTCGTA 


TTTTCAGTTA 


CAATCAATCT 


TTCCAAGCGA 


CAGGAAATGT 


600 


GATTGGTCCT 


ATGATTGGTT 


CAAGCGTATC 


TGCCGCTTTC 


GGTTATCGTG 


GGGTCTTTAT 


660 


TTCCACTTCT 


TGTCTGGTTC 


TCCTTAACCT 


TCTATGGGTT 


CGTCGAAGCA 


CCGCCGAATT 


720 


AAAAAAAGAG 


AAAAAT&<^TG 


ACTAAGTTAT 


ACACAGAAAA 


GTTATCCACA ATTTGAATTT 


780 


TTAATTTTTC 


TCTTTTTCTT 


ATTTTAAAAA 


AAATAGGCAT 


ATAAAACAAA 


AAAGAAAAGC 


840 


r*/^ n T n n* TT a r* 

\9wilAl 1 1 Aw 


luui 1 1 IA1\3 


AL.L>A\9W\ 1 A 1 


1 1 1 1 L> 1 


TTTGGtTAAC 


TATACGCTTT 


900 


TTGTTTCACA 


ACTCGTGAAA 


CTTTTTAGAC 


GTATTGTAGT 


TCTTTTTCAT 


CTTGGTAGGC 


960 


GCGATCAATT 


AAGCCGCCCC 


CTAAGCATTC 


CATCCCATCA 


TAAAAAACAA 


CAGCTTGACC 


1020 


TGGAGTGACT 


GCACGCGCTG 


GTTCCTTAAA 


CGTAACGGTT 


GCTTTTGTGC 


CATCTTCAGA 


1080 


TAAGGAAACA 


TGCACTGGAA 


TATCTGCTTG 


ACGATAACGG 


AATTTAGCCG 


TACAGTCAAA 


1140 


CTCTTTTGGc 


ATTGGTGTAT 


CAACAGTAAA 


ATGAACCTCA 


CTAGCTTCTA 


AATGTGtTGc 


1200 


ATATAGTTTT 


TCATGGTGAA 


ATCCTTGtCC 


CACATATAAT 


GTATTGGTAG 


TTAAATCTTT 


1260 


CCCAATCACA 


AACCATGGTT 


CTTGTGTTTT 


ACCGCCGCCA 


CCAATTCCTA 


AGCCTTGGCG 


1320 


TTGACCGATA 


GTATAATACA 


TCAGACCATC 


ATGTTGTCCT 


TTGATTTCGC 


CATCTTCTGT 


1380 


CACCATGTTG 


CCTTTTTTAG 


CTGGTAAATA 


ATTACTCAAA 


AATTCTTTGA 


AATTCTTTTC 


1440 


ACCAATAAAA 


CAAACGCCCG 


TTGAGTCTTT 


TTTCTTGGCT 


GTGGCTAATC 


CAGCACGTTC 


1500 


AGCAATTGCA 


CGAACTTCTG 


ATTTTTCCAT 


GCCACCCAAT 


GGAAACATTG 


TTTTTGCCAA 


1560 
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TTGGGCTTGT 


GATAATTGGC 


TTAAAAAATA 


GGTTTGGTCT 


TTIATTATTGT CAATTCCACG 


1620 


TAACATATGC 


ACTACGCCAT 


TTTCATCTGT 


TTCTACTTGC 


GCATAGTGGC 


CAGTkGCAAC 


1680 


ATAGTCAGCG 


CCTAGTTGCA 


TAGCATAATC 


TAGGAATGCT 


TTAAATTTAA 


TTTCTTTATT 


1740 


ACACATGACA 


TCTGGATTTG 


GTGTCCGTCC 


TAACCGATAT 


TCTGCTAAAA 


AATATTCAAA 


1800 


CACACGGTCC 


CAATATTCTT 


TTTCAAAATT 


GACCGAATAA 


TACGGAATCC 


CAATTTGATC 


1860 


CGCTACTTTT 


GCGACATCTT 


TATAATCCTC 


AGTTGCGGTA 


CAGACACCAT 


TTTCGTCTGT 


1920 


ATCGTCCCAA 


TTTTTCATAA 


AGATCCCAAT 


TACATCATAG 


CCTTGCTCTT 


TTAAAAGCAA 


1980 


CGCTGTGACA 


GATGAATCGA 


CACCGCCACT 


CATGCCTACC 


ACGACACGCG 


TTTTGCTGTT 


2040 


ATCTGCCATT 




L iCAAX i 




ATCATCCACG 


ATTTGAAGGT 


91 nn 
^ xuu 


CGTAACTTTT 


CAGTGCTTTT 


CATTATACTA 


TTTTTTTGAA 


ATAAGTAAAG 


CAAGCCGGGC 


2160 


GTATAATGAG 


AAAACACTTG 


GAACAAAGAC 


ACAGCCAACA 


AGCTATTGCT 


TTCAAATAAA 


2220 


CAGAAAACcT 


QoAGGAACAA 


GCTACTTTCG 


CTTGTkTCTT 


CAGGTTTCTC 


TAAATTATTC 


2280 


TACTTTCT 












2288 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



TTTAATTTTT 


GGGAAaAAAT 


TwCCaAAGGC 


TAAGGaTaGk 


TAAAtTGgGG ACCwTATTTg 


60 


GAATaTTGaT 


TAAATTCTGA 


CCACTTAATT 


GtCTTTkGAA 


AAggTTATGC 


TAAAGTTACT 


120 


TTGTAATcGA 


ATGaGGAGGG 


CTATCACATG 


GaAGAAAAGA 


ATTGGCCTGA 


GTTATTTGCT 


180 


CGTTATCTTA 


TTGTCGAAAG 


AGGCTATTCA 


GAAAAAACAA 


AAAAAGCATA 


TCAAGAAGAT 


240 


ATACAACATT 


TTTTTTCGTT 


TTTAAAGACT 


TCAGGTAATG 


ACAATTATTT 


GACTGTGGAA 


300 


CATTTGGATG 


TTCGGGCTTA 


TTTGAGCGAA 


TTATATGATC 


AGGAATATAG 


CCGCAATTCG 


360 


ATTAGTCGAA 


AAATTGCCAG 


TTTGCGTTCT 


TTTTATCAAT 


TCTTATTAAA 


AAATGAAGCA 


420 


ATCCAAGAGA 


ATCCTTTTTC 


CTATGTTCAT 


ATGAAAAAGA 


AACAGTTACG 


GTTGCCCCGT 


480 


TTTTTTTATG 


AAAAAGAAAT 


GGATGCATTA 


TTTGAGAGCG 


CACAAGGGGA 


GCAACCGCTT 


540 


GATTTGAGAA 


ATCAAGCATT 


ACTAGAAGTT 


CTGTATGGGA 


CTGGTATTCG 


GGTCAGTGAA 


600 


TGTGCGAACT 


TAACG6TGGA 


CGCAGTTGAC 


TTTCAAGCTT 


CTGTTTTATT 


GGTTCACGGA 


660 


AAAGGAAATA 


AAGACCGTTA 


TGTGCCTTTT 


GGTTCATTTG 


CACAAGACGC 


CTTAAAGGAT 


720 


TACTTAGAGA 


ATGGCCGCGC 


GCTATTAATG 


ACAAAATATC 


AAAAAAAACA 


TCCGTATGTC 


780 


TTTGTTAATC 


ATCATGGTGA 


ACAAATCACG 


CCAACAGGGA 


TTGAATATGT 


CTTAAACCAA 


840 


CTAATTAAAA 


AAAGTTCATT 


AAATGCGGAA 


ATTCATCCCC 


ACATGTTACG 


CCATACATTT 


900 
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GCGACACATT 


TATTAAATAA 


TGGCGCCGAT 


ATGCGGACAG 


TTCAAGAATT 


ATTAGGACAT 


960 


GCTAATTTAT 


CAACAACTCA GATATATGCG 


CATGTGACAA AAGAAAGTTT ACAAAAAAAT 


1020 


TATCGAACGT 


TTCATCCTCG 


TGCATAATGG 


AGGTAAACGT 


CTAGGAGGAA 


CAATAATGGT 


1080 


TGAATCACAA 


TTTCACTCTA 


CAACAATTTG 


TGCAGTTGAA AAAGATGGAA AATTTGCAAT 


1140 


GGCTGGTGAC 


GGCCAAGTAA 


CAATGGGCGA 


ATCTGTCGTT 


ATGAAAGGTA CGGCCAAAAA 


1200 


AGTGCGCCGT 


ATTTACAATG 


ATGAAGTCGT 


TGTCGGTTTT 


GCAGGAAGTG 


TGGCAGATGC 


1260 


CTTTACATTA 


GAAGAAAAGT 


TTGAAGGAAA 


ATTAAATGAA 


TACAACGGAA 


ACTTAACGAG 


1320 


AGCTGCGGTT 


GAGTTArCCC 


AAGAGTGGCG 


TACACAACAG 


TCTATGCAAA 


AATTAGAAGC 


1380 


TATGCTGATT 


GTGATGAATA 


AGGAAGAAAT 


GCTCTTAGTT 


TCAGGAACTG 


GGGAAGTCAT 


1440 


CACACCAGAC 


GATGGTATTT 


TAGCAATTGG 


tTCAGGCGGT 


AATTTTGCTT 


TGTCCGCTGC 


1500 


AAGAgCCATG AAAAACTTTG 


GCGATAAAGA AATGCCmGCG 


AAAGAGATTG 


CTAAAAATGC 


1560 


CTTAAATATT 


GCTGCAGATA 


TCTGTGTATT 


TACGAATCAT 


AATATCATTG 


TAGAAGAATT 


1620 


ATAAGCGAGG 


GAAAATCCAT 


GAATGAATTA 


AACAAAACAC 


CAAAAGAAAT 


AGTCAAAGAA 


1680 


CTAGATCAAT 


ATATTGTTGG 


ACAGCAAGCT 


GCCAAAAAAT 


CAGTGGCGGT 


AGCTTTACGT 


1740 


AACCGCTATC 


GTCGCTTGCA ATTAGAAGAA AATATGCAAC 


AAGATATTAC 


GCCTAAAAAC 


1800 


TTACTAATGA 


TTGGACCAAC 


AGGTGTTGGT 


AAAACTGAGA 


TTGCTCGTCG 


TCTCGCAAAA 


1860 


ATTGTCAATG 


CGCCTTTTGT 


AAAAGTTGAA 


GCAACCAAAT 


TTACAGAAGT 


AGGCTATGTT 


1920 


GGTCGGGATG 


TCGAATCAAT 


GGTGCGGGAT 


TTAGTAGAAA ATGCGATCCA AaTTGTTGAA 


1980 


AAACAACAAT 


ACAGTCGCGT 


GTATGCGCAA 


cATTATUUW^ 


GGCCAATCAA 


CGTTTAGTCa 


2040 


AAGTATTAGT 


GCCTGGAATT 


AAAAAAGAAC 


AAAAACAGGC 


TGGCGGTAaT 


CAATTTGAaC 


2100 


AAaTGATGAA 


CATGTTTAAC 


ATGGCTCAGC 


AACAGCAAGA 


AGCACAAGAA 


GAAGTAACGG 


2160 


AAGATATTCG 


AACGAATCGC 


CGAACAATTT 


TGGAACAGCT 


GGAAAAAGGT 


CTATTAGATA 


2220 


ATCGTGAAGT 


AACAATTGAA 


ATTGAAGAAC 


CGAAGAAAAC 


GATGCCAGCT 


ATGAACAATG 


2280 


GCCTAGAACA AATGGGTATC 


GACTTAAATG 


AAACGTTAGG 


CGCTCTGTCA 


CCAAAGAAAA 


2340 


AAATCGAACG 


TACTGTAACG 


GTGAAAGAAG 


CACAAGAATT 


ATTAGTGAAA 


GAAGAATCAG 


2400 


CAAAAATTGT 


CAATGACGCT 


GATATTCATA 


GTGAAGCTAT 


TCGTTTAGCT 


GAATCAAGCG 


2460 


GAATTATTTT 


TATCGATGAG ATTGATAAAA TCACCTCTAA AAGTCAACAA AATTCGGGCG 


2520 


AAGTCTCTCG 


TGAAGGAGTA 


CAAAGAGATA TTTTGCCGAT 


TGTTGAAGGC 


TCCCAAGTTA 


2580 


ACACGAAGTA 


TGGTCCTTTA 


CAAACGGATC 


ACATTTTATT 


TATCGCTTCA GGTGCTTTCC 


2640 


ACTTGTCAAA ACCAAGTGAC 


TTGATTCCAG 


AATTACAAGG 


CCGCTTCCCA 


ATTCGAGTTG 


2700 


AATTAGATGA 


TTTAACGGCG 


GATGAtTCGT 


AAGTATCTTA ACTGAGCCAA ACAATGCTTT 


2760 


AATTAAACAA 


TATGTAGCAT 


TAATTGGCAC AGAAAATGTT 


TCAGTCATCT 


TTACAAAAGA 


2820 


AGCAATTGAA 


CGGTTAGCAC 


ACATCGCTTA 


TGATGTAAAC 


CGTGATACAG 


ATAATATTGG 


2880 
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GGCGCGTCGT 


TTACACACAA 


TTTTAGAGCG 


TTTATTAGAA 


GATTTATTAT 


ATGAAGCACC 


2940 


AGATATGCaA 


ATGGGTGAAA 


TTACGATTAC 


CGAAGCATAT 


GTCAATGAAA 


AATTGAATGA 


3000 


CATTGTTCAA 


AATGAAGATT 


TAAGTCGTTA 


CATCTTATAA 


AAAAGGCTGG 


AGGAGAAAAG 


3060 


ATGGCTACAT 


TATTAGAAAA 


AACACGTCAA 


GTGAACGAGC 


TTTTGCAAAA 


AAACAATCTG 


3120 


TTTGACGTAC 


AAGCAGAACT 


TCCGTACAAC 


AAAATGGCAA 


TGATTTTAGG 


GGATATTTTA 


3180 


GAAAGTAATG 


CGTATATTAT 


TAGCAGTTCA 


GGGGATTTGT 


TGGGGTATAC 


TGAAAAATTA 


3240 


GATGTTAATA 


ATGCCCGTAT 


CAAAAATATG 


TTTAAAGAGA 


AAAAATTTCC 


TCAAGGATAC 


3300 


ACCGAGGCCG 


TGGATATGCT 


CAAAGTAACC 


GAAGCCAATA 


TTCCAATAGA 


TAGTGATTTA 


3360 


ACTGCATTTC 


CTTTTGAATC 


ACGAGAGCTG 


TATCCGTTTG 


GTTTAACCAC 


TATTGTGCCT 


3420 


TTGTACGGTG 


CTGGCAAACG 


CTTGGGTACG 


ATTATTTTAG 


CGCGTGTTGA 


AAAATCCTTT 


3480 


AACGAGGATG 


ATTTAGTGTT 


AGCTGAATAT 


AGTGCGACCG 


TTGTCGGCAT 


GCAAATTTTG 


3540 


TATCATCAAT 


CCAGAACAAT 


TGAAGCAGAA 


GTTCGCAGTG 


CGACTGCCGT 


GCAAATGGCC 


3600 


ATCAACACTC 


TTTCCTATAG 


TGAATTAAAA 


GCCGTCCATG 


CCATTTTTGA 


AGCCTTAGAT 


3660 


GGCGAAGAAG 


GTCGTTTAAC 


TGCTTCAAGT 


ATTGCAGATG 


AAATAGGCAT 


TACTCGTTCC 


3720 


GTAATTGTGA 


ATGCCTTAAG 


AAAATTAGAA 


TCAGCTGGAA 


TTATTGAATC 


ACGTTCTCTT 


3780 


GGTATGAAA6 


GGACTTATTT 


AAAAGTATTA 


AATCAACAAT 


TTATTAAAGA 


GTTGGAAAAA 


3840 


TAAGCAATTG 


CTCGTATCAA 


CTAGGAGGAC 


AACGCAATGA 


CTGTACAAAT 


TGAAAATGAA 


3900 


TTTCTAATCG 


CAACATTTGC 


TGAAGAAGGA 


GCAGAATTAG 


TAAGTTTACA 


ATCAAAAGAA 


3960 


ACCGGCATTG 


AATATATTTG 


GCAAGGAAAT 


CCTGAATTTT 


GGGCCCGCCA 


TGCGCCCGTA 


4020 


CTTTTTCCGA 


TTGTCGGTCG 


TCTAAAAGAA 


GATACGTATA 


TGTACCAAAA 


TCAAGCGTAT 


4080 


CATTTAACTC 


AACATGGTTT 


TGCTCGTGAT 


CAAGTATTTG 


ATGTGATTGA 


AAAAGGTGGA 


4140 


GAAGAAGTTT 


CTTTTTCACT 


AAAATCAACA 


AAAGAAACCA 


AGAAGAAGTA 


TCCTTTTGAT 


4200 


TTTGT^CTAG 


TGATTACTTA 


TACATTAGAA 


CATCAAGAAT 


TGACTGTAAA 


CTACCAAGTA 


4260 


GAATACACAG 


GGAAAGAAGA 


GATGTATTTT 


GGGATTGGTG 


GTCATCCAGC 


CTTTAATGTA 


4320 


CCTTTAGAAT 


CTTCACTAAC 


GTTTGAAGAT 


TATTATCTAA 


GTTTTTCACC 


TAAAAAATCA 


4380 


CGGACACAAA 


TTCCCTTAGC 


AGGCCCGTTT 


ATTGATTTAG 


CGAATAAAAC 


GCTAGCCCAA 


4440 


ACGAATACCA 


GCTTTGACTT 


AACGCATCAA 


TTATTTGAAA 


ATGATGCAAT 


GATTTTTGAA 


4500 


ACAAAAGGTC 


AAACAGCAAT 


TACGATTGCC 


ACAGACGAAA 


GTGACCATCA 


AGTCACTTTG 


4560 


AGTTATCCAG 


AAATGCCTTT 


TGTGGGGATT 


TGGTCACCGA 


CACCGAAAGA 


AGCGCCTTTT 


4620 


GTCTGTATCG 


AACCTTGGTG 


CGGGATTGCT 


GATGCTGTCG 


ACGCAACTGG 


CCAACTTGCT 


4680 


GaAAAAtTTG 


GCATCaATAA 


GTTmCCAGCT 


AACGAACTAT 


TTAAAmCACA 


GTATATGATT 


4740 


AGTGTAAAAT 


AAAGAAAAAG 


GCTGTGAACA 


TTATTTGTTC 


ACAGCCTTTT 


TCTTTATTTT 


4800 


TTTTGCACGT 


AAGCCAAAAG 


AAAGACGACT 


TTCTGTTCCA 


TTTTTAATTC 


GCTGAATGTT 


4860 
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TTCACGATGA 


CGGACAAAGA 


TAAAA6TAGT 


CAAAGCAATC 


GCAATCACGG 


TTAATAACCA 


4920 


GTTAAATGTT 


GGTAAAATAG 


CTGGGACAGT 


GAAAGGGAAA 


ATAATAGTAG 


AAAGTGTAAT 


4980 


TAGTACAGCG 


CTAATCATAC 


TTGTTAAACT 


CACCATACTG 


GTTAGATACA 


GACAAATCAC 


5040 


AAAAATAAGA 


GCAGAATAAA 


TAAAAAATGT 


GGGGCTGTAT 


GCTAATAACA 


TGGCAGCGCT 


5100 


AGTGGCTACG 


GCTTTACCAC 


CTTTGAAATT 


GGCAAAAATA 


GGGAAGGTAT 


GCCCTAAAAC 


5160 


AGCTGCTACC 


CCAAAGAAGA 


GCGGATTCAC 


GCCTTGTAAA 


CCA7VACAAAT 


AGGGTAATGA 


5220 


AGTGGCTAAC 


GTTCCTTTCA 


AGATATCCAT 


TAATAATACC 


GTAATTCCGG 


CAGGTTTCCC 


5280 


TAAGACACGA 


AATGTATTGG 


TTGTTCCTGT 


ATTCCCACTC 


CCAAATTGGC 


GTATATCTTT 


5340 


TTTAAAGAAA 


AGTTTACCAA 


TCCAAACACC 


TGAGGGAATC 


GAACCTAATA 


AATAGGCAAC 


5400 


AAGTAACftAA 


ATGACGATTT 


TCATCAAGTT 


AACCTCTTTC 


TTATTCAAAC 


TAGGATTTAT 


5460 


TTTATCATGA 


ATAGAAAAAC 


GGAACAACGT 


CTAATAACTA 


AAACAAGAAA 


AGTTTTTCTT 


5520 


TTGCTTGAGG 


AAGCGTTGAC 


AAATGGTTGA 


TAACCGTTAT 


AATAAATGTG 


TGTCATTAGA 


5580 


CAAAGACGTC 


TAATCACTGG 


TTTGAaAATT 


AGTTATCAAC 


GAAGATACAA 


TATATATGTA 


5640 


CTTATGAAGT 


GTCAGGGAAA 


GAGGTGCGAA 


TCCTCTACAG 


ACCTACCTAC 


TGTATGGTGG 


5700 


ATGAAACCAA 


TAAGACCACA 


GATTATTCTG 


GAAGGATTGG 


GAGTAAGAAG 


AAGCTAAGTC 


5760 


AGGATACCGG 


CTTGATAAGT 


CTAATCATTC 


TTTTGTAGGA 


AAAAGAAGGG 


GCTTTGAAGA 


5820 


TAACTGGCTG 


AGTATATCAT 


TAGCATAGTT 


AAGTGACTGC 


TTTTTTTCTA 


TTGGAGAAAA 


5880 


AAGTGGTTTT 


TTTGTATTGT 


TTTGACGTTG 


AGACAAAGGA 


GGTTCATTTC 


AGAAAATTTT 


5940 


CCCCAAAATA 


AAATAGACGA 


ATGCGAGGAT 


GAAAAAATGA 


AAAAATTTAC 


TTTAACAATG 


6000 


ATGACTTTAG 


GTTTAGTAGC 


AACACTTGGC 


TTAGCAGGAT 


GTGGTAAACA 


GGAAAAG/U^ 


6060 


GCAACTACCT 


CTTCTGAAAA 


AACAGAAGTA 


ACGTTACCAA 


CCAAAGACCG 


TAGCGGCAAA 


6120 


GAAATTACTT 


TACCCAAAGA 


AGCAACCAAA 


ATTATTTCCC 


TAGTGCCATC 


AACAACAGAA 


6180 


GTGATTGAAG 


ACTTAGGTAA 


AACCGACCAA 


TTAATCGCAG 


TTGATACTCA 


AAGTAGTACA 


6240 


ATGATGACTG 


ATTTAAAAAA 


ATTACCACAA 


ATGGATATGA 


TGGCTGTCGA 


TGCCGAAAAA 


6300 


TTGATTGCCT 


TGAAACCACA 


AATTGTTTAT 


GTGAATGACA 


TCAATTTAGC 


TAGCTCAGAA 


6360 


AGTGTTTGGA 


AGCAAGTGGA 


AGATGCTGGA 


ATTACAGTCG 


TTAATATCCC 


CACTAGTACA 


6420 


AGCATCAAAG 


CAATCAAAGA 


AGACGTCCAA 


TTCATCGCTG 


ATAGCTTATC 


TGAACATGAA 


6480 


AAAGGACAAA 


AGTTAATCAA 


AACAATGGAT 


CAAGAAATCG 


ACGAgTAGCG 


AAAATTGGTA 


6540 


AAACTATTAA 


AArACCAAAA 


ACTGTTTTGT 


TTGAAGTGGC 


TGCCTTACCA 


GACATTTATA 


6600 


GTTTTGGTAA 


TGGGACATTT 


TTAAATGAAT 


GGATTGAACC 


AATTGGGGCA 


AAAAATGTTT 


6660 


TGGCCAATGA 


AAAGGCTGGT 


TnCCAGTGAC 


A 






6691 



(2) INFORMATION FOR SEQ ID NO: 89: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



PCT/US98/0898S 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

TTAnGGTAnC CAATCGTTCC CCGGTAAATT TGGACAGTTT TAATCTGTTC CACAAACGAT 60 

nAACGGCTCC GTTTGTTAAT AACTGCCGTT CTAACATCTT TTCTT^TTG TGAAACAATT 120 

TCTTGTTTGG CATAGGGCGG ATCCAATAAG ACCAAATCAA ATTGTAACTT CTCTTCATAA 180 

AACTGTTCTA AGGCACGATT AGCATCCATT TTTCTAACTT CAAATTTTTC TGGTTCTTTT 240 

GTAATCGCAA TATTTTCTTT AATTACTTTT AGTGCAGCAA AATTCTTTTC AATACAAATA 300 

CTTTTGTCCA TTCCACGAGA AACGGCTTCA ATCGCTAACC CGCCACTTCC TGAATATAAA 360 

TCCAGCGCCA TTCCGCCATC GAAATATGGA CCAATCATAT TAAAAATAGA TTCTTTTACT 420 

TTATCTGTTG TGGGCCTAGT ATTGTCGCCA TCTAACGCTT TCAAGCGCCG GCCACCATAT 480 

TCTCCAGAAA TAACACGCAT TGTTTCGACT CCTTTATCAA TCATTCTTTT ATAATTCATT 540 

ATAACAAAAA GAGTGAAAAA TGTTTGAAAT TTCTCATAAC ATTTCTCACT CTTTTGCATT 600 

AGACATTCTC TGCTAAGCGA ATCTCTGTAG ATAATTCCTC TACTACATAG CCGTCGTCTT 660 

CTTGTTCTTG ATAAGCCGCT TTGGTACCAA TTTTTTCAGC AAAATTCATT TCGACATCTG 720 

GTCGATACGA TTTATCGACA CTACGAACAA AATGTAACTG ATTAATTTTT TCCATGTTAC 780 

TTTCAATATT TTCTTCATTC ATGTAAATCA CCACATATTT CATTTTTCGT GAAACATAGT 840 

GAATCAGTCC ATATCGGCGT AACTGTTTTA ACGGTTTTAA TGTGTATACC CAAACAATTA 900 

GGCAAC6ACG CTCTTGAAGC GTAAATTCTT TTTCTTCTTT AGCTTGCATG GCAGCTTCCT 960 

CCACAACCTG AATGTTTCTT TCCTCGTTCA AAGAAAGGAT TTCCAGCATC TACTTTAATA 1020 

TCTTCAGAAA TTAGTTGCGC AATCTTTAAA CCAATTGTAT CTAGCAACGT TTGAACGTTT 1080 

GTCTCAGCAA AACGATAATT TGCAACGATT TCATTTAAAT CAAGCGCCCG CTTCGCTTTA 1140 

CGTAGGGCTC GTTGTTTCTC CCGAAAATCG GGTGCATGAA TTCCATAAGC CTCTACTCGT 1200 

TCAAAGGCTG ACTTAGCTTC TAAAAATGCT TTTTGTAACT GAGCAACTTC CGCCGAAGCA 1260 

TACATAGCTT TTTTTGTTTC TTTATAGTTT TGAACAGCAT CACTTTCTAA AAGTGCTGAA 1320 

ATTAAACATT CTGTTTGATC TTCAATTGCA AATAAATGTT CTGTTACAAT CATAACTTTT 1380 

CACTACTCTC CGTGATTTGC CTATTTTCTG GTCGTTTCTC TAATAAGACT AACACACCTT 1440 

CATTGGAAAA GGCAACGCCC AATTCTGTCG TTTCTGACTC AGCAAAATAG CTATCATATT 1500 

GCGGGTCCGA AAACCATGaC AAAATAGTAA AGGTACTATC ATAAACAGGT TCTTGGAAAA 1560 

TACCtGcTGG * TTCTCTCAAC GAAAGTTkGC TCGTTTtAAC TAATAAAGrA AATTCGkCCT 1620 

TAGTCmAATG AAaGGGATGA ATGGCTGTAT GTTCCTCTCT CGTTTGTAAC AATTCTGCTT 1680 

GTCGATTTGC CGACAAAATT GTTGGTGTAT GGTTAGCCAT CGTTTGATAC AATGTTTGGG 1740 
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cATCTTTTTG 


GCTTTGTGTA 


GAAACTGAAA 


AAGCAACTTG 


ACTTTCTTGc 


TGTTTAACCA 


1800 


GATTCATGAC 


ATAAATCGCT 


TGGTTACTTT 


GAACAGGATC 


GAAGGTCATT 


GACTGCGTTA 


1860 


TATCAACTGG 


CAAAGGAGAA 


CCTTCATTTA 


ATTGATACGG 


CATTTCCGTG 


AGCAACGTTT 


1920 


CTTTATTGAC 


ATAATTAACT 


6CCATTAAAC 


CGCCTGTCCG 


CTGATTAAAA 


AATAACACCG 


1980 


CAAAACTGCC 


ATTATCAAAA 


GCAACCAGTG 


GTCGATAGTT 


CATATCTTCT 


TCCATCAGCT 


2040 


CGACATCGTA 


AGCTTCATGA 


TGAAAATTAA 


ATGCAAAATT 


CGAATAAATC 


GTCATTAAAT 


2100 


CAGACAAGTC 


AGCAAGCTTC 


ATATCTATTT 


TAAACGGTTC 


TATCATTTTG 


TCATTGTTAA 


2160 


AGGCCTTAAT 


TGCGATGACT 


TTATTTTCTC 


GAACGTTGAC 


TTCTAAGTAA 


TCATTCATCG 


2220 


TCAAACCGAA 


CGTCCATAAT 


TCATACGTAC 


GACCTGTTTT 


AATTTTTTCT 


TTAGGATTGC 


2280 


CAAACTGGCG 


GATAAAGATT 


TCCACATCTT 


TCCCTATGTA 


TGCTGCGTAC 


CAGAAGCTGG 


2340 


CAATTCTTCA 


TGAGGAACAG 


CGGTGTGAGC 


GCTGGATAAG 


CGCCCACTTT 


GAGCTGTTTC 


2400 


GGTTGGTTTT 


TCAGCAGGGA 


AAAAAACTGG 


CTCTAAATAT 


CCAATGAGTA 


GAACTGTTAA 


2460 


AAAAACAGCA 


GTAAAGGCCC 


AAAATCGCTT 


CATTTTTTCC 


CCTCTCATCT 


TATTTTTCTT 


2520 


TCACTTCTAA 


TAACAAATCA 


CCTGAGCTGA 


TTGcTTCGCC 


TTCTTCAACA 


TAAATGTGAT 


2580 


CTACCGTGCC 


CGCAAAACGT 


GCTTCGATAG 


TCGTTTCCAT 


TTTCATGGCT 


TCTGTAATCA 


2640 


GCAACGGCTG 


ACCTTTTTCT 


ACTTTATCGC 


CACGTTTGAC 


CAATACTTGT 


AGAACAGAAC 


2700 


CAGACATCGT 


TGCGCCGATT 


TGTTCTTTAT 


TAGTTGGTTC 


CGCCTTTTGT 


TTTACTTGAA 


2760 


CAGCA6ACTT 


AATAGAGGCA 


TCTTTAACTA 


AAACCTCACG 


ACGTTGCCCA 


TTCAAATTGA 


2820 


AGAACAAGAC 


ACGGTTTCCG 


TCAATATCAG 


GTTCGCCAAT 


TTCATCTAAA 


CGAATAATTA 


2880 


ACGTTTTTCC 


TCGTTCAATT 


TGAACTTCCA 


ATGTCTCTCC 


TTGACGAATA 


CCATTAAAGA 


2940 


ATGTTGGCGT 


ATCTAATAAG 


GTTATATCTC 


CAAAGGTTTC 


GTATTTTTGA 


CGATACTCTA 


3000 


AAAATACTTG 


TGGATACATT 


AAATAACTTa 


AAACTTCTTC 


CAATTTAGGT 


TGaTACCCAA 


3060 


TTTTTTCGGC 


TAATTCTTCT 


TGTACTTTCG 


CAAAATCAAC 


AGGTGCTGcT 


AAGTCACCTG 


3120 


GcCGCTCTGT 


GAAGGCTGGT 


CGACCTTTTA 


GAATAATTCG 


TTGCAGCTCT 


TTCGGGAACC 


3180 


CACCAACTGG 


TTGACCTAAA 


TCACCTTGGA 


AAAAAGTCAC 


AACTGATTCA 


GGAAAACTTA 


3240 


GTTCTTCACC 


ACGCGCATAA 


ACATCTTGTT 


CTGTCAGATT 


ATTTTGAACC 


ATGAAGAGTG 


3300 


CCATGTCTCC 


CACGACTTTT 


GAAGATGGCG 


TCACTTTTAC 


AATATCGCCA 


AACATCAAAT 


3360 


TCACTGTGTG 


ATATATTTTT 


TTGATTTCAT 


CCCAACGGTG 


CCCTAAGCCT 


ACCGCTTTTG 


3420 


CTTGCTGCTG 


TAGATTAGAG 


TATTGCCCAC 


CAGGCATTTC 


ATGCATATAG 


ACTTCTGTTT 


3480 


GCGGGGCATT 


TAACCCATTT 


TCAAATGGTT 


GATAATACAT 


GCGCACATCT 


TCCCAATAAT 


3540 


GATTGATTTT 


CTGTGCATTA 


TCAATGTTAA 


TAGTTGGCGT 


CCGTTCACCA 


TTGACTAAAG 


3600 


CATAATATAG 


ACTGTTCATA 


CTTGGCTGGC 


TAGTTGCGCC 


ACTCATCGCA 


CTCATTGCGA 


3660 


CGTCAACAAT 


ATCAACGCCC 


GCTTTAGTAG 


CTGCTGAATA 


TGTGATGATC 


CCATTGCCAC 


3720 
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TAGTGTCGTG 


AGTGTGGAGA 


TGAATTGGTA 


AATCCGTGGC 


TQCCTTTAAT 


TCACTAATTA 


3780 


AACGATAAGC AGCTTGTGGT„ 


TTCAATAAGC 


CAGCCATATC 


TTTAATCGCA 


ATGATTTGTG 


3840 


CACCTAAATT 


TTCCAATTCT 


TTAGCCATAT 


CAAGGTAATA 


TTGAACATTA 


TATTTTGCTC 


3900 


GGGCTGGATC 


ATTGATGTCC 


CCAGTATAAC 


AAATTGCTGC 


TTGCGCAATT 


TTTCCGGTAT 


3960 


CCCGAACGAC 


TTGAATACTT 


TTTTCCATTT 


GAGGAATCCA 


GTTTAAGCTA 


TCAAAAATTC 


4020 


GGAAAACATC 


GACTCCTTGG 


CGAGCGGATT 


CTTTAATAAA 


TTCTTCAATA 


ACATTATCAG 


4080 


GATAATTTTG ATAGCCTACC 


GCATTAGATC 


CTCTGAACAG 


CATTTGTAAA AGTGTGTTTG 


4140 


GCATCAACTG ACGAATTTTT 


CTTAAACGTT 


GCCATGGGTC 


TTCGGTTAAG 


AAGCGATAGG 


4200 


CAACATCGAA 


CGTAGCGCCA 


CCCCACATTT 


CACTAGAAAA 


CAGCTCAGGC 


AGGGCTGCAT 


4260 


CAGTTAGACC 


AGCAATTGCT 


TTAAAATCTT 


GTGTTCTCAC 


ACGAGTAGCT 


AATAAACTTT 


4320 


GGTGCGCATC 


ACGGAACGTG 


GTATCTGTCA 


TTAACACACT 


TTCTTGATTC 


TTAACCCAAT 


4380 


CGATGACGGC 


GGTTGCTCCT 


TGAGCGTCTA 


GAATATTTTT 


AGCTGTAATC 


ACTTTTTCTG 


4440 


GAACTTCAAT 


ATCTGTCGGC 


ACGCGCGGGG 


CTTCAAAATA 


TTTTTTCTCC 


GTTCTTTCAA 


4500 


TGCCAGGGAA 


CCCATTGACC 


GTTACTTCTC 


CAATGTATTT 


CATCGTTTTA 


TTGCCACGAT 


4560 


CGCGCATACG 


AGGAAATTCA 


AATAATTCAG 


GTGTATTATC 


AATAAAGGTT 


GTTTTGGctT 


4620 


CGCCAGATTG 


AAACGCTGGA 


TAGCTCACAA 


CATTTTGTAA 


AAACGGAATA 


TTTGTTTTCA 


4680 


CGCCACGAAT 


CCGAAATTCT 


TTTAAGCAGC 


GTTGCATTTT 


ACTGATTGCC 


TGTTCAAAAG 


4740 


AGAAACCATG 


CGTACAGACT 


TTAACCAATA 


AAGAATCAAA 


GTAGGGAGTC ACAGCGTAgC 


4800 


CAGAATAAGC 


ATTCCCTACA 


TCTAAACGCA 


CACCAAAACC 


ACCTGGTGAA 


CGATACGTAT 


4860 


CGATTTTCCC 


TGTGTCTGGC 


ATAAACTGGT 


TCAGGGGATC 


TTCAGTCGTA 


ATCCGACATT 


4920 


GAATAGCCGC 


GCCTTTTAAT 


GTCAATTCGT 


TTTGTTTTGG 


CAAATGCATA 


TCTTTATGCA 


4980 


AATCAAGCCC 


TTGCGCAATT 


TGTAATTGAG 


AGATCACAAT 


ATCAATATCT 


6TAATCATTT 


5040 


CTGTGATAGT 


ATGTTCTACC 


TGAACACGAG 


GATTCACTTC 


AATAAAATAA 


AACTGATCCC 


5100 


CTTCTACTAA 


AAACTCAACC 


GTTCCCGCAT 


TCACGTAGCC 


GACATGTGCC 


ATTAACTGCA 


5160 


CAGCAGCCGA ACAAATGGCT 


GCTCGTTGTT 


-CTTCATTCAT 


TGATACACAT 


GGTGCGACTT 


5220 


CTACCACTTT 


TTGGTGGCGC 


CGTTGAACGG 


AACAATCACG 


TTCAAATAAA 


TGCAAGACGT 


5280 


TCCCATGATG 


ATCGCCTAAA 


ATTTGTACTT 


CGATATGTTT 


AGGATTAGAA ATATACTTTT 


5340 


CAACATAAAC 


CTCGTCAGAA 


CCAAAGGCTG 


CTTTCGCTTC 


ACTTTTTGCT 


CTTTCGTAAC 


5400 


CTTCTCGTGC 


TTCTTTGGCA 






GCGACCGCCG 


CCACCTAAAG 


5460 


CAGCTTTAAT 


CATGATAGGA 


AAGCCATGTG 


TTTCACCAAA 


AGCTACAACC 


TCTTCCACCG 


5520 


TTGCTACTGG 


• CCCGTCTGAG 


CCTGGAATCG 


AAGCAATCCC 


CGCAGCTACA 


GCTGCTTCTT 


5580 


TCGCTTTAAT 


TTTATCGCCA 


AAAATATCTA 


AATGATGTGT 


TTTAGGTCCG 


ACAAAAATGA 


5640 


TTCCTTCTTC 


TTCACAACGT 


TCGGCGAAcG 


TAAGTTCTCT 


GATAAAAAGC 


CATAACCAGG 


5700 
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ATGAATGGCA 'ICTGCTCCC3 ATTTTTT&SC AATTTGGATA .-"ATTTTCAA TGTCTAAATA 5760 

GGCTTCAATT GGTTTTTTCC CCTTACCAAC TMATACGCT rC^ATCTGCTT TGAAACGATG 5820 

AACAGAATAC TCATCTTCTG CAGCATAAAT C^ACTGTA C^GATATCTA ATTCTGTACA 5880 

AGCTCTAAAA ATTCGAATC3 CAATTTCGCC ACGATTGGCA ACTAATACCT TTTTCATAAC 5940 

GTCACTCCTA CCATCAATTT CCTTCAAATT T.-,TTGGTTCT SrXCTPJLAGA TAAATAGTAC 6000 

TCATTTTCTA ATTTCTGAC3 TG7TTCATCT GCACTGATAT TCAGGACAAA GGCCACAGCA 6060 

ATCGAAATAA 7CAGCA.-J^CT ATTCCCACCT TGACTCAftGA SITGGGAAAGT AATTCCTGTT 6120 

AATGGAATAA 7CCCAGTAAT CCCACCAAC?^ T7AATAAASA CTTGAATTAA TAACATTGTA 6180 

CCAATTCCAA TACACAXT5J; T3AATTAAAA GGTTTCTTCG AGCGTACACC GACTAAAATA 6240 

ATTCGCGCAA TCATAAACAT CVyC3UVT:CT A;AATCGCTA GACCGCCGAT AATGCCTAAC 6300 

TCTTCTAAAG TAATCGCA.--A GATAAAGTCC 6TATGTGCTT CTGGTPAAAA GCCTTTTTTC 6360 

TGAACACTAT TCCCTAGACC TTTGCCAAAC CAGCCGCCAT TACTGATTGC ATAATAAGAA 6420 

TTAGCTAACT GA73TCCTA?. GTTCCGTrCS. TCTAAAAATG GATTTTTAAA AACAGCAAAA 6480 

CGATTATAAA TATATTGATA ACGCGCTGGG AAAATTTTTC C7TTAGACAT AATTAATAAC 6540 

TGAATAGCCG TGATACIACC TAAAATGCCC A^GCCACCGA CCAAGTAGGT ATACATATAA 6600 

TTAATCCCAC TTGCTAAAAC CArCACGP-.TG GTAATTAATG T7AGAATGGC CGCATTCCCG 6660 

AAGTCAGGCT GGATTGCA.-r rAG^uGOL^TT AATACAAA^ CGAGCATCAA CGGGCGCCCG 6720 

GCAGCTTGTT TAAATTGATC CATGCCTCCL-- T7AATTGTCT TrTGTCGCCG TGCTAAAATA 6780 

TAAGACAGAT ACCAGACAAC CATAATITTT AAATACTCM CT3GTTGCAT CGAAAATCCA 6840 

CCAATCTCAA TCCAACCAC3 rSCSCCGTTA ATTTCTTTCC CSATACCAGG AATCCGAACG 6900 

GCTAAAACCA TAACCGrAA? TACCGCAAT3 GCAAACATGA TAAAACTTCG ATTCTGAAAA 6960 

ACGGATGTTT TCATCXrArA AATAAAAAAC A7TGCAACTA ACCCTAC7AC CCAAAAGGCA 7020 

ACCTGGTTAA TAACGAAtSA G37AGG73AA AAACC7T1CA 77AC7TGTAA 7GCGGATGTG 7080 

6AACTA7ATA CCATAA7CAA 7CCCAC(S.7A CTXAAAATTA AA7AXGGAA7 AAAAATACXA 7140 

TAG7CTAGCA AATGCXG7rT C777AC77TG TTTGGCA^AG C77AACCC7C C7TGCCAATC 7200 

TC6TTTTTTT CTTGTGTTTC TTCATAAACA TCTGTATATA ATTGATXCAA TTCGCGTTCT 7260 

AAA7CACTTA G7AAGCGA7G TCCTTGATCA TTAGTXAftAA TCCCCAAACG GGTCGCA7AG 7320 

G7CAC77GXC TXaAAAAACC ArACA777GT G7ATCCACCA C77C7TCAAA 7GCTTTACA7 7380 

7GAGAAATGC ACAAACTSTX TTTCTGArXA CGAATAAGCA TTXXAAXGCG TTC7GCATCG 7440 

TCT7CTAA7A ATXGAATGGC AAA^TXCTXT GrJUV7TCC77 CGXCCATTTG 7ACAGAACC7 7500 

CCC7A7CT7A AT7AAArAA7 7A7AACArAA T7C7CAG7TA GA7ACATGAT GAJcT7CAGAA 7560 

AATTTCCTTT AAGAAATTTT C7GAAAC7AA GCGATA7t77 AAG7AAGAAA 7CTC77TATA 7620 

TAACAAGCT7 ATXTTArCAC TAaAACAACA AATACATTAA AAGACX777T <7C77A7T7TC 7680 
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600 



TGCTTTTAAT 


TTTAAATAAA 


ATAACAAACT 


ATATCATTTT 


TTTGTCTCTT 


GATATTCTTT 


7740 


AATTACTTCILAAAAAGGCGG TGCCATATyT ATCTAATTTA TTTTGGCCGA 


CCCCTTTTAT 


7800 


TkGCAATAAT 


TGAATCGTTG 


TTTGCGGTAA 


TTTTTCACAC 


ATTTCTTTAA 


GTGTACTATC 


7860 


GGAAAAGACT 


ACATATGGCG 


GCACGCCTGC 


CTCTTGAGCC 


ATGTCCATAC 


GTAATTCGCG 


7920 


TAATCGTTCA 7\ACAAGGCAT 


CATCTACTGC 


TACTTTTCGA ACTTTTTGGT 


CTTCTTTTCG 


7980 


GAAGACTTTC 


CGTTCCCCTA AAAGCACCTG 


AACACCTTCT 


GTCGAGACTG 


ATAACAACGG 


8040 


AAATTGCCCA 


TCAGAAGGAA 


TTAAATAACG 


TTCTGCTGTC 


AAATAATCAA 


TCAGTTGTGT 


8100 


GACTTCTTTT 


TGCGTGCGAC 


CTTTCATTA6 


TCCGTATGTT 


GGCAAACGAT 


CAAAATGCCA 


8160 


TTGATCAATT 


TTTTGATCTT 


TTGAGCCAGT 


CAAGACTTTA 


CCAACTAAGC 


CCTTACCGAA 


8220 


ACGTTCTCCC ATACGTTTTA CACAAGACAA AACTTTTTGG GCATCGACCG TAATATCGAC 


8280 


TAATTCTCGA 


TTATCCAGAC 


AATTCGAGCA 


GCGACCGCAG 


TCCGTCCCTT 


CTTCACCAAA 


8340 


ATAGCGTAAA ATATATTTTT 


GTAAACACAT 


TTGTGCATTC 


GCATATTGTG 


ACATTTCCCG 


8400 


TAATTTCAAG 


TACTCTTTCT 


GCTTATAATC 


AATGGTCATT 


TCAGATTGTT 


CAATAAAGTA 


8460 


TTGTTGAATT 


TGCAAGTCTT 


GTGGAGCAAA 


CATTAACACA 


GCATCACTAG 


GCAGACCGTC 


8520 


ACGACCAGCT 


CGGCCGGCTT 


CTTGGTAATA 


AGATTCAATA 


TTCCCTGGTA 


CTTGTGCGTG 


8580 


AATGACAAAA 


CGAACATTAC 


TCTTATTAAT 


CCCCATACCA 


AAGGCATTTG 


TTGCTACCAT 


8640 


CACTTGCACT 


TGATCATACA AGAATGCTTC 


TTGATTTTCA 


CTACGTAACT 


GTTCACTCAT 


8700 


GCCCCCATGA 


TACATCCCAG 


CTGCAATTTT 


TTTACTTTCT 


AATAAATGAT 


AAATTCGTTC 


8760 


AACTTCCTTC 


CGCGTGCTTG 


CATAAATAAT 


ACCTGATTGT 


CCAGTATTCA 


TCTTTAAATA 


8820 


TTCCAAAAGA 


AAAACGTCTC 


TATTTTGATC 


TTTCACGACT 


TGAAAAGATA 


AATTTTCTCG 


8880 


AGCGAATCCT 


GTTTTGATTT 


CATTTTCTGA 


TGGAATTCTT 


AATTGCTTGA 


CAATATCCTC 


8940 


CGCTACTTGT 


GGCGTTGCTG 


TTGCTGTTAG 


TGCAATCACA 


GTTGGTTGCT 


GTTGAAATTG 


9000 


ATCGATAATC 


TCTGCTAAAC 


GCAAATAGCT 


TGGACGAAAA 


TCATGCCCCC 


ATTGTGAAAT 


9060 


ACAGTGTGCC 


TCATCTACTG 


CCAATAAATC 


AATCGGTACA 


TGTGTCAGCA 


TCTGTTGAAA 


9120 


ATCATAAGAT 


TCTAATCGCT 


CTGGCGCAAC 


ATACAATAGC 


TTGACTTCTT 


TGTTAACCGC 


9180 


TAATTGAATA 


CGATGATTCA 


TCTCTTGATA GGAAATGGTA 


CTATTGATAT 


ATGTAGCTGG 


9240 


AATCCCCATT 


AAATTCAGGG 


CGTCTACTTG 


ATCTTTCATT 


AAAGAAATTA 


ATGGTGAAAT 


9300 


CACCAATGTT 


AAATTATCCA 


GAAGTAAGGC 


TGGCAACTGA 


TAACAAATCG 


ATTTGCCTCC 


9360 


ACCTGTTGGC 


ATAATGCCTA AGACATTTTC 


TTGTCGCAAT 


ACATGACGAA 


TAATCGTTTC 


9420 


TTGTCCTGGC 


CGAAAATCAT 


CATAGCCAAA 


TGTATCTTTT 


AATAATTCTT 


GTAATGCACT 


9480 


CATTTATTGC 


- GCTCCAATCT 


TTGTTCTACA AGTCATTTTA 


ACACGCTAAT 


CTAACAGGAA 


9540 


CAAGGTGTGT 


TAAAACAAGA CACAATCGTG AAACACCAGT 


TAACGT6AAA 


CATCAAAAAG 


. 9600 


AATGAAAAAT 


TGTGAAAGCT TGCTATCAAA AGCTTTCAAA AATAGCGGCA TTTGCTTGTT 


9660 
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GTGGATAACG 
TTCGTTTTTG 
AATGTTTATT 
TCGTTTTTGT 
TCATCTTCAT 
ACTGATGTTT 
TTCACAGTTA 
GTCGTTGGTT 
6CTTTACCAG 
ATCATTGGTA 
AAGAATTCTG 
TGTCCGTTAC 
ATCACGCTAC 
TTAACTCCAT 
TCACGACGCA 
CAAGAATCTG 
AAACGTTCTT 
TTGTTAACTA 
TCTTGATTGT 
GCAATTAAAT 
AAAATTTTCG 
TCGCCGACTT 
TTGTAGTCTA 
GGCACATGTT 
TCATCTGATT 
TGATCGTCGT 
GGACGAGCAG 
AATGCTTTTT 
AAAACAACAC 
CCTGGTGTAT 
AAGATAGTAA 
TGTGTGTGTC 
CCATGGTCAA 



PCTAJS98/08985 



AGTAGTTATC 
CCATCAAAAA 
TGTCGAAAAA 
TAAGAATTTG 
TCAAGAATTC 
GGTCTTTCGT 
AATCGTTGTC 
CAACGAAAAC 
TATCGATTGA 
AGTATTGATC 
TTGAATAACC 
CGGCATTAAT 
CCATGTATTC 
CAATTTCACG 
TGTTTTCAAT 
GGCCAATTGG 
CGATTTTACG 
AGAAAGTCAT 
GAACATCTAC 
CGCCCGCTTT 
TTACACGGAA 
TCATTGTGCC 
GTAATGAGAC 
CAATAATTTT 
CACTTGAAGT 
CTGCACCTAA 
AAGGTTTGTC 
TCAATACGAA 
CGTCTACCAT 
CTAGAATGTT 
TTCCACGTTC 
CATCTAAAGT 
CGTGGGCGAT 



CACACATAAA 
AACCAACTAT 
TTATTCAGCT 
TTTTCTTAAA 
TAATGATTCT 
TGCCGAACGA 
GCGGTTGTTT 
TGTACCACGT 
AACTAGTGCA 
GAAGGTGTGG 
AATTAAGCCA 
CATATCTTGC 
TTCAGGTGTG 
TTCAATAACT 
TAAAATTGAT 
ATCAACACGT 
AGCGGTGATG 
TTGTAAGGTT 
AACTGTTTCC 
TGCTTCATCA 
ATTTTTCACG 
AC6GAACACA 
TTGGAATTGA 
ATCAAAAATT 
TCCGTTTAAA 
TTCGATGAAT 
AATTTTGTTA 
ACGTGTTTGA 
TTTCATGATA 
GATACGTGTA 
ACTTTCAAGT 
GTCAGATTGT 
AATTGCCACG 



601 
AAAATAACCT 

TTTCGTAG.TA 
TTTTTCTTTT 
CGGATGCTTT 
TCTAACGTTA 
ACGTTAGTCA 
TCCCCAACAA 
TCTTCAATAC 
CCTTGATGAC 
TTCATAATAC 
CGAGCTGGTG 
ATTTCGCCTT 
. TCAATTTGAA 
TCTGGACGAG 
AAATGCAATT 
AAAGATACGT 
TATTTTCCTT 
GGCTCATCAA 
CCAAC6AAAA 
ATTTCCACAC 
CTGCCATCTA 
CGGCCAATCC 
AGTGGTTCGT 
GGGGCCATTG 
GCAGAAGCAT 
AACTCTAAAA 
ACAACCACGA 
GGCATTGTTC 
CGTTCTACTT 
CCGTTATAAT 
GCATTGGAAT 
TTTAAAAGTT 
TTACGAATAT 



TATCTTAAAA 
AAACGAGCAT 
TCTTGCTTGC 
CTGGTGTTAC 
ATTTTTTCGG 
TTTGTTTAGC 
TCATTCCTTC 
TCATGATTGA 
GGCCACCAAT 
CATAACCACG 
CTAAGAAGAT 
TACGTAAACT 
CACGTTCAAA 
AAACTTGTAA 
CGCCACGACC 
CTGTTTGTAA 
CACGTCCCGC 
TGTGTAGAAT 
TGTCTTCCAT 
GTTGTAAGCC 
ATTTCATCAA 
CAATACGTCC 
CTGAATTGTC 
TTGGCTCTTG 
AAACAACTGG 
CTTCATCTAC 
TTGGTGTTAC 
CTTCATACGC 
CACCACCGAA 
CTACGGCTGT 
CCATTGCACG 
CATCTACTAA 
CATCTCTGTA 



TACGCATTAA 
AGTTGGTTTA 
TTTTTCGCGT 
TTCACAGTAT 
TTTTTTAATG 
TTTTGTGATG 
GTAAACTTCT 
GTAAGTCGTT 
CGTTCCTTGA 
AGTCATTGAT 
TAAACGCATT 
TAGTGATTCA 
TGGTTCACAT 
TTCATAGCCT 
TGATACAGTC 
CTCAGCCATT 
AAATGGAGAA 
TGGTAATGCT 
TCCAGAAACG 
AAAGAAACCT 
CGCAACTTGG 
AACGTAATCG 
AACTGGAGCT 
ATCTGCTGGA 
GAAATCTAAT 
TACGTGTTCA 
TTTTTGTTCT 
ATCGACAACT 
GTCCGCGTGT 
ATTTTTTGCT 
TTCTTGTAAT 
GGTTGTTTTA 
TTTCAATTTA 



9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
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602 

ATTGCTCCTT ATCTATATAC GCTTATGTTA CTTAAGTCCA TAAAmGCCGA CTGCTTATTA 
TmGCAAACTT TTTTCATTAA AAAAAGcTTT CGCCTGCAAA AATAATGAAA GTTTTAGAAA 
TTATTCGGCT TTTTCTGAAA ACAAGGACAT TTTTTtACAA ATTTtyCTCT TTTAAAAGAG 
GAAGAGCCTT ACCTTTCAGC AACGCTCTTC TTTGATTATT CCGATTCATT TAAATATCGG 
GTAAATACTT CATCATAGGT TTCAGGAGTT GCTGCAATAA AGTATTCACG ACCCGCAAAA 
GTTAATGGTT TTCCTGTAAT ACCAGAGTAT TTAAACCCAA ATTCTTCCAA AAGTACTAAG 
CCTGCTGCAT AATCCCAAGG ACTTAGATTA GATAAATAGC CATGATGGTT GCCTTTTAAC 
ATAGCAATAA TTTCCAAACC TGCACAACCG CTAATACGCA CGCCCATGCT TGCTTGTCCG 
ATTTCCGTTG CGTGATGAAT ATTTTTTCCA TGCATGTAAG CATTCATGCC CCATAATCCA 
TCTGCCAAGG CTTTCATAGT AGGCGCTTTT AATAATTGAT TATTCCGATA AACGCCAAGC 
CCTTTGCCGC CCCAATAAAG TTCTTCTCGC ATCACATCAT AGATAAAACC TAATTTACCA 
ATGCCGTCTT CATAAACAGC TAACATAATA CAAAAATTTT CTCTTTCCAT CACGAAATTC 
ATTGTGCCAT CAATCGGATC GATGATCCAC ACGCGACCAG CAAAACTTTT TAAAGTATTG 
TAGCCTTTTT CTTCCCCTAA AATTTGATCT TCGGGAAAAT TTGTTTGAAT TTTATTCATT 
AAAAATTCTT GGGTTTGTTC ATCCATATTT GTCACTAAAT CGGTGCGGCC ATTTTTCTGT 
TGAACGGTTA ACTCTGATTC CAAGTTCATT CGAATCACAT CGGCTGCTTC AAAAAGCCAA 
CTTTGAATTG TTTGAATAAA TTTTTGCGTT TCAGTCATCT AATCATCCCC CAGCTTTAAT 
TTACCGCTTC CCTGTTTiTT AGCTTGTTGA AGCGTACGAT ATAACGAATA TCCTGAAGCA 
TTTTCAAATT CACGACCTAA GCGTTTTTCT TCGCCGATAC TTTTAACGAC CGTTTTAAAT 
TGTTGATAAG TTGTTAAAAA TTTGTCAACA GGCAAGCCTG TTTCGTTGGC TTGCTCAACT 
GCTGTCCACA TATTAGkGAC AATCACCATT TCCtCTGTCG TCCAATCTAA ATCTAATGGA 
TATTGATAGT CTTTCATGTC TTCACCTCAT CTTCTCTAGT TTAGCATATT TTCATAAAAn 
AACATCTTAT TTACGTCGAT TTTTTAATGC ATTAAAGAGA GGATATTCTG CTGTTCCTTT 
GCGATACAAA CtTTGTTGTG AAGCATAAAT ACCGCTATTT CCTGAACACA TAAAACTGAT 
TAAGCAAACC ATGAAAAAAT AGACCGCCGC TTCACTACCA AATAGTTCAA TTCCCATAAT 
GAAACAAGCA ATCGGCGTAT TCGTGGCGCC AGAAAAAACA CCAATAAATC CTAAACCTGC 
TAAAAAAGGA ATTGATAGAT GTAACAAAGG GGCCAAAGAA CTCCCTAACG TAGCCCCAAT 
TTCAAACAAT GGCGTCACTT CTCCTCCTTG GAATCCGGCG CCCAAGGATA ACACCGTAAA 
GAATAATTTC CCAACAAAAT CAAAAGGCTG CGCATTACCT GCAAAAGCAT CTTCTAGTAA 
AGGTAAACTC AAGCCTAAAT AACGTTGGTT GTTTAAGACG AGCACAAACA GAACGACAAT 
GGCAGCGCCA'AGAAACGCAC GAAGGTAGAC ATTAGCAAAC CAATTTGCAT AGGTCTTTTT 
TAAAAAGACA ATGGAACGAC TAAAAACCCA ACCTGCCAAA CCAAAACAAA TCGATGCCAA 
AAATAATTTT ACAAATAGTT CAACCGACCA AGTCGGAATT TTGCCCATCG GATAATGCGT 



11700 

IIUQ 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 
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ATGTGTTACA 
GAAAAGT6CC 
CGCTAATGGT 
TTGTCGTTCC 
CATTTGCACC 
GCCAAACAAT 
TTCAATCACT 
AAAAACGACA 
TGTTACATAG 
AATTGCGGTA 
TTCTTTTTTC 
7VAAAGCCTAC 
TAAATAGTTC 
GCAaCTCTGG 
CCTTTGATAA 
GAAATAGTAA 
TTCAACTCGC 
GCCAAAACAA 
TGAAGCCCAG 
AAAAAATAAT 
GGTAAATACG 
CGTTTCACAT 
AACTCTTTCA 
GGCTGATAGC 
AGTCCGATGG 
TCAACATCTT 
GCAscACAGA 
AAAAAACCAT 
TCTGCATCTG 
GCATT6GGAA 
TTGGTTTGGT 
ACCAGTTTTT 
TAGCAGAAAA 



CCAAAACTTT 
TCTGCACGAA 
GTGCCAAAAA 
AAAGCGGACA 
GCCGTCCCTT 
GTTAAAGGAA 
AAGTTATTTC 
CCGCTAATGG 
GTGAGACTCT 
ATGAGCAACC 
ACTTTACATG 
CAAAGGGTAC 
GTGGTGAACA 
AAAGTGCAAT 
ACCGATAATA 
AAGGCAAAAT 
GATAATAGTT 
AACTAATGAC 
TTAAGACAAA 
AGATTCCACA 
CTAGACAATC 
CTCTCAGTGC 
GCATTAATTT 
CTAAGTGTTC 
TTTTTGTTGC 
GTGCAATAAA 
CTGTTGTATC 
CTGCATCTAC 
TTAATACAAT 
ATTGACGAGA 
CCGTTGATCC 
TTAATGTTTC 
ATTCTTCTGC 



CAGTTACAAA 
CTTTTCCAAC 
CAGAACTAAA 
GTTTAAAGAC 
CACGACCAAC 
TTAAGCGCAA 
CTCGGCTAGC 
GGAGGATAAA 
TTAAGAAAAA 
ATTTTGCCAT 
CTCCTTTACA 
ACTTCTCCCG 
CCATCACCAT 
AAAAAaTCaA 
AGCCATCACC 
AATCAGCCCC 
GATTGTAAAA 
AAGCGATAAC 
AACGAGCATC 
CTTCTGCCTT 
TAAATTTCCT 
TTCTTGAGCC 
AATGGTTAAC 
GGTAATCATT 
TCGACGCGCT 
CATAAAATTT 
TGTGAAAACA 
CACACCCACT 
TTGGAAAGCG 
CAGAACTTGG 
ATCATCAATC 
TCCGATGACT 
TACTTCGACT 



603 
ATTAGCAAAT 
CGCCAAAACT 
ACCTGCACTA 
TTTTCCTACA 
AGAACCACCA 
GGGAATATCT 
ATTTTTCCCA 
GAGTAACCAA 
GGCGGATAAC 
ATAACCGCCG 
ATTTTCTTAT 
TTTGkaGGCG 
GTTGCTATTA 
TTGCGTTCCG 
ACACTAACGA 
AGTTTTGCAG 
AAAATGCCGA 
GAAAAGCCTA 
CCCGCTACGA 
AGCGTTAATG 
TGAACCCAAC 
ATAAAGGCCT 
TCATAGTCAT 
GATAACCGAA 
TTTTGAATAA 
GTGACATCTT 
GCAGTTAGCT 
AAAACTTGAT 
TGGTTCAAAG 
ATT6GACCAG 
GCAATTATTT 
TTCTCCTCGT 
TCTCTTCTTG 



AAGCCTGCAA 
TCTAATCCGA 
ATTCCGCTAA 
GCATTTGCGA 
AATAAGTGCG 
TCTTCCCCGC 
AAACGCGTAT 
GGATTTGCTA 
GCCCCCATAA 
ATTGATAGTA 
CGCCAATCCT 
TCATTAATTT 
AACTAATwAa 
TTTTtATCCA 
ATAAAATTAA 
GTACTTTTAG 
AAACTAACGA 
ACCTATACAA 
GATTAATAAA 
ACTTTGATTT 
GACTTCGTTG 
CGTCAATATA 
CCAACAAAGC 
AAAACTGACC 
AATTATTAAT 
TCATTTTGAT 
CATTGACAAT 
CCCAAGAATA 
CGGCCCCTTT 
CAATACTGTC 
TTTTCTTAAT 
TTACACACGG 
TTTTTTTCTT 



AAAAACTTGG 
AAACCGTTCC 
TAATAATTAC 
TTGCGCCCCC 
TAGTAATCGT 
CATTGCCTTG 
AGAAATACGT 
AACGAACATC 
ATAAACXIAAT 
GTTGCTTTGT 
AGCGTACAGA 
ATTCGTCCmA 
ATwATACCaC 
CGTGCTATTA 
GTACATGTAG 
TTGAAAATTT 
AATACTGACG 
GTGTTGCAGT 
GGGTTGCGCT 
AATCACCCTG 
ACGAATAAAA 
AGCAATCGAT 
ATTTCCCCAA 
ATTGCCACCA ' 
CGTAAAAAAC 
CCTTGTTTGC 
ATTTTCACTA 
GTGTCCTTTT 
GCCTTGTTGC 
CATCATTGCT 
TGGTAAAGCA 
TACAAGAAAG 
GTTCGAAAAC 



13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 
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TGAGAAATAA ACTTGTAAAT AAATAGGTTA AGAAGATGAA CATAATTACT 


TTGTATAACT 


15660 


GATACAAATA AATCATGTAA 


TTTGACCTCT 


TTTAAAAAAA TAACGGTATA 


GAATGTAGGC 


15720 


AGTGACAATC 


GAAACTAGAA 


TAACGCCAAT 


TAAATCAACC ACAAAAAAGG 


CGCCTGTTGA 


15780 


AATTACCCCA AGTTTCGGTA AAGCTGATtG GCTCATtAAA TACCCCTTTC 


TCTCTATTTG 


15840 


TTTTTACCTVA CACACCACCA CGATGACCAT 


CACACACCCC CTATAATAAT 


ACTTTGGaGG 


15900 


ATTATAGTAC ACAGGTGGAT TATAGTACGT AAmTGGcGGA ATATAyCCTG GcCAATATGT 


15960 


TGGTGGTGGC 


GGATTATAAA 


CCGGTAwTTT 


TCGGTTTTCT CGAACAATTG 


cAATTGTTTG 


16020 


ACCCGCATGA 


TTTTGCGTCA 


GACTGACGCG 


TGTTGGCGTA GTATTTCGTG AATAACCAGT 


16080 


CGGCGCTTGA ACTTCTCTAA TTTCATATAC CCGGTTGACA GATAGCCCAC 


TAATAGTGAC 


16140 


TGTACCATCC 


CAACCTGTTG 


TTACTCTTTG 


TGTATAGCCA TTAGTTGTAT 


CTTTCCATTC 


16200 


AAAAACAGCG 


CCCGCTAAAC 


GTGCTTTTGT 


TTCAACATCT TGTTTAATAA 


TTCTGATAGC 


16260 


ACCATAGTTT 


GCTTTATTGT 


TTAACACTAA 


CGTATACGTT TTTGTGCCTT 


GACCTCGTAA 


16320 


ATCAATCCGA 


TGGACCCTAC 


TATCCAACGC 


ATACCCAGTT GGCGCTTGTA 


TCTCGGTGAT 


16380 


TTCATAAATA 


CGATCCGACG 


ATAATCCTGG 


AATCAGTACT TCACCATTTC 


CATTTGCTGT 


16440 


AAATGTTGCC 


GTATAGTTAT 


TGGTAGTATC 


GCGAAATCTG AATTTCGCTC 


CTGCTAAACG 


16500 


TTTTCTTGTA 


TCCGTATCTT 


GCTTAATAAT 


TTTGATGGCG CCTTTTTGTG 


CTTGATTATT 


16560 


TAAGACTAAT 


GTATAGGTTT 


TACCACCTTG 


TCCACGTAAA TCTACTTTAT 


GAACACGTCT 


16620 


ATCTATGATA 


TAACCATTCG 


GCGCTTTGAT 


TTCTGTGATC CCATAGATAC 


GATCACCGGC 


16680 


TAATCCAGTA ATCAGTACTT 


CACCGTTCCT 


ATTCGCTGTA AACGTTGCAG 


TATAATTATT 


16740 


GGTTGTATCT 


CGGAATTTAA ATTGTGCGCC 


TGCTAAACGT TGTTTCGTAT 


ATCCATCTTG 


16800 


CTTAACAATT 


TTGATGGCAC 


CTTTTTGTGC 


TTGATTATTT AAGACTAATG 


TATAGGTTTT 


16860 


ACCACCTTGT 


CCACGTAAAT 


CTACTTTATG 


AACACGTCTA TCTATGATAT 


AGCCATTCGG 


16920 


TGCTTTTATT 


TCTGTGACCT 


CATAGATACG 


ATCACCGGCT AATCCAGTAA 


TCAGTACTTC 


16980 


ACCGTTCCTA TTCGCTGTAA ACGTTGCAGT ATAATTATTG GTTGTAtCTC GGAATTTAAA 


17040 


TTGTGCGCCT 


GCTAAACGTT 


GTTTCGTATA 


TCCATCTTGC TTAACAATTT 


TGATGGCGCC 


17100 


TTTTTGAGCT 


TGATTATTTT 


TGGTGAACGT 


GTAGGTTTTG CCAGCATAAT 


TCGCAGGTAA 


17160 


AAGAATTTTA 


TGCACCGTTC 


TGTCAAGTAT 


ATAACCATTT GGTGCTTTTA 


TTTCTGTAAT 


17220 


TTCATAAATC 


CGGTTGACGG 


AAAAATTATA 


CAAAGTTGCT TCACCATTAG 


CATTAGCAGT 


17280 


TACTACGCCT 


GTATTCCCAT 


TCACTGTGTC 


ACGCCATCTA AATTGTGCGC 


CTGCTAAACG 


17340 


TTGTTTTGaG 


TAACCATCTT 


GTTTAATTAA 


TTTTAAATTG CCACGTAGCG 


CAACATTTTC 


17400 


AATGGTATAA 


TAACCAATCT 


TGTCAGCAAA ATTAGTCGTT AGGGTAACTC 


GGTAAGTCTT 


17460 


GTTATTTAAG 


ATATA6CCGT 


TTGGTGCTTT 


AATTTCTGTA ATTTCGTATA 


CACGATTGAC 


17520 


TGGCAAATTC 


GTTATCAGGG 


CTTGTCCGTT 


TTTGTCAGCT ACAACTGTTC 


CTTCTTTTCC 


17580 
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TGTCACCGTA 


TCACGCCATT TGAACTCTGC ACCTGCTAGA CGTTTTCTTG AATCTTTGTC 


17640 


TACTTTGAT-T 


ATTGC-TAGAC TACCTTTTTG CGCTGTATTT TCAATGGTAT AATAACCAAT 


17700 


TTTATCwGCA AAATTAGTCG TTAAGATAAC TCGGTAAGTC TTGTTATTTA AGATATAGCC 


17760 


rTTTGrTGCT TTAATTTCyG TAATTTCGTA TACACGATTG ACTGGCAAAT ctGTTATCAG 


17820 


JcGCTTGwCCr 


TTTTTGTCAG CTACGACTGT TCCTTCTTTT CCTGTCACCG TATCACGCCA 


17880 


TTTAAACTCC 


GCACCTGCTA GACGTTTTCT TGAATCTTTG TCTACTTTGA TTATTGCTAG 


17940 


ACTACCTTTT 


TGCGCTGTAT TTTCAATGAT TACTGTCACT ACTTTATTAG CTTGTGCAGT 


18000 


TGTTAGAGTG 


ACTTTATGCA CAGTTTTATC TAATACATAC CCCACTGGTG CTTTCGTTTC 


18060 


TGTGATTTCA 


TACGTACGAT TGACTGATAA GTTGGGAATT GTGATGGTTC CATCGGTTCC 


18120 


AACTGTCACT 


rTTCCTGTTT TTCCTGTCAC TGTATCyTTC CACTGAAATT CTGCGCCTGT 


18180 


TAAACGTTTT 


TTrCTkTCTT TGTCTTGTTT AACAATTTTG ATGGAGCCTT TTTGCGCTGT 


18240 


ATTTTCAATG 


ATTACTGTCA CTACTTTATT TGCCTGTGCA GTTGTTAAAG TGACTTTATG 


18300 


CACGGTTTTG 


TCTAGTACAT AACCTGTGGG TGCTTTTGTT TCTGTAAGTT CATACGTACG 


18360 


ATTAACTGCT 


AAATTGGGAA TTGTGACCGT TCCGTCGGTT CCAACTGTCA CTGTTCCTGT 


18420 


TTTTCCTGTC 


ACTGTATCCT TCCACTGAAA TTCTGCGCCT GTTAAACGTT TTTTGCTTTC 


18480 


TTTGTCTTGT 


TTAACAATTT TGATGGAGCC AATTTGTACT TGCTGCTTAT TCTTCACCCG 


18540 


ATATGTTAAA GTGTAGTCGG CTTTTCCAGC AAATGAAACA GGATGTTTAG TTTCGTCTAA 


18600 


TTCATAGCCA 


TTAGGCGCCT TCGTTTCAAC TAAATAATAT TCTCCATCAG GTGATAAATT 


18660 


TGAAAATAAC 


AGCTCGCCAT TTTCATTCGT GACACCTTTT TCTATTAGAT TATTTTTTkG 


18720 


GTCATACAAC 


GAAAATTCAG CACCTGcTAA ACGTTTCACC TCATTATTTT CATCTACTTT 


18780 


TTCyACTTTT AACTGGkTTn CTAAACGCGt ATTTcTTTTt tCAAGCGAAT CGTCTGkGTT 


18840 


TGTTCTAcTT 


AATTnCAAAT GGAaCTGGTG TTGCATCTAA CTTGTAACCA GTTGGTGCTT 


18900 


TTGTTTCAAT 


GAATGCATAC TGACCAGCTT CCAAACCAGC TAAACGAATT TGACCTGCTT 


18960 


GATTGGTGGT 


TAATGTTTCT TGAACAACTT GTTGTGTTTT TGTGTTCACG ACTTTGAAAA 


19020 


CAGCACCTTC TAAAACTTTA GCAGTTACTG CAcAATTTTT TCyAAAaCGA CACCCCyTCG 


19080 


ACCTGTTCCA 


TCACTACTCT CGTCTGCTGT GATGTAGGCA ATATCATCAT CTTCCCAaGC 


19140 


ATCGAATCCC 


TCACTAGAAA TACGCATATA CGTGTAATTT GATAGCACGG CATCACTTGG 


19200 


AATATTTTCA 


TTTGGAATCG TTTCTAAATA AATATAATAG GCATTTTTAT TGGAAAGATA 


19260 


ACTGCCCGTG 


GCTGTTGGTA TTCCTTCAAA TTTCTCTGCT TCATAATTAA ATCCAGATTC 


19320 


AGAGGCATTT TTTTGGAAAA ATTGACTGGC ATCATAGCTA TATTCGTCAT GTTCCAAATT 


19380 


CCGAATCATT 


TCACCATTTG TTTCTAAAAT AGGCTGCTTG GTGAGATATG CAGCAATACT 


19440 


GTCAATAGCT 


AATGTTTGGT TTGGACCTAC ATTATCTAAA AATTTAAAGG TTGTTTtTTC 


19500 


TGAACCTGAA AGACCTGCTT GATTATAGTT AAAATAAATG ACCCAAGCAA TTCTTCCCTC 


19560 
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TGAGGTAAAT CCTAACGGGA CTTTTCGAAC CAGTTCAGTT GGCGGAACTT TCTCCACGTA 19620 

TTGCCCGATA TTAATTTTGA CAATTTTTTG TGAATCCCCA -AAAGTAAAAA CGACTTGATC 19680 

TTCAAAATCT TCTGATAAAT TTTGATTATA TCGTAGCTTC AGTTGCAAAT CTGCTGAGTA 19740 

ATCTGTAAGC GTTTGTAAGG CTTTAATCGT CCAAGTACCA TCATCGTTTG GAATCACTTG 19800 

AAGTGCGTCT CCAATTTCAA AAGGTGTCAT TGAATAGGAA TATTTTTCGC TAGCCATCCT 19860 

TAATGTCATC GTCTCTCCGC TTGCTAATTC AGAGATTCCT AATGACAAAG AAACTGTCGC 19920 

CCAATCTTTT GTAGCAACAA TTTCCGTTTT TTCCCCCGGn yATTCAACAG CTATAGACCG 19980 

GATTTGTGTG CCACTTTCAT TATCCATTGC AT/U^GAAATA ATACTTGTAT TAGATCCTAT 20040 

AATAGTAAAG ATAATTAAAC AAATCGAAAA TA 20072 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 688 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



TTGAnACAGA 


TGGAGTnnGC 


ACTGGCTATA 


CAGTAAGATA 


TGATCTTGGT 


GGAGAAAAAG 


60 


AnAAAGACAT 


AACAGTCAAT 


TTACCCGGAG 


CAGaATTACC 


ACTTAGTTTA AAAGTAAAAG 


120 


CACATTACAC 


AAATGGATTA 


CCACTTGAAG 


CGGGAAAAGA AGTACGCTTA AGAAATTTAA 


180 


CAGATGGATC 


AACAGAAGTA 


mTTAAGAAAC 


ArGTAGACGC 


TAACGGAGAA 


GTTGTATTTA 


240 


CTGAGCAAGA 


TGGTATAAAA 


AAGGAAGTAA 


ATTATGGTAT 


TGAAACAGAT 


GGAGTAAGAA 


300 


ATGGCTATAC 


AGTAAGATAT 


GATCTTGGTG 


GAGAAAAAGA 


AAAAGACATA 


ACAGTCAATT 


360 


TACCCGGAGC 


AGAATTACCA 


CTTAGTTTAA 


AAGTAAAAGC 


ACATTACACA 


AATGGATTAC 


420 


CACTTGAAGC 


GGGAAAAGAA 


GTACGCTTAA 


GAAATTTAAC 


AGATGGATCA 


ACAGAAGTAC 


480 


TTAAGAAACA 


AGTAGACGCT 


AACGGAGAAG 


TTGTATTTAC 


TGAGCAAGAT 


GGTATAAAAA 


540 


AGGAAGTAAA 


TTATGGTATT 


GAAACAGATG 


GAGTAAGAAA 


TGGCTATACA 


GTAAGATATG 


600 


ATCTTGGTGG 


AGAAAAAGAA 


AAAGACATAA 


CAGTCAATTT 


ACCCGGAGCA GAATTACCAC 


660 


TTAGTTTAAA 


AGTAAAAGCA 


CATTACAC 








688 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
nGATAAGGTC CTATAAATAG nCCGGATATn TTTTTATTAT TAAGATCTAG TACTTTCCTC 



60 
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TTACTATGAG TGTACTAGAT TCTTTTTTTG CCAACTTTAA 


AGAACTAGAC 


CAATTTTCGG 


120 


TTTTTAGATT TTTGTTCACA AGTACCTACT TTTAATTGTA 


TGATTGTTTA 


TGTTTCTAGG 


-1^0 


GAATATATTT ACAAAGGAGG TATATAGACC AATATCGGAT 


AGCGCCAGAC 


CTGAACGTTC 


240 


AGGTGACGAG GAGAGAGCTT ATCGAAGATT CGGCGGGTGG 


CTCTAGGGAC 


TGCACTCTAC 


300 


AGATAACAAA GAAAAACTAA TTGTGAAGTT AGAACAAAGC 


GGTTATCACG 


CAGGTAGAAA 


360 


CATATAATGG GATTTGCTAG TTAAACAGGA CCTAGTAAAT 


ACAGAATTTC 


AATTGGAAAG 


420 


TAGGAATTAG CATTATGTGT GGAATTGTCG GAATGATTGG 


AAAAGAAAAT 


GTCACGGAAA 


480 


GTTTGATTCA AGGATTAACA AAATTGGAAT ATAGAGGTTA 


TGATTCTGCT 


GGGATTTGTG 


540 


TGAATGrTCm AAGCCAACAG CCCCATGTAG TTAArGCAGT TtGGGGgCAT TkCCsATTTa 


600 


GCAGCAAAAT TAACGCCAGA AATCGATgGA ACAGTTGGTA 


TTGGTnCATA CTCGTTGGGC 


660 


CACTCATGGC GAACCAACTG TCGCCAATGC ACATCCCCAT 


GTTTCAAGCG ATCAACGCTT 


720 


TGCATTAGTG CATAATGGTG TCATTGAAAA CTTTGAAGAA 


TTAAAAGAAC 


AATTTTTAAG 


780 


TGGTGCTCAC TTAATTGGTG ATACAGaTAC TGAAATCGTC 


GTTCAATTAA 


TTGcTCATTT 


840 


TGCTGAAAAT GGTTTTTCAA CAAAAGATGn CTTTTAAAGC 


GGCTTTAGCm 


GAwATTaAAG 


900 


GTTCATACGC TTTTGCATTA ATGGAT/UU^A TGGaCCCCAA 


TATTATCTAT 


GTAGCGAAAA 


960 


ATAAAAGCCC GTTACTTATC GGTCTAgGTG mAGGCtTaAT 


GkTTATtTGT 


AGTGaTGCGA 


1020 


TGGcGATGAT TAAAGAAcCA ATCAATTtGT TGAGATtGTT 


GATGGTGAGA 


TTGTAACTGT 


1080 


TACAGCGGAC AAAGTAGTAA TCGAAACGCC AGAAGGACAA GTAATCAACC GAGAACCTTT 


1140 


TGAAGCGCAA TTAGATTTAA ATGACATTGA AAAAGGAACA 


TATCCGTTTT 


ATATGCTCAA 


1200 


AGAAATTGAT GAACAACCTG CAGTGATGCG CAAAATTGTT 


CAAGCGTATA CTCAAGGTGA 


1260 


AAAAGTGACT TTAGATGAGG CGTTACTGGA AAAAATTAAT 


CAATGTAACC 


GAATTTACAT 


1320 


TATTGCTTGT GGGACAAGTT ATAATGCAGG TTGGGTCGGA 


AAATCTTTAA 


TTGAAACGAT 


1380 


TGCCAGTATT CCTGTGGAAG TTCATTTATC AAGCGAATTT 


GGCTACAACA 


TGCCTATTTT 


1440 


ATCTGAAAAA CCGTTCTTCA TTTTCTTAAG TC7VAAGTGGT 


GAAACGGCAG 


ATAGTCGCCA 


1500 


AGTACTGGTA CAGGTTAATG AATTAGGGCA TCCATCACTT 


ACATTAACAA 


ATGTTGCTGG 


1560 


CTCAACATTA TCACGTGAAG CGGACGATAC CTTGTTATTA 


CACGCTGGAC 


CAGAAATTGC 


1620 


TGTCGCTTCC ACCAAAGCTT ATACTGCTCA AATCGCTGTC 


TTAGCTATTT 


TAGCAAAAGC 


1680 


AATTGGCGAA GAACGCCAAA CATTAGCAGC CGTCTCATTT 


GATGTGGCTC 


ACGAATTGAG 


1740 


CGTGGTAGCA GCTGTCATGG AAACCTTGAT TGATGAAAAA CAACAAATGA AAGCCTTGGT 


1800 


GGAAGATTAT CTGACGTATA CTCGAAATGC CTTCTATATC 


GGGCGAGGAG 


TCGACTATTA 


1860 


TGTGTCGATG* GAAGCGGCCT TGAAGTTGAA AGAAATTTCC 


TACATTCAAG 


CAGAAGGCTT 


1920 


TGCGGCAGGT GAATTGAAGC ATGGAACGAT TGCCTTAATT 


GAAGAAGGAA 


CACCTGTGAT 


1980 


TGGGATTGTG ACAGATGCCA AAGTCGCTGC CCATACAAGA 


GGCAATTTAA 


AGGAAGTCGA 


2040 
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AAGTCGTAGA GCAAGAAATA TTGTGATTGC TTTGGAATCC 


TTGGCAAAAC 


CACAGGACCA 


2100 


ACTGATTATT CCAGATGTTC ATCCGTTACT TTCTGCCTTA 


GTTAGCATTG 


TTCCAGGTCA 


2160 


ATTATTAGCG TACTATGCAA CCTTACAAAG AGGCTACGAT 


GTTGATAAAC 


CAAGGAACTT 


2220 


AGCCAATU^GT GTAcTGTTGA ATAAATCAAG AACCTGAGAG 


AAAGATGTTC 


AACGGGTTTT 


2280 


CATGAGTTTT AATATCTGAG ACAAAAATCA CTTTAGATTT 


TTGTCTCAGA 


TATTTTTTAT 


2340 


AAAAGTVAAGA TATTCAGTCA AAAATGAAAG ATTCCAAAGA AATTCTTTTT TTCCGTTGAA 


2400 


AACTGTTTGC ATTTTAGCAA AATTAGATAT AATTAATAAG AACCGAaaAA TAAGAATTAG 


2460 


CCAGTGAGGT AACTTTGTTT GAAAGAAAGC AAAAAGTAGG TTAAGCTGAA AAGTGCTAAT 


2520 


TTATTTAACA AAAAAAGATT GTGAGTTGTT TGATATGGAT 


CAGTTTAGTA 


CATTAACTGA 


2580 


ACAAGAATTT ACTACCTTTG CAATGACACA TCCTGCAGGT 


AACTTTTTAG 


AAACACCCGA 


2640 


AATGAAACAT TTATTGGAAC GACGTGGCTG GCATTGTGAA 


TACGTTGGCG 


TCAAACGAGA 


2700 


AGGCCAACTA ATTGCTGCGT GTATCCTAAG CAAGAAAAAA 


GTCAAAATTG 


GTTATGCTTT 


2760 


CGACATTGAC GGCGGAATTT TGATGGATTA TACAGATAAA 


AAATGTGTAG 


AAGCCTTTTT 


2820 


CACAGGCTTA AAAAAATATG TGAAAAAAAA TGATGGCCTT 


TATTTAACCT 


TTACTCCAAA 


2880 


TAAACAAATT TGTTTACGTG ATTTTAATGG CGGTGAAGTT 


GAAAAGGTCA 


ATCAAGAAAC 


2940 


ATTTGACTAT TTCACATCAA TTGGTTTTGA ACACCAAGGG 


TTTGATGTGC 


ACAATTTTGA 


3000 


TGGTGCGCCA CGTTGGCTGT TTGTCAAAGA TATGGCTGGA 


TTAACAGAAG 


AAGATCTTTG 


3060 


GAAATCTTAT GGAAAAGATG CTAAATATGA CATCAAGAAA 


ACATGGGAAT 


ATGGAGTAAC 


3120 


AATCCGAGAA CTTCgTTACG AAGAGCTACC TCTCTTTAAG 


AAATTAACGG 


AAGAAACAAG 


3180 


TGCACGGCGT AATTTtGAaG ATAaGGATTt AGCTTATTAC 


CAaGCTGTTT 


ATGAaGAGTT 


3240 


TGgCGAaCGA gCGAAtTTAT GGTGcGGACT AAcTTTGCTA 


CTTATTTaGA 


AAATCTTCAT 


3300 


GAGAAATTAC GCAAATTACA AGAGACGCTC AACGAAgTCA ATGAAGCGTT 


AATTGCCAAT 


3360 


CCTAAAAGCC GTAAAAAAAA TAACCAGAAA CGTGAATTTG 


AAGATGAAGT 


TCGTACCGTG 


3420 


CGCAAACGGA TTGACGAGGC TAAAGAAAT6 AAAACATCAG 


ATGAACCTGA 


AATTTTAGCA 


3480 


GGTGCCTTAT TCATTGTTCA TCCACAAGAA GTTGTTTACT 


TGTTTAGTGG 


TACGTATGAA 


3540 


AAATATAAAC AATACTATGC ACCGTATTTA ATTCAACATA AAATGTTAAC 


TTACACAGTA 


3600 


GT^AAATAACA TTCCCAAATA TAATTTTTAT GGCGTT6ACG 


GTATCTTTGA 


TGGAAGTGAT 


3660 


GGGGTTCTTA AATTCAAACA ATCCTTTGGC GGTCATGTCG 


AAGAATTAAT 


GGGCAATTTC 


3720 


CAATGGAAAG CCAAACCT^T GAAATATGCC TTATATCACG 


CATTAAAAAC 


AATTAAAGAA 


3780 


AAAGTCTAAA ATTAAAACCC AATCCTTAAA AAGGAATGGC 


AAAGTGAGGC 


GTTTAAATGG 


3840 


AAAATCAAGT CGAGGTTATG ACGTATGCAC ATTTAAAAGA 


AATTATGCAG 


GCGTTAGAAG 


3900 


CAAACGAAGC GATTACAGAA GATACAAAAG TCTTTATTGA 


TACAGGATGG 


GATAGCGTGC 


3960 


AAGAAGTAGC CCCAGATGCG GTCAGTATTG AAAAAGTTGC 


TAAATTTACA 


GTAGCAGATG 


4020 
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TTTTAACAAA 




GirGGGTATA GXTTAGAAGA AAi\GGCGGAG AAGATGAACG 


4080 


CTGAGGGCGA 


f — Yr-'^---'"'^ 


GCAJi.TTATTA TTCGTAATCT TTACTAAAAA Ar:TCCGTTTG 


4140 


AATAGTTAAA 




GXACAGAACC GTTTAGCTCC GAGCAACGCG TAsTGTTGGT 


4200 


TCTGTTCCCG 


w «.-jvs - ----- 


GGrtTTGAGC CTGGAGCAAA AATCCAAAGT GATTTTTGTC 


4260 


CCAGGCTCAT 


C~ 3CTT T.- AA 


rAAGAAATAA AGGAGGTTAG ATATGAGTAC CATTCmTGrA 


4320 


TTyCCAAAAA 




TTTTATTGCA CAAGGAGAAG AAGCACTTGT AGAACATAAT 


4380 


CAGATCGCCG 


CrTTaa^iAAA 


TTTTCAACAA GCCTATCAGC wACCAmCAAc GCCGCCAGTA 


4440 






A.rtGTTGGAA ATGGGtGAaG CTGATGAAGC ACTAGCATTG 


4500 


GCAGAAGCG7 


tccaaga:;7c 


TIATTTTGAG AACCTTGAAA CAGCGGCAAT TTATATGCAA 


4560 


ATTTATAGTC 


AAAGTCGTCG 


C7TTATTG7VA GGGTATATTT TATTGAAGCA ATTACTTCAA 


4620 


ACT AAAAAAA 




ACAACAAAAA ACGTTAGAAC AGCAATTAAT GCAAGTGGAA 


4680 


GAGGCGTATC 


AACAATTAGA. 


aIaCGCAGCAA ATTCAAGCAA TTAAAAGGrA CTTGTTAGTA 


4740 


AGCGATCAAT 


TGCCCGrrTA 


TCA-ACAATTA GCGAACATTA AAACCAGTTT ATATTTGCCT 


4800 


AAACCAGTAT 


TrGTGGA_AG7 


CGCAAAAGAT TTAGTTATGA ACCAAGCACT TAGTTATTTT 


4860 


GCTCGGGAAC 


GGTTCArTGA. 


AGA-ATTAGCA CTTTTGCAAT TTTCAGAACC ACTGACTTTT 


4920 


CTTTSGTATG 


AC^JirCA^CC 


7CAGACAGTC TTACTAGAAG GAAAAACTGG ACCGTTAAAT 


4980 


ACGCCGATCT 


ArTCCGA---AT 


T7GTACTGAA CTTAGGAATC GTTTAGAAAA TGATGATCCA 


5040 


ATTATGCTGC 




AG.AAGAAATT CGATTGCATC TTGCCTATTT GTATCCATTA 


5100 


GCAGAGACGG 


TTATTTCAGA 


TCCAACTATC TGGGTCTTGG GTTATTTGGC AACCTATTAT 


5160 


CCTGAATATA 


TTGA-AAAAGA 


A7TGACTGAA GCAAAAGGCC ATCAAATCGA TGCTGTTCAA 


5220 


AAAGTTCAAC 


AGGCGtATACG 


A^-^GGCATTT ACACAAATAA TGCTTTAAAA AATGGCACGT 


5280 


CCTAAAAGCG 


AGGACGT3CC 


A7TTTTTCGG TTTCAGAAAT TTGCTGAATC TGCTCTAATT 


5340 


GCTTTTCiTG 


ACTGAT7.--7T 


GCGGATAATA GAAAAAAGCA AGACAGAAAT AAGTAAAAAG 


5400 


AATTCCTTGA 


ArCAAGC-JiJS. 


TCACAACGAA CAATGGCGTT GCTCCTGGAA AAGAATCAGA 


5460 


AATCCGTGT=. 


A7TAAAAATT 


CAACTAACCA TAAAATCATT GAAAATAAAA ATAATTTTTT 


•5520 


CTGACGAGTT 


CTCATAGTCC 


GCAACAACCT TTCTGGTGAT TTTCACTTCT CTATCATAAC 


5580 


ATAGAAAACG 


AGCTGTTTTT 


TCTCAAAAAT TTTGTTACTT CATAAGAAAA ATGGGAAATA 


5640 


AGCAAGTTTC 


AGGA-AGTACT 


AGCTTTACCG AAAAGAATAC TTAATCCATC TTAATCAAAC 


5700 






AAAAGAAGTG TTTTTGGATG GAACAAATTA TTTCATAAAC 


5760 


ACAAATATT7 


CTTTGTTGTT 


TGTATAAATG TCATAAAAGT GATTACTTAC TACTTTTTGA 


5820 


TTTTTAGACA 


T-ATCCGT.ATC 


GGTCAGTGGT GCCATCTCTA ATCCTAAATT CGATGAAGTA 


5880 


TTGAACCAAA 


CGT.AATT-:UK= 


GAAGTATAGA CCGTCATTTG GTGGCACAAT TITTTGAAAGA 


5940 


ATTGGGTAGT 


T7CGACT7GT 


A7TAGCATAA ACA6TTGAAT TTTTGAAAAA CGTATTTAAA 


6000 
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TAAACTTTTT 
TGGCGATTGA. 
AAAATATAAT 
TGTCGATTGT 
TAGCGAGGCG 
AGAGAAGCAA 
CTATTTGATA 
AAACGATTAC 
CTGGGAATAT 
GGTACAAATT 
AGTGCGACTT 
AGGAATAAGA 
AACAGTGAAA 
GGAAGAAACG 
CATTGGAGGA 
ATTTTTTGGT 
GTTAGAATCG 
GAGAGAAATT 
TGGCGAGCAT 
TGATAAATTA 
TCAAATTTTT 
TTAmCaATTA 
TCTTGACAGA 
GGGATTTTAC 
TTAATTAAAA 
TTTGG6TAAT 
TyCTTTTTTC 
TGTCCTAGCT 
CGGGcATGTC 
TTTATATATT 
GCAAAATTTT 
TATTTTTTTA 
GATTTTGTAG 



TTCGTTCGTT 
ATGQTT.CTTT 
AAGATAACAA 
TAAATAAACT 
CATCACCTGC 
GTCCTAAATA 
GATGACTAAT 
TCTGGTTCAA 
CGTGCAAACT 
TTAACTCGAT 
TTTTGCAAAG 
CGATAACTAT 
AAACAGAAAC 
AACAAAAAAT 
ACCAAGGATT 
CATTGATGAT 
GTGTGGTCAA 
CACTGCCCCA 
TATCATCTAA 
AAATGATCAA 
TTGATTCkGT 
AAATATmCCA 
ACACTAAGGG 
CAAAAAGCCA 
ACAATTCTAT 
TTTTGGGTAT 
TATTTGCTTC 
GTGTTTTGaT 
TAAAACTATG 
TATTCACAGA 
GGGGATTACG 
AATAATAGAA 
GGCCAATTCT 



TGTTACAACA 
TTGATAGTTT 
GCTGGCCACA 
TAACATGCTG 
GATATTTCTT 
CAAGATACAA 
AAAAAATAAA 
TAGCTCTGAA 
TGGTAAGGCC 
AAGGTAGCTG 
AGCAATGAAA 
ATAAATTCCT 
CAAGAAAAAC 
ACTCAATGCC 
GAGGCCAATT 
ATAAATTGCC 
GCCCAGATCT 
ACGTGCGTAA 
GTAAGGGAAA 
TAGAAGATAG 
AGGTAATTGT 
TAACGAGTGG 
TAGGCAAAAC 
CCTGCCAGCT 
ATGTTTTAAT 
TTACAAAATA 
AGTAGCTGAT 
ATATATTGGA 
AGATGAAATT 
TGAATTTGAG 
TAAATCTCCT 
AAGAGAAGAA 
ATACTGGCCA 



610 
TTTTTTAAAT 
AATGTGGAAG 
AGTAACGAAG 
ATTGTAACGA 
GAATAGACAA 
TAAAGGAAAC 
ACAAGCAGTA 
AAATAATGGT 
TGACTGCCCC 
ACTATAGAAA 
TTTTCTCCAG 
GACGAGGCTT 
GTAAAAGAAT 
ATATAGGGAC 
AAAGTAGCTA 
aGTATGCTTG 
GTCAAATGAC 
GTCGTACCAA 
TTGATAATGC 
GCTTTATTTT 
TTCATCATTG 
AATGAAGAAA 
AAAAAAATCC 
TCATAGATAA 
ATATACATCA 
TCTTTGTATT 
GCGTATTCAT 
TCAGCTTCAG 
CtTTCAATAT 
ATGGGCCATG 
CTCAATATTT 
GGGACATCAA 
GCATCATACT 



CATCTGTTAA 
AATACACCAA 
TTAGATAAAT 
AAACAGCAAA 
TAAAAATGCC 
TTAAGCCTAA 
GGAGCAGTAA 
AACTATTTTG 
CAATGTTGTT 
GTAGATAAGC 
CAAGTAACTG 
GGTAAGTATT 
TACGCTGCCA 
TATCAAACCT 
CTAAAGAGGA 
TGAGAGATAA 
GACTTCCTTG 
AATCTGTCAT 
CGATGGATAA 
CAATACACCA 
TACTCCACTC 
TTGCTTTTGT 
CAACTCTTTA 
TTATTCAAGA 
AGATAACAAT 
TGTTTTCATA 
ATGACTGTAT 
CTAGGaTTAA 
TTGcTAGTTC 
AATTTTTTTT 
TTTTCTTTTG 
TATTTCGAAC 
TAtAGGCATT 



AACAGCTGCT 
AACAAACGAT 
TGCTTTCACT 
TCCGTAAATA 
AAAACTTAAG 
GTTGATTTTG 
TAAAAGCCAT 
ATAAGCCACA 
TGTTGCGGAA 
GATAGCAGCA 
CTTGAAAGTA 
AAACATAACA 
CCAATAAAAA 
GAAACTAAGA 
ACCCCAGCTA 
GATAAGACCT 
AACAAACGAT 
TCCAGCGACT 
AACTGCCACT 
ATTCTTTAAA 
CAATAATTAT 
TATTTTCTCA 
TTAAAGAGTT 
ATATTGCTGT 
TTAGATATAC 
TTTTTCTTTG 
TATCTCTTTA 
TAAGTCAGAA 
ACATGTTTCT 
ACTAACAAAA 
TTCCATTTCA 
ACCTGCTTCT 
TTTTATCTTA 



6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 
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ATAACTTCGT TACTTTCATC AATGCTATCC CAAGTCAACG CAC5pAACCTC TTGATAACGA 
GCTCCAGTAT ATAGTTGTAC TAAAATGAAG TATTTACTTG TGGTGTTTTC ACGAGATAAT 
AAAGCTTCTT TAAATTTAGT TAAATCATCA ATAGATATGG TTTTATTTGA TTTCTTTTCT 
GGTACTTGGC CAGTTAACTT TATATTTCTT GTTGGATTTG AATGAATATA TTGCATGTCC 
AACGCTTCTA AAAACATAGA TTTTACAATG TTATGTCTGT TGCTCACCGT CTCAACTGCT 
AGTCCTTTTT CATTGATATT GCCAGGTTTT GCAAGCCAGT TAATCCATTG CTGATATTTG 
ACTTTGGTGA CGCCTTTrAG CAATTGTTTG TCCCCAAAGA AATCATGTAA ATTCTTTCTT 
GTTAAAGCTA TTTTTTGCAT ACGGCCAGCA GACACTTTAT TCTTTTTAAA TGTATCAACC 
CAAATATCAT AAAAATCTAA AAGTTTAATT GCTTCAGTGT TTTCACCACG CATAATTTTT 
GATTTAAACT CTGTTTCAGC ATCTCTACAA GCCTTCTTAG TAGATCTAGT AATCATTTTA 
TTTTTCCATT CATTTGTAAT AGAGTCTTTA TATTTAAATC TTAACTGCCA TTTTCCATTA 
GCTAATTTCT TTGGTTCTGC CATAAATACC AACCTCTCGG ATTTTTTTAA TTTAAATACA 
TTTAAGAACG ATTTTTTATT GAATATGTTA TAAGTAAGTA TCAAGTTATC GTATATCATA 
AATACGAATA CGAATATATG TtCGTTTTTT GTGaTAAAAA GAAAAACCCG AAGGTTTCTC 
TCTTTGTTAA TAAAATAATA AGAAGATATT AATAATCAAT AATTTCTACA AGTCCGCTAT 
CTGCACTTTG TGGAGTTATG TGTATTTCCA CATTAGCTTC TCTTTTTGCT CCATATGCAT 
TTACTATAGT AGCACTTACT TTTTGATACC ATTTACCATC TTTTTGCGTC CATGGTTGGg 
ATGACGCCGA TAATTGAATG CATTTTGCTT CCTTTATAAG GATACATATC ATTAAATTGA 
TGCTTAGCTA AAGTATCTAA GACGGTTTGT TGTTCTAAGG TTGGCTCATC TGCTAGCTTA 
GTAATGTCAT TAGTATTTTT ATCTTCGCTC GGTAAAGTAT TAGTAGCTAC ATCAGACTTC 
GCTTCAGTAC TTACTTCTTT TTTTTCAGAT ACATCAGATT GTTTATTATC TTGGCTACTA 
CTTTTAGAAA CATCAGAGTT TACTGTTTTk TGTTTGCTAG TTTCAGTACT kGATACTGGa 
TTTTTCaTAt ATTCTTGAAA ATCTTTTTCA TTTAAAAATC TAACAACAAA ATCTTTGTCG 
GCAGTTTCGA ATGTGAGAGT ATCTCCCTTT TTAAGAGATA CTTTTTTCTT GTTTCCCACT 
CCGTATAAGT CATGGTTACT ATTTTCTCCG CCTGATTTGA CGGTAATAAG AACTATGCTC 
GATTTATCAT CTAGTTGAGT TAATACAGCA TAGTAATCTC CAGGATCAAT ATCTTTTCCA 
aCTGAATATT CCCCTGACGC AACAAAGtCT TTTGAATTCG GTTGTTTAAC TGTGCTGCTA 
GATTGACTGG TTGACTTTTC AGTATCTTTA TTATCATTAT TTCCCGrACA AGCAGTAAGT 
AGTGAAATAC CACAAACACA AACCAATCCT AAACTAATTA ATTTTTTCAT CATAATTCCT 
CCTAAATAAT TTATTCAAAC CATTCATATA CCCCAATTGT ACCATCACAC CTAAAAATTA 
TTTGATAATT TTTATAGATC ATTCCATTTT CATATAATTT TGAGTAACTT TTGATTGCGT 
CTTGAAGATA TTCAGTAGTT ACTCCTAGGA ACTCTGCACA TTGCCAAACA TATATTAATC 
GCTCATGATA ACAATCTATA AAATCTTGTG 6AGTTATTAA CAATTTACAT CCACAATCTC 



8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 
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TAGCTCGTTG TTCTTGTTGT CTTTTTTCAT TACTATTTTG CTCT^TAATA TCCCCAACAC 
TTGTTAAATG ATGACCAATT TCCTCTGCGA GAGTACCCAC CATTTCTTCG TTTGATTGAT 
TAGGGkTTAA ATATACTACA TTATTAATAT AGAGACCTTT CTGATTTcTG GGCATCATAG 
GTCTATTCTG TAGTTTAGTT CAGGATAATT AGCCATCAAG CTTTCCAATT TATn 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
itATTGGTTGG TACAGGTATT nGAATTGGTT nGCTCCCTTG GCATCAAAAG GTTTGCmAAT 
tCGtTAAwTT mTyCAGATTG GTAATATCTG aTAACTCGkT GtAAGAAcAA TCaATAGAAA 
aTAGATTTGT ATTTTgGCTG CTAACAtCTA GAACAGTTAA TTGATTGCTC aAACAAACTA 
AAATGGCtAG ATCAACATTA TTAGTAACAT CTAAAACGGT TAATTGATTG TAAGAACAGT 
CTAAAAGATG TAACCTAGTA TTTTGGCTGA CATCTAAAGT AGTTAATTGA TTGCTAGAAC 
AGTCTAAATT CCTTAAATAG GTGTTCTGAG TAATATCCAA GTCTGTTAGA CGATTTCCAC 
TACAAGATAG AGTCTCAAGC ATTCTATTCT GACTAAAATC CAGATTGGTC AAATTATTAA 
ATTGACAGCT TAAATCAGTT AATTGTTCTG CATACTTGAT ACCTTTCAAG CTATCTAAAT 
TTTTATTAGT GACAGTAATT TTTGTTAAAC TTGACATATC TGCAATAGTC AATTGATCCT 
TATCTTTCCC AAAGCCTGAC CAGTTATTTC CTTGGCTTTG TACAGTAGAA TCAGTCCTTA 
AAAGTGTTGC GAACCTAGCA TCGATATCTG TTCCTTCAAT TAAATTGGTT GTTTTAGGAT 
CTAATTCTTT ATTTTttTTG TCCAGTTTTG CCGATTCGTT TTCTTGAGTA GGAGTAGTCT 
TTGTAGATAA AGTTGAATTT AATGTGCTCG TTGTTGTTAA TTCCATTGAG TTTTTTGTTC 
CTTGTGATTG ATCCGTATTT GCATCTATAG AGGAATCTGT GCCCGAAAGT AAACTTGAAG 
ATGATGTTGA TTCAACTTTT TTGGATGAAG TAGTTTTTGC CGATACCATC ACACCCATTA 
TTCCTTGTGA AAGGACTAAA CCTAAACACA AAAACAATCT AAAGGATTTA CTCATAGTTA 
TTCTCCTTTT CTACATAATT GTCTAAATAG AAACAGCCAA AAACTACTTA TTAAAAACAA 
TATGCCAAAC CAGCTTAGCC AAGTTATTCG ATTGTCACCA GTTTTAGGTA ATCTTTTTAT 
TTGCGTTCTG CTACGAACGA TTGATAGCGA TTGAAtGGTT GTTTGCGGCA ATACATtCAC 
AACTGGATTT TTTTTTGATG GTATGTCTGA AGTTTTTGTA AACCCGATTC CGACTTCTGT 
GGTCTCCCTA TTTTCTGCAT AAGTTGTTTG AGGAATATAA ATTAATAGAA CCAaCAaTAA 
TAAAAaCkGA AGATTTCyTC yTCTCATTAT GTCACTTCCC CTTCGTTTTT CCACTTCTCT 
CCACGACGAA CTAAGTAAGC AATCAATCCA GCAAGTGCTA CTGTCAAAGA TAATGCTACC 
CATGGTACCC ATTGTGGGAT CACTACTTGA TACGCTGCCG ATTGATTCAC TTTTTTCGCT 



10020 

laoao. 

10140 
10194 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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TGCGCTTCTC CTACTTTAAA TTCTTTTTCC 'CATTTCCAAG TTCGACCATC ACCTGTTGCT . 1500 

TTTCCAGTAA AGTGATAAGT TCCTGGTTTA AAGGTATTCA ATCCAATAGG AATGTTAAAA 1560 

TCGAAATTCG AATTTGGTGC TACAGAGAAT TTCTCCATCT TATGTTTATA TAAAATTTTC 1620 

TTTTCCCCTT TTAGACGTAT CTCCCCTTGA ATCGTTAGTT CTTGAAGAAC CTTTGGTTGA 1680 

TCGTTTTGAA TACGTGCTGC AATCACTTTT TGACCATTCG CTAATTGTAA GCTGGCTTTC 1740 

TTTAAATGAA GCATCGCTCC CTGATTAAAG GGTTCTTCAT CCTCCGTAAG CATAATCGCA 1800 

ATCACGCGCG CATATTGAGG AGTTAAGCCA GATTTCTTTT GTTTCACTTC CGTACTATTC 1860 

TTTCGAACAA AGCGCAGAGA GCCCAGCTTT ACTCCGGAGA AAGCCGATTC AGGTGGCGTG 1920 

GATCTCAAAT GTnATCGGTT TCTCTTCTTT CCCCTTGGAG GGGGAATTTC CTTAACGGCC 1980 

ACCTTTriATT TTTTACTAAC TCACGGAATG 'GGGGCTTTCA AACGGGCATC GAACTTAnGG 2040 

GGGAACATTG GnA 2053 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4178 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



GCAGTCGAGC CTGTTGGTCC 


ATAAAAATCG 


GTGTATGAAA 


GCAATTCTTT TTAGATATTT 


60 


TTGACTTCCC AATTTGATAT 


ACTGCGGAAA 


GTATCTGAGA 


ATTTATTTAA CACATTTTTT 


120 


ATTTACTAnA GAAAGTAAAG 


ATCaGAGCTG 


TGCaAATAGA 


AATGATTAAA AACATCCCaA 


180 


TATCATCATC TAATGTTTCA 


AAAACTTCAA 


CAGGTTTCAT 


AAAATGTTTA ATTATTTTTT 


240 


TCATTGtATT CaCCTCAAAA 


AGAGAATAGA 


ATGAAATATT 


CTACTTATCC ATTGATATAT 


300 


CTAACGATTG ACTTACAAAA 


TTGAAATAAA 


AGTCTAAAAG 


TCAGTTGGTG TGATTATTAT 


360 


TATAATACAA AAAAGACCTC 


CATTCTTTTA 


AGAATAAGGT 


CTTTTTTTTC GTAGTCGCCT 


420 


TCTAGGCAAA CTAATTCAGT 


TAGCCGTATT 


CTAAACTCAA 


TAAAATGTCA AATACTCATA 


480 


GGTGTATTTT TATCACAAAA 


TTTACCACAG 


ATTTGCGGTA 


TTATTTACAT ATCTTAGATT 


540 


ATTAAAAATG ACAAAACAGC 


TATTTAATGC 


GTATTGTATG 


ATTTTTTTGT TCCTTAAATT 


600 


GTTATGGAAT TTCAGATGGG 


GAGTGGCAAA 


TATACGAAGA 


AAATTAACTA TTTATAGCGG 


660 


TTTTATTTTA TTAGAGAGTC 


TCGTTAGTTT 


TAGTATTGTA 


TGTGTGATAG CAGGGATATT 


720 


CTCCCTGACT ATTACACAAT 


TAATTCAGCA 


AAATTATCAA 


CGGGAGCAAG AATTAACGCG 


780 


CACTCGTCTA* GGATACGAAG 


CAATTTTATT 


TTTAGAGCAG 


ACGGGTGATT TACGTTTCAA 


840 


AGAGCGGATA TATCAAGGAG 


AAACTTACCG 


ATTTTCTTTG 


ATGGAGAACG ATGGAAGAAG 


900 


GATACTAAAA GTCACAGATT 


CAAGAGGAGC 


GATTTTGATT 


GGTCAATAAA AAATATGCGG 


960 
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oX Jl 1 Iriv^uAx 


O X X #iunu X u X 


TTA^^TAf^PAP 


TAGTCGTTTT 

X /Iw X X X X A 


AAQTTGTATG 


TGTCAATTGT 


1020 




wli X WviLiAn 


AuX X X XnXV^^ 


naa&TPAATA 


TTTGAAAAAC 


aATGATTPGA 


1080 




fit ^ fffv fn Iff! fit nvfit fft A 
X AX X i I X 1 XA 


AX Xv«AAX IJhfct 


aaaaAGAATG 


TPAAAAGPTP 


GTTTTPPAAA 

V9X X X XV.rV.pAAA 


1 1 dn 




1 CCAAGAGAx 


X X X 1 X X L» 


papTPacaaa 


PAAAPAACAP 


T* a T"p "p a a *p T 

XAX X X WiAi X 




CAAATCaAAb 


JVJt^lk'PTtJtTVltrp 

AAGATAAAAi 


•p a TT a a pr*r* a 
lAl lAAbUlaA 


pTpaaTf^TA 

u X W.Ari X X A 


AAf^GPTATPa 


21 PPGPTGTT a 
AV«V«UV.F X X X A 




ATTGGTATCIft 


AAAA rLriaAvJA 


#^ nnini m 71 n ]k Tk a T 

isX X XAAAnnl 


na arzriapA AT 


PGTTTaPTTT 
XXX XXX 


GCJVAGTTAPT 
vjVTnnvji X x x 


J. JZU 


TTTACTTCTG 


VaAAAAAUAi X 


XuAi Aol 111 


TTTPP/^aTAA 
X X XC^laAXAA 


agt:/; a n A a TT 


TTAATGAGGP 
X X AAX\3AV7\3V« 




AAAAATATTU 


AbuAAAUT TA 


1 IGl lUAUGo 


r»r* a Tr* r^fr* a T 

CLiii X GGCUA 1 


Xul I XAX X lu 


aT^IAGTTTTP 
AXvsAuX X X XV.* 


1 ddO 


TCGCCCTTCA 


GTTACTAGAA 


^7k 7k /^^T»/^lk/^'P 

GAACGTCAGT 


TAACAt*AAAA 


A fp rpnk A A A 
Ail XACGUAA 


GV« X AUv^lJAGG 




AATACTATGC 


AGGGAAAAGT 


jk »p »p •« rtl /-• 7k T »P 

ATCTT xCATT 


TATTTCTTGC 


A A T/^TI* AAA 

AGATG r xAAA 


oaaaaTa^ar* 
U AAAA 1 AGAv^ 




GAAAGTTAAA 


AACAGAAGAA 


AGGCTCGTAT 


ACGCGCAAGT 


GACCCTCGAT 


rr>7k tn 7k /**7t m 7k /^7k 

TATACATACA 


"1 an 


AAAATGAACA 


It rnrnlt It It Ik nrt 1\ 

ATTAAGAATA 


ACTGTTTTAT 


»T*7k7k7k/^7kA7kT»^ 

T AAACAAA 1 C 


1 GGTUGAAAA 


T AO^ A AT* A 1*0 

1 ACUAAl AiC 


XdoU 


AAGAGAGAGT 


TTCTCATCAA 


It It It Ik 7k 7t ^^f^r^ 

AAAAAAGCGG 


7k 7k It Jt 7k »T» 7k r^Tn 

AAACAATACT 


GGAATAGCG i 


TGTTTTTCTA 


1 / 4U 


CAAATTATTA 


GAT TTTAGAA 


/-» It Tt Tk^Tkr^ik r^*T* 
GAAAGACAGT 


7k 7k 7k^^/^7k»P»n/^ 

AAAGGCATTG 


ATTATTCTTT 


CTTTTTTcGT 


1 Qf\f\ 

loUU 


TAAAATAAAA 


^jk ^^mik m^^vk 

CAGGTATGGA 


Vk 1*/^ 7k O 7k 

AGTTGTTACA 


TTTTGAATAG 


AAT AGAGG J> G 


d AAX X xGx X 


Xobtl 


CCCAGAAAAA 


aTTGAAGAGG 


CTTTCGAGCC 


AATGGAALAA 


GUAAlUGAGi 


»pa*i»n»r*/^a ar*a 
XAX XoCAAuA 


1 QOf\ 


GTCATTGGAT 


ACTTCTTTTT 


TAGATGCGTA 


*t*A 'P'P^A A A An* 

TATTGAAAAx 


r*r*ntr*jk a a a*ra 
GGTGAAAAIA 


X X i X AGAl GA 


1 oan 

X7DU 


TTTTCAAGTA 


CGAGTACTTG 


ACGGAGTALC 


T A A nr*/^ A A ^ 
TAATLGAGAG 


A o A a a a<^ 
AwlG X GAAAU 


a ai^iiva/^a a ar* 
AAi lAGAAAvir 




GCTGTATCAT 


ACGATTAAAA 


AAATTGATTT 


AbL.TL.^xGAA 


GAx G X Gv#GAL* 


GV« X X G 1 CLrGA 


z xuu 


ATTACTTTTG 


/^rriTk jk ^ ik^/^M 7k 

CTAAAAGGAA 


/^/^IkOTk 7k 7k A ^ Tk 

CGAGAAAAGA 


GCAGGx xLAA 


AAA •Pr' A 

GC AAATGA X ^ 


AAX xGAGGuv^ 


ZXDU 


AGACGGTATT 


GGCTTTTTGT 


TTGTTTATTT 


Ar*rp Ar* A A/^A A 

AGTAbAACAA 


T^P A A^A A A'Pa 

1 i AALJ\AA1 A 


a a a r» a ^r*^ 
AA X wiGAGGG 




ATTAAAAAi T 


TTAGATCCTG 


C xl CZAGGAAi 


GGGAAAx 1 X A 


v# X G 1 X AAuGG 


X X L. X GU X GAA 




UCTTGAAAuA 


GUiGGTTACA 


AAGTTTCTGG 


TTAlGGxGxG 


GAXAxUGAXG 


AAAv«Av« X a X X 






1 X i) X oAA Wi 


a T^Jr "•i*"i»^f "'1'^* 

AluWrX XVsoX^ 


apaacppaaT 


aTTPni^TTGT 
AX XV<AuX XlaX 


TPPATPAAGA 






oAX X XVsX XAX 


X \3\3J^ X V^VrfiAu X 


TGATTTAGPA 


TTAAGTGAPT 
X X AAla X vtnx^ X 


TGPPG A TTCC 

X VjV^Vo w/V X X V3V9 


2460 


CTATTATCCA 


AATr^ATrJAfSP 

/WX Unx OnOVi^ 


n AHP A A IkCX^ 


ATTTGPTGPT 

t\X X X X X 


GPTGPGGAGG 


AAGGPPATAG 


2520 


TTATGCCrAT 


CATTTATTAA 

>>.«rVX X XAX LT\r\ 


THHAAPAAf^P 


PATGAAATAP 


GTGAAGPPAG 


PAGGGTTTGG 


2580 


fTTATTCTTA 




A T ATTT TAG A 

r\ X t\ X X X X n>V3n 


GAPTGAGPAA 


AGTGAATTPT 

rWj X Vxr\f \ X X k^ X 


TTAAAAATTG 


2640 


ATTGACAAAA 


AATGTCTATT 


TACAAGGAAT 


GATACAGTTA 


CCTGATGAAT 


TGTTTAAGTC 


2700 


AGAACAATCA 


CGTAAAAGTA 


TCTTGCTTGT 


TCAAAACAAG 


GGTGCAGATG 


CTGAGCAAGT 


2760 


GAAGGAA6TT 


TTGTTAGCAA 


AACTTGCCTC 


TTTGAAAGAT 


ATCAATAAAG 


TAACGGAGTT 


2820 


CTTTAAACAA 


TTTGAAGCTT 


GGAAAGCTTC 


AAATTTAAAA 


TAAAATAGTA 


ACGAAGAGGA 


2880 


GAAAATTATG 


TCTAAAACAA 


TTGCAATTAA 


TGCAGGTAGT 


TCAAGTTTAA 


AATGGCAATT 


2940 
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ATACCAAATG 


CCAAATGAAG 


AAGTGATCGC 


TAAAGGAATC 


GTTGAACGTA 


TTGGCTTAAA 


3000 


AGATTCTATC 


rrCACAATJA. 


.AAIATGGTGA 


AGGACAAAAA 


TATGAAGTAA 


TTGTTGATAT 


3060 


TGATAACCAT 


GAAGTGGCAG 


TAAAAATGTT 


ACTAGACCAA 


TTAATCGACT 


TAAACATTTT 


3120 


AGGTTCTTAT 


GATGAAATTA 


CTGGTGTTGG 


TCACCGTGTA 


GTTGCTGGTG 


GAGAAGAATT 


3180 


TAAAGATTCT 


GTTGTTATTA 


CAGACGAAGT 


ATTAGAAAAA 


ATTGAAGCTT 


TATCTGAATT 


3240 


AGCACCTTTA 


CACAACCCAG 


CGAATGCGAT 


GGGGATTAAA 


GCATTTAAAC 


ATATCTTACG 


3300 


AGAAATCATT 


AGCGTGGCAG 


TCTTTGATAC 


GTCGTTCCAC 


ACAACAATGC 


CAGAACACAA 


3360 


CTATTTATAT 


AGTGTACCGC 


GTGAATACTA 


TGAAAAATTT 


GCTGCACGTA 


AATATGGGGC 


3420 


ACATGGAACA 


AGTCATCGTT 


ATGTCTCTCA 


AC6TGCTGCA 


GAAATGTTAG 


GTCGTCCAAT 


3480 


TGAAGACTTG 


AAAATTATCA 


CTTGTCACTT 


AGGAAATGGT 


GCTTCAATCA 


CTGCAGTTGA 


3540 


TGGTGGT7y\A 


TCAGTCGATA 


CATCAATGGG 


CTTCACTCCT 


CTTGCTGGCG 


TTACAATGGG 


3600 


GACTCGTTCA 


GGGGATGTTG 


ATGCGTCACT 


ATTACCTTAT 


TTAATGACAA 


AACTTGGCTT 


3660 


GACTGATGTG 


CAAGACATGG 


T TGATATTT T 


AAACAAAAAA 


TCTGGTTTAT 


TAGGTTTAAC 


3720 


AAATGGTTTA 


TCAAGCGACA 


TGCGCGATAT 


TCAATCAAAT 


CTAGATAAAC 


CTGAAGTTCA 


3780 


AACAGCATAC 


AACATTTTTA 


TCGATCGTAT 


TCGTAAATAC 


ATTGGTAGCT 


ATGTCACTGT 


3840 


TATGAATGGC 


GTTGATGCGA 


TTGTCTTTAC 


TGCTGGAATC 


GGTGAAAATG 


CAGTTGGCGT 


3900 


TCGTAAAGAT 


ATTATTGATG 


GCATGACTTG 


GTTTGGTTGT 


GA7UVTCGATG 


ATGACAAAAA 


3960 


TAATGTACAC 


GGTGAAGAAG 


CAGTTATCTC 


AACTGATGAT 


TCAAAAATCA AGCTTCTATT 


4020 


AGTACCAACT 


GATGAAGAGT 


TAATGATTGC 


CCGTGACGTA 


GAACGTCTAA 


AATAATTAAA 


4080 


C6AACACACC 


TTGTGAGATG 


CTCTCACAAG 


GTnTCTTAAT 


AGGTAAATAA 


GAAGAAACTG 


4140 


AGCCAACTAT 


CAAAAGTGAT 


AGTTGGCTCA 


nTTnCCTT 






4178 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

CAAAATCAGC CACCTATGGC CAGTCTATCC TGCCAAAGCG TTTTTTATTC TTCATCGGTT 60 

GCTTTTTCTw GTTCACGATA JcTTtcTCaTT TyTTTGCTGA ATAATTCACC GTTACTAGGG 120 

GGCGTATATG TAATAGCATT CGCTCCGGCG ATAATGGTTT CATGAATACT CGATTCAGTT 180 

GGTCCTCCTG TTGCAATAAT CGGTAATTCT GGATACTTGC CACGGAAATA GCGTACGGTG 240 

GCTGCAGTGT CTTTTCCTGC ACTGATATTA ATCATATCCA CGCCAGCCGC AAGTTTTTCa 300 

TCAATTGGTG TATGaATAGA AGTTAcTGTG CTGATAATAG GAATATCGAT TGCTTCATCC 360 

ACCATTTTAA TTGTTTCAAT TGGTGTAGGA CCATTCAAAA CGACACCCAT TGAGCCTTGT 420 
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GCTTCTGCAA 


ATAAACTCAT 


ATAAGCAGAA- 


CGCATGCCTT 


GCGTTAAACC 


GCCGCCGACT 


480 


CCTGAAAAGA 


CAGGAATATC 


CGCTGCTTCA 


ATAATGCCTT 


TTGTAATCGC 


TGGATGCGGT 


540 


GTAAAAGGAT 


ATACAGCAAT 


CACTGCCTGT 


GCATCCGTAT 


TGCGGATAAT 


CGCAATATCT 


600 


GTCGTAAAAA 


TAATTGATTT 


TATTTTACGG 


CCAAAAATTT 


TAATGCCACT 


GGCTTGCCGA 


660 


ATAATCTCTG 


GCATTTTTAC 


AATATCTTGT 


TTCAGCGGCG 


TCATAATTTC 


TGGAATCCAC 


720 


TTTTCGCTCA 


TTGTCATCAT 


TCCTCCATGC 


TTAACGGTCG 


TATTTATTTA 


AATTGTATTC 


780 


TTCAATTACA 


TGAATGATCT 


TATCAGCGTA 


TGTAGGATCC 


GTTGCGTATC 


CAGCATCTTG 


840 


AAGCGCTTGT 


GCAGCCTCTT 


TATAATTTTT 


GGCTAACAAG 


ACTTTTTCAT 


ATAAATTTGG 


900 


ATCCCAATCG 


ACCCCATCAA 


CAAATAACTT 


GGTATGATCT 


CGCATGGATT 


CTTCCCAATT 


960 


GTCATAAACT 


CGAAAATCAC 


CCTGAATCGT 


AATCCAGACC 


TCATTGACAT 


ACTCTTTTGT 


1020 


TTCTAAATTG 


AsTTTTGGTT 


GATTCCCATA 


TGCTTTAATC 


CCAAAAAGAT 


TGTTATATTT 


1080 


CGATGCTAAT 


TGACTkTGTC 


CAAAATTTGA 


TTCTAAAACT 


GCTTGCCCCA 


TAATAATACT 


1140 


TGGTAATACC 


CCGTAAGATT 


CTTGTAATTC 


ACGTGCATaC 


GGTGTGaTTT 


CAGCAAcAAA 


1200 


TTCTTCTTTT 


GAGATGTTTT 


CATTTTGGCG 


ATTGACTGAC 


TCGCTTTGTC 


CCTTATCTGA 


1260 


ATCaGAAAGG 


ACAGATAACG 


AAAAGACAAA 


GGCCAAACCG 


ACGACAACTA 


CGCCTAAGAA 


1320 


TAACTTTGGC 


AAATTCAGCT 


GCTTTTTTCT 


TTTTTTCTGA 


TAaGTTGgCT 


CcACTTCATT 


1380 


AATTCCCCaC 


TTCaTCTTTT 


aTTTTCCgAt 


AAAATGTTTc 


aCTAtCTTTc 


ACTTTcgtGA 


1440 


tTGatTTCaT 


TTGtTaCTAt 


aACCGAATAA 


TTGTATCATT 


TTCACAGAAA 


ATTATCAACT 


1500 


TTCATCACTG 


CTTACGCTCG 


TGATTTCAGT 


ACTTGATTCA 


GTGGTAGCAG 


TTGAAGAAAC 


1560 


TGTACTACTT 


TCAGAAATTT 


CAGTGGATGA 


TTTCACTGTT 


GTTTCAGTTG 


ATGTGTGAAC 


1620 


AGATTCTTCA 


ACTGGCTTTT 


TGCCAAAAAA 


GACAAAAATT 


ACCGCACCAA 


ACATAATAAT 


1680 


CACAGATAAA 


ATCCCAGCAA 


TTGTGTATTT 


CAAAAGACCA 


TCCTTCCTAT 


TCGTTATTCA 


1740 


CCTACGTAAG 


TATCCATCAG 


ATTTTCATTG 


ACCCAATCTA 


ATAATTCAAC 


TTTTGAACCT 


1800 


TCCAATTGTC 


GTTGAATCGT 


AGCTGGTAAC 


CGGAATGTAA 


TAATATCATC 


AAAACCCCAT 


1860 


TCACCATAGG 


CCAGCGTCAC 


TTTCAAATCA 


ATATAACCAG 


CAGCAATTTC 


TTGTTCTTGT 


1920 


TTATTGTTAG 


GAATTAATTC 


TACAACGCCC 


AAATCAGATT 


GTAGCCATTT 


TTTGGCAATT 


1980 


GTTGTTTTTA 


ACCGTTTATA 


TTCCGAAACA 


GCTAGGTTAC 


GTTTTTCCGG 


AAATAAGATG 


2040 


GTTTGGCGGA 


GTATTTTTTG 


AGTCAGCATC 


CACTCATAAT 


CCGTAAACAA 


ATTCTTAATT 


2100 


TTTTGCATGA 


CTTCATTGTC 


CAAACCAAAT 


TGGCCTTCTT 


TCCAAGCTTG 


CCAATCGGCT 


2160 


GCTTGAATTT 


GATAAGCGGA 


GTAAAAATTT 


TCTTCTGTTT 


GATACTGTTC 


AATAATTGCG 


2220 


TCAACAATAA 


TATCTACATA 


AACTTGaTGC 


ATAATCATTC 


GCCTCCATTT 


TCTATTTTAC 


2280 


TGAAATTCTC 


TAAAAAAAGC 


ATCTTTTTAT 


CTCTGGaTTT 


TATCATATTT 


ACAAAATTGT 


2340 


CCTTGTTTTC 


TAATTTCTCT 


TGTTTACAGT 


ATAATAAATA 


ATGAAGAAGA 


TTTGAAGAAA 


2400 



wo 98/50555 



PCT/US98/08985 



617 



TTGCATAGAT 


TAGGAGGACT 


TCTTTTGACA 


AAAAAATTAT 


ATCAATGGGG 


CATCGTAGGA 


2460 


TTAGGTGGAA 


TTGCCcACGA 


ATTTGCTTCA 


ACGTTTAAAC 


AAGAAACTAs 


CCAaCTTGCA 


2520 


GCTGTTGCTT 


CACGGACATT 


GTCTAAAGCA 


GAACAATTTG 


CCGCTGATTA 


TTCAATCCCT 


2580 


AAAGCCTATG 


GTTCTTATGA 


AgAAtTGCTA 


gCaGATGaAA 


CAATCGATGT 


TGtCTATATA 


2640 


GCAGTGCCAA 


ATAAACAACA 


TGCTGAACAC 


ATTTTAAAAG 


CGCTCCAGGC 


CGGAAAACAT 


2700 


GTGTTGTGTG 


AAAAAGCTAT 


TACGATGAAC 


GCGGCAGAAT 


TAAAACAAGC 


CTTGGCAGTT 


2760 


GCAGAAAAAA 


ATGACCTTAT 


TTTAGCAGAA 


GCCATGACTA 


TTTTTAACAT 


GCCACTATAC 


2820 


GACGAATTAC 


GTCGCTTGAT 


GGATGCTGGC 


AAATTTGGTC 


AGTTAAAAAT 


GATTCAAGCA 


2880 


CCATTTGGAA 


6TTACAAAGA 


ACCCGATCCT 


AAAAATCGCT 


TTTTTAATCC 


TGACTTAGCA 


2940 


GGCGGCGCAT 


TGTTAGATAT 


TGGTACTTAT 


GCGGTTTCGT 


TTGCTCGTTG 


GTTTCTCAGC 


3000 


TCACAGCCAG 


ACGTTTTAGC 


TTCTAGCATG 


GTTCCCTTCT 


CTACTGGCGT 


TGATGAACAA 


3060 


TCGGTGACAT 


TGCTTCGGAA 


TAAAGAAAAT 


GAGTTGGCTA 


CTATTTCATT 


AACATTCCAA 


3120 


GCAAAAATGC 


CAAAGCAAGG 


AATTGTTGCT 


TTTGAAGATG 


CCTATATTAC 


CATTACAGAT 


3180 


TATCCACGCG 


CAGATGAAGC 


GATTGTTCAC 


TACAACGACG 


GTACGGTTGA 


GACCATCGTT 


3240 


TCTGGGTACT 


CAGCTGAAGC 


CATGAATTAC 


GAAATCTACA 


ATATGGTTGA 


AGCAATCAAA 


3300 


GGTACACAAC 


CCAATCGCTC 


CTTATTTTTC 


ACGACAGAAG 


TCATTGATAT 


TTTGGACCAG 


3360 


ATGCAAAAAT 


TATGGCAGAA 


TTAAAAAAAA 


GAACTTCGGC 


TTTTGCTGAA 


GTTCTTTTTT 


3420 


TTATTTGTTC 


TCTATGGATA 


AATACGGTTT 


AACATACGTG 


GGAATGGAAT 


CGCTTCACGA 


3480 


ATGTGTTCAT 


TACCAGTAAT 


GAACGTAACA 


GCACGTTCTA 


AACCTAATCC 


GAAGCCAGAA 


3540 


TGTGGCACAG 


AACCGTATTT 


ACGTAAATCT 


AAATACCAGC 


TGTACTCTTC 


TTCTGATAAG 


3600 


CCAAACTCAG 


CTACTTTTTC 


TTTCAAATAA 


TCATAATCAG 


TTGCACGCTC 


TGAACCACCG 


3660 


ATAATTTCAC 


CATAACCTTC 


TGGTGCAATT 


AAATCGGCAC 


AAATTACAAC 


ATCTTCACGT 


3720 


GTCGGATGTG 


GTTTCATATA 


GAAGGCTTTA 


ATTGCTTTTG 


GATAATTTAA 


AATGAATGTT 


3780 


GGAACGCCAT 


AGTAATTTGA 


AATAAAGGTT 


TCATGTGGGC 


TACCAAAGTC 


TTCGCCCCAT 


3840 


TCAATGTGAT 


CATAATCATT 


GTTTGCTTCT 


TCTTTTTGTA 


ATAACTCAAT 


CGCATCATCG 


3900 


TAAGTAATAC 


GTTTGAATGG 


CTCCGAAACA 


TATTTTTCCA 


GTAACGCTGT 


ATCACGTTCT 


3960 


AAAACATCCA 


ATGGATATTG 


TTGATTTTTC 


AATACCGATT 


TAATTAAATG 


TTTTACGTAG 


4020 


GCTTCTTGAA 


TATCTAGTGA 


TTCTTCATGA 


GTTGTGTATG 


CCATTTCTGG 


TTCAATCATC 


4080 


CAGAATTCCG 


TTAAGTGACG 


ACGAGTTTTG 


GATTTTTCCG 


CACGGAAAGT 


TGGTCCAAAA 


4140 


GTAAAGACTT 


TTCCAAATGC 


CATCGCACCA 


GCTTCTGCAT 


ATAATTGACC 


TGTTTGTGAT 


4200 


AAGAATGCTG 


GTTGTCCGAA 


GTAATCTGTT 


TCAAATAATT 


CTGTTGTGCC 


TTCTGGTGCT 


4260 


GAGCCTGTTA 


AAATTGGGCT 


ATCAATTTTA 


ATGAATCCTT 


GTTCGTTAAA 


GAACTCGTAA 


4320 


GTAGCACGAA 


TAATTTCGTT 


ACGAACTAAC 


ATGATCGCAT 


GTTGTTTGCT 


TGAACGTAAC 


4380 
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CATAAATGAC GATGATCCAT 
TCTTCTGAAG CGCCAACGAC 
CGGCTATCTT CTTTGATAAC 
ATTTCTTGGA ATTTTTCTGT 
AACACGACAC CTTGGAAAAA 
CTTGAGCGTT TGTTAGCAAT 
GAATCAATAA TTTGAATTTG 
TTAGGCGTTG TGCTGAATTC 
TTCTAACGTC ATT7VAATCTG 
CTCTCCCGTT ACCAGAGCAA 
GTTATCATAC CCACAAATTT 
TTCAGGTTTT ATTAGCGAAA 
GCGTTTTTCA AAGGCCTGaC 
AATTGCCGCA TATTGGCTAA 
CATTGCTTGA ATAATTTTTT 
ATATGtTTTG GAAACACCAT 
TGCAATTGGT GTGAACCGAT 
AATGAGGATA TTATGAGCAA 
AATAACACCA GTTGGGTTAG 
TACAGCTTCT AATTGTGCGA 
TACTGGCTTG CCTTCTGCTA 
GATAATCAcT TCGTCCCCAA 
GGCACCATCT GTCACAATAA 
ATAAGAGACA ATTGCTTGCT 
TCTTCCATCC TCAATTGAAG 
TGGTTCCCCA ACCGTTAAGC 
GGCGGcAGCT GcTAATGTAA 
ACACCTTTCA CTCCCACTTT 
ATCTCTTCGC CTTTATCAAA 
GTGACCACTT CCCATGCAGG 
TCATCGGGAT GGGCTTTAGC 
TGATTTAAGA CCGTGACTTT 
TTATCATTTT TGCCGGTTAA 



618 



T/UVGAAATCT 


GTGCCGTGTT 


CTTTCGGCGT 


AATTGGGTAA 


. 4440 


TTCTAAATCA 


GTGATATCCA 


TTTCATAGCC 


GAATTTAGAA 


4500 


ACCAGTAACC 


ATGACTGCTG 


TTTCTTGkGA 


TAAATGTTTA 


4560 


ACCAGCTTCT 


TCACCAAAGG 


CTTCAATGAA 


ATTTGGTTTG 


4620 


GGCTGTCCCG 


TCACGTAATT 


GTAAAAAGGC 


AATTTTTCCG 


4680 


CCAAGCGCCA 


ATTTTTACTG 


TTTCTCCAAC 


ATGTTTGTTT 


4740 


TTCCACGTTT 


GTCTCTCCGT 


TCATTTcTTA 


GTTTATTCTT 


4800 


GTTTCTTTTC 


TATAAAGTCA 


TTCATTCGCT 


CAATCGCTTT 


4860 


TCGCATAGCT 


CATCCGCACA 


TGTTCTGGTG 


CACCAAAGCC 


4920 


CATGCGCTTC 


TTGTAATAAA 


TCTTCCACCC 


AGTTAGTCAC 


4980 


CTAACGTTTT 


TTTGACATTC 


GGGAATAGAT 


AAAAGGCGCC 


5040 


CGCCTGGCAA 


AGCTGCTACT 


TTTGGATAGA 


TATGATTTAG 


5100 


GCATGTCTTC 


AACTGTCGAC 


TGTTCACCTG 


TTAAAGCCTC 


5160 


CTGCGACTGG 


ATTACTGGTG 


GATTGTGAAG 


CGAGTTGTGT 


5220 


CATCX:CCGAC 


TGCAAAACCA 


ATGCGCCAGC 


CTGTCATTGC 


5280 


TAATAATGAT 


TGTCTGTTGA 


CGAATtGCTT 


CTGAGATGGT 


5340 


GTCCATTGTA 


CACCAAACGA 


CCATAAATAT 


CATCTGCAAT 


5400 


CTGCCCATTC 


ACCAATTTGG 


CGTAATTCTT 


CTTCCGTATA 


5460 


ATGGTGAATT 


TAAAATGATT 


GCTTTTGTTT 


TGTCCGTTCG 


5520 


CTGACACTTT 


AAATGATTGC 


TCTTGCGTGC 


TTGATACAAA 


5580 


ATTTGACTTG 


TTCACCATAA 


CTTACCCAAT 


AAGGGACAGG 


5640 


CGTTCAAAAT 


GGCTTGAAAT 


AATAGATACA 


GCGCATATTT 


5700 


CTTGCTTGGG 


TTGATAACGA 


AgTTGATATT 


CTCGTTCCAC 


5760 


TTAATTCAGG 


AATGCCGCCA 


GAAGGTGTGT 


AATAACTTGC 


5820 


CAATTGCTGC 


TTTTTGAATA 


TTTTTCGGGG 


TTACAAAATC 


5880. 


TTAAAACATC 


TACACCTTTT 


GCTTTTAATG 


CTTTCGCCTT 


5940 


CAGAaGGAGT 


CAATTGCTGT 


GCTCGATTTG 


ATACGTTCAT 


6000 


CATCAAAACC 


TATCTAAAtT 


AAATATCTTT 


GACAACGTTG 


6060 


GCTCATCAGA 


TAATACGTCA 


GTACTCCTTG 


ATCATTTTCG 


6120 


GCGCTTATCA 


TAAAGCCCTA 


ACGTAGCTTT 


CATCACCGAT 


6180 


AACAATGTCT 


GTTATTTGTT 


GTTCAGTTTG 


ACCATCTTTT 


6240 


TTCTCCCGAT 


TTAGGGACAA 


TCACGACAAT 


CTCTGTGCCT 


6300 


AGTAAAATAC 


GTATTTTTGC 


GTGTAAACCA 


GTAAAATTTA 


6360 
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TCAACTGTTT 


CTAAATTAGC 


AGAAGTTTTG 


GCAATTGCAG 


TTGCTTCTTT 


TTTGGCTTGG 


6420 


GcCATTGGTT 


GATTGGtTCG 


TACaAAAAAG 


ATAATTATTG 


TTACAAGAAT 


TGCTAAAAGG 


6480 


ATTGCCAATA 


GGGTTAAAAG 


GACAGTAATC 


CGTTTCTTCT 


CTTTCTTTTC 


TTCTAATTCC 


6540 


TCCAAAAATC 


CTTCCATCCT 


CTCTAAACTT 


ACTGGGTCTT 


ATTATAGCAG 


AAAAATAGCG 


6600 


GCCTTTCACA 


AATATGACCT 


AAAAATAATC 


GATTATTTGT 


TTAAAAATTC 


CTTTAATTCT 


6660 


TCTAAAATTT 


CAGGCATGGG 


CGCTTCTTTT 


ACTGAGAGTC 


CCTTGGGAAA 


CGCTTTGATT 


6720 


AATCGTTTGC 


CATATTTTGC 


AGTGACTAAA 


CGACGATCCA 


GAATCAGCAA 


TGCCCCTCTG 


6780 


TCTTCTTCAG 


AACGGATTAA 


ACGGCCCAGT 


GCTTGTCGCA 


AACGTAAGGC 


TGCTTTCGGT 


6840 


ACCGCTTCTT 


GAGCAAATGG 


TTGTATCCCT 


TCTGCTTCTA 


AATACGCATA 


TTTTGCTTGC 


6900 


ACAAGTGGTC 


GTTTCGGGTT 


TTCAAAAGGC 


AAACGCGTGA 


CTATCAATAA 


CTGTAAAGCT 


6960 


GTTCCTGGTA 


AGTCCACACC 


TTCCCAAAAG 


CTATCGGCTC 


CTAGCAAAAC 


ATTTGGTTTT 


7020 


CCATTAGAGA 


ACCGTTTAAG 


AATTTTTTCA 


CGACTGCCGC 


TAATCCCTTG 


GGCGAGGACC 


7080 


TCTTGTCCTT 


TTtCCAATAA 


TCTTGGCTGG 


ACTTGATGAT 


ACACGGTGGA 


CAAT7ATTCA 


7140 


TGCGAAGTAA 


7UUVGAACCAA 


CATCGAACGT 


TCCTCTTGCG 


CCATCAATTT 


TTCAATGGTA 


7200 


TCTGAAATGT 


AcTGACCCAT 


tTCATAGGmA 


GAGGCATTTT 


GAATGGCTAC 


TGCTTCTTCC 


7260 


GGTACATAGA 


CTCGCGCATT 


TTCTGCATAG 


TTATAAGGCG 


TTGGCAGGGT 


TTTGAACGTC 


7320 


ACCTCTGTTA 


AACCTAACCG 


TTGCGGTAAA 


TACTTTTTAT 


TGTTGCCAAA 


CTTCAATGTG 


7380 


CCGCTTGTAT 


AAATAATTCG 


TTCATACCGC 


GGATACCACT 


TTGTTTCAGG 


AAGCAAACTA 


7440 


GCTTCTAAGT 


CATGAATTTC 


AATCAAGCCA 


TGGTCTTCAT 


TTTGTGGTAA 


ATATTCTTTT 


7500 


ACCCAACGTG 


GACGCCAATC 


ATCAAGATAA 


ATGGTAAAAC 


AGTGATGAAT 


ACGTTCAATT 


7560 


TCATCAAAAA 


ATTGAAACAA 


GCGAACAAAA 


ACAATGCGAT 


CAGCCAAACT 


AAAGGTTTCC 


7620 


AACTGACTCA 


TAATTGTTTG 


CTGTAACTTA 


CTTTGAATAG 


TTAACAGCTC 


CTGCAGTAAA 


7680 


AGCTGGAGCG 


TTTGACTATG 


TTTTTCTCCC 


GCAGCAGATA 


AATAATCAAT 


ACGTTCTTTG 


7740 


GTTAAAAGCA 


ACTCCTGATG 


GTCTATCCGC 


TTCTGTGTTT 


TACTAAATAA 


ACGATGCAAT 


7800 


TCAAAAAGAA 


AATCTTGCCA 


TTCATCCACA AAATCCAGTA 


AACCTTGCTG 


ATAAATTTTT 


7860 


ACAAACCGCT 


GCTCTTCAGA 


AAACTGTTGA 7\AAATTTGCT 


GTAAAGCAAC 


AAAAAGTTGT 


7920 


TCTTTTTCAA 


TAAAATGCGT 


GACTTGTTTT 


TTTAGTATCA 


CACTGTTCAC 


TCGTTCGCTC 


7980 


GCCATCCGAC 


TAGCGATGTC 


TGGTAAATGA 


TGTGCTTCAT 


CAATAATTAG 


CCACGGACTT 


8040 


TCAGGTAATA 


ACGGTTGTTT 


TCTTAGCGAC 


TCTTGCGCCA 


AAAATGCATG 


ATTAACAATC 


8100 


AACACGTTAC 


TTTGAGCGAC 


TTTTTTATAC 


AGATGTCTTA 


AAAAGTCAAC 


TTCATAAAGT 


8160 


TCATCTTTTT 


TTGAAAGAAA 


ATCCAATCCG 


CGATGTGTAA 


CTTCTTTCCA 


GAATAAATGA 


8220 


TTTAAATTCG 


TCAATTGTAG 


TTCATCTAAA 


TCGCCCGTTT 


CTGTTTCTAG 


CAACCAGACG 


8280 


AGAACGCCCA 


TTTGATAAAG 


AGCGTATTGT 


TTTTGCGGAG 


TTGGCTGATT 


CAACGTTGCC 


8340 
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TTAAAGCATT 


GTAAATCAAG 


ATAGTGTCGA 


TGGCTTTTAA 


TAACAATGCC 


TCGTAATTTC 


8400 


CCTTGGCAGA 


TTTGATTGGC 


TAATGGTAAA 


TCTTTTTCCA 


CTAATTGATT 


TTGTAAAACA 


8460 


ATCGAAACTG 


TTGAAATGAT 


GACTGGTTTT 


tCTGGCGTTG 


CTAAATAGCT 


CATCGGCAAC 


8520 


AGATAGCCTA 


ATGTTTTTCC 


CGTTCCAGTC 


GCCGCTTCAA 


TAAATAAATC 


TTTTGTCGTT 


8580 


CCTTCTGTAA 


AATGATCATA 


GACTAAATTC 


ATCATCCGAC 


TCTGTTCCGC 


ACGATAAGCG 


8640 


AAACGTTCGC 


CAAACAGTTT 


TTCTTTGGCT 


TTTTTAGTTT 


TGGGATACGT 


GCTAGTCTGA 


8700 


TAGAACGTTT 


CTTCAAAAAG 


AGGAACTTCT 


TTTTTGCGTA 


AAGCAATGCC 


ACTGACCACT 


8760 


TGATAAGCTG 


GATTTAArGG 


TGTGACTTGT 


TTTTTCATTT 


GTTCATACGT 


TTGTTGAATA 


8820 


AAGGCACTGG 


TTTCCCGCGC 


TGTCTGTTGA 


CTTAATTCCG 


CAATTTTTTC 


CATAGTGACT 


8880 


AATGGTAAAG 


ATTTCATTTT 


TTCTTGAATT 


AATAACAAAA 


GTTCgGCTGT 


AACCTGCGCA 


8940 


TCACTATCTG 


CTTGATGGGG 


ATTCTCATGA 


GACAAACCTA 


AGCTTTCCGA 


CAAATCACTT 


9000 


AAACGGA7UVC 


TTTTTTCTGT 


CGGTAAAAAA 


ATCTGTGCCA 


ATTCCACTGT 


ATCAATGGCT 


9060 


GGAATAGTAA 


GTGGCGGCGT 


GCCACACCGT 


ACCAGTTCAC 


GAGCTAAAAA 


ATTATAATCA 


9120 


AAGTGGACGT 


TATGCGCCAC 


AAAAATAGTA 


TCTTCCAGAT 


AATGATAAAT 


TGTATGTGCT 


9180 


ACATCTTCAA 


AATAAGGCGC 


TTTTTGAACT 


TGCGTATTAC 


TAATCCCCGT 


TAAACTTTGA 


9240 


ATTTGTTTTG 


GTACTACTTG 


ATTGGGATTC 


ACATCTGTCG 


CAAAGTTCGC 


AATAATCTTT 


9300 


CCATCTTGAA 


CGAGAACGCA 


TCCAAACTGA 


ATGATCCGAT 


CACTTGTAGG 


ATCTGTTCCA 


9360 


GTCGTTTCAA 


GGTCAACTAC 


TGCATACACT 


TGTTTGTCTT 


TCAATGACTT 


CACGCCTTTC 


9420 


ATTCATCTTT 


GAAAATTATA 


CTACAATTTA 


TAGGATTTTC 


ACTAGTATTT 


CAAAAATTCT 


9480 


CTTGAAATCT 


TTTCCAGAGA 


TAATAAGAAA 


GrGCCTGAAA 


CAAAAATCCA 


AAGTGATTTT 


9540 


TGTTTCAGGC 


yTAAAAGCTG 


ATAAATTGAA 


CGGTTTTGTC 


TTACCTTCTC 


CTAAATAAAC 


9600 


GATTAATAAC 


AGTCAAGGAT 


TTTCTGCATA 


TACTGGTTTA 


ATTGCCCATA 


GCCTTCCAAT 


9660 


TCTCTGATTA 


CTGGGAGCGG 


ACAACGAACC 


TTTTTATTAA 


ATAGTGTCCG 


CTTTTCAACA 


9720 


TATTCTGTAA 


tTCGTCTAAA 


AATTTATCTA 


ATCCTTTTTC 


TAATTTCATA 


CAACTTAACT 


9780 


GATTGCAAAT 


TCGCACGTCT 


CCTGCCTTTT 


TCTTTTGAAC 


TTGTTTAAAT 


GATTGTGAGA 


9840 


ATGCTTTTGC 


TTCAAACATT 


TGTGCTTCTA 


ACTGTCTTTT 


TTCAATAGTT 


ATTTTCGGTA 


9900 


CTGCTTGATT 


TCCATCAGTA 


TCCTCTAAAT 


GGCGACTCAT 


AAAGCGACGA 


TAAATTTCCT 


9960 


CTTCTAATTC 


GTCCAATTGT 


CGTTGAAAGT 


CCGCAGGAAT 


TGTCCGACTT 


GATGAATTTT 


10020 


CTTTCTCACT 


TTCCATAGA6 


ACAGCTTCTT 


TTTCTGGCTT 


CTGCTCCTCT 


TCTTTTTCGG 


10080 


CAGACTTATT 


CACTGAATTT 


AGCGAGCTTT 


TTTCTGTTTC 


TTGTGGAAAA 


GTAAATAAaG 


10140 


GTTTTGCCGC 


GGTACTCTGC 


TGGTTTTCTT 


GTTTTTTTTC 


TGGCAGCGCT 


TCTTTAATTG 


10200 


GCGCTGGTTC 


ACTTGTTGCT 


ACAGTTGTTT 


CACCTTCTTG 


ATTTTCCCGA 


TTATCAGGCG 


10260 


AGAGATTGAC 


GTCTGGTTGT 


TCCAGATTAG 


TATCCTCTGA 


TACTTTCAGC 


GATGCCTGTT 


10320 
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CTGCGGCTAT 


CACAGGAGAA 


GCTCTTTTCA 


AATGGATCGT 


TTTTATTTGT 


CTAGAAATGA 


10380 


CGGGATTTCT 


CGTTTGATTA 


ACGGCTTCAT 


TTAAGGAGGC 


TAAATTTTCT 


TTTAAAAGTG 


10440 


TCAACATTTC 


ACGTAAGACA 


GCAATTTCAC 


CAGCTAATTC 


CGTTAATTGA 


TTTTCTTGCG 


10500 


GCACTACTTG 


CACATTGGTT 


TCATTCATAC 


GGACTGTTTT 


TTTATTTTTA 


GTtGTCGGTA 


10560 


TTTCATACAT 


CCATTGTTTC 


AGTGCGGATC 


GTAAGTCACG 


AGTTGTCGGC 


TCAAAcTTCC 


10620 


AAAATAAACA 


GTGTAGTACA 


TTAcTCCGTT 


TCAACATAGA 


CATAAATCTT 


TTTAACAATT 


10680 


AACTCTTCAA 


ACTGTTCTGA 


TTCATCCCAA 


CTTTCGCATA 


AAGCCTCAAA 


AAATTGTTCC 


10740 


CCTTCTGCTA 


AATTTTGATA 


CTCTTTCGCT 


AAATAAAGCT 


TGCGrATTAA 


GTAAAAaTTA 


10800 


TACGGTGTTT 


CAACATTATC 


TGCATAAATT 


TCTCGTCCTA 


AAAAATCTGG 


ATTCGTTTCT 


10860 


TTACAAAGTUV 


AGACCGTTTT 


CTCTTCTGGT 


TGTGTTTCAA 


GTGTTTTAGT 


CATGAGATTC 


10920 


ACTTCCTTTT 


TCTCCGTTAA 


TAACCATTTT 


TTAATGATAG 


ATACCTTTAG 


TATACAGGAG 


10980 


AAATTCTTCT 


GATTTTCTCA 


TATTTTTAAG 


AGACAATAGA 


AATTTTAGAG 


AAAAATCGAC 


11040 


AGAAAAAAAA 


CGTGAGATAA 


ACAGTCCTAT 


TGACCATTTA 


TCTCACGTAA 


AATACTTAAA 


11100 


CTGACTATTT 


TATCCTTGGA 


CATGTAACCA 


ATTATTCTGC 


TGAATCAATT 


GGCAAAATCG 


11160 


TGTCTTTTTT 


CTCGCGACGT 


GCAATTGCTG 


GTAAAATCAT 


TCCTAATAGA 


ATTAAGACGA 


11220 


TTGGTGTCAC 


AATGTTAGAA 


GCTAATTGGA 


ACAACCAGCC 


TTGAGGGTCT 


GAGGCATACT 


11280 


CAAATTTAGG 


CACCATTCCT 


AGGATACAAG 


CAAAGGCTGT 


AAATAGGAAA 


CACCAAAGAC 


11340 


CAAACACAAA 


ACCAATTTTA 


GGATTTTTAA 


CAAATTTATA 


TTCTGAATTA 


AATTGTTTGT 


11400 


ATGCTTTGTT 


TAGAAGCATA 


TAGGCAAAGA 


AGACCCACAA 


GTAACGCATT 


GGCATAACTA 


11460 


CAGAATTTAA 


GTTAGTTAGC 


CATTTAACCA 


CTTCGTTCAT 


GTCCTTCAAC 


CCAAACATTG 


11520 


GTAAAGCGAT 


AATAATACTT 


ACTAAAATAC 


CTGTTAATAA 


GTAACCATTA 


ATTAACGTAC 


11580 


CTTTTTTCGT 


ACGTTTACGT 


AACCAACTTG 


GAACAAATTC 


TGGATCTGCT 


TCAGCTAAAA 


11640 


GAATTTTCAG 


TGGAGCATCA 


ATTGAAAAGG 


CTAGCGCAgc 


CACTTGACCC 


ACTGTATTCG 


11700 


TCAAAGCATA 


AACAATCATT 


AAGAGATTAC 


CAACACCATA 


ATATGCGCCT 


AATTTTTGGA _ 


11760 


AGGCACTGTA 


GGCACCATTT 


GCCATTAAAT 


CTTTAGGAAT 


ATTGTCACTT 


GAGAATAACA 


11820 


TGCCCATTGC 


AAATGAACCT 


AAAACAGCCG 


ATAAACCAAC 


CATTGCCGCT 


AGGAAAATCA 


11880 


TTCCTTTAGG 


GAACTCTTTG 


GCTGGATTTT 


TAGTTGCATT 


TACGTAAGGA 


GAAATTTTTT 


11940 


CAGCGCCACC 


TACCGCAAAA 


ACTAACATCG 


ATACGGTTGT 


AAAATAACTA 


AAGTCAAATT 


12000 


TAGGAATATA 


GGTGCTGACT 


TGTGACATGT 


TTGGCGTCGC 


AAATTCAACC 


GTTGAATTAA 


12060 


TAAATG6TGC 


GCCTACCGCT 


AATAAAATGA 


ATAAAATAGA 


CATCACAAAC 


ATGGCGGTTC 


12120 


CAGCTAAACC 


ACCAATGACT 


TTTAAGGTGG 


TTAACCCTTT 


TGTTGATAAT 


AACAAGAAAA 


12180 


CCAAGAAAAT 


TATTAATGAA 


ATAAATGATA 


CCGTTACGAT 


TGGCATCGTA 


TTGACTAGGT 


12240 


TTCCATTCCC 


TTGAACTGCC 


CAACCTAAGG 


CAATTAAAAT 


TGCTTGTGGT 


TTTTGTGCCA 


12300 
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AGTAAGGAAT 


GTGCACTACC 


CAATAAGTCC 


ATGCTGCATA 


ATAGGCTAGA 


CGTTTAGTAC 


12360 


TGGTATTTTG 


CACCCACGAA 


CTTUVCTCCAC 


CATTACTGTC 


TTTAAATGTT 


GAACCTAACT 


12420 


GTCCTACGAT 


TAACGCATAG 


GGAATAAAAT 


ATAA7UVCTAA 


AATCAAAAGC 


CATGAAGTGA 


12480 


CCACTGAAAT 


CCCTTGTTGT 


GCATAATTAT 


TCACAACATT 


TCCAAGACCC 


CAGACCATGT 


12540 


TAAAGGCAAT 


TAACCCGACG 


GTGAACCATC 


GTAATTGCTT 


TTTGTCCATA 


TAACCAATTC 


12600 


CTTTCTTATT 


AAAAAAAATA 


AAGCTCTTCA 


TTTATCATAA 


GCATTTTACT 


TTTTATGCAC 


12660 


AAGCTCAACA 


TAATTAAATA 


AAGATGGCTT 


TTTAATAAGT 


AATTAAAACG 


CTTCTAGCAA 


12720 


TGATTTTCTA 


ATTTATTTGA 


TGACTAAAAA 


AGGCCGGTGA 


ATAAGTTGCT 


TCACCAGCCT 


12780 


TTTTCTAAGA 


GATTTTTCTT 


TTATAACATA 


TCAAATAATG 


ATAATTGGTT 


TTCATCTGGC 


12840 


AAGTCTrtTA 


AGACACCATT 


TTCATTCATG 


TATTCAATTA 


ACGTTTTAGA 


AACTTTGCCT 


12900 


CGTGTTGCCA 


AATCTTCTTT 


CGATAAGAAC 


GGTTGCTCTT 


CTCGCGCAGc 


AACAATGGCT 


12960 


TTCGCTACGT 


TGA6GCCTAA 


ACTTGGCACT 


GCTCGGAAAG 


GTGCAATCAA 


GGTATCCCCT 


13020 


TCAATCACAA 


AATTTTCAGC 


ATCT6ATTTG 


TACAAATCAA 


TCATGCCAAA 


TTTATAGCCC 


13080 


CGTTCCAACA 


TTTCATTACA 


TAATTCTAAA 


ACCGTTAATA 


AGTTTTTCTC 


TTTAGTGGAG 


13140 


GCGTCCATCC 


CTTTGtCCGT 


AATTTCTTTC 


ATTCGTGCTT 


TCACAGCTTC 


TTTTCCTTGT 


13200 


GACATTGCCA 


CTAAGTCGAA 


GTCATCCGCC 


CGAACGGAGA 


AAAACGCACA 


GTAATATAAG 


13260 


ATTGGGAAAT 


AAACTTTAAA 


GTAAGCGACC 


CGCAACGCCA 


TTAATACGTA 


TGCTGCCGCA 


13320 


TGGGCTTTCG 


GGAACATATA 


TTTAATTTTA 


GAACAAGAAT 


CAATATACCA 


GTCAGGGACG 


13380 


TTGTTTTCTT 


TCATCGCATT 


TAAATACGTA 


TCCCGTAGTT 


CATCAGGAAT 


CTTATTCCAT 


13440 


TGTCCTTTAC 


GTACCGTTTC 


CATAATCTTA 


AAGGCCATCC 


CACTGTCCAA 


ACCAGCATGA 


13500 


ATTAAATAAA 


CCATGATATC 


ATCCCGACAA 


CCAATTACTT 


CAGCAAGATT 


GgctTCGCCA 


13560 


CGACGAATCA 


ATTCTTCTGC 


ATTCCCTAGC 


CACACATCGG 


TTCCGTGAGA 


CAGTCCAGAA 


13620 


ATTTGAAGTA 


ATTCTGCAAA 


AGTTGATGGA 


TGGGTCTGTT 


CTAACATACC 


TCGAACGAAA 


13680 


CGTGTCCCAA 


ATTCAGGAAT 


TCCTAAAGTT 


CCTGTTTTAG 


AATAAATTTG 


CTCTGCATCC 


13740 


ACACCTAGAA 


CATCAGGTCC 


TTCAAAAATA 


CGCATCACTT 


CTGGATCGTC 


TGTTGGGATG 


13800 


GTTTTTGGCT 


CAATCCCAGA 


TAAATCTTGT 


AACATCCGAA 


TCACGGTCGG 


ATCATCATGT 


13860 


CCTAGAATAT 


CAAGTTTTAA 


AATATTATCG 


TGAATGGAAT 


GGAAATCAAA 


GTGCGTCGTT 


13920 


TTCCATTCTG 


CCTCTTGATC 


ATCAGCAGGG 


TATTGAATTG 


GCGTAAAGTC 


ATAAACATCC 


13980 


ATGTAATCAG 


GAATTACAAT 


AATACCGCCG 


GGATGCTGCC 


CAGTCGTCCG 


TTTAACCCCT 


14040 


GTTGAACCTT 


TGGCTAAGCG 


ATCAATTTCA 


GCACCACGCA 


AATGTAAATT 


ATGATCTCTT 


14100 


TCATATCCTT 


TAACAAAGCC 


ATACGCTGTT 


TTGTCGGCTA 


CTGTACCAAT 


GGTTCCTGCC 


14160 


CGGTAGACAT 


ATTCTTCACC 


AAAAAG7VACT 


TTCGTGTAGT 


TATGGGCTTC 


GGCTTGATAA 


14220 


TCGCCAGAAA 


AGTTCAAATC 


GATATCGGGA 


ACTTTGTCGC 


CATGGAACCC 


TAAGAACGTT 


14280 



wo 98/50555 



PCTAJS98/08985 



623 



TCAAAAGGAA 


TATCGTGACC 


ATCTTTAAAC 


AATCGAGCGC 


CACATTTTGG 


ACAGGCTTTT 


.1434.0 


TCAGGCATAT 


CAAACCCTGA 


ACCATAGGAA 


CCATCTTCAT 


AGAACTCAGA 


ATATTGACAC 


14400 


TCAGGACAAT 


AATAATGTGG 


CGCTAAAGGA 


TTTACTTCAG 


TAATCCCTGT 


CATTGTCGCA 


14460 


ACAAAACTCG 


AACCGACAGA 


GCCCCGTGAA 


CCAACTAAAT 


AGCCATCTTC 


ATTACTTTTA 


14520 


TGCACCAGTT 


TTTGAGAAAT 


TAAATA/VATG 


ACCGAGAAAC 


CATTCCCGTT 


AATCGAATTC 


14580 


AGTTCCTTTT 


CTAACCGTTT 


TTCGACAATT 


TCTGGTAAAG 


GATCGCCATA 


CAATTCGCGC 


14640 


GCACGATTGT 


AGCTTAAATC 


AGTGATCTCT 


TGTTCAGACC 


CAGGAATTTT 


CGGCGTATAC 


14700 


AAATCGTCTT 


TTACTGGAAT 


AACTTCTTCA 


CATAAATCAG 


CGACTTTGTT 


GGGATTTTCT 


14760 


ACTACAATTC 


GCTTAGCGAT 


GTCTTGTCCT 


AAAAATTGAA 


ATTCCGTTAA 


CATTTCATCG 


14820 


GTGGTTCTGA 


AATGAACCTT 


TGGTAAACTA 


TGGCGATTTA 


ACGGATTAGC 


GCCGCCCATA 


14880 


GACCCGACTA 


AAATTTTCCG 


ATAGATGGCA 


TCTTCTTCAT 


TTAAATAATG 


AACATTTCCA 


14940 


GTGGCGACAA 


CCAGCTTACC 


AAGTTCGTCA 


CCGATTTTCA 


CCAGATTACT 


AATAATTTCT 


15000 


TCCAAATCTG 


CTTCATTTTT 


CACCAATTCT 


TGTTCAATTA 


ATGGCGCGTA 


AACTGGTTTG 


15060 


GGCATAACTT 


CAATATAATC 


GTAGAATTTG 


GCACGATTTT 


TTGCTTCTTC 


GACACCTTTT 


15120 


TGCATCATTG 


CTTCAAAAAT 


TTCGCCGTTT 


GAACATGCGG 


AACCGATTAA 


AAGTCCTTCT 


15180 


CTTAACTTCG 


ATAATTGCGA 


ACGAGGAATC 


CGTGGTACTC 


GGAAAAAATA 


ATCTACATTG 


15240 


GACATGGAAA 


TCAATTTAAA 


AAGATTTTTT 


AATCCTGCTT 


GAGTTGTTGC 


TAAAATCGTT 


15300 


GCATGGAAAG 


GTCGAGCTCG 


TTTGTATGAG 


TCTCCTTCGC 


CAATATGACG 


ATTCAAATCA 


15360 


TCATGGAAAT 


GCATATCATG 


ATTCTCTTTC 


GCTTCTTTTA 


AGAAAATCCA 


GCACAAATGA 


15420 


CCTGTTGATT 


CCGAGTCATA 


GATGGCGCGG 


TGATGTTGTT 


CTAAGTTGAC 


GCCAAATTTT 


15480 


TTCGATAAGG 


TATTTAGACG 


ATGGCTTTTG 


AAATGTGGAT 


ACAAGAATCT 


GGATAATTCT 


15540 


AATGTATCGA 


TGACGGGATT 


AGCCGCTTCA 


GGTATTCCAT 


GTTTGCCATA 


ACTTGTATTT 


15600 


AAGAACCCCA 


TATCAAAAGA 


AGCGTTGTGG 


GCCACTAAAA 


TAGTTCCTTC 


TGAAAACTCT 


15660 


TTAAACATGC 


GTAAAACTTC 


TTCTTCGGAT 


TTTGAACCGC 


GCACCATTTC 


ATCTGTAATC 


15720 


CCAGTTAAAT 


TGATCGTTGT 


TTGAGATAAT 


GGATGGCCAG 


GATCAATAAA 


TTGCTCAAAA 


15780 


GTCTCAATGA 


CATTTCCTTT 


GTGcATTTTG 


ACAGCGGCCA 


ATTCAATAAT 


TGTATCATAA 


15840 


ACAGCGGAAA 


GCCCCGTTGT 


TTCCACGTCA 


AAGACCACAT 


AGGTAGCATC 


TGTTAATTCA 


15900 


ATATGTTCTT 


CGTTGTAAGC 


AATGGGTACA 


CCATCATCAA 


CAATATTGGC 


TTCCACGCCG 


15960 


TATAAAATTT 


TAACGCCTGC 


TTTTTTACCT 


GCAGCATGGG 


CATCAGGAAA 


CGCTTGGGCG 


16020 


CCGCCAT6AT 


CCGTAATTGC 


GATGGCCTTA 


TGACCCCATT 


TCCCCGcTTG 


CGCCACCAAG 


16080 


TCACCGACTT 


TATTGGTCGC 


ATCCATTGTA 


CTCATGTTAC 


TATGCATGTG 


TAATTCTACA 


16140 


CGTTTTTCAC 


CTTCTGGCGC 


GTAGTCTTTT 


CTTGGCGCAT 


GGGCGACTTC 


CATCAAATCT 


16200 


TGAGCATTCA 


TGACTAAATC 


ACGCATGAAG 


GTATCTTCTT 


GTACACTACC 


ACGAACACGA 


16260 
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ATCCAACTTT 


GCGGTTGAAT 


CGCATCAAAA 


ACTTGTTCAT 


CTTTTTCCCC 


ATTCGAAAAT 


. 16320 


TTCTTCACCA 


CGAAAGAGGA 


AGTATAATCA 


GTAATTTTCA 


AAATTAAAAT 


TTTTCGTTTA 


16380 


GAACGCAATT 


CACGCACTTC 


TTTATCAAAA 


ATAAAGCCTT 


CAATCGTTAT 


CCGACGTTCT 


16440 


TCTTCCAGTA 


TGTTTCCCAT 


TGGCATAATG 


GGTTCATCAT 


TGGGAATATT 


CCGACCTAAA 


16500 


CGAATTGGAC 


CATCAAACGC 


AGGCGCTTGT 


TGTTGTTTTT 


CTTTTTTCAT 


TTGTTCATGT 


16560 


TTAACAAGCG 


ATTCAGCAGC 


TTGTTGTTGA 


AAGGCTGCGG 


CTTGTTCTAA 


TTTTTGCTCT 


16620 


TCAAATTTTT 


TCAGCACTTC 


TGCAGCTTGC 


TGTTGATCCA 


TTTTTGGTTC 


AATATGAAAT 


16680 


TTAGGAAACC 


CGTAAGAGAA 


ATAAAGTTCC 


TCAATAATTG 


GTAAATATTG 


TTGCTTCAAA 


16740 


TAAGGAATAA 


CTGCTTCATT 


ATCAACAGGT 


AAGATAATTT 


TCCGATCTTC 


AAATTGTGGT 


16800 


GTCTGTGTTT 


TTAAGACTTT 


CTGAACTAAC 


GGCGTATTGC 


ATTGACTATT 


TAACAAGGCC 


16860 


AATTGCCAAT 


AATCCGTTAA 


TTGTTCCTCT 


GTAAATGTTG 


TCTGATTTGT 


GGTTATTTGA 


16920 


ACGGAAACCT 


GCGCAATTTG 


TTGAAAAGCT 


AGTTCCAATT 


GTTGCATAAA 


TGTTTGATAT 


16980 


AGCATCACTG 


GCATCAATTC 


ATCAAAACTT 


AAATGAAATT 


CCCATAAGCG 


ACTTTGTTGG 


17040 


TGAACAATGA 


CCTTGTCAAT 


CCGGCCATTT 


TGAATTAAGG 


GATGTTCTTT 


CGCTGTGTCA 


17100 


TCTAACTGAA 


TTTGCGTCAT 


CAGTTTATCA 


AATAAATCGC 


GTGCTTTTTC 


TGTCACAAAA 


17160 


TGAACCTCCT 


TCTATCATTG 


AAAAAAGGCC 


CTCAGACAAA 


TCTGTCAAGT 


GCCTTTTTTC 


17220 


GGTTTCTCTG 


CATTCGCTTC 


TCTATTAAGT 


ATACTCGTTT 


TTTGTAAAAA 


AACAATCTTG 


17280 


TTTCTTTTCT 


AAAAAGAGCT 


AAGAAGAGCT 


TGAAGCAAAA 


ATCACTTTAG 


ATTTTTGCTT 


17340 


CAAGCyTAAA 


CGCTGATAAG 


TGGCGGAAAC 


AGTTTATCCT 


AATCATTCTG 


TCCGCCCTTT 


17400 


TCATTATTAT 


TCCACTTCTG 


ATGTTGTGTT 


CATCArAATA 


GAAAGTGTAC 


TTTCTAAttC 


17460 


TTCTTTGCGT 


ACTTCCaACA 


TTTCACCAGT 


ACGTTTGATT 


TkGACTTCTA 


CCACACCATC 


175-20 


GACTGCTTTT 


TTCCCGACTG 


TGATACGGAT 


TGGACAACCA 


ATTAAATCAG 


CATCAGCAAA 


17580 


TTTTACGCCT 


GCGCGCTCGT 


TGCGGTCATC 


AACTAATACT 


TCGTAGCCCG 


CTTCGGTCAT 


17640 


CATTGCTTCA 


ACTTCTTGTG 


ATAGTTTTGT 


TTGGTACTCA 


TCTTTCACAT 


TCATTTGGAC 


17700 


CACGTGTAAA 


TCAAATGGC6 


CAATTCCTGT 


TGGCCAATTA 


ATACCACGCT 


CGTCTGCATT 


17760 


TTGTTCAACG 


ATTGCT6AAA 


GTAAACGACT 


AACGCCGATG 


CCATAGCAAC 


CCATGATCAC 


17820 


TGATTTTTCA 


CGACCGTTTT 


CATCTAAAAC 


TGTTGCGCCC 


ATAGCATCAC 


TATAACGCGT 


17880 


TCCTAATTTA 


AAGATATGAC 


CAATTTCGAT 


TCXTTTTAGTA 


AAGGCTAAGA 


CACCATTGCC 


17940 


ATCAGGAGAA 


GGATCGCCTT 


CTTGAACAAA 


GCGTAAATCT 


TCGTAACTAA 


TTGGTTGGAA 


18000 


ATCACGATCT 


GGATTTACAT 


TTGTCAAATG 


ATAGCCATCT 


TCGTTAGCTC 


CAACAATGGC 


' 18060 


GTTGGCTAAA 


TCTTGAACAG 


CTAAATCTGC 


ATAAATTTTA 


ACATCTTCGG 


AAACATTGAC 


18120 


TGGACCAATT 


GAACCAAAGC 


CCGCTCCTAA 


GACACGGCGA 


GCATCTTCTT 


CTGTTGCTTC 


18180 


ATCTAAGAAA 


TCAGCGCCTA 


AGAAGTTTTT 


CAATTTGACA 


TCATTGACAT 


CATGATCGCC 


18240 



wo 98/50555 



PCTAJS98/08985 



625 



ACGAACCAAG 


ACCATTACTG 


GCTCTTCATC 


-GGCAATAAAT 


AGTACTGATT 


TAATAATTCG 


. 18300 


TTGTGGCTCA 


ACTTCAAAGA 


AGTTTGCCAC 


TTCGGCAATT 


GTACCCACTT 


CTGGTGTTGC 


18360 


AATTTTTTCT 


AAATCTAATT 


GTGTTTCATG 


TGATTTTTTC 


GGCGTATAAA 


GACTTGTTGC 


18420 


CATTTCTAAA 


TTGGCAGCAT 


AATCACTTTC 


TGTTGAATAA 


CAAATAGTAT 


CTTCCCCAAT 


18480 


TTCAGAAATC 


GCCATAA?VTT 


CTTTAGAATC 


TTTACCACCC 


ATGGCACCAC 


CGTCCCCAAT 


18540 


AATGGCACGG 


AATTCTAGTC 


CACAGCGTTC 


AAAAATGCGT 


GAGTAAGCTT 


TTTCGTAATC 


18600 


ACGGTATGAT 


TGATCTAAAC 


TAGCTTCATC 


GGCATGGAAA 


GAATAGCCAT 


CTTTCATGAT 


18660 


AAATTCACGG 


CCTCTTAATA 


AGCCAGAACG 


TGAGCGTTTT 


TCATCACGAT 


ATTTTGTTTG 


18720 


AATTTGGTAT 


AT^TTCAAAG 


GTAATCGTTT 


GTAAGAATTA 


ATTTCATCAC 


GAATTAATTC 


18780 


TGTGAAGGTT 


TCTTCATGCG 


TTGGGCCTAA 


GATATAGTCA 


CGGTCATTGC 


GATCTTTTAA 


18840 


GCGATACAAG 


TTAGGTCCAT 


AAGTTTCATA 


ACGGCCTGAT 


TCTTTCCAAA 


GTTCTGCGGG 


18900 


TAATAACGCA 


GGCATTAACA 


TTTCTUVCTGC 


ATCAATTtTC 


TCAAATTCTT 


CTCGCATAAT 


18960 


CGTTTTTAAT 


TTTTCTAAAA 


CACGATTGGC 


TAAAGGTAAA 


TAAGAATAGA 


TCCCTGCAGC 


19020 


GACTTGACGA 


ATATAGCCTG 


CGCGCAACAG 


AATTTGATGA 


CTTAAGACTT 


CTGCATCATT 


19080 


TGGTACTTCT 


CTTAACGTTG 


GAATTAACAT 


TTTTGACTGT 


TTCATTAAAA 


CTCTCCTCTA 


19140 


TTTTAAAATC 


AACTATTTTT 


GATTTCTTTA 


CACACTTTTT 


TACTTAAACT 


TTCTCCTTAA 


19200 


AAGAAAAAGC 


GTTGAATATC 


GTTCCAAGTA 


ACTaACACCA 


TTaACACCAT 


GACAAACCCA 


19260 


AAGCCAATTA 


ACGTAATGAT 


GCCTTCTTTT 


TCAGGACTAA 


TTGGTTTTCC 


ACGTACACCT 


19320 


TCAATAATGT 


TTAAGACAAT 


TTTCCCGCCA 


TCTAAAGCTG 


GGATCGGCAA 


CAAATTAATA 


19380 


ATCCCTAAGT 


TCATTGACAA 


CATGGCCATT 


AAGAATACAA 


CTGTACTTAC 


TCCAGCATTG 


19440 


GATGCTTCTT 


CCGATAATTT 


AAACATCATG 


ACTGGCCCAC 


CTAGTTTGTT 


TAAACTAAAG 


19500 


CcTGTGAATA 


GTGAGCCGAG 


TGCTTTAAAA 


ATCTGTGTCG 


TACTATTCAA 


AGTATCCTGA 


19560 


ATACCGCCCA 


TCAATTTTGA 


CGGTAAATCG 


GTTTTCATAT 


AAGGATAAAC 


GCCGACTTTA 


19620 


CCAATCGTTT 


GTTTTTCCAC 


TTTTTGTTTT 


TCTGGTGTCA 


CTGTTAGTTG 


CTCTTCTTTG 


19680 


CCGTTACGCT 


CAACTACGAA 


CGTTAACGGC 


TTTTCGGGGT 


TCTTCTGCAC 


AATGGTTGTA 


19740 


AAATCTTCGT 


ATTTTTTGAT 


TTTTTGATTA 


TTAATCGATA 


AGACTTTATC 


GTTTTCTTTC 


19800 


AACCCAGCTT 


CTGCGGCTGG 


GCCATTAGGA 


ATCACTTGTC 


CAATTTGATT 


CGTGTTTAAA 


19860 


TCAGTGACGC 


CTCCTTGTAG 


AAAGACCGCT 


AACGTAAACA 


GAATAAACCC 


TAAGATAAAG 


19920 


TTATTCATGG 


GTCCCGCAAA 


GTTCGTCAAA 


ATGCGTTGCG 


ATAATTTCGC 


TGATTGAAAT 


19980 


TGAACGTCAA 


GTGGCGCAAT 


CCGCACCTCG 


GTTCCATCAC 


TTTCAATAAT 


CGTTGCATCA 


20040 


TGGTCAACTT 


TATAAACGGT 


TTCTTCTTCC 


TCGTTTCCAT 


TGACATAGCC 


CTTGATGAAT 


20100 


AATTCTTTTT 


CAAGATCAAA 


ATCAATGACT 


TCCATCGGAA 


TACTATGAGG 


TAATTGTACT 


20160 


TTTTTACTTG 


TATTAATTTT 


AACCACATTA 


CCCACGGCAT 


TTAACTCAAC 


AGATAGAGGC 


20220 
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ATACCTGGTG 


TGATTTCTGT 


CATGTCTTCG 


CCCATCCCAG 


CCATTCGCAC 


ATAGCCACCA 


20280 


ATTGGCAATA 


AGCGAATCGT 


ATAAGTGGTG 


CCATCTTTTC 


CACGATGCGC 


AAAAATCTTT 


20340 


GGTCCCATTC 


CGATTGCAAA 


CTCACGAACT 


AAAATACCCG 


CTCGTTTTGC 


AAAATAAAAG 


20400 


TGGCCAAATT 


CATGTACGAG 


GACAAGAATA 


CCGAAGACAA 


TAATGAATGT 


GATAATTGTT 


204 60 


TTCATAGATG 


TCCTTCTTCC 


TTCATTCACT 


GACGTTCAGT 


GCTTTTTATT 


ATTACATGGG 


20520 


AAAGTCTCGT 


AAAACTTGCT 


CTGTTCCCTT 


CCCGATAAAC 


CAGTGCAACT 


GTTTACTGAG 


20580 


AACAGTCGCC 


ACTGTTAGTG 


AAC7U\ATCTT 


TCCTTATGTA AGACAAGTTT 


TATTTTAGCA 


20640 


TATTCACTCA 


TAAATAACCA 


TTGGACTTGA 


CCACTTCGCC 


TAGATATTTT 


GTGTTTTTTT 


20700 


AGAAAACTGT 


ATAAATGAGA 


GGmaTTGATT 


ACTACGACCA 


GTATTTATTT 


TAAAGACCTT 


20760 


CAACAATAGT 


AGCTAACTTT 


AGACCATAAA 


CTTTTCCGAC 


AGTTTGTCGT 


ATTTTATCAA 


20820 


CACGCTGATT 


GACATGTGCT 


ACCATGAAAG 


TATTAATAGT 


AATTTTAGGG 


GGAATGAATC 


20880 


AATGTATGGA 


GAATTACATG 


GTAAAGTAGC 


CGTTGTCACT 


GGTGCAGCTA 


CGGGTTTAGG 


20940 


ATTATTTATT 


AcTCTGCGTT 


ATATCTTAGA 


GGGCATGAAT 


GTTGTTGCTG 


ACTATGTCGG 


21000 


TGAGCTGCCT 


AAAGAATTTG 


AAGATGTCCA 


AGCCAAGCAC 


GCAGATCGCG 


TAAAATTCGT 


21060 


GAAGGCTGAT 


GTCTCTAATG 


AAGAAGATAT 


TAAAGCGTTA 


gCTGAAACCG 


CTTTAAAAGA 


21120 


GTTTGGTCAT 


GTTGACATTT 


GGGTAAATAA 


CGCCGGTGTG 


GAGGCTTCTT 


TTCCGACAAT 


21180 


TGATATGCCA 


TTAAAAGAAT 


GGCAACGTGT 


CATTGATGTC 


AATTTAAACG 


GTGTCTTTTT 


21240 


GGGTTCTACG 


AC 










21252 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8726 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEONESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

GCATATCAAC AAGTCAAAGA ACGATATGAA ATTTTACAAA AAGAAGAAAA AGCCTTAAAG 60 

CAACAACTGG CTTCTTATCA TGATttCTAT GAAACGTTTT GCTTGGCTGA AGAAGTCTTA 120 

TTGGCGGTTA TCCAAAGAGA ACAAGCACGT CAACATTTGT TAAATGCTTC ATCATTTTTC 180 

ATCTTGCAAA CGTGGATTCC AGTCGAAGAA AAAGCCGAGA TTTTGACAGC GATTGAAGAA 240 

AAAGTTCCTA AGGACGAAAT AGCTCTTACT TTTGAAAATC CTACGAAGGC GGAAATTGAG 300 

ACGGATATAC CAGTTAAACT AGCGAATAAT AAATTGGTGC AGCCGTTTGA AATGTTAACA 360 

GAGATGTATA* GCTTACCAAA ATATGAAGAA GTTGATCCGA CACCTGCGAT GATGCCGTTT 420 

TATCTAGTCT TTTTTGGTAT GATGGTAGCG GATATTGGCT ATGGCTTACT GATGCTACTC 480 

TTGTCTATTA TTGCTTTGAA AGCATTTGTT TTACCGAGAG GCATGAAACG TTTTGCCGAC 540 
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TTTTTCTTGA 


TTTTATCATT 


TCCAACAATT 


ATTTGGGGCT 


TCATTTACGG 


TTCGTTTTTC 


600 


GGTGCAGCAT 


TACCGCCAAT 


AATGTTTGGA 


ATCAAATCAC 


CATTTCCGAT 


TTTATCAACG 


660 


ACCGAAGATG 


TAAATACGAT 


TCTTATTTTA 


TCGGTTATTT 


TTGGGTTTAT 


CCAATTGGTT 


720 


GTCGGTTTAA 


TGATTAATGG 


GATTCAGTTA 


TCAAAACAAA 


AGCGCTATTT 


AGATAGTATT 


780 


AACGAAAGTT 


ACGCTTGGTT 


AGGGATTTTG 


TTTGGTTTAG 


CATTGTTAGT 


CGTTGGTAAA 


840 


TTAGTGGTTA 


AAAATGAAGG 


CTTATTTACA 


GCAGGTGCGA 


TTTTAGCTAG 


CCTTTCTGCC 


900 


ATTGCAATTA 


TTGTGATTCC 


AATGATTCAA 


TCGAAGGCCA 


AATTAAAAGG 


CTTAGCCAAA 


960 


GGACTGTATG 


GACTATACGG 


TGTGACAGGT 


TATGTCGGTG 


ATTTAGTTAG 


TTATACACGT 


1020 


TTGATGGCGT 


TAGGAATTGC 


TGGGGGAAGT 


ATTGCTTCAG 


CGTTTAATAT 


GCTCGTAGAG 


1080 


TTCATGCCGC 


CAGTTGCCCG 


TTTTAGCGTG 


GGCATCTTGC 


TGTTAATTGT 


TTTACATGCG 


1140 


TTAAACATAT 


TCTTATCGTT 


ATTAGGTGCT 


TATGTTCATG 


GCGCACGTTT 


GCAATATGTT 


1200 


GAGTTCTTTG 


GAAAATTTTA 


CACAGGCGGT 


GGCCGGGCGT 


TTAATCCGCT 


AAAAACAAAA 


1260 


GAAAAGTATG 


TCAATGTTGA 


GAAAAAATAG 


AAAATTATTT 


ATTGGAGGAA 


AATAGAGATG 


1320 


ATGGATTACT 


TAATTAATAA 


TAATGGTGGA 


ATTGTTTTTG 


CAGTATTAGG 


AATGGCAATG 


1380 


GCGACAATTT 


TTGCTGGGAT 


TGGATCAGCC 


AAAGGGGTTG 


GCTTTACTGG 


GGAAGCAGCA 


1440 


GCAGCTTTAA 


CGACGGAGCA 


GCCAGAAAAA 


TTTGGTCAAG 


CGTTAATTTT 


ACAGTTATTA 


1500 


CCAGGTACAC 


AAGGACTATA 


TGGTTTTGTT 


ATTGCTTTCT 


TAATTTATAT 


TAACTTAGGA 


1560 


AACGATATGA 


GTATGGTTCA 


AGGATTGAAC 


TATTTTGTAG 


CAGCTTTACC 


AATTGCTTTT 


1620 


GCAGGATTGT 


TCTCTGGGAT 


TGCCCAAGGT 


CGTGTAGCCG 


CTGCGGGGAT 


CCAAATTTTA 


1680 


GCGAAGAAAC 


CAGAACATGC 


AACAAAAGGA 


ATTATCTATG 


CTGCCATGGT 


TGAAACTTAT 


1740 


GCCATCTTAG 


GGTTCGTAAT 


TTCCTTCTTA 


CTTGTATTGA 


ACGTCAAATA 


ATTGAGGAGG 


1800 


CAGATGGATG 


GATGCCATTG 


AAAAAATCAT 


CAGTGAAATT 


AAGCAGCAAG 


GAAAGCAAGA 


1860 


AGTCGAAGCC 


TATGTCACCA 


GTGAACAAAC 


GCGCATCGAT 


CAAGAATTCC 


AAGCAGCACA 


1920 


ACAAGAAATT 


TTGCTGAAGC 


AAGAACATGA 


AATAGAAAAA 


CGACAACAAC 


AATTATTAAA 


1980 


AGAATTTAAA 


CAACGCCAAC 


AACGGCAAAC 


ACTAGAAATT 


CGTCAGGATG 


CATTAAATAA 


2040 


AAAACAaGCA 


TATCTTAACC 


AaTTATTTGA 


CGAaGTGGtC 


TTAAAGATGA 


GTGAATGGTC 


2100 


AGCGGAAGAa 


TTTCAACAaT 


TCATGAAAGC 


ACAATTAGGT 


TCACTTGAAT 


TAACAGGAAA 


2160 


AGCAACCATT 


CTTTTAGGCG 


AGTATTCTCA 


AACGAAAGTG 


ACGCAAGAAT 


GGCTGACCGC 


2220 


CCTTTCTGAT 


GCTACTGTTC 


AATGGGAGCT 


TTCAGAGGAA 


GTTGTTCCAA 


AAGAAAGTGG 


2280 


CTTTATCGTT 


GCAAAAGACG 


GTTTGGATTA 


TAATTTTCTC 


TTTTCAGCAT 


TAGTTGAAGA 


2340 


GATTCAAAAA* 


ACAGAAGGCT 


TTAAAGTAGC 


AGAAAAACTT 


TTCAGTTAAG 


AAGGGAGGAC 


2400 


GTAGCATGAG 


AAAACCAACG 


TATCATCAAA 


TTAATCCGCT 


GATTCGTTTA 


AAAGAAACGG 


2460 


AGCTGCTTTC 


TGAGCAACAG 


TTTCAACAAC 


TTTTAGAAGC 


GGAAACAGTT 


GAAGACGTTA 


2520 
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GAAATATGTT 


AAAAAGTACG 


GTTTATCAAC 


CTTATTTAAC 


cgAaactttt 


GAAGAAAAAT 


2580 


TTGACTACCA 


TTCTTCTGAA 


GCGCTGGGAA 


GTCTTTATCG 


CTGGTTATAT 


GAAATGGCTC 


2640 


CAGAACCAGC 


TGTAGTAACG 


TTGTATACGA 


TGCGTTTTAC 


CTTCCATAAC 


TTGAAAATTT 


2700 


TGACGAAAGC 


TGAGCGAACA 


GGTAAAGATT 


TTGATTATCT 


TTATTTAGAA 


GATGGTCGTT 


2760 


ATTCTCTAAA 


CACAGTCAAA 


AGTGCCATTC 


ATACAAAAAA 


CTCTTCCG/U^ 


CTTGAGCCAG 


2820 


CATTATTAGA 


AGTGATT VGT 


GATGTTTTTG 


CCTATCTGGA 


AGAGGGCGGT 


TTACCACAAG 


2880 


CAATTGATAT 


CCTTTATGAT 


CGAGCATTTT 


TACAGCAACA 


ACGAAAATTG 


GCGGATGAAT 


2940 


TGGGCTATGA 


AGAGTTAACG 


AAGGAAGTCA 


TCGCTTTTAT 


TGATTTAACC 


AATCTATCAA 


3000 


CGATGGCTAG 


AGGGATTGTT 


CAACATCAAA 


ACAGCAACTT 


TTTATCAACC 


GTACTTTCTA 


3060 


GCGCTGGCAG 


TATCACAAAG 


AAAGAACTGT 


TGACCTATGC 


GGAAAAGTCA 


CTTGTAGAAT 


3120 


TTACAGCATT 


TGTCAGAACA 


ACGAATTACG 


GACAACTATT 


AGAAAAGATT 


ATCAATAAAG 


3180 


AAACCAtGAa 


TTAAATTTAT 


TGGCTTTtGA 


AAAACTCAAA 


GATGATTACC 


TGACCTCAAt 


3240 


GTATGAaAAC 


GGTCGAACGG 


TCGCTTTtGG 


ACCGCTCCCA 


TTACTAGCCT 


TTTTGAATGC 


3300 


GAAAGAGGTT 


GAATGGAAAA 


ATCTACGTTT 


AATTTTGGTC 


GGAAAACATA 


GTGGGTTCTC 


3360 


AAATGAAAAA 


ATTCGAGAAA 


GGATGCGAAA 


AGTCAATGGG 


GTATAAAATT 


GGTGTCATTG 


3420 


GAGACAAAAA 


TTCAATCTTA 


CCATTTAAAC 


TGTTTGGCTT 


T6AGGTCCAT 


CATGCCATTT 


3480 


CAGAAATGCA 


AGTTAGAGAA 


GCCATTGAAA 


CTATGGCGAA 


AAATAAGTTT 


GGGGTCATTT 


3540 


TCATAACGGA 


AGAAGCCTCT 


ACTTTAGCAG 


AAGAAACGAT 


TGAACGATAC 


AAAGAACAAG 


3600 


TGACACCAGC 


GATTATTTTA 


ATCCCAAGTC 


ATAATGGTAC 


AATCGGAATC 


GGTTTAAGTG 


3660 


AGATTGAAAA 


AAATGTTGAA 


AAAGCAATTG 


GACAAAATAT 


TTTATAATGC 


AAAGGAGAAT 


3720 


GAGCGTGCAA 


ATTGGAAAAA 


TTGTCAAAGT 


TTCAGGTCCT_ 


TTGATTTTAG 


CTGAAAACAT 


3780 


GTCAGATGCT 


AGTATCCAAG 


ACATTTGTCA 


TGTAGGAGAT 


TTAGGCGTTA 


TCGGAGAGAT 


3840 


TATTGAAATG 


CGAGGCGACG 


TCGCTTCGAT 


TCAAGTATAT 


GAAGAAACAA 


CAGGCATTGG 


3900 


ACCAGGAGAA 


CCAGTTATTT 


CAACAGGAGA 


ACCATTATCT 


GTTGAATTAG 


CCCCAGGTTT 


3960 


AATTGCCGAA 


ATGTTTGATG 


GTATTCAACG 


ACCATTGGAT 


ACATTTCAAG 


AAGTAACCCA 


4020 


CAGTAACTTT 


TTAGGCCGTG 


GCGTTAAAAT 


TGATGCGTTA 


GATCGTGAGA 


AAAAATGGAC 


4080 


GTTTGAACCA 


ACTGTGGCAG 


TTGGTGAAGA 


AGTGTCGGCA 


GGTGACATCG 


TCGGTGTGGT 


4140 


TCAAGAAACA 


CCGATTATTC 


AACATAAAAT 


TATGGTGCCT 


TTCGGCGTTT 


CAGGAACGAT 


4200 


TGCCGAAATT 


AAAGCAGGTG 


ACTTTGCCAT 


TGATGAAACA 


GTTTACTCAG 


TGGAAACGGC 


4260 


TAAAGGAAC6 


GAAAGTTTTA 


GCATGATGCA 


AAAATGGCCC 


GTTCGGCGGG 


GACGTCCCAT 


4320 


TTTAGAAAAA 


CTAAGTCCCA 


AA6TACCGAT 


GGTGACCGGA 


CAACGCGTAA 


TTGATACCTT 


4380 


TTTCCCAATT 


ACGAAAGGCG 


GAgcGGCAGC 


AGTTcCAGGA 


CCATTTGGCG 


CTGGAAAAAC 


4440 


AGTCGTTCAG 


CACCAAATTG 


CTAAGTGGGC 


CGATGTCGAC 


TTAGTCGTTT 


ACGTTGGTTG 


4500 
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TTTaaaTfaa 

1 1 XAAAiuAA 


TTTfr'ar^a aT 


•na aTTr'a^^r* 
lAAl li3AL.CL> 


430U 






•pr* a aTT'/^r' ar* 


^^a'T"i*i"i'a a 
iiAi i 1 lAAi 1 


rr^r^a aTar'OT 


CAAATATGuC 


A on 
4ozO 


GGTAGCGGCA 


CGGGAAGCCT 


r'/^aTTTaTa/^ 


a r**^/^ a TT a 


a TT/ir" ar* a a T 
All uuAuAAT 


Jk m»prp^r>^fpr^» jV 

ATT TCCGTGA 


4doU 


TATGGGTTAC 


TCAGTCGCAA 


TT A 1 (sVaUlJviA 


1 ik^lAUl 1^1 


UOl IGbGCAG 


jv jko/^^mHYk^o 

AAGCGTTACG 


4740 


AGAAATGAGT 


GGTCGGTTAG 




J, 1 IsA 1 uAA 


GisL. 1 A 1 CCAG 


CCT AT TTAGG 


ji 0 An 
4oUU 


TAGTCGCTTA 


gctg;\atatt 


ATCaAACbAUU 


AbCiALi AAu 1 C 


Jk rff~*f~* z^/** mm Ik /** 

A 1 CGCGTT AG 


Ik Ik IK Ik ^ nk m^ )k 

GAAAAGATCA 


4o60 


TCGTGAAGGA AGCATTACAG 


CGATTAGTGC 


GGTTTCGCCA 


TCTGGTGGGG 


ACATATCAGA 


4920 






TACGCGTTGT 


m Ik % Ik ^m Ik mm^ 

TAAAGTATTC 


TGGGGCTTAG 


ATTCCCAATT 


4980 


AGCACAAAAA 


CGTCATTTTC 


CTTCTATTAA 


CTGGTTGCAA 


AGTTATTCTC 


TTTATTCCAC 


5040 


AGAAGTAGGG 


CAATATTTAG 


ACTTGGAATT 


GCAAGGAAAC 


TGGGCCgCTA 


TGGTAGCTGA 


5100 


AGGGATGCGG 


ATTTTACAAG 


AGGAATCTCA 


ATTGGAAGAA 


ATTGTTCGCT 


TGGTTGGGAT 


5160 


TGATTCCTTG 


TCGGATAAAG 


ACCGTTTAAC 


gTTGGAAACA 


GCC7UUVTCAT 


TACGGGAAGA 


5220 


CTATTTGCAA 


CAAAATGCTT 


TTGATGACGT 


GGATACGTTC 


ACTTCTCGAA 


CAA7VACAAGC 


5280 


GAAAATGTTG 


CAATTGATTC 


TAACTTTTGG 


TGAAGAAGGT 


CAAAAAGCCT 


TAAGTTTAGG 


5340 


CACTTATTTC 


TCTGAGTTAA 


TGGCGGGAAC 


AGTTGAAATC 


CGCGATCGCA 


TTGCTCGTAG 


5400 


CAAGTATTTA 


CCAGAAGAAG 


AATTAGAATU^ 


ATTGGATCGT 


TTACAAGCAG 


AAATTAAAAC 


5460 


AACGATAAAA 


GAAATCATTG 


CTGAAGGAGG 


AATGACGAAT 


GATTAAAGAA 


TATCGTACAA 


5520 


TCAATGAAGT 


CGTTGGTCCT 


CTGATGATTG 


TTGAAAAAGT 


GGCAGGCGTA 


AAGTACGAAG 


5580 


AATTAATTGA 


AGTACGCATG 


CAAAATGGCG 


AAATTCGCCA 


AGGGCAAGTT 


TTAGAAATCA 


5640 


ATGGAGATAA AGCGATGGTC 


CAAATTTTTG 


AAGGAACGAG 


TAACATCAAT 


ATTCGTGATT 


5700 


CAAAAGTTCG 


CTTTCTGGGA 


CATCCTTTAG 


iM Ik mm jk f^r^t^ m 

AATTAGGGGT 


mm^^^^^fk ^ M m 

TTCGCCAGAT 


ATGATGGGGC 


5760 


GCGTTTTTGA 


CGGCTTAGGT 


/■limit K ^ Ik Ik ^ 

CGCTTAAAAG 


ATAATGGACC 


AGAATTATTA 


CCTGAGAAAA 


5820 


AATTAGATAT 


TAACGGCGAA 


GTTATCAATC 


CAGTTGCTCG 


m#^ik mm>k m^^*A> 

TGATTATCCC 


GATGAGTTCA 


CO A A 

5880 


TCCAAACAGG 


GATTTCAGCG 


ATTGACCATT 


m Ik Ik Ik m Ik ^^mm 

TAAATACCTT 


AGTTCGTGGT 


^ 9k ^ w Ik w mm ih rs 

CAAAAATTAC 


5940 


CTGTTTTTTC 


AGCATCTGGC 


TTACCTCATA 




f~*/^ TV ^ A 7\ Jk H1> 

GGCACAAAT T 


r^r^r^yi iv jv tv 

GCCAGACAAG 


Cf\f\r\ 
dUUU 


CCAACGTGTT 


AAATAGTGaA 




rTT"P a /^i»i»i»i* 
L'CoxAul 1x1 


1 bcovjCCA r r 


QibCaATTACd 


bUoU 


TTGAAGAAGC 


GGAATATTTT 


AX (jVsAAUAI r 


rp /-»/^/^rp/^ 7\ Jk 

TCCb i CAAAL 


a/^/^o#^/^a TV *T"T» 
AtKaCC^C-AAT T 


GATUvallUAb 


CI on 


TCTTGTTTAT 


GAACTTAGCG 


AA 1 oA 1 L> Uou 


rr^aT'pr'a Jirr* 
UL.AX XbAA(^V9 


a Ti*i^/^ a a 0 a 
V3AI 1 UCAAUA 


or»T a ^ a a TO/^ 
1 ACiAA 1 


CI QH 

o±a U 


CTTTGACAGC 


CGCTGAATAT 


TTAGCTTATG 


AAAAGGGCAT 


GCATGTCTTA 


GTTATCATGA 


6240 


CGGATATGAC 


AAATTATTGC 


GAAgCGTTGC 


GAGAAATTTC 


AGCAGCACGC 


CGTGAAGTTC 


6300 


CAGGACGTCG' 


TGGTTACCCA 


GGTTATCTTT 


ACACGAACTT 


AGCAACGTTG 


TATGAACGGG 


6360 


CAGGCCGAAT 


TCGTGGCTCA 


AAAGGTTCCG 


TAACACAGAT 


TCCTATTTTA 


ACAATGCCAG 


6420 


AGGAGGATAA 


AACACATCCA 


ATTCCCGATT 


TAACTGGCTA 


TATTACAGAA 


GGGCAAATTA 


6480 
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TCTTGTCCCG 


GGAACTATAT 


AAGAGCGGTA 


TCCAACCACC 


AATTGATGTA 


TTGCCATCAC 


6540 


TTTCCCGTCT 


TAAAGACAAA 


•GGAACTG6CG 


AAGGCAAAAC 


GCGCGGGGAT 


CATGCaGCGA 


6600 


CGATGAATCA 


ATTGTTCTCA 


GCCTATGCAC 


AAGGGAAACA 


AGCCAAAGAA 


TTAGCTGTCA 


6660 


TTTTAGGAGA 


ATCAGCTCTT 


TCCGATGTCG 


ATAAAATTTA 


CGCAGCTTTT 


GCCCAACGTT 


6720 


TTGAAGAAGA 


ATATGTCAAT 


CAAGGATTCG 


ATACAAACCG 


TTCGATTGAA 


GAAACGTTAG 


6780 


ATCTTGGGTG 


GGAACTATTA 


AGTATGCTAC 


CGAGAACAGA 


ACTAAAACGA 


ATTAAAGAGG 


6840 


ATATGCTCGA 


TCAATATTTA 


ACTGAAGGGA 


AGTAGAAAAA 


TGGCTCGATT 


AAATGTTAAT 


6900 


CCAACCCGTA 


TGGAACTTTC 


CCGATTGAAA 


AAACAGTTAA 


CTACGGCCAC 


GAGAGGTCAT 


6960 


AAGTTGCTGA 


AAGACAAACA 


AGATGAATTG 


ATGCGACGGT 


TTATTGCGCT 


AGTCAAAGAA 


7020 


AACAATGAGT 


TACGTATTCA 


GGTTGAACAA 


GAAGTGACGG 


ATGCGTTATC 


AAATTTTGTA 


7080 


TTGGCAAACG 


CAACGTTGAA 


CGAAGCCTTT 


ATTGAAGAGT 


TAGTGGCAAT 


TCCAGCTGAA 


7140 


AAAGTTGAAT 


TAGAAATTAT 


TGAACAGAAC 


ATCTTAAGTG 


TGCCAGTTCC 


TAAAATGATT 


7200 


TTTGATTACG 


ATGAATCGGT 


TCAAGAAGCA 


CCCTTAGATT 


ATGGCTACGT 


GAATTCAAAC 


7260 


AGCGAGCTTG 


ATCAAGCTTT 


TGCTAAAATT 


TCAAGCATTT 


TACCGAAATT 


ATTAGCACTT 


7320 


GCAAATGTTG 


AAAAGACGTG 


CCAATTATTA 


TCAAAAGAAA 


TTGAAAAAAC 


GCGTCGTCGT 


7380 


GTCAACGCCT 


TAGAGTATAT 


GACGATTCCA 


CAATTrGArG 


AAaCAaTTTA 


CTmCaTTCmA 


7440 


ATGAAATTwG 


rAGrAAATGA 


ACGTGGcGAA 


ATTACACGTC 


TAATTAAAAT 


TAAAAGCATG 


7500 


AACAAgAAAA 


TTAGTGGTTA 


GGAAGAACGA 


GTGCGAAcCa 


GTTGGTAGCT 


TTCGTGCTCG 


7560 


TTTTTAATAA 


AAAAGGGAGT 


GGAAAAAATG 


AAAAATGAAG 


AACGCCGCAA 


AGCGATTGCA 


7620 


CTGAATTGTC 


AAAAATACGA 


AAGTGACTAT 


GCTCGCTTGG 


TGGAACCAAT 


CAATGaGCTA 


7680 


CTGTTAAACT 


TGGGAGCGGC 


TATTTCAGAA 


GAAGCCgCGA 


AcAAATCATT 


CTGAATGTAA 


7740 


AACGATATCA 


TCACGGCGTG 


AAATATCTTC 


CTGAGTGTCA 


TTTAGATGAA 


AGTAACCAAT 


7800 


TTATCGAAGA 


TGGGCTTGAA 


GCTCTTAAAA 


AGGGAGATTT 


AGGCAACGGT 


GCGTTACAAC 


7860 


TATTTGGGGC 


CGGCTTAAAC 


TTTGCTAGTT 


TTGCAGCAAA 


AGCATAAGGA 


ACCAAAAAAA 


7920 


TTGATGCCCA 


CCAAATGTTG 


GCAGAACGTT 


TtACGAAGTT 


GTTGTCTGTA 


CaAACGGATA 


7980 


ATGATAaCAA 


ACAATAAGrA 


AAATACCCTk 


GGTTTATTAA 


GTCGACTTAG 


TAAACCAAGG 


8040 


syATTTTTTT 


AACTGTTATT 


GATTAGGGTA 


TTCAGACACC 


TGTAATTGCC 


ATAACTTATC 


8100 


TGTCACTGAA 


GCAATAAACG 


CAGCACTCTT 


AGCATTATCA 


GGTCCTTTAG 


TAAAAATTGC 


8160 


TAAAGCAAAG 


GGGTGAGGTG 


TTTCTAAAAT 


CCCCATATCA 


TGAATAAATT 


CATCATAGGA 


8220 


ACCAATTTTA 


TGAGCAACTT 


TTCCTTGAGT 


TGTGGGAGTT 


TCCATTCTTT 


CATGGAAAAC 


8280 


ACTTTGTTTC 


ATAGAATCAT 


AAATCGCTTG 


GTAATCTGGT 


TTTGTTGCTT 


TTTCAGTATA 


8340 


TAATTTTTGT 


AGAATAACAA 


GCGCATCTTC 


AGAAGAAAAT 


TGTGGCTCTT 


CaATCGAAGG 


8400 


TGTCTTGTGC 


AAATAACGCT 


GATACATTTC 


TCGTTTTGCT 


TTTGCATCAC 


CACCCAATGT 


8460 



wo 98/50555 PCT/US98/08985 



631 

GTCATAGAGC ATATTTTTAG CGATATTATC AGAATAGGTG ATGGCATATT CTTGTAACGT 8520 

TTTTAATGGA TATTCTGGTT GGATATTATA GGCAATGATC CCcAQJ.TCqX TCTXQATAAT 8580 

CTTCTTCGGC GTTATAGGGA ATCAGGTCTG TCCATTTTTT TTGACCAGAA GCAACCGTGT 8640 

CGGCAACTAG CATCGTCAGG GGCACTTTAA TAGTACTAGC TGTATAAAAT TCATGTTGGC 8700 

CATTCACGGA ACGCTGTTCG CCAGTT 8726 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



GTAAAGACTT 


TTCACAAGTA 


TCAAAGAAAT 


CCGTCGTTAA 


TCTTTACTAT 


TCGACAAACG 


60 


AACACCCTCT 


TGCTCAGCAA 


GAGGGTGTTT 


TTTTGTGTAA 


TTAAGTGGAA 


AATCAACATT 


120 


GACTTTTTTT 


GAAGTCACAT 


AAGTGTGAAA 


ATTAGGTTTT 


TGGACTUVTGA 


TTCAGAAATC 


180 


TTTTGAAAAT 


TGAAGGTGTG 


AGGAAATTTA 


GAAAGATAAA 


AAGAGGGAGT 


ACTGAAAACC 


240 


GAAATTTCAG 


TTAAGTGCTT 


GGATCAAGAA 


ACAAAAACTA 


CGTTAkTTTG 


TCAATATCAG 


300 


TTGAAATTGG 


AGAAGAAGCA 


GAATGGGGTG 


ATAGTATTGG 


CAAATTAATT 


ATTTAAGGTC 


360 


ATCGTTACAG 


GATTTTTGTT 


TTTTATATyC 


aATCmAAAAG 


AAATAAATAC AAAGaACACA 


420 


GATAAGTATT 


TTAGCAA'STA 


ACCAATAAAT 


AGCAGGACTG 


TTGGTGCTAT 


AAGAATACAA 


480 


GAAAATAGTA 


GTAAACTTAA 


TTGAGAATCG 


TTTGCTATTT 


AAATAGTGTT 


CTTTTCCTTT 


540 


TAATATATAT 


ACGATAAAAT 


AAGTATATTA 


ATTGTAAaAG 


GACATGATAA 


TTATGGAAGA 


600 


ATTAGAATGG 


AATwwTTTTT 


aTGTTTGCTA 


TAGTTTTGAG 


GGTAGTGATT 


CTTTTAATAA 


660 


CTGTC7UVGAT 


AGTAGATTAC 


TTTGnGAATA 


CTCCTGAAGT 


ACAAATACAG 


AGAAAACTTC 


720 


CTGAGTTTGA 


AAAAGTGCCA 


CCTTATGATC 


CAAGAGATAA 


TTCATTTCTG 


ATACAAATTA 


780 


TTCTTTAAGT 


AATACAACGA 


CTAAGTTTAT 


TTACCAAACC 


ATTACAATAA GAAAAGGTGC 


840 


TUVGTATACCT 


AATTACTATT 


GGTATGATGA 


TGGTCAATTT 


AGAGGAAATC 


TAAAACTTCA 


900 


ACGTTGGCAA 


GACTACGGAG 


AAGATCCAAA 


CATAAGAGGA 


TATATCTGTA 


CTTATGGAGG 


960 


CTACGTTACC 


AATGGTTCAA 


GCCCGGCGCA 


AAGAATTGTC 


GATATTAATG 


AACAATAGCT 


1020 


ATTCTAACTT 


TATGTAATAG 


GAGGGAGTAA 


TCATGAAAAA 


AATCGGGTAT 


TTTAGTTGTA 


1080 


TTATTTTTTT 


CATGTTTTTG 


GTAGGTTGTA 


GTAATAACAA 


AAAAGAAAAC 


GGCAATCTTT 


1140 


TGAATGCCAG 


TTCGTTTCCT 


TTAATACTCA 


CCACGATTAT 


TGAAAAAGAA 


GAAGACCTAA 


1200 


CGAAAGGTTC AATTTTTTTC 


AACAAGGATA 


AAACCATGAC 


GCTTGAAAAA GAATATTTAG 


1260 


TTAATCCCAA 


TAATGAAGAC 


ACAAAAAAAA 


CAAGTAGAAC 


AGAAAAAAAG 


GTATATTWJV 


1320 


ATATTAAAAT 


ACAAGAAAAT 


AAAGAGAGCT 


ATGAAATTAT 


AGGTCAATTG 


GACAAAAAAA 


1380 
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CGAAAAAAAT 


AGAGTTTAAA 


AAAGTTGATG 


AAGGTAAACG 


TATATCTGAT 


GCAGAAGGTA 


1440 


ATGTGTATGG 


TGATTTTGGT 


GGTAAATAGG 


ATAGTTAAGT 


CATAAAAAAG 


AAGGCTAAAA 


1500 


ATTTTTATTT 


TTTTCGTCTC 


TATACTTTCT 


GATGTTAGTT 


TTTTTACCAA 


CATCAGAAAG 


1560 


TATAGAGACG 


TTTTTGTGTA 


ATTAAACAAA 


AAATCAACAT 


TGACTAAGAC 


CAGAAATACT 


1620 


TCCTTTAGTG 


CTCCGAGTTG 


CTCTAATATA 


AGACACCAAA 


TTGATAACCA 


TTTGCTTGAT 


1680 


AAAACTCTAT 


TAATTGTGGC 


AATGCTTGTT 


TTGTTAATTC 


CTTATCTACT 


GAATCATGCA 


1740 


TTAAGACTAC 


TCTAATATTA 


TAATCCGGAT 


AAACCTCAAG 


TGAATGTTGA 


TGGAAAGCGA 


1800 


GCATTTCTGC 


GACTGTAGTA 


GGCTGTCTGT 


CTAACGGCTC 


CGCATCTCCA 


ACCATCGCAT 


1860 


TCCAGTCAAT 


CCAATGGATG 


CCAAGTTGTT 


TGAATT^TTC 


ATCAGATTTT 


TCTGTACCTC 


1920 


CCCATGiCCAT 


ATGACCTCCT 


GGGTATCGCC 


AGAGAGTTGT 


TTGAAATGTT 


TCGCCCAATA 


1980 


TTTTTTTAAG 


AGAGTTCTCC 


ATGGTTGCTA 


TTTCCGCTTG 


TATTTCTTTA 


GTGTTGACAA 


2040 


TCCCGTTTGG 


ATATAACGTT 


GCATAATCAT 


GAGTTGAACT 


ATGAAAGCCT 


ATTGAATGTC 


2100 


CTTCTGCAAC 


TTCGCGTTTC 


ACTATCTTTT 


GATTTTCGGA 


AGTTAGGGTA 


TTTCCTACAA 


2160 


GAAAAAAAGT 


TGCGTGGACG 


TTATACTTTT 


TTAATGTATC 


TAGGACTTGT 


GGGGTtATTT 


2220 


GATTATTAAC 


GCCATCATCA 


AAGGTTAAGA 


ATACTAATTT 


TTCACCTTTG 


TAAGGTTCTT 


2280 


CTCCAGTTAT 


ATATTTTCTG 


ATTTTGTCTG' 


CTGAGTAAGC 


ATATTTTTTT 


GATGAAATAG 


2340 


TCATGTGTGT 


TGATTCCTTT 


TCTAACTTGG 


TCGGGGAATC 


TTTTTTTGCA 


GTTGAATTTT 


2400 


TAGTCGATGT 


CTGCGTAGTA 


TCATCATTTT 


TACTTGTTAT 


CTTGGGTTGA 


GTTAATAGGA 


2460 


ACGTTTGAAT 


GAAAACAAGA 


ACAATGAGGA 


TT7UVCGCACC 


TAAATnATAC 


ATACCCGTnT 


2520 


ATTTTGAATT 


T 










2531 



(2) information" FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

ACCGCCGCcG CCAArGAtGa AcGAAAATAT CTcGAATCCA CCAAAACCAC TACTGCTAAA 60 

TCCGCCGCCA CCGAAACCGC CAAAGCCaCC aGCGCCGcCA CCGwAATTAG GATCAGTTCC 120 

TGCATGGCCA TATTGGTCAT AGGCCGCTTT TTTCTGTGGA TCACTTAATA CTTCATACGC 180 

TTCAGAAACT TCTTTAAATT TTTCTTCCGC ATCGGCTTCT TTGTTAATAT CTGGATGGTA 240 

CTTTTTGGAA" AGTTTGCGGT ATGCTTTTTT AATTTCATCA TCTGAAGCAC CTTTTGCTAA 300 

TCCTAGCACT TCATAATAAT CTCGTTTCGT CGCCATTGAC TTCCCTCCAA CTTTCTTCAA 360 

ACTTTACAAT GTGTTTAGAA CCAGCCAAAT TCATCGATTG CTTTCGGTCT CTCTGGTTGG 420 
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ATTATCTGTC 


ACACCTTGAA 


TCAGTGTTAC 


TAGCTGTGAA 


TAC5GTATTTT 


CAGCTTGTTT 


480 


AACATCGTTG 


TAATCATACC 


ACAGTTCTTT 


CAAAACCTAA 


ATAGTTTCTT 


GCTAACTTGA 


540 


TTGTGGTTGA 


TTTTCTCACA 


AAAGACAAGG 


GACTGGGACA 


TTAGTCATTG 


TACTAATGAT 


600 


CCAGTCCCTA 


AAATCaATTG 


TTATTGATTA 


TTTGTCATCA 


CCATTTACTT 


CTTCAAAATC 


660 


AGCATCAACA 


ACATCATCAG 


CGCCACCTtG 


gAGCAGCTTC 


AGGgATTTTC 


TTGTGCTTGT 


720 


TGTTGCGCat 


TGgTTCGTAA 


AGTTTTACTG 


TTAAGTTTTG 


GACGATTTCA 


TTTAATGAAT 

X X X •XC* X ^yVaC^ X 


780 


CACGTTTTGC 


TTTCATT TGT 


TCGATATCGT 


TTGCTTCGAT 


GGCTGCTTTT 


AATTCATCAC 


840 


GCGCATCTTC 


TGCTTTT TTC 

X X X X X X X %^ 


ACTTCTTCAG 

X X X X 


CATCTACTTT 

^^Fm X X Ma^0 XXX 


GCCTTCTAAT 

%7 X X \^ X m»mm X 


TCTTTCAATG 

X w XXX v*nnx w 


900 


TTTTATCAAC 


AGTGAATAAT 

mm\^ X ^^mrnrnm X Mmmm X 


AAAGCATCTG 


CATCGTTACG 


TAAGTCAACT 


TCTTPTTTAP 

X w X X V« XXX t^\^ 


960 


GTTGTTTATC 


TGCTTCAGCG 


TTTGCTTCAG 


CATCTTTCAC 


CATACGTTCG 


ATTTCATCAT 


1020 


CAGATAAACC 


TGAAGAAGAT 


TTAATPGTGA 


XXOXX X\9X 


X X X X X V3n\9 X U 






TCGCACGAAC 


ATTTACAATC 


A XXX XF\X 


CAATGTCAAA 


GCTTACTTCA 


ATTTnTf;r;r'A 

nx X XwXOvTV^n 


1140 


CACCACGAGG 


AGCTGCAGGA 


ATATCTGTTA 

rx X n. X X vj X X n. 


ATT GG AAACG 


ACCTAATf^TT 

/Aw^ X r\t\ x\D X X 


TT ATT ATP An 

X xT\x xr\x\^f\\J 


1200 


CGGCCATtGG 


ACGTTCACrT 


TGTAAAACGT 


GAATATCAAC 


AGCTGGT TG A 


X I. V7 X v»/AvjV-»/Av3 


1260 


CAGTTGAGAA 


T ACTTGT GAT 

X mm\^ X X ^7 X ^J£m X 


TTACwTGtTG 

X X ^ X W i» X \J 


GAATCGTAGT 


GTTArGATPa 


ATTAATTTTR 

n X xtV\x 1 X X \3 


1320 


TAAATAcGCC 


ACCCaTTGTT 


TCAATCCCTA 


ATGACAACGG 


TGTTACGTCT 

X W X X J^^\3 X X 


AATAAoACAA 


1380 


CGTCTT TAAC 

X XXX rnmma^^ 


ATCACCA6TG 


ATTACACCAC 


CTTGGATTGC 

w X X X X 


AGCACCCATT 


GCT APT fTTP 
o\« X X ^ X X v« 


1440 


GTCAGGGTTA 


ACTGATTTGT 

X ^^mm XXX \9 X 


TTGGTTCTTT 

X A X X \« XXX 


GTTTGTTTCT 

W XXX W XXX W X 


TTGCGAACAG 


CTTCAACTAC 


1500 


AGCTGGAATA 


CGTGTTGAAC 


CACCAACTAG 


GATAACTTCG 


TCGATTTCAG 


AAGGATTTAA 
^%n\3^S£^x X xnn 


1560 


TCCAGCATCT 


TTTAAGGCTT 


GACGTACTGG 


TACTTTTGTA 


CGTTCAACTA 


AATCACTTGT 

r^rx X vmkw X x \7 x 


1620 


TAATTCATCA 


AATTTTGCAC 


GAGTTAAGTT 


CATTTCTAAG 

XXX X *««X\J 


TGCAATGGTC 


CTG cTTCGCC 


1680 


AGCAGTAATA 


AATGGCAAGC 


TGATTTGTGT 

X %«V«» XXX \ji X W X 


GCTTGTTACA 

%7V# X X Vii X X mrnS^trm 


CCTGATAAGT 


CTTTTTTPCP 

w X X X X X X V* wV» 


1740 


TTTTTCAGCA 


GCATCTTTCA 


AAC6TTGTAA 




TCGTTAGCTA 

X X«%9 X X aX^TX^ X 


A6TCAATGCC 

mm\^ X XrfwV^ X 17%^^^ 


1800 


ATTTTCTTTT 


TTAAATTCTG 


CAACCATGTA 


GTCAATGATC 


TTATTATCAA 


AGTCATCCCC 

mm%^ X m X 


1860 


ACCTAGGTTG 


TTATCACCGG 


CTGTTGATAA 


TACATCGAAA 


ACGCCATCGC 


CTAATTCAAG 

%^X»a«XX X %M^»XX# 


1920 


GATTGACACG 


TCGAATGTAC 


CACCACCAAG 


GTCA7UVGACA 


AGAATTTTTT 


CATCTTTGTC 


1980 


TGTTTTATCT 


AAGCCATAAG 


CTAAAGCTGC 


TGCTGTTGGT 


TCGTTAACGA 


TCCGTTCTAC 


2040 


TTCTAAACCA 


GCAATTTTAC 


CAGCATCTTT 


TGTTGCTTGA 


CGTTGCGCAT 


CGTTGAAATA 


2100 








TTTTTCACCT 


AAATAQiTCTT 


CAGCAAAACC 


^IbU 


TTTTAAATAT 


TGTAAGATCA 


TTGCAGAAAC 


TTCTTGTGGT 


GTATATGATT 


TTCCTTC7UVC 


2220 


ATCAACTTTG* 


TAGCCAGCTT 


CACCCATGTG 


ACGTTTAATA 


GATGAAATCG 


TGTTAGGATT 


2280 


TGTAACAGCT 


TGACGTTTTG 


CTACTTCGCC 


CACTTgAATT 


TCACCGTTTT 


TGAATGAAAC 


2340 


GACCGAAGgA 


GTTGTGCGGT 


TACCTTCTGG 


GTTTGCAATA 


ATTTTCgctT 


CGCCGCCTTC 


2400 
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TAGTACTGCG ACAGCTGAGT TTGTTGTTCC tAAGTCAATA CGAATAATTT TACTCATAGT 24 60 

GAATATCTCC. AATCTGTATT AGTTTTCTAT AATAATATAT TTTTAATTAA CGTTTGTTAT 2520 

TGTGCAACAA TGACCATTGA TGGTCGTAAC ACACsATCTT GTAATTTGTA CCCTTTTTGT 2580 

AGAACTTCAA CAATGGTATC TGCTGGGGTT TCTTCAGAAG CAGGGACTGT TTGAACGGCT 2640 

TGATGTAAAT TTGGATCAAA GGTTCACCCA TTGCTGGGAT TTCTTCAATT CCTTCTTCTT 2700 

TTAAAGCCAC AGTAATA 2717 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



GAATTTAGAA 


ATGAGGACTA 


CTGATGATTT 


TATTACAAGC 


AAATCAAGTT 


GCCCGGCATT 


60 


TTGGCTCGGA 


AACATTGTTT 


GAAAACATAC 


ATTTAGAAAT 


TGCAACAAAA 


AGTCGGATTG 


120 


CCTTAGTTGG 


TCGTAATGGT 


GCTGGAAAAT 


CGACTTTTTT 


AAAAATCATT 


GCAGGCATTG 


180 


ATGCTCCCGA 


TAGCGGAACC 


ATTGCCAAAA 


ATaAAACTGC 


TACGTTAGGT 


TATTTAGCTC 


240 


AAAATACCGG 


CTTAGAATCA 


GATAAAACCG 


TTTGGGAAGA 


AATGACAAAA 


GCCTTTGCTG 


300 


ACATCCTAGA 


AATGGAACAg 


CGTATGCGAG 


aATTAGAAAC 


TAAAATTAGT 


GAAATGGAGC 


360 


CAACCACTTC 


CGTTTATGAA 


GGAATTTTAA 


AAGAGTACGA 


TCAATTGCAA 


CATACGTTTT 


420 


CTGAAAAAAA 


TGGCTACGGC 


TATGAAAATG 


AAATTCGCTC 


AGTCCTTCAC 


GGCTTTGGCT 


480 


TTGATGAATC 


CTTTTACACG 


AAAGATATTC 


AAACCTTATC 


TGGTGGTCAA 


AAAACCCGGC 


540 


TTGCATTAGC 


GAGAATGCTT 


TTACAAAAAC 


CAGACATTTT 


AATTCTGGAC 


GAGCCTACAA 


600 


ACCACTTAGA 


TATCGAGACG 


CTTtCTTGGC 


TGGAATCTTA 


TTTGCCAAGT 


TATGCCGGCG 


660 


CCCTATTAAT 


TGTTTCCCAC 


GATCGtTATT 


TTTTAGATAA 


GGTAGTTAAT 


GAAGTTTATG 


720 


AACTGAGTCG 


CAAAAAAATG 


ACTCACTACA 


AAGGaAACTA 


TTCCAAATAC 


TTAGAGTTAA 


780 


AAGCAGAACA 


ATTAGCCAGT 


GAATGGAAAG 


CGTATGAAAA 


GCAACAAGAA 


GAAATCAATA 


840 


AGTTAGAAGA 


TTTCGTTGCC 


AAAAATCTGG 


TTCGTGCATC 


TACAACGAAA 


CGTGCACAAA 


900 


GTCGCCGAAA 


AGTATTAGAA 


AAAATGGACC 


GTTTAGACCG 


ACCTCAAGGA 


GATGAAAAAT 


960 


CGGCGCATTT 


TCTTTTCGAT 


AGTGAAAAAG 


TCTCGGGAAA 


TGTTGTTTTA 


CAAGTCGAAG 


1020 


ATGCCGCCAT 


TGGTTACGAC 


CAAGAACATA 


TTTTATCCGA 


ACCTATTCAC 


TTGGATATTC 


1080 


GTCGCAAAGA 


AGCCATTGCC 


TTAGTCGGAC 


CGAACGGAAT 


TGGTAAATCC 


ACTCTCTTGA 


1140 


AATCAATTAT 


TGACCGCATT 


CCTTTCATTA 


AAGGAAGTAA 


AACTTTTGGC 


ACCAATGTTT 


1200 


CTGTAGGTTA 


CTATGACCAA 


GAGCAAGCCA 


ATTTACATGG 


CAATAAAACG 


GTCTTAGCGG 


1260 


AATTATGGGA 


TGAACACCCA 


ACCACACCTG 


AAAAAGAGAT 


TCGAAGTATT 


TTAGGCGGCT 


1320 
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TTCTCTTCAG 


TGGAGACGAT GTTGAAAAAA 


*CGATTCCTTT 


ATTAAGCGGT 


GGCGAAAAA6 


1380 


CCCGTGTGGC ATTAGCAAAA CTAGCGATGG 


ATCGTGACAA 


TTTCTTGATT 


CTCGATGAGC 


1440 


CAACCAATCA 


CTTGGATATC GATAATAAAG 


AAGTTTTAGA 


AAATGCGCTG 


ATTGATTATG 


1500 


AAGGAACCAT 


CCTCTTCGTT TCCCATGACC 


GTTACTTTAT 


CAATCGAATT 


GCAACAAAAG 


1560 


TTGTTGAGCT 


TTCTGAAAAA GGCAGCAAAC 


TTTATTTAGG 


CGACTATGAT 


TATTATTTAG 


1620 


AAAAGAAACA 


aGAGGAAGAA GAAATCGCTG 


CCCTCTTAGC 


TAATGAAGAA 


GCGGCGAAAA 


1680 


AACCCGAACC 


AGTTACAGCC AAAAATACCT 


TTTATCAAAA 


CAAGGAGCAA 


CAAAAATTAC 


1740 


TCCGTACTTT 


GCAAAGAAAA ATAACACAAG 


TCGAAGAAAA 


TCTTGCTCAG 


TTAGATACGA 


1800 


CAATTGCACA 


ATTAGAAGCA CAAATGAGTC 


AACCAGACAT. TCTAGAAAAT 


CATGTTGAAT 


1860 


TGCTGGCTTT 


AAACCAACAA TTAGATGAAG 


TTCGTCAGCA ACAAGATGAA 


CTACTTGAGC 


1920 


AATGGGAAAA 


TTTCAGTTTA GAATTAGAAG 


AAATGGAAAA 


TAACAATTAA 


CGGAGGGATT 


1980 


CACTATGGCT 


TCCAAAAAAA TCAGTTTAAC 


GAaGCCCAAA 


TATCAACAAA 


TCGCCGTTGA 


2040 


TGTTGCGGAA 


AAAATTGCAG AAGGCAAATT 


ACATGTTGGA 


GATAAAATCC 


ATGCCCGTTC 


2100 


CACCCTAGCT 


AACCAATACC AAGTGTCGCC 


AGAAACAGCT 


CGCAAAGCCA 


TTATCGTTTT 


2160 


AGTTGATTTA GAAATCGTTA AAGCCAAACA CGGCAGCGGC 


TTCTATGTGG 


CATCAAAAGA 


2220 


AAAAGCGCAG 


GACTTTGTGA CACAATATCA AGACGTTCAA ACGATTGCTG 


AAATTAAAGA 


2280 


AGAATTGCTA 


GATAGCGTAG cGAAACAAAA AGAAGAACTT 


ACTCATTTTT 


CTAGCATTCT 


2340 


AGATACTCTT 


GTTGAACAAA CAAAGCGTTT 


TGATTCTTTC 


AATCCAATGA 


ATCCCTATTC 


2400 


ATTAGTTTTA 


ACAGAAGAAG CTGCTTATCT 


TGAGGCGACG 


ATTAGCGAAA 


TGAATTTTTG 


2460 


GCAAAACACT 


TCGGCAACGA TTATCGCCAT 


CAAACACAAA 


GAAGAGCTTC 


TGGTCTCACC 


2520 


TGGTCCATAT 


GCAAAGATTT CGTTGAACGA 


CACACTTTAT 


TTTGTTGGAC 


ATGACGAATC 


2580 


AACCTTACAA 


CGTGTCCAAA ATTTCTTTTA 


TCCTTAAACA 


AAAAAGTGAC 


TACTTATCTT 


2640 


GCGGTAAATA 


GTCACTTTTT TTATTTATTA 


TTTCTATTTT 


TCATAAAAAC 


CCACTATAAA 


2700 


ACCTAGTTAA 


AATACTGTTT TTCTTCATTT 


TCAGAAAGAT 


TTTCATCTAA 


TTTGACAAAG 


2760 


TGACAACTTT 


ATAGTAGTCT GATGTAGTCA 


CTTTTTTTAA 


TGAGGAGGAA 


CAATGATTTG 


2820 


CCAAAAGTAA AAgTAAATCA CCTTACCAAG ATCTTTGGtA AGAAAACCAA 


ACCGGCACTT 


2880 


GAAATGATTC 


GTGCCaACAA AAGCAAAACA GAAATTTTAG AAAAAACGGG 


TGCTACAGTC 


2940 


GGTGTTTATG 


ATGTAAATTT CGAAGTTGAA 


GAAGGCGAAA 


TTTTCGTTAT 


TATGGGGTTA 


3000 


TCAG6AAGTG GGAAATCAAC ATTAATTCGT TTATTATUVTC GTTTGATTGA 


ACCAACTTCA 


3060 


GGAAATATTT 


ATATTGATGG ACAAGATATT 


TCTTCTTTAG 


ATAAAGAAGG 


ATTACGTGAA 


3120 


GTACGTCGAA 


ATAAAATGAG CATGGTTTTC 


CAAAACTTTG 


GTCTTTTCCC 


CCATCGAACT 


3180 


ATCTTAGAAA 


ATACGGAATA TGGTTTAGAA 


ATTCGTGGTG 


TTCCTAAAGA 


AGAACGCCAA 


3240 


GCAAAAGCTG 


AAAAAGCCTT AGAAAATTCT 


AGCTTAATTG 


CTTTTAAAGA 


CCAATTACCA 


3300 
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AGTCAATTAT 


CTGGTGGGAT 


GCAACAACGT 


GTCGGCCTTG 


CTCGTGCCTT 


GGCTAACGAT 


■ 3360 


CCAGAAATTT 


TATTAATGGA 


CGAAGCGTTC 


TCTGCTCTTG 


ATCCGTTAAT 


TCGTCGTGAA 


3420 


ATGCAAGACG 


AACTATTAGA 


TTTACAAGAA 


AATGTCAAAA 


AAACCATTAT 


CTTTATCACA 


3480 


CATGATTTGA 


ATGAAGCTTT 


ACGAATCGGC 


GACCGGATTG 


CTTTGATGAA 


AGATGGTCAA 


3540 


ATTATGCAAA 


TTGGGACTGG 


AGAAGAAATT 


TTAACCAACC 


CAGCCAACGA 


ATATGTGCGA 


3600 


ACCTTCGTGG 


AAGATGTGGA 


TCGTTCAAAA 


GTCTTAACTG 


CCCAAAATAT 


TATGGTACCT 


3660 


GCTTT7VACTA 


CTAACATTGA 


AATTGATGGC 


CCAACTGTTG 


CTTTAAAACG 


GATGCGTCAA 


3720 


GAAGAAGTGA 


GTATGTTACT 


TGCCGTGGAT 


AAAAAACGCC 


AACTAAAAGG 


CGTTGTACGC 


3780 


GCAGAAAAAG 


CTTTAGAAGC 


GCGTAAAAAT 


GGGACTTCTC 


TTGTAGAATG 


TGTGGATCCA 


3840 


GAAATTCAAA 


CAATCGACAA 


AGATATGTTA 


GTGAATGATA 


TTTTCCCACT 


GATTTATGAT 


3900 


GCACAAACGC 


CTCTTGCCGT 


GACTGATAAC 


GGCAAACTAT 


TAGGCGTGGT 


CATCCGTGGT 


3960 


AGCGTACTTG 


AAGCACTTGC 


AGAAACAGAG 


GTGAACGAAC 


ATGAATAAGT 


ATCAATTACC 


4020 


TGTCGCGTCT 


TGGGTTGAAA 


GTTTTACTGA 


TTGGTTAACT 


TCAACATTTG 


CTGGATTATT 


4080 


CAGCTTTTTA 


CAAACAATTG 


GTCAAAGTGT 


AATGGACAAC 


ATTACTGCCT 


TACTAACAGC 


4140 


CGTACCACCA 


CTTGTGCTGA 


TCGTGCTATT 


AACGATTGCT 


GCTTTCTTTA 


TTTCTAATAA 


4200 


AAAAATTGGC 


TTAAGTTTAT 


TCACGTTTAT 


TGGCTTAATG 


TTTATCTATA 


ATCAAAACTT 


4260 


ATGGAACGAT 


TTAATGAGTA 


CCGTTACGTT 


AGTTTTACTT 


TCTAGTGTTA 


TTTCCATTAT 


4320 


CATTjGGTGTG 


CCACTAGGGA 


TTTTAATGGC 


GAAAAGCGAA 


AAAGCCAAGA 


GTATCATTAC 


4380 


ACCGATTTTG 


GACTTCATGC 


AAACAATGCC 


TGGTTTCGTT 


TACTTGATTC 


CAGCCGTTGC 


4440 


ATTCTTCGGA 


ATTGGTATGG 


TTCCTGGGGT 


CTTCGCCTCT 


GTTATCTTCG 


CTTTACCACC 


4500 


AACCGTACGC 


TTTACGAACT 


TAGGAATTCG 


ACAAGTGCCA 


AATGAACTTG 


TTGAAGCAGC 


4560 


TGATTCTTAC 


GGTAGTACTG 


GCTGGCAAAA 


ATTATTCAAA 


TTAGAATTAC 


CTTTAGCCAA 


4620 


AAACACGATT 


ATGGCTGGTG 


TTAACCAAAC 


AACCATGCTA 


GCCCTATCAA 


TGGTAGTTAT 


4680 


TGCTTCAATG 


ATTGGTGCGC CTGGATTAGG 


ACGTGGCGTC 


CTTTCCGCTT 


TACAACGTGC 


4740 


ACAAGTCGGG 


AACGGTTTCG 


TAAATGGTGT 


GGCTTTAGTT 


ATTTTAGCGA 


TTATCATTGA 


4800 


CCGTTTTACT 


CAACATTTAA ATAATAAAAA 


AGCAGCTCCT 


AAAGCAGCTG 


GTGCCACCTC 


4860 


TAAAAAGAAA 


AAATACX^GGA 


TTATTGCAGC 


GGTTGTTGTA 


ATTGTCGCTG 


GCTTAGTTGG 


4920 


TGCTTCAATT 


TTTACAACAA CAAATGATAA 


GCAAATTTCC 


CTTTCTTATG 


TGGAATGGGA 


4980 


TACCGAAGTA 


GCCTCCACAC 


ATGTGGTTGC 


CGAAGTCTTA 


AAAGATATGG 


GGTATGATGT 


5040 


TAAAACGACC 


CCTTTAGATA ACGCAATCAT 


GTGGGAATCT 


GTAGCTAAAG 


GTGAAACAGA 


5100 


TGCCATGGTT 


GGTGCGTGGC 


TGCCAGGAAC 


TCACGCAGAG 


CAATATAAAC 


AATATAAAGA 


5160 


TAAATTAGAC 


GACCTTGGTG AAAACCTTAA 


AGGCGCGAAG 


TTAGGAATTG 


TTGTCCCTTC 


5220 


TTATATGGAT 


GTTGATTCTA 


TCGAAGATTT 


ATCTGATCAA 


GCTGGGAAAA 


AGATTACTGG 


5280 
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AATTGAACCT 


GGGGCTGGTG 


TTGTTGCAGC AGCAGAGAAA 


ACTAAAGAAG 


CTTATCCTAA 


5340 


TTTGAAAGAT 


TGGTCTGTTG 


AAACTTCTTC 


ATCAGGTGCC 


ATGACTGTCG 


CCTTAGGACA 


5400 


AGCAATTAAA 


AACAACGAAG 


ACATCGTTAT 


TACTGGCTGG 


TCTCCTCACT 


GGATGTTTGC 


54 60 


AAAATATGAT 


TTAAAATATT 


TAGCTGATCC 


AAAAGGCACT 


ATGGGCGGCG 


AAGAAGCGAT 


5520 


TCACACAATG 


GCTCGCCAAG 


GCCTAAAAGA 


AGATCAACCA 


GAAGCTTACA 


AAGTGTTAGA 


5580 


TAATTTCCAT 


TGGACCACTA 


AGGATATGGA ATCTGTCATG 


TTAGAAATTA 


ACGAAGGCAA 


5640 


AGACCCACAA 


GAAGCTGCAC 


GTGATTGGGT 


TGATTCACAT 


AAAGACCAAG 


TAGCAGAATG 


5700 


GnAAAAATAA 


TTCTTAACAA 


ATGACTTCCC 


CATCTATTAA 


AAGrGGGCGT 


TGAACAAATT 


5760 


GTTCAACGCC 


CTCTTTTTTC 


GCAAAAAAAA 


CCACTCAGTT 


GGGTAGACTG 


AGTGGAAAGG 


5820 


AGTTATTTTT 


ACTTGAGGAG 


TAAAAATGAA 


AAGTGTTTAT 


GTTGGGTGTT 


TTGTTGGTAT 


5880 


GCTTATATAT 


TAACGAGAAC 


CTGTGAATAA 


TTTGTGACGA 


AAACCTTGGA 


AATTTGCTAC 


5940 


GAATTACTTA 


AAAAAGTTTA 


AATGTGTCTC 


ACCCTAGTTA 


ATCGATACAG 


TGATTATTTC 


6000 


TTTTTAGCAT 


TTTGGATGGC 


TTTAAATAAT 


GTTGAAAATT 


GTCCTTTGAA 


AGACATCACA 


6060 


TCATCAAAGA 


TATCAACACC 


ATCGTTTAAC 


ATTTTGACGG 


TAACCCAGCC 


AAGCGCTTCC 


6120 


GTAATTGCTA 


CCGCAACTGA 


AGCATTGACC 


ATGCCACCAG 


TCACAGTTCC 


GACAACAGGA 


6180 


ATCACTTTTA 


GAATATTCCC 


GACTGCACTC 


TTACCTAAAC 


TAACCACTAC 


TAATTCTTTG 


6240 


GTTAAGCTTT 


TCCCTAAGCT 


TTCAGACCAA 


GACTGACCAA 


AAATTTTATG 


CAGCCGTGAC 


6300 


ATCATAGTTA 


ATTGAACAGG 


TACCAGTAAA 


AAAGCATCCG 


AAAACGGAAT 


TGGCGAAAAC 


6360 


CCAATAATTG 


CTGCTGACAA 


TGCAGCAAAA 


TGAATCGTTT 


GATGACTTTT 


CTTTTTCACG 


6420 


TCAGGTGAAA 


CACCAGCCAA 


AAATTCATCA 


AATACTTTGT 


CGAGTTGTGT 


TTGACTGTTC 


6480 


GTCACTAATT 


GTTCTGCTTT 


ACCTTTAGAA GTTGCGAAAG 


CATTCATTAT 


AGCCGTAGAC 


6540 


GTTTTTTGTG 


6AAATTTTTT 


CAAGACATCT 


TGAAAAAAAG 


AACGAAATTC 


TGGGTCTTCG 


6600 


ATTTCTTCAT 


TCAATTGCTT 


AGTTTCTTGT 


TCTGCTAATT 


CTTGTTTTTC 


AATCTCTTCA 


6660 


AAAAGTTCTT 


GTTTCTTTTT 


GTTTTTATGT 


TTTTTTCTTC 


TCATCTCATT 


AACTCCTTTA 


6720 


TCTATCAATA 


TGTAGAAACA 


AAGTATAGCA 


TAGCCCCTGA 


TTGATTCTCT 


TTTAGATGAT 


6780 


TTGAAAAGCC 


AGAACTTATC 


TGTTTTTTAA 


ACAAATAGAA 


ATGTTCCAGC 


TTTTCTCAAA 


6840 


AAAATCAGCT 


ATTTTGTTTG 


TTCTTCACCG 


CAATAAACCG 


TCAAGTGTTT 


AGGCAAAATC 


6900 


TTCAGCGTAA 


TTGGCAAAGC 


AGCTCCTTCA 


TCACCATCCA 


CATTAGTGGT 


TAATTCTTCT 


6960 


TGGTTTTCTA 


AGGAGATTGT 


TCCTTCTTTA AAGGTTAGGT 


AAACTAAATT 


ATCAGTGGAC 


7020 


TGGTCGACGC 


CTTTTAACAA 


ATCTGGGACT 


GCTTTTACTG 


CATCCCATAA 


TGATTGATcT 


7080 


TTTAAATAAA 


CTAAATGAAG 


TTTGCCATCA 


TCCACTTGGG 


CCTCTGGTAA 


TAATGTTTCA 


7140 


AACCCACCGA 


TTGAATTCGT 


TAAGCCAACT 


AAAACGGTAC 


TACTTTCAAT 


GGTTTGTTCT 


7200 


TTTTGATCCA 


AGCTTAAATG 


AAAAGGATAC 


GTTTGAGCAT 


TCGCTAAATG 


TTTGGCACCT 


7260 



wo 98/50555 



638 



PCTAJS98/08985 



GAAATAAAAT 


AAGCGAGTTT 


TCCTAACTTC 


GTTTTTTGTT 


CAACGTCCAC 


GTCGTTAATT 


7320 


GACTCTGGAA 


TCGTCCCAAT 


TGCAACGACA 


TTCATAAAGT 


AGTCGTCATT 


GATTTTCCCC 


7380 


ACATCTAAAG 


CACTGGTTTT 


TTCTAAATCT 


AATTGTTGAA 


TCGCTTCTTC 


AGGGTCCATC 


7440 


GGTAAATTCA 


AGGCTCGCGC 


TAAATCGTTC 


ACCGTTCCTA 


AAGGAAAAAA 


GCCAAATTTT 


7500 


GGGCGGTAAG 


CTTGTTCTGC 


CAACCCGCTA 


ATTCCTTCAT 


TAACGGTTCC 


ATCTCCCCCC 


7560 


ATTACGAAAA 


CACTGTCGAA 


GTGACTTTCC 


GCAGCTTCAC 


GGGCAAATTG 


TTCTGCATCG 


7620 


CCGCCTTTTT 


CTGTTTGTTT 


CACTACAACT 


TCATCAAATA 


ACTGCTTTAA 


TTTTTCTTCT 


7680 


GCTAACGTTT 


CAAATTCTTT 


GGCTTTTTCA 


CCACCAGAAC 


TGGGGTTAAC 


AATTAAAACT 


7740 


GCTTTTTTCA 


CTGGTACCAC 


TCCTCATTGA 


TATCTAAATC 


GTTTGATTCC 


"TTACAATTAT 


7800 


TATAAAAGAT 


ATTTTTGATT 


CTGGCGAAAA 


ATATGCTGAC 


ACTTGCTTAA 


CAAACAAGCA 


7860 


TTCCTGCTCC 


TCACTACTTT 


TCAAAACGTT 


CTTTAAAGCA 


ATAGGCAAAA 


TAATAGAACA 


7920 


ATTTCCAAAA 


ATCTTGCCAA 


ACTTTTGGAT 


CCATCCCCGT 


CAATTCATGA 


ATTCGTTTCA 


7980 


AACGGTAATT 


CAATGTATTT 


CGATGGATAT 


GTAATACTTG 


TGAGGTTTCA 


TTGATGTTTA 


8040 


AGTTATGaTT 


GGCGAAAAAA 


ATAACGGTTT 


CATAGTATTC 


ATCATAAATC 


GTCTGGATGG 


8100 


TTTCAAAAAA 


ATCCAATTCA 


ATATGAAAAG 


GAAAATTAAA 


AATTCCTTGA 


ATGAAAAAAT 


8160 


CTTCTGCTTG 


GTAAAACTTA 


GCCTTTTCTA 


ATTGGAGAAA 


CGATAGAAGT 


AAATACGTGT 


8220 


TTTGTGCCTC 


TTGGACCATT 


TGTTGAATCG 


TGGCTTTTTC 


CGAACTTATC 


ACAAGGTATT 


8280 


CTGTTACTGG 


CTTTGTTATC 


TGAGGTGGTT 


CAGAAGTAAT 


CACCAGATAA 


CAATTTTCCC 


8340 


ATTCAAATAT 


TTCATGTCCA 


GGATAATAAC 


TGGATAATAA 


GCGCTTGTCC 


TTGGTCAGTA 


8400 


CACAATGATT 


CTGGTCATCT 


AATTGAAGGT 


TATATAGTTC 


CAACGCTTCA 


AGTTTTAAGG 


8460 


CTTCTGAGTA 


AGAAGAATCA 


ACATGCATTA 


AGTGCTGGAT TVAACTGATTT 


TTTTGTTTCT 


8520 


TCTTTTTTTC 


GCGTTCTTGT 


TGTGTATTTA 


ATTCTTCTGT 


CAGTAAAAGA 


ACAATCGA6C 


8580 


GCACAAGCTT 


GGTAAATTTG 


CGCACCTCGT 


CTGGTTCTCC 


TGAAATGCCT 


ACGACACCTA 


8640 


AAATGTGATT 


ATTCAAAATA 


ATCGGTTCAT 


TGGTGCCTTT 


TTTCTCTAAA 


GAAGTATCTT 


8700 


TATAAATTTC 


AATATTTTTT 


CGTTCATTGA 


TGGCTCTTTC 


TGCACCACGA 


TGGCGCTCCC 


8760 


CAATTCGAGC 


AGAATCTCCA 


CTCGCAATAA 


TTTTTCCTCG 


TTCATCCATA 


ATATTGATGT 


8820 


TATAAGGAAT 


ATCTGCCATC 


AACTTTTCAA 


CAATTTTATT 


TGCTTGGATT 


TTGTCTAATT 


8880 


TTAACATTGC 


TATCGCCAAC 


TTTCTAAAAA 


GCGAGTaGCG 


TTTGGGACAT 


AAGCCGCAAT 


8940 


GACTTATGTC 


CCAAACgCTT 


AATTCTTCAG 


ATCAACGACT 


GAACATTTTT 


ATTTCAGCTT 


9000 


TCATTACTGA 


TGATACCATG 


GTTAGTTCTT 


AGTCACAAGG 


CACCAGCTGT 


AATGTACGAG 


9060 


CAATATTTTC 


AACGGTTCTT 


TCTAAATTAA 


CTGGTCCATT 


TTTCAATGCT 


GCTTCCAATG 


9120 


GTTCGGCTTT 


GGCCAAAATT 


CCAAAAATAG 


CCGTCATCCC 


TTCTTCATAG 


AGGACATCCA 


9180 


CATCTTTGCC 


GATATAGCCA 


GCACAAGCGA 


AAACCGGTTT 


TTGATACTTC 


TTGGCCAAAC 


9240 
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GCGAGACGCC 


ATAAGGGGCT 


TTGCCGAATT 


TTGTTTGAAA 


ATCCATACCG 


CCTTCACCAG 


. 9300 


TAAAGACATA 


ATCAGCGGCT 


TTTATTTTTT 


CTTCTAATTG 


ATTCACTTCT 


ACGACAATAT 


9360 


CAATTCCTGG 


GCGTAACTGA 


GCTTTTGTGA 


ATGCTAGTAA 


CCCTGCACCC 


A7UVCCACCAG 


9420 


CAGCTCCTGC 


GCCTGGCATC 


ATCTCAACAT 


CAATGCCGAT 


CTCTTTTTTG 


ATGATCGCTG 


9480 


CATAATGGGC 


TAAGTTTTTA 


TCTAATTGAG 


CCACCATTTC 


TTCTGTCGCA 


CCTTTTTGCG 


9540 


GACCAAAGAC 


ATGAGACGCC 


CCTGTTGGCC 


CTGTTAGTGG 


GTTATTCACA 


TCACTTGCTA 


9600 


CTAGAACTTC 


GGTAGCAAAA 


ATGCGTTGAT 


CAAACTGTGT 


TAAATCGATC 


TGCGCCAGTT 


9660 


TATCTAATGC 


ACCACCGCCA 


CGTGTCAATT 


CTTGCCCTTC 


CTTGTCTAAC 


AAACGCGCAC 


9720 


CAAGTGCCTG 


AATCATACCT 


GCACCCCCAT 


CATTTGTCAC 


GCTACCACCA 


ATGCCGATAA 


9780 


TGATTTT-TTT 


TGCCCCATGA 


TTGAGCGCAT 


GTTGAATCAT 


CTCTCCTGTA 


CCATAAGTAG 


9840 


AAGTGATCAA 


GGGATTGCGT 


TCTGCTGCTT 


TAACCAATTC 


AATTCCGTTG 


GCTTTCGCCA 


9900 


TTTCAATGAC 


CACCGTTTCT 


TGGTCACCTA 


ATAAGCCGTA 


ATAGGTTTCA 


ACGATTTGAT 


9960 


CAGGAAGCGG 


TCCTGCCACG 


CGCACTGGTA 


CTCTTGTACC 


TCCTGTTGCA 


TCCACTAAGG 


10020 


AATCCACCGT 


TCCTTCCCCG 


CCGTCTGCCA 


TCGGGACATG 


TTGAATGTCT 


GCATCTTTAA 


10080 


ATACAGCTAA 


AATCCCTTTT 


TCCATTGCCA 


GACAGGCTTC 


TTTTGCACTC 


ATACTTTCTT 


10140 


TAAATGAGTC 


TGGAGCCAAA ACAAATTTTT 


TCGTCATCTT 


TTCGCGCCTA 


CTTTCCAACT 


10200 


AGTTTAAAAT 


AAAGCCATAT 


AAAATGGTTG 


CAACAATCGT 


CATTACCCCA 


CCGACACACA 


10260 


TTTCATAAGG 


AATCACTCGA ATTCTTTCTT 


TAATCGTCAT 


ATTCATACTT 


TGTGCCGTAA 


10320 


TATGGAAATA 


ATTGCCTTGA 


GGAACACTAT 


CAATCACAGT 


TGCACCAGTA 


TGCACCATGA 


10380 


CTGCTGCTGA 


AAGTGAAGAT 


GTGCCCATAT 


CTAAAATCGC 


TGGTCCAAAA 


GAACCTGTCG 


10440 


CTAAAATAAC 


CGCTGTGGAT 


GTTGAACCAA 


CTGCTAAGCC 


CATTAAAATA 


CCAGAAATCG 


10500 - 


GTGCCAAGAA 


GGTTCCAGAA ATTCCCATAA 


CAGAAATTAA 


GTGGACAATT 


TGTGTTGATA 


10560 


AATCAGAAAC 


AGAAATTAAT 


CCAGCAATTG 


CACCTGCACC 


AATTAAAATT 


AAAACGGTAT 


10620 


CGGTTACTCT 


CTTCAAGCCA 


GAAGATGCAT 


AAAAGGCCAG 


TTGTTTTGTT 


TGCCCAATCG 


10680 


CAATTGTCCC 


AACAACTGAG 


GCGAACGGCA 


AAATATACAG 


AGCATCAATA 


TTCAAACTTT 


10740 


TGATGAAAGA 


AATATTTAAA ACGGCGCCAA 


TCGGATTAAG 


TAATAATAAG 


ACAAXCGCTA 


10800 


ACACTGGCGC 


GATCAACGCC 


TTTTTCAAAG 


GTAATAATTC 


TTTTGTTTGT 


TCCTCTACAA 


10860 


GATCTGTTTC 


TTTGACCATT 


GATCCTTTGT 


TTCTCATGAT 


ATTTGCAAGA 


ACAACAGCTG 


10920 


TAATTAAGCC 


AAATACGGCT 


GGAATAAAGC 


CAGCAATCAT 


AACGTCACTT 


AGAGGTAAAT 


10980 


CAAAGCCATT 


CGCCGCTGCA ATAGTATTTG 


GATTAGGCGA 


AATAATATTG 


CCCGCTTTGC 


11040 


CGCCACCTGA 


TAAGGCAATC AATAATGATA 


ACTTACTGAT 


ACCCGTACTT 


TTCCCAACAG 


11100 


ATAGAGCAAT 


CGGGGCAACA ATCAACACTG 


CAACAGGAAT 


GAATACGCCC 


ACTGCTGTAA 


11160 


TTACCATTGT 


GGATAATGCC 


AAAGATAAAA 


TAGCTTTTTT 


CTCGCCAAGC 


TTTTGTACAA 


11220 
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TGGCGTTGGC 


AATTGATTCT 


GCCGCACCAG 


ATTCCATCAT 


CACACCCGCC 


AACATGCCGG 


. 11280 


CTGCTAAGAC 


ACGGATTGCG 


GTTCCCATAA 


TACTTTGTGT 


TCCGTTTAAT 


AGAACAGCGA 


11340 


CTGTATCTGA 


CAGGGAAGCG 


CCTCCAATAA 


GACAACCAAC 


AATTGCACCA 


AAAAACAAGG 


11400 


AATACACAGG 


ATTAAATTTT 


TTCAAAATTA 


ATACTATCGC 


TACAACTAAT 


CCTAAAATTG 


11460 


CAGCAACCCA 


AGATATTGTA 


ATATCTGGCA 


TTCAAAATCA 


GCTCCTTAAA 


TTGTTTACGC 


11520 


TTTCATTCTA 


TCAAAGTCCT 


TTCTTTTTAA 


AATTGTAAAT 


TTAACAGGAT 


TTATTGTTCA 


11580 


AATGCACAAA 


AAAACCGTTC 


TTAAAGAACA 


GTTTTTTGAA 


ATCTTTTTCT 


ATCTTTCTCA 


11640 


ATAATGACCA 


CCAATCCAAC 


GaACAAAAGA 


ACACTCACAA 


TTAACAATAC 


TGCCGTTTTA 


11700 


TTATATCCAT 


GAAATAGCkT 


cGCCTCCACA 


TGCGTAAAGT 


ATTGAGGAAG 


GAATTACACC 


11760 


AATGAATATA 


GCGGCTATTA 


AATTTTTTAA 


TGGCAACTGC 


ATAGTATCTG 


CAGCAAAATT 


11820 


AATCACAGAA 


GAAGGAATAA 


TTGGAATCAT 


GTAGCCCACC 


ATAATCCCAA 


TTTTTGGATG 


11880 


CTTCATCCGT 


CGAATGGACT 


GGACCCAATG 


ATTGGTCTCC 


GTTTTTTTAT 


CTAAAAATTT 


11940 


TATATGATGC 


ATCAAAAAGA 


TTGCAACCAA 


ATTTCCCATA 


CTATTCCCTA 


AAACGTTAAT 


12000 


GAGTGTGCCG 


CTGAAGGGAC 


CAAAACTCAA 


TCCTGCAATC 


ACACCAATTA 


CTGAGGTCGG 


12060 


AATCCCTGGT 


ACTGCACACA 


TGATCGCGGT 


CA7U\;VAAACC 


AACAATAGCG 


CTGCTGAAAA 


12120 


ACCATGTGAA 


CGAATTCGCT 


CTAATAATTG 


TGTCTTACTG 


GCTTTAGGAT 


TCAAAAATAA 


12180 


TTGAATATCT 


GTTTGATATT 


CTAGATACAA 


ACGGTAACCT 


ACCAGCAGGA 


TTAAAATAAG 


12240 


CCCGCTACTG 


ATTAGTAGTA 


TTCGGATTGT 


TTTCCTCTTC 


ATTTTCTTTG 


CTCCTTTTAT 


12300 


GCTTTATACA 


AAGTGTAGCA 


QAGGATTCTT 


TGTGCCTCAA 


TTAGAAGCTT 


GTATGAATAA 


12360 


AAGCCGCCCC 


TAAAAGTTAG 


ACTAAAAATC 


TAACTTTTAG 


GGGCGATTCG 


AAAGTTACGA 


12420 


AGATACTTTT 


TTTATTTCCG 


ATACATCTTA 


AACTCCAAGA 


ACAGATCGTT 


ATAAATTCCT 


12480 


AAGTATTCTT 


GACCTAAGTC 


TTGGTAAACT 


TCTAAATGTT 


TGATAGTTTC 


ATCATCTGGA 


12540 


TAAAATTGTT 


TATCTTCGGC 


AACTTCTTTT 


GGTAATAGTT 


TTTTAGCTTC 


TTTATTTGGT 


12600 


GTGGAATAAC 


CAATATATTC 


TGCATTTTGT 


GCCGCATTTT 


CTGGTCGTAA 


CATAAAGTTC 


12660 


ATAAATGCAT 


AGGCACCCTC 


TTTATTTTTG 


GCTGTCTTAG 


GCATCACAAT 


GTTATCAAAC 


12720 


CAGAGATTAG 


AGCCTTCACT 


GGGAATCACA 


TAATGTAGAT 


GTTCATTGTT 


TTCTAGCATT 


12780 


TCAGCAGCTT 


CACCAGAGAA 


AGTTACAGCA 


ACTGCACTTT 


CTTCATTAGc 


CATGTACATT 


12840 


TTGATTTCAT 


CGGCAACAAT 


TGCTTTGACA 


TTGTTCGTTA 


ATCGGTTTAA 


CTTATCGGTA 


12900 


GCCTGACGTA 


ATTGTTGGTC 


GTTTTTACTG 


TTTAACGAAT 


AGCCTAAACT 


GTTCAAAGAT 


12960 


AATCCTAACA 


CTTCGCGAGC 


GCCATCAATC 


AGCATGACAT 


TATTTTTTAA 


TTCCGGGCGC 


13020 


CATAAATCAT 


CCCAATGTTG 


GATCTGACGG 


CCGTCAATAA 


ATTTATCATT 


ATAAATAATC 


13080 


CCCAACGTGC 


CCCAGAAGTA 


CGGAACGGAA 


AACTTATTTT 


TGGGATCAAA 


GGATTGATCT 


13140 


AAAAAGCGTG 


CATCAATGTT 


TTCTAAGCCT 


TTTAATTTGC 


TGTGATCAAG 


TGGTAAAAGC 


13200 
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ATCTTCGCTT 


TCATCATTTT 


TTGAATCATA 


TATTCAGAAG 


GAATGGCAAT 


ATCATAGGCT 


. 13260 


GTGCCACCTT 


GCTGAATTTT 


TGTATACATA 


GCTTCATTAG 


AATCAAAGGT 


TTCGTAATTG 


13320 


ACTTTATAGC 


CTGTTTCTTT 


TTCAAATTTC 


TTAATCAAGG 


CCGGATCTAT 


ATAGTCCCCC 


13380 


CAATTGTAAA 


TGGTCAAGGT 


ATCTGCTCCT 


GCCATGCCAC 


TTGCTTTCTC 


CAATTGGCGC 


13440 


ACGCCAAAAA 


ACAAGATGAC 


AATAATAGCG 


ATAATTCCAA 


TAAAAAGTGA 


CTGTAATTTT 


13500 


TTCATTTTAA 


CTTGGCCACC 


TCACTTTGTT 


CCCGACGCAT 


TTTCTTTAAA 


CGTTTTGGCG 


13560 


TATTGTCTTT 


ACTGATAAAG 


TAATAACCGA 


CAACTAGAAT 


CATTGAGAAT 


AGGAATACTA 


13620 


ACGCACTTAA 


GGCATTGATT 


TCTAAACTAA 


TTCCCTGACG 


TGCTCGTGAA 


TAAATTTCAA 


13680 


CAGATAATGT 


TGAAAAACCA 


TTGCCTGTCA 


CAAAGAATGT 


TACGGCAAAA 


TCGTCCAACG 


13740 


AATAAGTAAA 


GGCCATAAAG 


TAGCCAGCAA 


TAATTCCAGG 


CGCTAAAAAA 


GGCAAAATAA 


13800 


TATTTTTAAT 


CACTTGAACA 


TTATT6GCCC 


CTAAATCCCG 


TGCCGCATCT 


ACCATAGAAT 


13860 


CGTTCATCTC 


TTGTAGTTTT 


GGTAAAACCA 


TTAAAACAAC 


AATTGGAATA 


CTAAAGGCAA 


13920 


TATGCGACAA 


TAATACACTT 


GTAAAACCTA 


AACTGATAAA 


CCCTACTGCC 


GTGAAGAAAA 


13980 


TTAAGAAACT 


GGCTCCAATA 


ATAACATCTG 


GTGAAACGAG 


TAAAATGTTA 


TTAAAGCTAA 


14040 


GCAACGTGTT 


CCGGTTTTTA 


CGACGTTTTG 


TATAATAAAT 


CCCCATCGCA 


CCAAATGTTC 


14100 


CAATGATGGT 


TGCTATTAAA 


GCCGATAAGA 


AAGCTAGCAA 


AAACGTATTC 


AGAACAATGG 


14160 


TTAACAAGCG 


TGTGTCTTCG 


AAAACGTTCC 


GATAATTATC 


TAACGTAAAA 


CCAGTAAAcT 


14220 


GATTCATCGT 


TTGCGTGTCG 


TTAAACGAAT 


AAAAAAtCAA 


ATAAAAGATT 


GGTGCGTACA 


14280 


GTAGCGCAAA 


GACAACGAAT 


AAATACACAT 


AAGACCATTT 


AAACTTTTTC 


TTTTGCATTA 


14340 


TTTGCCACGC 


CCTTTCTTCT 


TCTCACCCGT 


CAGCAACATC 


ACAATAAACA 


TGGCAACAAT 


14400 


TAAAATCACA 


CCTATTGTTG 


AACCCATGCC 


CCAGTTTTGC 


GTTACTAAGA 


AGTGTTCCTC 


14460 


AATCGCCGTT 


CCCMCGTAA 


TCACCCGATT 


GCCGCCAATT 


AAACGCGTTA 


ACATAAAGAG 


14520 


CGAGAGCGAA 


GGAATAAAGA 


CAGCTTGAAC 


GCCACTTTTC 


ACACCGTTTA 


GGGAGAGAGG 


14580 


AAAAATGACA 


CGGCGGAACG 


TCTCCGCACT 


GCTAGCTCCT 


AAGTCCCGAC 


TAGCACTAAT 


14640 


CAATGAAGGA 


TTCATCTCTT 


CTAAGGCATT 


AAAAATCGGC 


ATAATCATAA 


ATGGAATTTC 


14700 


AATATACGTT 


GCGACAAATA 


AAAAACTGAA 


ATCTGTAAAC 


AGAATCTGGT 


GCGTACCCAA 


14760 


ACCCATAAAA 


CTCATAAACT 


GATTGATACT 


CCCATGAATA 


CTGAAAATCC 


CAATAAAGGC 


14820 


ATAAGCTTTT 


AATAATAAAT 


TCACCCACGT 


TGGTAAAATA 


ACAAGCATTA 


ACCACAACTG 


14880 


TTTATGCTTG 


AGCTTTGTCA 


AAAAATAAGC 


CGTAGGATAA 


CTTACCAATA 


ACGTAAAAAT 


14940 


GGTAATCAGA 


AAAGCATACC 


ACACTGAATT 


AATGGTCATA 


CTTAAATACG 


TTCCTGATGT 


15000 


AAAATACGTT 


TGGTAATTCG 


CCAAGGTAAA 


CTGGCCATTC 


ATATCGAAAA 


ATGATTGATA 


15060 


AATAATCATT 


AGCACTGGCG 


CAATGACGAA 


TAAAAGTAAC 


CACATTACAT 


AGGGAATCGA 


15120 


ATAAATTCGA 


CGCATTGTTT 


TCATTTTATT 


TTCCCCTTTC 


GTTTACTCGT 


CGTAACTTTC 


15180 
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TAAACGGGCA 


TCAAAGTCTT CTTCTGATTC 


•ATTGAAGCGC 


ATCACGTGAA 


TATCTTCTGG 


. 15240 


CTCAAAAGAG 


AGCCCTACTT GAGCGCCTTC 


TTTGGCTTTT 


CGAGTGGAAT 


GCACCATCCA 


15300 


TTCGTTTCCT TGTTCATCAA AACAAATAAT TTCATAATGA ACGCCGCGGA ATAACTGCGT 


15360 


GTCCACCGTT 


ACGGTTAGCT TGCCTTTTTC AAGCGTGGTA 


ATCGTTAAGT 


CTTCTGGACG 


15420 


TAAAACGACT 


TCTACAGviTT CATTTTTCCG 


CATGCCGCCA 


TCGACACATT 


CAAACTTTTT 


15480 


GCCCACAAAT 


GAAACTAAAT TATCTTCCAG 


CATCACGCCA 


TCCACAATAT 


TACTTTCCCC 


15540 


CACAAAGTCA 


GCAACAAAGT GATTAATCGG 


TTCATCATAA 


ATATCGACTG 


GTGTGmCACT 


15600 


TTGCACGATA 


TGACCTTTAT TCATGACAAA AATTTCATCA 


CTCATCGCTA 


ATGCTTCTTC 


15660 


TTGATCaTGG 


GTCACAAAAA TAAAAGTAAT 


TCCTAAACGT 


TGTTGTAAAT CACGTAGTTC 


15720 


ATATTGCATC 


GCTGTGCGTA ATTTCAAATC 


TAAAGCGGAT 


AACGGCTCAT 


CTAAAAGCAA 


15780 


TACTTTTGGC 


TCATTGACAA TCGCTCGTGC 


AATTGCGACC 


CGCTGACGTT 


GGCCCCcAGa 


15840 


CATTTCACTA ATTTCaCGTG TTTCGTAGCC 


CGGTAATTGA 


ACCAAACGTA 


ATGCCTCTTT 


15900 


CACTTTTTTT 


TCAATTTCTG CTTTGGATAG 


TTTTTTGATT 


TTTAAACCAA 


ACGCAACATT 


15960 


ATCAAAAACA TTCATGTGAG GAAACAAAGC 


ATAATCCTGA 


AACACGGTAT 


TGACTTGGCG 


16020 


TTTATTGGCA GGTACATCAT TTATTCTTTT 


ACCATCAAAA 


TAAACATCGC 


CAGAAGTCAC 


16080 


GTCATTGAAT 


CCCGCAATAA TACGCAAAAT 


CGTTGTTTTG 


CCACATCCTG AAGGTCCTAA 


16140 


TAACGTATAA AATTTTCCTT CTTCAATTTC 


AAAGCTGACA 


TTTTTTAATA 


CTGGTTCATC 


16200 


GTCAAATTGT 


TTGACAACAT TTTCAAATGA AATCACATGG 


TTTCTCACTG 


GATAAACTCC 


16260 


TCCATTTCGT 


CCTTTATAAA TACGATTCTG 


TCGCCACAAT 


TAACGCACGA 


CTTTTTCCTC 


16320 


GACCATTGTT 


GCTAATTTGG TGAGGTTCTG 


TTGCTTGGTA 


ATACATGGTT 


TGACCTTTTT 


16380 


TAGCAATATA 


CGTGGCTTCC CCTAATGTCA 


AACGAATGGA 


GCCATCAATC 


ACATAAATAA 


16440 


ATGTTTCTGA 


TAGAGAAGGT TCAAACGTTT 


TGTATTCGCC 


ATTTTTATCA 


AAAGTAATAA 


16500 


TGACAGGTTC 


CaTTTCTTTT TCATTCGAGG 


CTGGAATTAA 


CCATTTTAGT 


TCATAGCCAT 


16560 


GTTCTTCATC 


ATAATAGAGG GTACTTTCTT 


CTTCATTGTA 


GACAATTTTC 


TGTTCAAGCG 


16620 


GTTGCTGACT 


AAAAAACTGC TCTGGCGTCA 


CGCCTAGTAC 


TTCTAAAATT 


GAGAAAAATG 


16680 


TTTCCATTGA TGGTGAGCTT AAATCCCGCT CCAATTGGGA AATATAGCCT TTGGATAAAT 


16740 


CCGTTCTTTC GCCTAATTCT TCTTGTGTTA AATTTTTCTG AATGCGTAAA TTTCTCAACT 


16800 


TCTCACCAAT 


TTCCATGGCA CACCTCGTTT 


CTATTTTCAG 


CTAATCTTTT 


CAACAAAAGT 


16860 


TTTGTAATAC 










16870 


(2) INFORMATION FOR SEQ ID NO: 99: 









(i) SEQUENCE CHARACTERISTICS:- 

(A) LENGTH: 3905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



GAATTATTTC 


AATTACCTGT 


TGACTATCAA 


GATGAACGAT 


TAACAACTGT 


CCAAGCAGAG 


60 


CGTTTCTTGG 


TGGAGCAAGC 


AGATGCTTCA 


AGAGCCAAAA 


GAAAAAAAGT 


GATTGATAAA 
on X X on x nnn 




CTAGCGGCAG 


TAATGATTCT 


ACAAAATTAT 


TTAGATGCAC 


ATAGCCGATA 

n X now won X t\ 


ATTATCGAGP 

nx XAX^OAOV^ 


X.OU 


TTTGAATACA 


TCAnGGGAGA 


GAAGAAGAAT 


GACAGAACAT 


AAPPAPGATP 

AAV^V^AWOAX Vtf 


ATGATPAPCa 
A X o A X t.«A\y» V9 A 






ACATTACTTT 


AGTAGACGTVA 


PAAGGAAATG 


AAAPATTATA 
AAAV,>AX JIAIA 


PGAGATPTTA 
V^oAoAX ki* X X A 


oUU 


PTAACTGTTG 


ACGCTCAAGA 


AG AAT TPGGT 

nOnn X X V^OVa X 


AZVaAATTATG 
nncuvix xnxo 


T* *p "PT A *p a T* a 

XXX X AX XAXA 


TPPA/^pan/:a 

X v,..l,.rAok«AooA 




AT TCCTGAGG 
t\ X X ^ w X vtnou 


ATGAAGATGT 


TG A AT TGr* A A 


GPT"P APTPPT 
o^ X X X 1 


ATGTGGAAAA 
Al o 1 ooAAAA 


PGPana APPT 

VfOk^AoAAoo X 


ii on 


Ar*AGAAGGrG 


ATTTAPAGPA 




GAPGPTGAAT 


GGGATATGAT 
oooAlAioAx 


TGa apa a/^*PT 

XoAAoAAoX X 




X XV«nnXnV«nx 


X \^nX S3\3\,0 X \3r\ 


no A HP a A/^ 11 f 




ar*^ar*ar*aar" 
AU oAoAo AAo 


KJak* X xTCT 




PPTATTGAGA 
v.>^lr\X loxaon 


n\3 X X X x\. 


i lx\nxnl\xiv\ 


1 X oi XAi/lAx 


oiAUAlls J. l\j 




OUU 


AAGATAPTAT 


PAf TATPATT 


n A PTfi Tr* B TP 
o no 1 o 1 \m\ X Sj 


oA 1 AAA oilA o 


oAL»AoUAl xij 


1 I AVj 1 i X X A 


bbU 


CAAGATGTTG 


TAAAAAPGTA 


A P A TCC**rTfZ'V 


AAT/^AP/^TTa 


aTTTar'Taar* 

A 1 1 X Ao X AAL* 


XAiL.1 Al9W\l 




AATTTATAAT 
fv\x X xTxxnnx 


X X o X ox XI\Xn 


PTAAGTATr^A 


PPTTTTAPTa 
11X1 Ao 1 A 


aTar'Tr'TTaa 

AXAl^XoX xAA 


XXX JCAJCAdi A 


Ton 
/oU 


T T AAAT A A AA 

X XTVVxxjnnJnn 


AAATGGfTPP 


APPTAPT^2/2P 


ATA^nTAI^AfZ 
Al Aoo 1 AoAo 


pppTPapaaa 

wCo X WioAAA 


T'pr'a^T'pa a a 
X X Wioi XAAA 


Q4 n 


CAAACACGTA 


ATTATTGCTG 

AX XAX X\9\9XS9 


TGGTf2PATT»^2 
XV9\3l\9V«>AA 1 o 


PPAPCa a PTfi 


aaaTTPpa/iT 

AAAX X^WloX 


TaaTaaappa 

XAAXAAAoLnA 




AGTAAATC!AA 


AV*nXr\x/ax X X 


PAPP'PPPT'n'P 
X\^\^ X X X 


ATTATGATAG 
Ax lAluAlAo 


PTTPaTTaTB 
^X X WiX XAXA 


AL>AX XAxUio 


you 


TAAAATGATG 


GTATGTTTAA 

\3 xfxxKM X X xrSk/t% 


TAt!AAAAAAA 


ATAGAAAGAT 

AX AO AAAOA X 


TGGAGTTTTG 
xovaAoX X X Xo 


APTCTCanAn 
A^ X o X o Ao Ao 


xu«.u 


TAGCGGGAGA 


AAAAATCAAA 


ACAGPGPGTA 


AAAAGAAAAA 

AAnAOAAAAA 


a TT A TP/:Pm A 
AX X AXI^oVkIUA 


oV^XoAAX Xoo 


1 nfln 


CAAAAGGAaT 


TTGTACACAA 


GCmACAATTA 
ox^umK^fmx xr\ 


GTAAPATTGA 

O X AAWA X X OA 


AAAPAAAAAT 

AAAV^AAAAAX 


GTGTGTGAPA 

O X O X O X OAltfA 


1 1 An 


GTTTAGACAT 


TTTCTCATCT 

AAA. A A A 


GTATGTCTAC 


GTTTAGACTT 

O X X X J^OA^ X X 


APAAGTAGAA 

AAO X AOAA 


GAATGTATPG 

OAAX V3 X AX WO 




AAGGTTCTAG 


TGAGAAAAAA 


CTCGAAAGCT 


TATTAAATAA 


AGTAGAAGAA 


TTATGPTTTT 

X X X O W X X X X 


1260 


ACTTTAAGCA 


CGATGAAGCC 


TATGATTTGT 


TGAAAGACTA 


TCCAGATGAT 


ATCGAAAGTT 


1320 


CAAACAGAAT 


TTTAGAAACA 


AAATTTTTCT 


ATTATAAAGG 


AATTACGAGT 


TTATTAGGTA 


1380 


AGAAAAATAA 


TAGCGAACAT 


TGTTCTATTT 


ACATCGTGGT 


TCTGAAATTA 


GTCGTGATAT 

A X^X^ A ^Jfl A A 


1440 


TAATATTTAT 


AATATTtTGA 


GTATGAATGC 


GATCGGTATT 


TTATATGAAT 


TAGAAGATGA 


1500 


TATTGAAAAA 


GCAAAAGTTT 


ATTACGATAA 


ATCTTTACAA 


TTGCTAAGCG 


AATTCAAATT 


1560 


AGATTATCCA 


TTAGAGCAAT 


GCCGTATTTA 


TTATAATACC 


GCAAAATTCT 


ATTCATTAAT 


1620 


TAAAGACTAT 


GCAAAGAGTA 


TTGAATTAAG 


TGATAAAGGA 


ATCGAAATTA 


ATCGGACGCA 


1680 


CAGTTCTATT 


TACTCTTTAG 


ATTGCTTATT 


ATACGAAAAA 


GCGTTTAATA 


AACAAATGCT 


1740 


AGGACTAGAT 


GCTGTTGAAG 


ATTATCGTAT 


TGCGTACTAC 


TTTACACGTT 


TCTTTGAAAA 


1800 


TAAAAAATTA 


CTTACTTATA 


TTGAAAAAGA 


TATGCAAGAG 


TTTAATATCA 


GTTTTAAATA 


1860 
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ATAAATTATT 


GTTTCAGATT 

\9 AAA \»J»%Jf» A A 


TtAAAAAGAA 


CAACCTAACT 


CAAGAGGAGT 


TAGGTTGTTC 


1920 


fw ffi IW ^^l^^^if* 


TTTTiVATTTC 
A X X xfmx X X vj 


X A X ^ X X X 


AGnCTTACGG 


ACGCTTTCGG 


CGACGACTTT 

w w«»\^^JfliW AAA 


1980 




xv#x x\j)\3mnx\9 


GATrCRGAAT 
vinx \a>vyvvu/v% X 


AATATTAGTC 


GGCGTGAGTT 


CAT T ATPCGfi 


2040 


AAlL'/vii- 111 


/zr'AATAPr'TT 
\3^AA X nv<\^ X X 




AATTTflPATT 
nnX X XwV^r&x x 


TPAATPGT AA 


X \^ X X X X 1 






AAolawAV^WlV* 


flf^ A A C AT Ar'P 


AC/^nAAf^PT 


AAPAPATTAT 


*p a a T TT a •PT 

X/Uil X XV7AX X 


z xou 




1 AOvsAuVrAu 


TTprrsar'aaT 


A 1 AAu^o^^i^ X 


fZCCCU^T A A An 


fT Tr* a Tr* ar»p 

i X l^Al ^A\3V3 




AAAAATTTuA 


ovaAAU 1 uiau 1 


•na^^r^aTT^^ 
i AbUwiI 1 u\« 


aaaaaTAAPT 

AAAAAX AA\^l 


1 oL> 1 W\ 1 


TCATCTGTTG 




AATCCACTCA 


GGTTTTAAAA 


CACCTxGGTGC 


r^r* A A AP Ar*Pll 
^GAAAuACuA 


Ar*A A A^A/^A»n 

AL^AAAlaAvJAT 


CAGC^GCCJTTG 




AAGAGCAGTA 


GCTAAATCAC 


CGGTACGgTG 


TTCCC^faAx 1 \9 


bl lAvaCTTCG 


TV A mrprpom/^/^ 

CAATTTCTGC 




GTGGTGTGGT 


GGCAGAGCTG 


TATCTGTTTC 


GCTCAbAa 1 U 


CU 1 VarCACCib 1 


r*A Ar^ A AT»0 AT» 

CAAuAAx bAI 


£** DU 


AATATGTTTT 


ACGCCAGCAG 


/"•fp 7\ A TV TV TV mmrp 

CTAAAAATTT 




^ A A A^A/^^A/~* 

VaAAAGACCAb 


/^/^/^ A /^^/^ A t^f~' 

CGGAGCCAC U 




GCCGTTGATT 


A A ^A f*^ A A 

ACCACaTGAA 


TGTCCTCTAT 


TTTCTTGTTG 


ATTAATTTTA 


AGCiCrl xCaxA 






A^A A/^A A/^A A 

ACaAAC^AACAA 


TCGC TGTCCC 


m/^ T A T* ^ A 


T/^ ATOr* A A A A 


CLi tsCi AA 1 A 1 U 


ZD4 u 


TAGT TCATCG 


ATTAAACGCT 


GTTCAATTTC 


A A A A A A A 

AAAACAACGA 


GGACjCGCTGA 


rp TV 'P/^'P'P/^/^ TV TV 

T ATC T T CCAA 


Z /UU 


GTTAATCCCC 


A A A f^f^ n*mf^ 

ccaaaggttg 


/^A^^^A A A A A 

GAGCCAAAAA 


TTTCACUGTT 


TGAATAATTT 


CTTCGGTATC 


Z / bU 


TTGTGTATCT 


A A A A /^A TV f*f^f* 

aaaacaaggg 


/*A A1>0/^A A rn/** 

GAATCGAATC 


/*A/^Ar*^A^^A 

CACACCAbCA 


A A ^^^mmwi TV TV 

AAGCGTTTAA 


TV ^ IV TV A 

ACAATGCAGC 




TTTTCCTTCC 


A A A ^^/^ A 

atgacaggca 


TCGCTGCTTC 


A A A A fP/^ 

AGGACuAATla 


'P'PA/^01*A A A/^ 

TTACrCTAAAC 


^■PA A A TV r*n*^f^ 

GTAAAAvTTGU 


OQ on 


tP^A TV r^t^ A 'P/^ A 


^1*A AH* A JV f^f^l^ 

CTAATAACGG 


CGACGGTGTT 


TTTCTTTGTC 


/^rpm TV TV rp fT»r^/**'P 

O xTAATTCGT 


A A^/^/^r^^'P'P/* 


Z74U 


TTTTTTTTCA 


I (axATUGuALv 


A Ar*A A A^A^r* 

AAGAAACAGC 


CbUAAwibb 1 


bV9 X U X b r AG\3 


r* A A fpn*pr*'r* A a 
LAAiTw.dAA 


JUUU 


ATCTbuT A r 1 


(jTTTCAATCG 


f'ts, TV ^»¥»m»p TV f~*r* 
GAACTTTAIGG 


GCAGACTTCT 


AATT TCCCXrC 


C7\AATTTTTT 




1 laL* 1 1 1 A 


AI UiTU 1 AATx 




1 1 i X u i i uAC 


AJ. AiCAx ill 


c*f*TC*c*T a a a T 
L'CK.'CiAAAl 


■^1 90 


A A iMp/^ A r*'pn* A 
AAl ruAL>l lA 


*l*A/^*I*'PA A A A A 

i AuTT AAAAA 


A#^^A'P1*^1*A A 

AvsCATTCxAA 


A A A»PAP/*AfP 

AAA xAbUAb 1 


A 1* n* r*i* T A 1* 1* 
CAT rtiTTATx 


ACx X AATbo 


JloU 


^«A(^W>V9V«V.0 1 nA 


A\^\3 1 13 1 l^lsLiA 


AUl llallaWiA 


ar*/2/:r'aTT a a 

A^iMwil lAA 


l9l 1 w\l ICuA 


1 1 VaoV^ 1 u 1 Al,* 




A A A A T A /^/^ 

1 lAAAAlAoV^ 


V.*AW\ I i^uUtJA 


o<pr'r*r'ar*r*r*a 
u X Ck-UAUmJA 


X Av«LiAL» 1 1 u 


OL.AAuCV3l3AV9 


a^a aTPf^a^: 
Av# AA 1 buCAo 






Al XL«A1A1 lk« 


Al oAAisv^u 1 \3 


AAAP/zaaaAA 


X 1 o 1 AItv* A 


a T a a \7i-ir*Tr2a 

A 1 AMyWVj 1 v»i\ 


vU 




A A TT* A rr* A pa 


ar^a a a aTa TT* 


r*r'r'ar*r'T A A*^ 




apaTr"rTTTA 

nV^n X X X 1 i M 


3420 


AAf^AATATA 


TAACATAPviT 


AAnrPHAPPA 


TTAArf^nnAA 


PnTAAAATTf^ 


PPAf^AfiATAA 


3480 


ATTTATAAAH 


^ X oX X X X vxni 




X X Xv^X X XV^uO 


TAPTAPnTTT 
X X riV^vj XIX 


AAf^TATTTTA 
X r\ X X X X n 


3540 


AAAAGGCTGC 


gacaacaatc 


ATCAAAACAG 


GACCAGGAAA 


GCCTGTTAAG 


TGTTGCAATA 


3600 


AACCGCCTGT 


AATAAATAAG 


6TACACGCAA 


TTAAAACACC 


AGCGCCCATT 


AATTTTACAT 


3660 


CAATTGGTGC* 


TTTGTCTTCT 


TCTAAAGCAT 


CACTTAAATC 


ATCTGAATTG 


GTAATTTTAA 


3720 


TCAACTGACC 


TTGTCCAGAA 


AGTTCAGGGC 


GTTTTTCTCC 


TAAACGACTA 


AGTAAACCA6 


3780 


AACACATAAT 


AGCAAAAAAA 


TTACCAATAA 


TGGTTGCGGG 


AATTAATTGC 


CCCACTAATT 


3840 
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GTTCACTTGG CAATCCAGTA ATGGCCACTA TAACCAAGGG ATAAAGGAAG AATGCCTTCA 3900 
CCGAT 3905 
(2) INFORMATION FOR S£Q ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



TAAAAAGCCC 


CTTAGAATCA 


TAGGTnTTTT 


TGATGGTTGT 


AATTGTTGAT 


GTTTTCTTnA 


60 


AGTTGTAAAG 


ATTTGATTTA 


ATAGTGATCG 


TGTCACCTAA 


TTGATAATCT 


TCACCTAATT 


120 


TAAACaGTTT 


AGAGCTTGTG 


GGAACCTCAC 


CACTTAGTGT 


TAAAATTCGT 


TTACGTTCTG 


180 


7UyW3TTTGTT 


ATTCCCTCTA 


TTTTTCaAGA 


CTTCGAGGTA 


TTGTGAATCT 


GTCAAAGTGA 


240 


CCTCGTCAGT 


TGTCTTTTGT 


AAGTCTCGAG 


CATCTACGTA 


TAACTCTTTT 


CTTTGTAGAC 


300 


CTGTTTTCTC 


GTTACCCACC 


ACTACACTTT 


TACGTTCCGA 


GCCTTCACCT 


TCACCGAAAA 


360 


CAATAGCGGT 


TGAAGATTCA 


TCAAAGTTAT 


TGTTCTGATA 


CCCCGCTTTT 


GTTAGATTTT 


420 


CGTATTCGTC 


TGAAAACTCA 


ATCGTTCTCG 


AAACATCTCG 


ACCTTTAAAA ATAGATAACG 


480 


TATTACCAGG 


AACGCCTAAT 


TTAGTAGCTG 


TCTCTCTTAT 


CCCAAAATCG 


TAGGATGTGC 


540 


AAAGAGTTTC 


CACTTCTTCG 


GCTACGACAC 


CATAACTATT 


CTGATACTGG 


ATAGACGTAT 


600 


TCCCTAAATT 


AGCTCGGGAA 


TCCAATTTTA 


AATATTGAAT 


TTTTCTTTTT 


ACATCCGAAG 


660 


GgTTTACTAC TTCATTA7TA 


AGATGTTCCC 


AAACAATTTG 


CTCTGGCGGT 


GCCGTTTTAT 


720 


TATAAATTCG 


GTT^CAATA 


CGATCTAATG 


ATTTCCCAAG 


AAGTGATTTC 


CCCGACACCT 


780 


TTATTTGTGC 


TGTTGCCTGA 


TCGTCTACAA 


CTACGGCATC 


AACATAGAAA 


TAGAGCCCTT 


840 


CCATATAAAT 


AACAGTGTCT 


GGAACAAATA 


TTTGTATATT 


TTCAGGTGTC 


AAACCAACAA 


900 


ACAGCTCAAA 


CGTTGAATAG 


GTATAGTAAT 


TTAATTTAAC 


AGTTAAACTT 


TTAAACCCAT 


960 


CAAATACTTT 


TTCGGAAATA 


AATTTCCATT 


GATTATCTCG 


ATCCTGAGTG 


AAAATTTCTA 


1020 


ACTCCATTAC 


ACACCACCTA 


CCAACGGCCG 


GAAGTCAAGT 


TGTACAATTA 


CGTTACCTUVT 


1080 


ACCAGTAGTG 


GCTTTCACTT 


GAAAATAGTT 


ATCACCCTTT 


TCAAGCTGCA GAAAGGTAGA 


1140 


ATTAGGGTCT 


CGCAATGGCA 


TAGCGTTAGA 


TTCAACACCG 


TTTGGGTCCG 


TCATGATTGC 


1200 


TTCTTTCTGT 


CCACGAGTAG 


TTACCAGTTT 


AAACTTAGTA 


CCTGCTTCAA ATGAACCTTT 


1260 


AAATCGGAAA AATTCCTGAG 


TAATTACGTT 


ATACACTTCT 


GGATCGGTAG 


CTTCTGCACC 


1320 


TAAAGAAAAA 


TAGAACGTGC 


CTCCCACTGA 


TACATCACCA 


TCATTGGTTA 


CAGGAACAAT 


1380 


CTCACCACTT 


TTTAGTGTAG 


CAAACTCATA 


CCCTTTCGTT 


ATTTCTAACG 


GCCACATGTG 


1440 


TTTTTTTGTG 


GCAACGGCTA 


GGGGAATTAA 


CGTATTATAT 


TTACTCATAT 


CAGACCAATA 


1500 


CGGATCTAAA 


GCTAGAAAGG 


TACACGTAAA 


AGCCTGTGTA 


ATATTTTTAG 


CTGGATCTGG 


1560 
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CAGGTCGGGA 


GCTTTTACTA 


CTAACACATC AATTTGAAAC 


TCACGATCAA 


ATACTCTGTA 


1620 


AATCAACGTA 


CCTGGTGTTT 


TGGGATTTAA 


CGTTTTTATC 


ATTTCATGCT 


GTAAATTGAA 


1680 


AAGATCCTTT 


GTTCCTCTAG 


CAATTATTTC 


ACCTTTTATC 


GTT7UVATCGC 


GCTTATCTAA 


1740 


ACGCTCCGAA 


ACTTT/^TAA 


CACCATCTAG 


TCCATATTGT 


TCTTCTGTTA 


TGATCTTATT 


1800 


TTCAACAGCT 


CCGAAGCCTG 


TTTTACTTTT 


AACGGTAAAA 


GGTGGTTGAA 


CACCGAAACT 


1860 


AATCAAAGAA 


CCTTCCGAGT 


TCTTGTATAT 


AAGTTCGTAC 


ATCCAACCTA 


CCTCCTATCT 


1920 


TCTCAGTGTT 


GCAAGGTCTT 


GCAACTTGTA 


TTTTGTTTCT 


CTTGCAATTT 


CTCTAGGCGT 


1980 


TAAAGGCTCG 


GGACTTGTAA 


TATATTGTGT 


AACATTGATA 


TCTCCGTCTT 


TTTGTTCAGC 


2040 


CATTACACCT 


CTCACTGCAG 


TTTCAACATA 


TCCTAACAGC 


GTGTCAATCG 


GTGCGACAGC 


2100 


CTCAGCCCCT 


GCTTCTCCAC 


CTATCATTGC 


GTTATTTCCG 


TTCATGCCAA 


ATATGGTAGG 


2160 


TTGCGTCATG 


ATACCGCCGT 


CTTTGTACCA 


TTCTATGCCC 


AAACTTGGTA 


TTTTACCTTT 


2220 


CAATAAATCG 


CCCGCAGACC 


AACCGGCCGG 


ACTAATAGAA 


AAGTGAGGCA 


GCGGAATTTT 


2280 


CGGCCAGCTA 


ATATTAAAGT 


TAAAGAAACC 


TTTAATCGCA 


TCTACTACAT 


TTTTCACTAG 


2340 


ATTTTTAGCC 


GCATTCATTG 


GCCCATCAAT 


AGCATTTTTT 


ACACCATTAA 


AGATATTAGA 


2400 


AACTGTTCCT 


GTAATACCAC 


CCCATACATT 


TGAAATA6TA 


TTTTTCACTG 


CGTTAACTAC 


2460 


ATTACTAATA 


GTATTTCTTA 


CACCATCCCA 


AATAGATGAA 


ACGGTATTTT 


TCATACCGTT 


2520 


AAATAAATTC 


TTAACGGTGT 


TCAcCGTTGC 


ACTAACGATA 


TTAGACACGG 


TCGATTTAAT 


2580 


ACTATTCCAC 


ACCGTGGATG 


CTGTATTTCT 


AATTGCGTTA 


AAAATGTTTG 


TAATTGAATT 


2640 


TTTTAAAGCA 


TTAAAAATAT 


TTATCACTGT 


GTTTTTAACA 


GAAGTAACGA 


TATTGGAAAT 


2700 


AGTCGTTTTA 


ATACTATTCC 


ATACACTAGA 


AGCCGTGTTT 


CTAATGGCAT 


TAAAAATGTT 


2760 


TGTTATCGTG 


TTTTTCATAG 


TATTAAATAT 


CGGTGTAACA 


AAGTTAAGGA 


TAGTCTGCAC 


2820 


AACATTAGAG 


ATTGTTGTTT 


TAATAGCATT 


CCAAACGTTA 


GTTGCTGTGT 


TTTTAATTGC 


2880 


GTTAAAGACA 


TTAGTAATCG 


TCGTTTTAAT 


AGCATTAAAT 


ATA6GAGTGA 


CAAAATTTTT 


2940 


AATAGCCGTA 


GAAACTGAAG 


TAACTGTATT 


TTTAATTCCG 


TTCCAAGCGA 


TTGTTACAGA 


3000 


GTAAACGACA 


GCATCCCATA 


TCTGTTGAAA 


GAAAGTTTTA 


AACGCATCAA 


AAATACGCTT 


3060 


AATAACTCCA 


ATGAAACCGT 


TAATTATTCC 


TGTGACGGTA 


TCACCTATTT 


TTTCCCATAC 


3120 


ACCGGTAGCA 


ATTGCTTTTA 


TGAGATCCCA 


TAGATATTGA 


AAATACGCTT 


TAAAACCATC 


3180 


AAAAATACCT 


TTCCAGATAT 


TAATTGTATT 


AACGATTATC 


GCCCCTACTA 


CAGCGTATAT 


3240 


TACATTCCAC 


ACTGTTTGTA 


GAAAATTCTT 


AATGCCGTTA 


AAAACGTTTT 


CTACTGTTTT 


3300 


TTTGAAACCA 


TCAAATGTTT 


TCTTGCCAGC 


ATCCGTGAAT 


TTCTTCCAAA 


TTTCTGAGAC 


3360 


TTTAGCGGTA 


ATATCATTCC 


ACGTTTTTAC 


TAAGCTACCT 


TTTAACCCAC 


TAGCTGTTTT 


3420 


AACGATTGAA 


TCCCACGTAT 


TTTTTATAAA 


ATCGGCTATA 


CCATTAAATA 


TCTCTACTGC 


3480 


TTTATCTTTA 


ACGGAGTTCC 


ATGTATCAAC 


TAGAAACTTA 


GTAAAAGATT 


TCCAAATCTT 


3540 
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TTGTCCTGTT 


TCAGTTTGGG TGAAAAAATA' AATTAACGCC 


GCTACTAACC 


CCACGATTGC 


3600 


CGAAATAATC 


AACACAAAAG 


GGTTGGCGTT 


TAAcACTGCG 


TTAAAAGCTG 


TTTGAATTGC 


3660 


AGTATAGGCT 


TTTGTGACAC 


CTGAAGCTAT 


TTTAGTAGCT 


GTTAATACAC 


CTTTCATGAT 


3720 


AATTTTATAG 


GTTGCTAAGG 


CGGCTCCTAC 


TCCAACTATA 


ATAGCTTTTA 


AA6AACTAAA 


3780 


AATAGCTTTA 


TTTTTAGTTA 


TAGCCTTTGT 


AAAATTAGAA 


AAAGCAGGGA 


TGACATTATT 


3840 


TACAATAACC 


TCACTCACAG* 


CTCCTGATAT 


GTCCCAAATC 


ACCTCTACGA 


TTTTTTCAAT 


3900 


CGCAGGTAAT 


ACTACTGTAT 


TTATTGTTTT 


AAATGCAGAT 


TTAAAAATTT 


CTATCGCTTT 


3960 


AGGTACTACA 


GCTGTGAAAA 


TATCGGCAAA 


AGTAATTGAC 


CATGTACTAA 


ATACTCCTGC 


4020 


CAATTTCGTA 


AATATCATAG 


ACACTTGACT 


GCCCGTATCC 


TCAAAAATCC 


CTTTTATTTC 


4080 


AGGCAGTTCT 


TTTTGTACAA 


TCTGAGCTAC 


ATTTTTTATA 


GCATCACTAA 


TCCACTTAAA 


4140 


AGCCCACTGA 


AGATTCATCA 


CAAAACCTGA 


ATTTTGAATG 


CCGTTAAATG 


CATCCATAGC 


4200 


CGATTTTTGG 


AATCCCCCAA 


AGTCTGGTAT 


AACTGCAGCT 


ACCTTATCAC 


CTAAGCCACT 


4260 


TAAGCCTTTC 


GTAATGCTTT 


GAATAATCGA 


AACGACCGTT 


CCTAAAATAG 


GAGTGCCTAT 


4320 


TTTTGCTAAA 


AAGTCCTGCC 


ACGCTTGTTT 


TAAGTTACCC 


ATAACGTTTT 


CATATCCGTC 


4380 


GGCTTCGCGA 


GCAGCCTGTC 


CTACAGCTCC 


GGCTTGTTTA 


AGCATGTTGC 


TAGCATAATC 


4440 


TAAACGAGTA 


GCTTGTTTCG 


TCGCTTCATC 


TAAATTTTGC 


CAATCTTTTG 


TAGAACCTAC 


4500 


TACTCCCTGA 


GAAATAGCAA 


ACTGCGCCTIT 


TTGAGTATCA 


TTTGCAAAAA 


TCCCTATCGC 


4560 


TTCACCGGCT 


TCGTAGTTTC 


CTTTTAGAAA 


CGAAGTAAGA 


GAACTTGAAG 


CATCTTCCAG 


4620 


TGAAACATCA 


TAGAAAGCTG 


CAGCATCTGC 


CGCTAAGGTA 


GTTGCTTGCT 


CTGATTTCTT 


4680 


CATTGCGGCT 


TCTGTATCTA 


GCCCTAAGCC 


TTTAAACATT 


GAAGTAATTC 


GAGAAAATGT 


4740 


CGGCTTTATA 


CGGTTAGGTA GGATATTCAT 


TTTTTTACCC 


ATGCCATCTA 


CCATTTTTTG 


4800 


TGCTAATGGT 


TCTAACTTAC 


CAAAAACCTG 


GGTAAATTGT 


GCATTCAAAG 


CTTGTGCACT 


4860 


TGCCGCGGCT 


TCTACGCTTA 


GCTTCCCAAA 


ATTAACGATT 


TTTTCTACGG 


CAAAAGCAGC 


4920 


ACCAATTGCT 


AGCGCTGCTT 


TTTTAAAGAA 


CCCCACTAAT 


TTACTTGTTG 


TTGATTGTCC 


4980 


CTTAGATTCC 


GTCTCGTCCA AAGCCTTATT 


GGCTTCACCA 


TTATCGACAC 


CGATTGTACC 


5040 


AAAAAGCTT6 


AAAATTTCGC 


CCATTAAGAC 


ACCTCACCTT 


CTTTTCGAGG 


TTTAACAAAT 


5100 


TGACTAGCAA AATCTAATGC TTTTTGTTGT TCTTCTTTTG 


TGGTAGATTT 


CGCTTTATTT 


5160 


TTAC6TAAAG 


GTCTGTATTT 


TTGTTGGTCT 


TTAAATTCTT 


TTAAAGACTG 


CGTCATCTGT 


5220 


GTGTGTAACC ACTGATTGTA GATTTTTTCT TCTTGTTCCA 


CTTCTTGAAC 


ATAAAAAATA 


5280 


AATTCCATTA 


AGTCACACAA 


ACGATAGGTA 


CGAAGTAACG 


ATATCGGATC 


TCCGTACCTT 


5340 


TTGAACAACA AATCTTTAAT 


TCGGTGTTCG 


CCGTCTTTTA 


TTTCAATAAG 


CTGGCGATAG 


5400 


AAGAGAAAAA 


ATCAGCTAGT 


TCTGGTTTCT 


TAAAGAAATC 


GACTAATAAA 


GTCGTATACT 


5460 


CCTTTAATCC 


AAGAGCTGAA ATTTCTTGTA 


TAGAAACGCC 


TGTTAACTCT 


GCTAAAAGTC 


5520 



wo 98/50555 



PCT/US98/08985 



648 

CGTTAATArC ACTC7TAATA TCTTTTAAGT TAAGCATGAC TTTTTGCAAC ATCGctGCGC 5580 

CCATTTCAAr GCCACGCTTA GTAGCTTCTT TGTCAGCTTT TTCTTTGGCT GCATCAATTT 5640 

TTTTCTGCTC TGCT7TTGTT GGCTCTTTTG ATTTGTGGTC CAACGGTACA ACTTTTTCAG 5700 

CAGArrCAAr ATTTTTTTCA AATAAATTGA TAAACTCGTC TTTCACATCA AGCTTTCCTA 5760 

TGATGGCAA3 7AATGTAAA7 AAGTCATCAC CTTTTAGTTC ACGCATTTCT AAAGTCATTT 5820 

TSTTATTCC7 CCATGtAAAA AAGAACGACT GATAAGCCGT TCTTTAATTA TTTCAGTTCG 5880 

TCTAAATT7G TTATTTCo^A ATATTTGATT CCTCAATAAT CTACAAGCTC TCAGTTAGCT 5940 

CTATAGGAT7 TTTCGA7GTA CTTGGGTAAT AAATACGGAA TGGGAATTCG TCATTTATTA 6000 

GCTGTTCATA AGAAGCGTTA GCTGTGATTT CCTGTTCAAT TACTACCTCT TTGCCATCCT 6060 

CTGTTTTAA? TTCARGTCC6 CTTTTCACTA ATCCGTTATC -TAACGCAACG ATGATAGGCT 6120 

GTTTCGTTCC ATTATGAATA CCCAGAACGG CCATGTTAGG GATATAGTCC CCTTCTTCTA 6180 

AATAGCGTT? AGGTTTAATA ATTTTATAAC CAGCAGGTGC TTCATCTGTA GTAGCATCTG 6240 

TCATCGTAGC ATTTiyiTGAT CGGCGTAAAT TTTCTGCTGT TAATTCAATC AAGTTGGCTT 6300 

TCATCGTAGC TGTGGCTGAT TCAAGCACAT TCAAACCTAC TACGTCCATA ATATAAGCAC 6360 

CGTCTACTTC CACTTTACGG TAAGAAAGTT CTGTTTTGAT TTCTACACCA CCAGAAGTTG 6420 

CCCCCATGGG AATACCGGTA AACTCCTCTT TCTCTTTATC GTATTTAAAA TCAGTAAATA 6480 

CTGTCGCAGA ATCAATGACA AAATTGTCTG CGGTTGTTTT TGTATAACCT GTTTTAGGTA 6540 

GTGCCATAT7 CTTTTTTCCT CCAATCGACC GCTACATAAA AGCGGACATT TCTTCTTTTT 6600 

AACAGCTCT7 CTTTTGTTGG TATTTTATTT GATCCTTGAA AACTAAAAAT GAGATTTAGT 6660 

TCTG6AGTTA GTACCCTTTT GTAACAAAGA GCTGTTTTAA ACTTGTCTTC TAGAACAAGC 6720 

AAGTTTAAA^. ATGAATTATT TTTGTCAAAA ATATCAATAT CCAGATAAAA ACCATCTTGA 6780 

TTTCTACGAA TAGGCTCGGA ATCAAAATCA AAGGTGGCAT ATGGATAGAT GACCTCTTTT 6840 

CGAGGATTCG TTTCTAAAAA TGTTTCTGGG TGTATCTCTT GTAAGATTCT TACCAGCTCT 6900 

C6TATAAACT CTTCCATCAC TTATCCCCCT TTAAATGTAG CTCCGTATTC TTTCCCTAGA 6960 

ATT7CGATGA CTTTTTCTTT ATTCTGCCGA AACGCGTTTC TTAAAAACTT TTGTGGTTTC 7020 

TGACCTC7T6 TAAAATGCCA TTCTCCATCT GGTCCCTCAA AGAACCATCC ACCTTTACGT 7080 

CCTGAGCCAT TTTCAGCAAA TTCTCCGGTT CCGAATTCAA CATAAATGGC GTATTGTTCT 7140 

GGCGAGCCGA CAATGCCGAC CACTTTTCCT CCAACAGCCT TAAGTTGATA GTCTATCCGG 7200 

TCCXGCAACT GTCCTGTTTC AACTGGCGCA CCAGACTTTG CAGCAGCTTT GATAATCATG 7260 

AGGACTTTAG TCATCCCCCG TTCTGCCGTC GCGTCTAACT CTTTTTTTAC CTTGTCTTTG 7320 

TAGCTGATAA ATTTAAAGTC ATTCTTCGCC ACTTAAAACA CCTTCAAACT TCAAATAAAT 7380 

TTCATTGTGA TGCTTTATGC CAACAGGATT ATCGGAATAC GTAATATCGT AATAACGATT 7440 

TGTTTGATCA ACCACTCGCA TATTGTCTGT TATCCCCTCA ATAAATTCGG GAATAATAAG 7500 
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AATGTGTGTA 


GATTCTTCTA 


CAAATGCGTT 


CTGTTTCGTT 


GTTTCATCTG 


TTCCTGTCAC 


7560 


CAAGTCGATA 


TATCCAGAAA 


CTTCTTTAAA 


CAAGCCCCAA 


TCGTCAATAA 


GCCCGCCTAA 


7620 


TCCGTCAGGC 


TTTTGGCCTT 


TGACCTCTTG 


TAGAAAAAAG 


GTTTGAGGTG 


AGTACATTAT 


7680 


CCCCACCTCA 


TTTTTTTATA 


TTTATTTAAA 


AAACTGAACT 


TTGCTGCAGG 


AAAGCCCTCG 


7740 


ATATTAnTCA 


CTAGCGTTCA 


CATCGTAATA 


CGTGATAGAC 


ATTCTAGCGA 


TAGATTCTGA 


7800 


CTTAATGCCT 


AACTTTGCGC 


CCATCGATTT 


TTTAAATCGT 


AAAAGTTCTT 


CAATACCCAT 


7860 


TTTAATATCT 


GCTGGATATT 


CTACTTTTGT 


AATAAAAGCA 


CCAGAAAAAG 


AGCCATCAAA 


7920 


AAAAGGCTCT 


TCAACATCTG 


TTTCCAGTGT 


CTTTTCTGTG 


ATAGCTTCAA 


CAGTTGTTAG 


7980 


GCAATCGTTT 


ACTTTAGAAT 


TACTAATTTG 


AATCGTATCA 


CCTACTCGTA 


AACCAAGAGG 


8040 


GTTATCTTTT 


AACACAATAG 


AATTACCCTT 


AAAAACAAC7V 


TCCCTAAAGC 


GTATATGAAC 


8100 


GTTTTGAAAA 


TTATTATTGG 


TTAGTTGCCT 


AACTGTCTGT 


TCGAACGCGT 


CTAAGTCACT 


8160 


CTGAGTAACC 


GATGGATCAA 


TTTTTTTTGC 


TTCTTCAATG 


CTGATAATCA 


AGTTAGCCAC 


. 8220 


CTCCTAAGAT 


GCAGGTGTAC 


CTGTTACTGC 


AATAGAGGTT 


GTAAACGCAC 


CGGAAGTAAA 


8280 


TTCAAATGTC 


GCTTTACCTT 


CTGCAATAAT 


TGAGCCATCA 


AATCCACCGT 


TTTTATTTTT 


8340 


AGTGATAGTA 


GCAACACTCT 


CATTACTAGA 


TATCACTGTA 


ATTGCTGAAA 


TTACAGCTTG 


8400 


TGCGTTTGTG 


GCATCTACTG 


GATTAGCTGC 


AATAGTAAAT 


GTCTTAGTTG 


CGCCTACTGC 


. 8460 


ACCCGTCCAT 


GTTTTCTGAC 


TAGGCGTGAT 


ACCCGTTGCT 


GGATTAGTAG 


GCGTTACGCT 


8520 


TTTTTTACTG 


CTTTTAAAAT 


AGCTACTTTA 


TTTTTTTCAG 


GGATGTGTTT 


ACCATATTTT 


8580 


CCAGCTCCCT 


GGATTGCTAC 


TCCGTTAAAA 


TCTTCGGAGT 


CAATAGCACG 


TGTTACAGAA 


8640 


ATACCAACGC 


CCGCAATACC 


TACGCTATCC 


GCGGAAAATT 


GAGCCATTTC 


CCCAGTTTGT 


8700 


AATTTGCTTT 


CTGGAATTTC 


AACCAATACA 


AAACCTTTAA 


ATTTATATAT 


TGTTTGTTCA 


8760 


TCAATGTTAG 


CACTTGAATT 


TTTAGTTGTT 


GTTGCTAAAC 


CATTGTCTAC 


TAAAAAATCA 


8820 


TAAACATCAG 


GGTGAACATA 


AGCAACCCAA 


GCAAGAGACT 


TAGAAACTAA 


ATTATTCACA 


8880 


AAAGTCTTAT 


GCGCCGTTGA 


GAACAGCTTA 


GTTACTCCTT 


CACTGGTTAA 


TTCAAACCGA 


8940 


ATCGTTTCAG 


AGGCTGCATC 


AGATAATGCT 


TTACCTAATA 


AACCGTCGAT 


ATACTCAGCC 


9000 


CAAGCAATCG 


CTTGTTCCTC 


TAATCTTTCA 


GCTACTACTG 


CGTCAGCGTC 


aTCGTTTACT 


9060 


GTGATATTAT 


CCACTCCTTC 


GTGAATCGCT 


AACGGA6ATT 


CATAAGGAAC 


CGTTGTATCA 


9120 


ATAGATTTAA 


TTTCTTTACG 


TGGCCCAAAA 


CGATTAGAGT 


TGCCAGTTCC 


TGTACCAAAC 


9180 


GCTACGTTTT 


CACCAGTGTT 


GTAGGGTTGC 


ATTACCACAG 


GTGTATCTGA 


AGTTTTTAGC 


9240 


ATCAAAAATG 


TGTCGCTATC 


TTTTACAGCA 


TCC6ACGTCT 


GCAACTCTCC 


TCCAAACGCA 


9300 


CGTAAAAAAT 


AAGATTTTTT 


CTCTGTAATT 


TTGGCTAATA 


AGCCTGCATA 


TTGTTTCGTG 


9360 


TAAAATTTTG 


TAGCCATAAA 


AAATTCCTCC 


AATTACTCGT 


ATTTTGATAA 


GATTGCTTCA 


9420 


AATGGATCTT 


TTTCGTTTGG 


TTCCTTCCCT 


TTTGGCAGTT 


TGTTATCAAT 


TGGTTGATAG 


9480 
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CCtGCTTTAT 


TTTCAGTCTT 


TTTTTTCGAC 


TTATCATCGA 


GTGTATCCTT 


CTTTTCAAAG 


9540 


TAATCAGGGA 


TGGAAGCTTT 


CAAGTCTTTC 


ACTTTACTTT 


CTAAGTCCTT 


CACATTCCCA 


9600 


TCTTTGTCAA 


GTTCTAACTC 


ACCTAATTTA 


AATAAAGCAT 


AATCAATATC 


CTTAGCCCCT 


9660 


GCACTTTGTA 


ATGCAGCAGA 


TACTTGGCTA 


TTGATTTTCA 


AATCTTTGTT 


TTGCTTTTCC 


9720 


AACGCGTCTG 


CCTTTTCTTT 


TAAATCAGCT 


AGTTCTTTTT 


GAACATCCGG 


ATTGTCCTTT 


9780 


GTTTTAGCTT 


CTAGCGATTT 


TAAGGTTTTA 


TTTGCTTCAG 


CTAACTGGCT 


TGAAAGATTA 


9840 


TTATATTGGT 


CTTTTGGTAC 


TGCATTTACA 


GGAAATTCTT 


TGTCAATTTC 


TTTATTAGCA 


9900 


GCCTCCATAT 


CCAACGTACC 


ATCTTCTTTC 


ATATGTTTTG 


CTAAAATTTG 


TTTGATCCAT 


9960 


TCCATCGTTT 


TCTACCTCCA 


TAGCATTTAT 


ATAGCGGTCG 


CTGCCGCTTA 


GAGTGTCAGA 


10020 


ATATACCGTC 


TGCTCGGTAG 


CGGTGeCCTT 


TTAACGTCAT 


GGCTCCTGGA 


CAAAATAAAA 


10080 


AGACTTATCA 


ATAAGCAAGT 


CTTAACTCCT 


CATATCATCT 


AATACTTGTT 


GTTTTGCTAT 


10140 


ATCTAGCATG 


CCTAGCAGCT 


GCAAACTTCC 


ATTTTGACTG 


TAATAAATAG 


TGACCTCTCC 


10200 


ATTTGGCCGC 


TGTACAACAA 


TTGTGGCCTG 


TTCAAAATCA 


GGCTCTTGGC 


TTGATACCCA 


10260 


TTCGTAAAAC 


TCTTCGTAGT 


CTACATCTAA 


ACAAGCGTAA 


GGATCTTCCA 


TCTTATGGCC 


10320 


TCCCTTAGAT 


ATAAAAATAG 


CACTCAGCCG 


CTTAATGGGT 


GAGTGCTACT 


ATTCGTTATA 


10380 


ATTATCTGCA 


TCCGCTTGCC 


ATCTATCTAG 


CAAAGTGGGT 


TGATTAAGTT 


TATGTTGTTG 


10440 


ATAATATTCC 


TGTCTCAACC 


GCTCACGTTC 


CTCAGGCGTT 


TTCGCTGTGT 


CTAGCAGCTT 


10500 


ATATAACTCA 


TCAAATTCCT 


TATCAGTCAT 


CTCTAAAGCG 


CTACGCCTAT 


AATCCACATC 


10560 


TACCTTACCC 


AAGAATTGTC 


ACCTCCAATA 


TATTATCGAC 


TATCTCCACT 


GTATACTTCA 


10620 


AGTTTCTGTT 


CAGTAGGAAT 


TCACTTTCCT 


TTTTCATCGA 


ACTGTTCGTT 


CCTATATAAG 


10680 


CGCCTTGCGT 


TCCTTTAGGA 


ATCAGGAATT 


TAACTACTCT 


ACCATCCTTA 


TTTGCACTGG 


10740 


TAAAATTTAA 


AAAATCCTCG 


GCAACTTTTT 


TGTTGATACT 


AGTAGACTTG 


AATTCTTTGA 


10800 


ATACATTTCC 


GTTCAGAATA 


GCATCGTATT 


CATTTGCAGA 


TACACCTCTA 


TAAGTTATTA 


10860 


TATCATGTTT 


CAATTTAAAT 


TTACTAATAG 


CATCATCTAA 


ATTTTGAATA 


ATATTGACAA 


10920 


GATAAGGTTT 


TTCTCTTGGA 


TTATATTTTT 


CATTTCGGAG 


TATCTTATTC 


ATTTTAGCAA 


10980 


ATGCGTCTGA 


TGTATAATCT 


TTCATAGCAT 


TAACTTGTTC 


AGGCTTCAAT 


CCAGCAACCC 


11040 


ATTTATCACT 


ATCATCTTGC 


AGATCAGATT 


GACCGAAGAA 


GTCCAGCTCA 


GTAAAATTCG 


11100 


GCTTTGGTTT 


CGATTCATTT 


TGTCTGATTC 


TATCTGCATA 


CCACTCATCA 


TAGCTTTTAA 


11160 


ACTCCGACAT 


CTCTTTAGAT 


TCATTATCTT 


TTCTAAGTTC 


AGGGCTGATA 


CCGTCGACAA 


11220 


CTTCAATTGT 


TGTACATCGG 


CAATTCACAT 


CTTCTGATGC 


TACCCCAAAC 


ATCCGCGGAC 


11280 


CTTTTGCCGA 


ATGCCCTCTA 


ATGGTAAACT 


CTTCGTCTAC 


ATCAACCTCT 


TTACCGTCTA 


11340 


ACTCTTGGTG 


GGTTCGGCGT 


6TATGTTTAT 


CGATCGTGGC 


AAGCCATTTC 


TTTTTAATAT 


11400 


TGATGCCCAG 


CTCTTTTGCT 


TCTTCATATC 


CTTTTTGGGT 


AGTGACAGAC 


TGAGTACGTC 


11460 
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CTGCTTCTGT 


TCTTGCAATA 


CGTAATGCTT 


GTTTGTAGCT 


AGCTTCTGTT 


TCCTCATTAA 


11520 


TCCATCTAGC 


TATTTCAGCA 


TAACTTTTAC 


CCTCGAATAA 


GCCCGTTATG 


ATATTGTTAG 


11580 


TCACATTTTG 


GGCTAATTCA 


TCACGGTACT 


TGTATAAACG 


CTTTGAGAGC 


CTCTTACCCG 


11640 


CTACAGGTGC 


ATTGACGAGA 


TTCATAATAT 


AATCATGGTT 


AATTAACGGC 


ATACTTAGTG 


11700 


CTATATTCTG 


TGACTGCTCT 


AGCGTATACC 


ACAGTCCATA 


ATATCCCTGT 


TCAGCTTGTT 


11760 


TTGCAGAATA 


GCCTTTGATA 


GTTTTTTCAA 


CTTTCGGTGA 


GTTTAACTGA 


AGAATGGCAT 


11820 


TAATTTCATC 


AGCAACACTA 


AAAAGTCTCT 


CTACTTCCAA 


ACGGGTGGAA 


AAAGAAAGAC 


11880 


TTTCTGCATT 


TTCTGTATAA 


ACTTTTAGCC 


GCTTTTTGAT 


GTCGATTAAT 


GATTGACGAT 


11940 


AAACATTAAA 


TAGCTGATTA 


TCCGTTTCCT 


GGTAATTGGC 


TTTCTGTAGA 


GCCTGTAGCT 


12000 


CCTTCTGCCA 


TTTGTTGAGT 


TGGGACATTT 


GGGGTCACCG 


CCGTTTCTTC 


TTCATCGGTG 


12060 


TTGTCCGTCA 


AACCTTGCGT 


ATATTCCGCC 


TCTTCTAATG 


CCTCTTTCAC 


GTCTTCCCAG 


12120 


TCAAGGTCAA 


ACTGTTCGCA 


GATAAGGCGT 


AAAACGTTGT 


CGTCGTCAAG 


GCGAGGTGCT 


12180 


ACTTGCAAGA 


TAGATTCAAG 


GATAATTTTT 


CTTGTCTCTG 


CTTCGGTTTT 


CTCATTGTTA 


12240 


ACAATATCTG 


TTTCATTCAC 


CATTACTTCT 


CTAGTAAACG 


TAAATGAAAC 


TTCTGTTGGG 


12300 


TCAAATGCTT 


TAGTATAACG 


ACGATTAATG 


TCATCAATAA 


CAAGCTTATT 


CATCCATTCT 


12360 


AGTAAAGCTC 


GTAGTCTGGC 


TTCAGTCTTG 


TTGGCTTTCA 


TGTTTAATAG 


TGTATAACGT 


12420 


GCTTTTATCA 


CTATGTTGGT 


GATATTCCCG 


TCGCCTACTT 


GAGTTGAATC 


AAAGGCCATT 


12480 


CCAAACTTAT 


AGATATTTTC 


TTTGTCAATC 


TCCATTTTGG 


TTTTACGCCC 


TTCAGTTGGA 


12540 


ATCGTTACTG 


TTTTAACATC 


TAAGCCCCCA 


TCTGAACCAG 


TACCAACAAC 


CTTTTTAGAT 


12600 


TTAACATTTT 


GTCTTAATTT 


TGAAAGATCG 


TCACCTTGGA 


AACCTGATAC 


AACGTAAATG 


12660 


GCTTCCGCAA 


AATCTTGTAA 


ATTGTTTGAT 


AAAAAACAAT 


TCATTAAGTC 


GTAATCATCT 


12720 


ATCAGTGCCT 


TAATAGACTT 


TAAATCCGTC 


GTTTCCTGTT 


TATTATTTGA 


TAACCGATAA 


12780 


AAAGGTATTT 


GTCCGTAGCT 


TCGTTGCAAT 


AAGCTTTCAT 


TATCGCTATC 


AACTGCTAAA 


12840 


ACATGTGGCC 


TTGGATTAAT 


TGGTTCAGCT 


TCATCTAATT 


CATAATCTTT 


GTTATCCTCA 


12900 


GCTACGAAAA 


AATAAACGTT 


CTGGTCAGTC 


CACACTTCCG 


CGTGATGGAT 


ATCGACTGTC 


12960 


TCGCCGTCTT 


TCTCGATTTC 


AGTGATATAA 


TGACGGCAAA 


TACGTTGTAA 


TTCATTGTAT 


13020 


TCATTGTATA 


CACCAAACAC 


GTTTAAACTA 


TCAGCCACTT 


GAAAACACAA 


TCGATCTTCT 


13080 


GCATTGGTTC 


TTGCATAAAC 


ATACTCAAAA 


CCTTTTTGGC 


TTGAACCTTC 


CACAAGCTCT 


13140 


TGTAGCACTA 


CTTGAAATTC 


AGAATTGTAA 


TACTCTGCTA 


AATACTCTTT 


AAGTTCTTCG 


13200 


TTTTCTGTTT 


CGTATTCAAC 


AGGATTAGAT 


AAAAGGTATT 


GAGTTTTTTG 


GTCAACAATT 


13260 


TCTGGGAAAA 


AGCCGTGCGG 


GATTCGTACA 


TTCGAGGCGT 


ATTTGTCTTC 


TCGCAGTACC 


13320 


CCCTCATCGT 


CTACGTAAAA 


GATACGATTG 


TTCATGATAT 


CATTTTCATG 


GTTATAGTAC 


13380 


CGAATACCTG 


TCTCTGCCTC 


TCTTTTTGAG 


GTAGATTTTC 


GATCTTTGTC 


AATCGCCGCC 


13440 
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TTTAAAGCAC TAGCGATAAT 


CTTCACGTCT* TCACTAAGTA 


AAGCTTCCAT 


TTACAGCCTC 


13500 


CTTTCTAGTA TAGCCATTTA 


TTTGTGTTCT 


TAATTTCTCT 


AAGCAGACTA 


GCTGCACTGT 


13560 


CTGGTGCGTC GTCGTGCTCT 


GCGTTCTCTG 


TATAGTCTAG 


TATCTCGGCT 


ATATATTCTT 


13620 


TGTCTGTGTC TTCAAGCCAA 


ATTATTCGGC 


TCCAATACTT 


CCGCAAATAA 


GAAGATATCT 


13680 


TGATAAACTT ATTTGTTTTT 


TCGTGATACT 


TTTGCACGTA 


TTGACTTCGT 


TCGATTAGGT 


13740 


GTTTAGCTAA ATAACCTTTA 


TCGCCGTTCG 


TTTCAGTGTA 


AAATGTTCCA 


GCTTGGTAGT 


13800 


ATTGATGTAG TTGCAATATT 


TCAGGTAGGC 


AATCATCAAC 


GTGTTTCTGC 


CATTTCTTGC 


13860 


CAAACCCGAT AATGGTTCCG 


TCCTTTTGCT 


CTTTAAAAAT 


GGTAAATGCT 


GTGCTATCAC 


13920 


CACCTCCGTA TGCCGCGTCG 


ATGTGTGCTA 


CACCGTTATA 


AATAAGGTTA 


GTGTTATCGA 


13980 


TATAGGTCGG TGCAGTAAAT 


AGCGATTCAC 


TATCAGCGAT 


GTGTTTTAAC 


TCGTAGTTCG 


14040 


CTGCAAAGAG CGACGGTGTC 


ATAGACTGCT 


GTAACGCTTT 


ACGTTGTTCT 


TTGTCAATTA 


14100 


ATCCTGTTTC GTAACAATCG AATTTCTTGA CGTTAGGCAT TTTAGAAATT GCATCTTCCT 


14160 


TATGCCAAGG CGTGCCTGTG 


TTAATAAAAC 


GGCCCTCACG 


ATTCTTCACA 


TTCTGCAATT 


14220 


CTTGATACTG CAGCTTTGTT 


TTCTCTCGTT 


CAGCACGACT 


TACTCGGTCT 


TTAATGTTAA 


14280 


CAATATCATC AGTGATAACG ATATCCGCGT GTTTACCTGT TAATGAAGCA TAAATCCCCA 


14340 


TACCAAGTAA TTGAGATGTT 


CCCCGGGTAG 


ATGTTTTTAA 


GTTGGTATCT 


ATTTCTGTTG 


14400 


TAGTCTCTTT CAAAAGCACC AATTCAACAC 


CATATAATGC 


AAATACAAGC 


GTTTTAAAGT 


14460 


ATTTGCTAGA TAAAACCTTA 


GCCACTTGTA AAATAATCTC 


TACGACGTCT 


GTATCGGTCT 


14520 


TACGTAAGAA GATAATATTT 


TTGTTGGGAA 


AAAGAACCAT 


CAACAATGCA 


ATAGCAATCG 


14580 


CCAATGTGGT TGTTTTAAAT 


GAGCCACGAT 


GAGCCAATAG 


CGTTTGATCG 


TCTTTCTCGA 


14640 


ACAAAAAAGA CTTTAGCCAA 


TCATTGTGCA 


GTTCTGCTAA 


GTCGGTAAAC 


CCAACTAAAT 


14700 


TTCCGAAAAT TACCGGGTTC 


GTTTTGATTA 


AGTTCAAGTA 


TTTTCGCTTT 


TTAGGGCTCA 


14760 


TGTAGAATCA TCCTC6AAAA ATTGCTCGAT 


TTCGACAGCT 


GCATCGGAGA 


TATTAAGCAC 


14820 


GCCTGAAACT TCAGTTTCTT 


TTCGATCTCG 


CCATTCGTCC 


GGCTTTCTAT 


TTTTCAACCA 


14880 


GAAAATAGCT GCAGTTGGAT 


TAGGAGCTAC 


TTGTTTTTTA ACCTTTTTTG 


TAATTTCCAT 


14940 


ACCGTTCTCT GTTAGTTCTT 


CTGTAACTTC 


GGTATATTCG 


TAGCCTGTAG 


CACTCTTAAA 


15000 


TAAAGCATTT TCAACTTGTC 


TATCAACGAC 


TTCTTTTCCC 


TTTTTTAAGG 


ATGCCGAAAG 


15060 


TGCCGGAAAT TTTTTTACCC 


ATTCTCTGAA 


AGTTG/^TAA 


GCTACCCCGA 


TATTCTGGGC 


15120 


TATCTGCTTA TCGATGAGGC 


CATCTCGTGC 


CCATCCTTCG ATTTTGATTA ACCCTTCCTC 


15180 


GGTTAGCCAC TCTGTGTAcT 


TCGCCATGAC 


CTCACCGCCT 


TTCTGTATCA 


AAAATAGACA 


15240 


CCTTCCCATT TTCTTGGGAA 


GGTGCCTTTT 


CTGCGTAATT 


TTACTGATAA 


CAGTATATCA 


15300 


CGATTTATCA ATGGGAATTG 


TCACATCATT 


GTCACTTTTT 


TCGATTTTCT 


TCAAAATCTG 


15360 


GTTATGTTTA TTTCTGATAT 


AGCCATAGCT 


ATGCCCAAAT 


TCTCCCGCAA 


TTTGTTTTAA 


15420 
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CGATAGCCCT 


TCGATATACT 


TTGCTACTAA GATTTTTTGA 


TCTAATCCTT 


TGAAACTATA 


15480 


AACTAGCTTT 


CGTAAATCGT 


AAATTGaGTT 


CATTTTCCAC 


GCTAAACGTT 


TTTCTAAGTC 


15540 


TTCGATAATA 


TCTTCCAACT 


TAGAAGCTTT 


GGAATCTTTT 


GTTAGTGTGT 


ATCTTCCTAA 


15600 


ATCCTCTGGG 


TCACACCATC 


TTTCTAACTC 


TTTTTTATAC 


GTTTCTAGCT 


CCGAGTCTAA 


15660 


GTAATAGATT 


TCTTGTTCTA 


ATTTTTGATA 


GCTGACTAAC 


CATTCGTACA 


ATGTACTTTC 


15720 


ACCTACCTTC 


TTGTTCATCG 


CTTTCACTAT 


TAACCGTAAC 


TTGTACATTT 


TTATACAGAA 


15780 


TTAGAGTTAA 


TAATGCGGAT 


AGCAATATAA 


ATCTAACATC 


ATACGGTATC 


TTAGCTAATA 


15840 


TCGAAATAAT 


CCACTCCGAC 


ACTTTGTATA 


TCATCATAGC 


TAAACCTAAA 


CATAACCAGA 


15900 


ArGATATTAA 


TTTATCCAAG 


TTTATCCCCC 


CTATACTCCT 


CAATTTTGGC 


TTTCACTGCA 


15960 


GCCATTAGCG 


CTGATTGCCC 


TTGTTCTTTT 


GCTTGTAAAG 


CTTTGATTAC 


TTGTTCGTCA 


16020 


ATAGTTCCTT 


TCGTCACTAA 


GTGATGAATA 


ATAACAGGCT 


GTGTTTGTCC 


TTGGCGGTCT 


16080 


AATCTGGCAT 


TGGCTTGTTG 


ATAAAACTCC 


AAAGACCATG 


TAAGCCCAAA 


CCACACGATA 


16140 


ATGTGCCCGC 


CTTTTTGTAA ATTTAGTCCG 


TGCCCTGCTG 


ATTGTGGATG 


GGCTAAAAGT 


16200 


AAAGAAATTT 


TTCCCTCGTT 


CCATTTCTCA 


ATATCCCCGT 


CAGACACATT 


TAAAGCCTTT 


16260 


GCTTGTTTAA ATCGTGCTTG 


GATTCTTTCT 


AGGTCATGTT 


GGTATTGATA 


AAAGACTAAA 


16320 


ACCGATTGCC 


CTTGTGCGTC 


CTCTATAACG 


CGTTCTAGCG 


CATTTAACTT 


TTCTTGGTGT 


16380 


ATTTCCCTTC 


CGTCGCCGTT 


TTCGTCGTAT 


ACAGCGCCGT 


TAGACAGCTG 


TAAAAGCTTG 


16440 


TTTGATAGTG 


TAGCCGCATT 


ACTAGCTACA ACGTCTGTTC 


CTTCGAGTTC 


TAACACGTAT 


16500 


TCCCGTTCTA 


GCTCTTTGTA 


CTGTTTCCAG 


CTTGTCGGGT 


TCAAGTCTAG 


CTCGATAATA 


16560 


TTTTCCGTTC 


TCGGTGGCAG 


TCGOUUVTAA 


TCTTTTGCTT 


TCATACTCAC 


GCATATATCG 


16620 


CTTATTTTGT 


TGTAAATCGC 


TTCTTCTGCT 


CCTGGGATTA 


ATTGCCAAGA 


ATACACGATA 


16680 


TGCCCGTTTT 


TTTGTGCGGG 


GACAAAATAT 


TTATTACGGT 


ATTGTGTGAT 


AGTCTTTCCT 


16740 


AGCCGTTCGC 


CTTGGTCCAA 


TAAATACATC 


TGCGCCCACA 


AATCCATCAA 


GCTGTTTGGG 


16800 


GAAGGTGTGC 


CTGTTAGCCC 


GATAACACGT 


TCCATCTTCG 


GCCGTACTTT 


TCGTAATGCT 


16860 


TTAAATCGTT 


TGGCACTACT 


TGACTTAAAA 


CTGGATAACT 


CGTCTATAAT 


CACGGTTTTA 


16920 


AAGGGCCAGT 


TTCGTTGATA 


GTAATTGACT 


AACCACTCCA 


CATTTTCACG 


ATTGATTAAA 


16980 


TAGACATCTG CCTTTTTGM TAAAGCTTCT TCACGTTGCT 


TTGGGTTTCC 


TAACACTTTT 


17040 


GAAAAAGTGA GGTGCTGCAG ATGGTCCCAT TTTTCAATTT CATCGGTCCA AGTTTTCTCG 


17100 


GCAACTGATA ATGGGGCAAT 


TACTAACACA 


TTTTCAATAA 


TTTCAAACGT 


GTGTAACAAC 


17160 


TCGTCAATAG CTGTCAAACT GGATAAGGTT TTCCCTAGTC CCATATCTAA CATCAAAGCA 


17220 


CAGTAAGGAT 


GATCAAGAAT 


AAAATTCTTA 


GAGTATTCCT 


GATAGGGATG 


TAACGTTGCT 


17280 


TTCAAACTCA ATCACTCCTT TTTATGGGAC ATATCAGGCA 


TAACTCTTGC 


TATACCGCTT 


17340 


TGCATTGCGT 


AGATTAACAT 


ATCCACTCCT 


TGCTTGCTGT 


CAATGACAAA 


CACTGGGATA 


17400 
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AGCTGCTCTT TGaATTTCTC TATAATTTTT AACTGGTCTT TTCTTGGTTT ACCTCCAGGC . 17460 

CTCTTTAATT CAACGAAAAA CACGTTGCCT TGATATAAAA TAATCCTGTC aGGAACACCC 17520 

CTAGTTCCTG GGGAAGTGAA CTTATAACAT AAAgCTCCtA TGCGTTTTAT CTGCCTAACT 17580 

AGATACTTTT CAATGTCGTT CTCAATTTGC ATTACTTCGG CCTCCCATCT GATAAGATAT 17640 

TGGTTGATGT GTTCCATCAG ATCTATTTTC TTCAACCTCC GTTTAAAAAT ACCGTGTCAC 17700 

ATGTAACACA AAAATTCTTC AACTTATATA TATTCCCCAT GTACATATAT ACATATACGC 17760 

CCTATATATG TATATTAATT ATTTCTATAT ATGAATAGAA AATAGTGTTA CAGGTGTTAC 17820 

GATTGTnCTT CATTCTTACT CTCCCAAGAT GTAATAGCGT AACACTCCCT GTAACACCAA 17880 

CGGGAAATAA GGTGTTACAG GTGTTACAGT GAATATTTTA GGTGTAACAC TCGTAGCTTT 17940 

- TTTAGGTGTT ACACCTAACC CCGCGTCATT ACTACCTTTT TGTTTTAAGG TGTAACACGT 18000 

AAATAGGCTA CTTGTACTCC ATACCCAGGG CCAAATCTTA AACGTCCTTT ATTACCTTTG 18060 

CTGTTTTTCT CCCAACCAGA TAAATGGCTT AAAACTTGAC GAATTTCTGC AGCTTTTGCA 18120 

GGATGTATGT TTTTAGAATC TCCGTTATAC AATTCATTCC AAACCTCTGC AATACATACT 18180 

TTGTTTCTAA CAATCTCGCC TTCTTCCTGA ATATCCGTCC CCCACCCCTG AATATACTCT 18240 

CGTCTTTCTT GCTTACTTCG TTTGTACCAA TCTTCCGTTA TTGGGATTTC CAAATATTCA 18300 

AGGATTTCGC CTTCCATGCT AGAGGTTTCC GTGTGCATTT CTTGGGCTTC TAGTGCTAGT 18360 

TTTTCTTGTT CGTCTGTTAG ATACAAAGGT TCTCCTGCTT GCCACAACTC AACGGCTTCT 18420 

GcCCATATCT GATTGCGCGT TTCGTCGGTC ATTTCCCACA CTTTATTTTT AATCGGTTGA 18480 

ATGCCGACAT CTACTGGCCA AAAACGCCGA TTTCCTGTTT TATCTCTTAG AAACTCGTTG 18540 

TCGTTCGTTG TTCCCCAAAA GACACAACGG CGTTTAAAAT AGGACTTATG CCGACCGTAA 18600 

"GCAACGCGGA AAATGTCTTC TTGCTTGCTA ATAAAATGCT TGGTGGCTTC AATGTCTGCT 18660 

TTTTTTGTCG CTGACAATTC GCCCATTTCC ATGATCCAAA CACCTTGCAA CGCTTCGTAA 18720 

GAGTCTTTCC CCGTCACACC TTCTAAGCTG TTAGAAAACC AGTCTCCCGC CAATTTTGCA 18780 

GGCAATAGCG TTTTTCCTAT TCCTTGCGGA CCGCTAGTCA CTAGCATGTA GTCAAACTTA 18840 

ATCCCTGGCA CAAAAATACG CCCTACGGCA GCTGTAAGAA ATTTTTTCGT AACTACTCGA 18900 

ATGAACGGTG TATCTTCTGC GCCTAAGTAG TCTATTAGTA AAGTTTCTAC GCGAGGCACT 18960 

CCGTCCCAGT GCAAGCTCTC TAGGTATTCT TTCACTGGAT CATATGAGTT TCTTTCGATT 19020 

TCTTGGACTA GCGCATCATC GATTTTGCCA CGGTTCACAA TACCGTAAAT TTTTTCGATA 19080 

TATACCCGTA ACCCCGCATC GTCGCTGTCC TTCCACATCT TGTCTCGATC TAACTTCCGC 19140 

CAAGGCAAGT TGTCTTTCAC CTCAATACGA TTAGAAAAAC TGTTCATAAA AATTTTCTTT 19200 

TTGAGGTTCG GGTCATTCAG CATAATGACT TCTAAATTTT TCGCGGAAGC CATAATCTGA 19260 

CCGTATTCGT CAATTTCGAG GTCCAATTTT GTAAACCAGT CTTTGTCGTC CTCTTCCAAT 19320 

TCGTCTAGCT CGCCGTCAAA GTCTTCTAGT GCTTGTGACA GCTTTTCACT CTGAATCAAG 19380 
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GTCTTAACTT 


GCTTGTCTTC CATTGCGAAC TCTCGCATCG CTTTAAACGA TGGATAACGG 


. 19440 


TCAATCCGCG 


TCGTAGGTTT GACGTCTTCA TCTAAATCGC CAAATAGATG GATACGGACA 


19500 


AGGTCAAACG 


CGTTCACCAG TTGATCGCCT ACAGGGTCCG TACCGTGATG AGAATAAGCG 


19560 


AACTTATCAT 


CATAAATCAC TAGCCCACCA CTAGTAGAAC CTTCTAAAAA CGTCCAACGG 


19620 


TCCTCTCTTC 


CTGTGGACCC GTAGATGTCA GGCAAAAAGG TTTCAATCGC TGAAATGATG 


19680 


TCATACGTGC 


GACAAAATGC CCCGACAATC CCTTTTTTCT CTAATGGGTC GCCTGCTTTT 


19740 


TTTGCTTGTC 


TTTCGCGGAT AGAATGCCCA CGGGAACTTT CTGGCCAAAA ACTTGCATCT 


19800 


CGCCAGTCTT 


CGTATTGGCT TAGAATTTCG TCAGGATCGA CCCACGGCAA ATCGATGTTA 


19860 


TCGGTGAAAT 


ATTCCCCGTC TATCGAATGA CTGGGCCAAT ACATTAAACG TTCTGCCTGA 


19920 


TAGGTCGTAT 


CATCGAAGTT ATCCATGCCG AAAAACTCCG CAAGCTTTCT CGCTAGGGGC 


19980 


TCATACTCTT 


CGGCGGTCAC TGGGCGACTT AACGGAATAA TCAAGCGATA ACGTGGACCT 


20040 


TTTACTAAAT 


GGCTGTGTGT CGTGTAGACA GCTGCAGCGT GGTCAAATT^ CAGCTGTACA 


20100 


TCGTCCCAAA AGTCTAACGT CGTGCTGTCG GCATCTAACG TAACAAGGcT ACGCTGTTGC 


20160 


GTGTTTCCTC 


TTTTCCGTCG TCCTTCTTTT AACCAACCAC CAATAAACGC CCCAACGTCT 


20220 


TTAACTTCGC 


CCTGCTTAGA CTTTGGCATT TTTTTGTAGT CTTCGACAGT CTCTTGTGTC 


20280 


ACTGTCGGCG 


TTTTCAATCG TTGGGTAAAA TCCGACCAAG ACATCTGCCG ATTTTTCCAT 


20340 


TTTTTCTCGG 


TTTTTGAAGC ACCCACTGCA AGATGGATTT CACCATCATA GGCTAATTTG 


20400 


ATGTTTTTTT 


CTGGCTGTTC CATCTTTCCG CTTCCTTTCT TTTATTGTTT AAGGATGAAA 


20460 


CCTAAACAGT 


CAAGCCATCC TTCTTGATAG TCACACGTAG AGGATTTATG ATCATAATTA 


20520 


AACGCAGCTC TCATTGCTTT AAGATATTTA CGCTTACCAA CAACGTAGCC CAACAAAAAG 


20580 


CAAAGTACTA AGTTTATTAG TATAAGTATG TTATTTCCCA TTTATTAACA ATCCTTTCCA 


20640 


TATTCGTACT 


CGTAAATTAG ATTAGTTAAC CAGCTATAAT CTAGCTCTTG CTCTTCGTCA 


20700 


AACTCCTGTT 


CGTTGACATA AGCAGCAAAA TCTACGATAT TTTCGATATT TTCCTCATCA 


20760 


TAAAAACCCC 


AACTCCAAAT GAATGCTTTA ACACGAACGA GTTCCCCAAT ATCTTTTTTA 


20820 


AAAACAAAAA 


CGGGCTCATT GACCATTTCA ACACTCATAC AAAATTCACA TGTCCCCACT 


20880 


TCTTCTTCGT 


GAGGATTAGT TTCAACATCA ACTAGTTTTA GTTTCATTTT CTATCGCTCC 


20940 


TTTTTCTAAT 


TTTTAGTGAT TTTCAATTCC CAATCATCGC TATCAAAAAA TACAAATTCT 


21000 


AATTCCTCAA AATCGTTTGT ATTTTTTATC AAATCTTTAA ATTTCTTATT TGAACAAATA 


21060 


TTTCCAAAGC GTTTTAAAAA GCGGTGCTTA AAATTTTCAT AATCACTGTA ACTAAATTCA 


21120 


AAAACTTTCC 


CAGTTGAAAA AACTCCACCA GTTGTCAGTT CTAACTTTAC ATAATTTAGT 


21180 


GCCATTATCT GAACCTCCTA ATCTTTCATG TAATATTTCG TTTCAAATCC TTCGGCGTTT 


21240 


AACGGCAAGC 


CTTCCGCCCA ATCAGGAACT ACAGACATGA TTTTGTTCAT TTCTTCAATA 


21300 


GATTTTTCCC 


CTTCTGGCAC TTCGGCTACT GCCTCATCGT GCACATGGAA AACAATAGGA 


21360 
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tagccctca: 


: GTTCrAGCCT CTGCATCGCT TCTGCTAAAA CGTCTCTTGC CGTTGCTTGG 


21420 


ACAATGrrrr ccaccaattt accgccgtag gtttgctgtt ccgtgaaagc gactttatcg 


21480 


CCTTGCCCrr c^t.-j^uvaat agcaggaccg tagtcacctt cttttaaatg cgcttttgca 


21540 




21600 


T T CAAC C Z 


: GAGGGCCTTT TTTAATGCCG CCGTTTTGTA AACAGTCAAT CACAGCTTTT 


21660 




^-CAAAAGTT CTTAATGCGT TTATTGGCTG TACGCCAACG GTCCACAATG 


21720 


TCTTGTAA- T 


C3TG7TCTTC AATGCCGTTT TCTAATGCAC CCATTGCTTT AAGCGCCCCT 


21780 


GGTCCGCCTr 


GGTAACCAAG CGCCAATGTC GCCACTTTTC CACGTTGGCG CATGTCTTTA 


21840 


CcTTCGTGGC 


TTTTCCAGTC GTAGTCCGTC ACTtCGCCTA AATGGAACAT CTGCGCCGCT 


21900 


GTTGCTCtGr 


AJiATTTTGCC GTGTGTGCGG AATACTTCTA ATACCCAATC TTGTTTGGCA 


21960 


TACCAAGCAA 


TCAC7CGGGC CTCAATGGCT GAAAAGTCAG ACACGATGAA CCGATGCCCT 


22020 


TCTTTGGCAA 


CTAAACCTGT TCGGATAAGT TGTTTCAATG TGTCTGGCAC ATCTTCATAC 


22080 


ATTAATCCG.-. 


TGCCTTCAAC ATCTTTTGCT TTCACAAGCT GACGGGCAAA ATCAATTTCA 


22140 


CTTAAATAG? 


TTC7AGGCAA GTTCTGTACT TGTAATAATC GCCCTGCCCA CCGTCCTGTG 


22200 


CGGTTGGCAC 


CGTAAAATTG TAAAATGCCA TGAATGCGAT TATCTGAACA ACGTGCATTA 


22260 


rCCATCATCA GATATTTTTT CGTGCTTGAA TTAGACAAAC TAAGGCGCAA TTTTAATACC 


22320 


TCCGCCACAT 


TCTCGGGCAA GTTTCCTAAT GCTAGAGCCT TTAACACAAC CTCTTTTCCT 


22380 


AACTTCTCGA 


AAGGCGTTCC TTGTTCTTCT AGCCACTTTT TCAGTTGCGC TAAGCTGTTA 


22440 


GGATTTTCCi 


ATCCTGTTAA AGCCTTCATT TCGTTTAATC CCGCTTCGCT TAAATCGGCC 


22500 


ATAATATCAA 


TAGCAGCCGT TGCCAATTCG TGGTCAATTT CTGCCCCTCT GTCGTTAATC 


22560 


CGCTGATCCA 


AGGTGTAAAG CTTCCACTCA CTTTCGGGGA CAGGGAAGCG ATTCAACTTA 


22620 


TTCGCGATAg CCA7TTCTAC ATTGACGTCT TGAATACAAT ACTCTATAAA CGTTTGCCAT 


22680 


TTTTCGGGAG 


CGTGTTCAGG TAAATTCCTT GTACGCATGC CGTTTTTCTT GGTCGGCTTA 


22740 


CACGGTTTAG 


AAAAGAAGTT AATCAGTTGC GTACCTCTTG TGTCTTTTTG CTGTTCGATG 


22800 


TTTAAATATT 


TCGCACATTG CCCTAAAGAA GCTGGCAACC CTAATTCGTT TGCGTGAACC 


22860 


ATCGTACAGT 


GCCATTGTGC AGGGTCTAAG TAGTACGGAA TACCTAAATA CTTAGACAAA 


22920 


CAAACGCGCT 


CAAATTGGGC ATTAAATGCA ATTTTAAGCA CGTTCTTATC TGTTAGCGCA 


22980 


GCCACTATCT 


CGTCAGGTAT TTCGCTAATT GTTAAGTCCT CACATTCGAC AGGCTGACCA 


23040 


TCAACCGAA? 


AAGCAAAAAG TAAAATTTCA AAGTTGGGGC TATCAGCGTA TTTATAAACC 


23100 


CCAACT7TGG 


TTAAGTCTTC ATCCGAATAC GTTTCAATAT CAATATTTAA CGTTTTCATT 


23160 


TGCTATCAGC 


CGT7TCAGAT GGTAACCGCA ATTTACTTTT ATCAACGGCA ATCATCGGAA 


23220 


GCTGTGCTTC 


TTCCTCTTTA AAAACATCAA CGTTGATCCC TAGCTCTTCA AGATAAGCCA 


23280 


CTCCTTGTT? 


AGCAGGCACT CTTTCTACAT TTGCAATAAT CGCATTGTAC GTTTCTCTTA 


23340 



wo 98/50555 



PCT/US98/08985 



657 



CCTCTTGGAT TTCTTGATCG TAAACAGCTT CTACTTTCGC AAGCCCTTTT ATTGAGTTAT 
ACTTAAGTTT TTCAAGAAAC CCTTTCTTAT AGGTTTCAAA ATCTCTAGCA TCCGTATCGC 
GATAAAAGCT CCATATTTCC AAACTGTTTT CTTCTAACAA ATCGACAAGT TCATTTTTTA 
CGTTTTGGCG AATTCTAAGT GAGCTGAAAT ATCTATGCAA TTTCTCAAAT TCAGGATCAA 
TTTTCAGCCA CTCTGATTCA ATATGTGCAC GAATAGCCTt CGTCTTTTTC GCTCTTATTT 
CTTTTTCCAC TGAATTTTTT AAAGTAATCA CATGGTTTCT TAAACGTGCT TTAGTTGGTT 
TCATTTTTTC CTACCCCCTG ATCTTCTACT GGTGTATATT CTTGTTTGAC ATAAAAATTA 
GTATCTCCGA ATTCTTTGAA TTCCCAACGG TGCAATCTGA CGGATTCTAA ACGGCTACCG 
TTTTCAAAAT ATATAGAATC CAGTTTTCTC TCTTCAATTA GCTCAAAAAC ATCTTGTAAT 
TCAATAG^SCA AATAAATTGG TCGTTTACCC ATTTCAATTT CTCCTTTCAA AAGAAAAGGG 
GCATTTTAGC CCCTTATTCA ATTTTTAGCT AAACATATCG TCTTCTTCGT CTTCCCATTC 
TAGGTCACCA AAATCGGATT CAGCATTGGC ACGACCGCCT AAGAAGTCCC CTTTACATAA 
CGTTAAAATG TTGTTTAGCC CAGCAGTAAC CCCTTTGTTT CCTGcAGTAC TGTAAGCATA 
AAAGTTAATA GATGCAATCG CATAAACACC TGAATAGACT TCGTCTGGAT CGTCTGTTTT 
TACAAGCACA CCATCTTCAC GTTTTACGAC TTGTGGTTTC GTTTTACTTG ATACGTTGAT 
AAACATTGCG TTTTCAAATT CTGGGCGTTC TTCGGTGTCC ATTTCTTCAT CGCCATCGCG 
TAGtAgTTTT CAAGCGCTCA AATTTAACGC CTTTTAATTT ATCCCCTTTT GCGCCTTCAT 
AGGCTGTTTT GATTGCTTCT TTCATCGCTT TAAGGGTTTC CTTATCGTCT TTTGGAATAA 
TCAACATGCA AGAATATTTT TTCTCTTGCC CTTCTTCCAT CGCATGAGGT TCTAACACAT 
GCACAAAACT TAATCTCACT TGATTTGTAA TTACTTTCGT TCCAGTTACT TTTGCCATTT 
TAAATTCCTA CTTTCTTATA TTTTTTAGTT AAATAATTCG TCTTCTTCTA AATCATCTAA 
CGAATTAGAA GCTTGTTCTG TAAAATACTG TTTAATGTCC GAGATGATTT GTTCTTTCAT 
CTCTAAAATA GATTCTTTAT AGTTTTTTAG GTTCATTGTC CACGTTTCAT CTACAAGCAA 
TGAGTTAGTA ACTAACGACG ATAACGCCGC TTCAATCGTG TTAAAATATC CTTTAAATAA 
CACTTTATTC TCACTGAATA CATAAAGCCC AACATTGCGT GTATCTTCTC TTGTGATGTA 
ATAACCTTTA CCGCTAAAAA TCACAGGCTT TTCTTTACAT GATTTTTGAG GCATTTTATC 
CCCCACCTTC TACTTAAATA CATCCAAAGT ACCTTCTTGA CGTTCT7VAGA TAACCACCGT 
ATCTGTCACC TTAATATTTA CAGGGCGATT TCTTGTGCTG TTTTCAACGG TAACAATAAC 
CGCATTTAAA TCATCAAAAG GCTGTATGAC AGTAAATTCA TCCTCAGaTA CGTTAATTCC 
TGAATGCTCC GCTTTCTCCA TTCCCTTTTT TAGAGTTGAA GCTATTGCCT CTTGTGTGTC 
AATTTTTTTC ATAGCCATTA CTCCACCCCT TCAAAATCAT TAAGCGCATT TTCTACGCTG 
TTTAACGCTG GTCGCTTGTC ACTTkCAGGA ACAAGAACAG GCTTTCCTTC TGGTTTGATA 
ATGAAGTCTG CAGCTAATTC CGCAAATTTT TTCTTGCCGA CAACTTTTTC TAATTGCCCA 



2340.0 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 

24900 

24960 

25020 

25080 

25140 

25200 

25260 

25320 
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ATGGCTTTCA GTTCCTGCGG TTTTAAAATG TCTTCATCTT CAAATCCTTC GGCTTCTAAA 
ACCATCAGTA GCCCTTCATT ATCTGAAATT TTTCGATTAC TtCGCCCAGC AACTACTTTC 
CAACCAGGGA ACTCTTCGCC TTCGTCCCTT GCTTTCTGTA AAGCGTAAGT TTCCACATGT 
TCCAGCCATT TTTTTATCTC TGGCGCACGT TCTAAAATTT CAGCAATTTC CTCATTGGTT 
AAAAGCGGTG CTTCTTTCAG TTCGTGCTTA TCAATCAGCT GGAAATTTCT CTCCGCACGT 
GGCCGTAGTT GGGCACGAAC TTTTGAAAAC TTCACAACGT CGTCTGTAAT CGTCCATTCT 
CCTGTACCTT CCCAAGCCTG CACTGCTCGA GGCGCCACGT AATTATCAGC CCAGTATAAT 
AATTCTTCTT TTTCAATTTC AAAGGTCGAA ATATTGTCTA 7VACGAGGTTG AATAATCGTC 
ATTCGCACGG TTTCAAATTC GTAAATAATG TCGTACTTAT CCACCGCACC AAGCGCATAT 
AACATCAGTT GAGGGTTTAA ATACGCATCA ACAGGAACAC CTTTGCCGTA TTTCAAGTCG 
ATAATTTCAA TCGTCTTATC TGATAAGACA ACCACGTCCG AAGTTCCAAA TCCTTCTGGG 
ACCCATTTTG AAAAATCTAC TTTTTGTTCT AATTCGACTG TGGCATTTTC GTATTGATTG 
ACACGTTCTT CTACTAAATC ACAATAAGCG GTGACGTATT CTTTCATGGA TTCATCACAA 
TACGGATGGT CTTGtTCAAA AAATTTCAAT CGTGAGTTTA CCGCTCTAGC GGTTAATAGC 
TTAAACCGct TCGCCAAATA CAACTCTGCT AATTCGTGTG CAGTTGTTCC CTCCTCCGCA 
TAAGAGcTGC CGCGATCTTT TACTTTTTCC TCTAGTCGTG CCAAAGGTGG GCATATTAGC 
CACCGATGCG CACTACTCGC CCCTAATAAG GCATGATTTC CTACTGGCAT GCTTATTCAC 
CTACCAACGT TTCTAAGTCA GTGATAAATT TACCGTAATG CTCTTCTTTC AAGTCTGAAA 
g^TTTTCTGC GTTGTAACGA CTAAAGCAGG TCTTTATCAT AGGTCGCTTG CCTTTATCCA 
TAGCCTTTCT CATCGCTTCT TGCACATCTG CTTTTGTTGC CCCTAGGTGT AAATCGGCTG 
TTGCTGATAC TTCTTCTGTT TCTGAATCAG TAGGCGCGAC TTTCTCCCCA GTTTTCGATG 
GTTCGGTGTC TGAGGTTGAG GCCGTAGAAA CTTCTTCTTT CTTCTTCGCA GTTGCTTTTT 
TCTTTTTTGC AGGTtCTTTC TTCGCTGCTT TTTCTACTTC TTCTGTAGCT TTTTTATCCG 
CTGTATTTAA AGATTGCGTG ACTTCTACAA CGTTGTCTGT TGCTAACTGC GATAAAATTT 
CTTTTAACTC TTGTGCTGAA TCTGCTTCAA TATTTAAATA GATGTTTGCC ATTTTTTATT 
CCTCCAGTTA TTTTAATTGG TTAATGTCTT CATCTGTTAC ATTGAAAATC GCTTTAAAAT 
TTGCGTGATA ATGCAACGGT GGGTCTACAT TACGTTTTTC ATAAACTTTT ATCAGCTTCA 
CAGGAAGTAG CATTCTTTTT GCTAATTCTT CTTGTGACAT GCCCGCTTTT CGTCTAATTT 
TCTTGTACGG CGATAACATG AGTTGCTACA CTTCCTATCT GGTTTAGCTC TAGGGTTAAA 
CATTCCCATG CTTCAAATGC TTGGATTCCC TGTTTTCTAA GGCAATTAAC AAGCGTTGTT 
TTTCCTGTAG GCCCcTGCTG ACCTTTCACA ATGATTGGTT TTCCCCATTT ATATGCTTTT 
AATATCTGCT CTTTTTGCTT CGATGTTAAA AAGTCAGGAA GTACTATCTT TTTATCGCTC 
ATCAACTTTC ACCGCCAATT CCCGATAAAA TCCAGGTACT TCTTCTTTTA GTTCTGTTTC 



25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280* 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 

27240 

27300 
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TGTGAAACGT 
GTTGTAAGAT 
ATACATTGGT 
ATTTTCTTCT 
AATAGGATCT 
CTTATAATCA 
TTCAATTGAT 
ATGCTCTAGA 
AACTGTTTTT 
CCAGACATAC 
TACGTGATAA 
TTTTTTGTTT 
GCAATCATCA 
TCTAACTCTG 
CTTTCGTGAC 
AAAAAAGCTT 
TAAATTAAAA 
TTGCCACCTA 
TGATATATTT 
CATCGTACAC 
TAATCtTTCT 
CATAAAATGT 
TTGCTTTCAC 
TGGCAAACGT 
CCCTTTAAAC 
ATCTTCGCCT 
ATGCGGTTCT 
CTCTGTCTGT 
GATCTTTTTG 
ATCCGCCCTT 
CATCTAACTC 
GCGTGTCCTG 
TTTGTACTAC 



TGTTTTAGCT 
AAATATTGCT 
TCTTTTTTCA 
ACTTCTTTCG 
TCTAACCAAT 
ATATGTGCTT 
TTTTTAGGTG 
AAAGGAGCAC 
TCTTTATGCC 
CCTTGTTCTT 
TAGTTTCTCA 
AGTAGCTTAC 
CCTAACACCC 
CTTGAAAATG 
GTACTGCATC 
CATTAGCTGT 
GCCAAATGAT 
CCAATAGACC 
TTTTCACTGT 
CTCATAAATT 
TTCAAATTCT 
TTCATAAAAT 
CTCCCTAAAA 
TCATCGCCGC 
AACACAACTG 
TTGTAGCAAA 
TATCTTCTAA 
TAATAAATCA 
AATTTGGCAT 
TCTTTTCTTG 
AATTAATACA 
CATTAAAGCC 
GGCTTTATTT 



TGTCTGATTT 
GTTTTCCCGT 
CTGTGTAACC 
TCCACTCAAG 
GATCCCATCC 
CTTTACTTTT 
AAGTTGCTGT 
TAATAATTGA 
ATTTCCATAA 
CCAAAAATGC 
TTTGCGTTTC 
TTCGTTGGCG 
CACTTCGTTG 
CTTTAGCGTG 
TCTTAACGTT 
TAACATTTAT 
AAAAACCGCT 
TAGTCCGAAT 
AATCATCCTT 
GAGATATTTT 
TTAAATTGAG 
TCCACGCTAG 
CCATCACAGA 
CTACAATTTC 
AAATGTTGTC 
T7ATCATTTT 
ACGACTTAAA 
GCGCTTCTAA 
AATATCCACG 
TAACGGTTTT 
ATCTTATGTG 
AGCCATTTCT 
CCATATTTAA 



TGCTGTACAA 
AGCATTTATC 
ATCAAGAATC 
GCCTTTCCCG 
CATCTTATTA 
CTGTACCCAT 
CCATTCAATA 
CAATTCTCTA 
ATCGGTTTTT 
CATTAAGTTA 
TCCTTTTTCG 
GACGAGGTAG 
AGCATTTTTT 
CGGAGGTCCT 
TCAATCTTTT 
TTCACCTCAC 
ATATAAATGT 
ACAACCAACA 
TCTTTCCAAT 
TTTCTGGATC 
CGGCATCTAA 
CCTTTTGCAA 
ACCGTTAAAA 
ATTTGTTCGA 
GTATGTGCTA 
TGTAAAACCT 
AAATCTAATT 
GAGCTGCCCT 
CTTCTTGTTC 
TATCCATCCA 
TTGGATGAAT 
CTAAACTAGA 
TTGACGCACG 



ACAAAACGAT 
GCGTATGGCA 
GCACGTGTAT 
GTAAGCAACT 
ATTTGCCAAA 
TTATCAAATT 
AAATCAGTAA 
TTCGCTTTAT 
GTAGGCTCAA 
TCATATGCTT 
TGCTATTATT 
GCTCTTTTTG 
GATAAGCAAC 
GCATTGTCGG 
CTCGTACTGC 
GATCTTCAAG 
TTTGGATTAA 
GTGCCGCTAT 
GTAAACATCT 
TGTGCAGCAA 
AAGTTTGTAG 
TATTTTGTGT 
TTTCGGATtG 
TGGTTATACA 
CAAGTGTCTT 
CCTACGGAAT 
CGTTTTGAAT 
ATCTGATTGT 
TGTCGTATAA 
GCGAACAAAC 
TACACCAGAC 
TTTAGTGATT 
ATCTGGATTC 



TCTTTTCAGT 
AAGGAATTTC 
GCAGTTCTCG 
CTTCATCTGT 
TACACCAACT 
CTTTTGAGAC 
TTCCTAATAT 
TTACTTCAAT 
GTCCTGTAGC 
CTTGTGTTTT 
TACTTGTATA 
TTTCAATACA 
TTCGTAACGT 
ATGCTTTGCA 
TTCACGTACT 
CGCTAAATCA 
TGGTCCAATA 
ACGTCTTAAT 
GCAAATCTTT 
TAACGTAAAT 
TCAATTGATC 
AATTTTTTCA 
CTTTTGCTTC 
CTCGTTTTTG 
GCACTTTTAA 
TAATCTGTGC 
TTTTCCGCTT 
TTTTGCTTGC 
GCCACGTACT 
TCATCAAACA 
GAAAATTGCG 
GAGTACATTT 
TGTTCGACAG 



27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 

28320 

28380 

28440 

28500 

28560 

28620 

28680 

28740 

28800 

28860 

28920 

28980 

29040 

29100 

29160 

29220 

29280 



wo 98/50555 



PCTAJS98/08985 



660 



CTTCCAGATC AACCTTAACC GGTTTTGGCA TACCCCTTAC CCCCTCTCAT GCTTTAGCTC .29340 

GTTTAGTTCA ACTAGCACAG CCACATAATC GGTAACAAAA GAATAACGGT CCTGATGTAG 29400 

TTTTTCTGTC CATCGATAAG ACACATCTAA CTGCAGTAGT TGATGAAGTA AAAAACTTTT 29460 

TTCACGAATT GATGCATCTA GATTTTCTTG TGCTTTCCAA TAACGATTTG TTCCATCGCA 29520 

TAATGAGCTT GATGAAATTT TTTCTCTGCA TCGATTAATT CTGTTGTTCG TTTTAAAAAC 29580 

TTCTTTAAAT AGATCTTTGA CAATTAAATG CAGTCTTGAT TTAACAATGT CTTCTGCCGT 29640 

TAGTGCGGGC ATTTCGTGTG CATAGTTCAT TTTCTAAGCT CCTTTTGTTA AAAAAATAGT 29700 

AATTACCACT TGCGTAAATA AGAATTACGT TTTATAATAA AAAAAGAATT AAGAATTACG 29760 

AAAATCGTAT TCCCACAAAG TTTCAAAATC TACATTAAGT AATTTTGCTA ACATAATTGA 29820 

TAGTTTG)VAA CTCGGGCAGG AACGGCCATT CTCTATATTA CGAATAGTTG TTTCAGTAAC 29880 

ACCaGTTAAC TCAGCTAAAC GTTTTTGAGT TAATGACTTT TTGATTCTCG CTGCTCGCAA 29940 

AGACCCTCTT GTACAATTCA TATGAACATC TCCTCTCATT TAATTTACGT AATTGCCAAT 30000 

TACATAAATG AGTATACAAT GTAATTAAGr ATTACGCAAG AGGTTTATTT GCTATTTTTA 30060 

CAAAAATAAC ATAAAGGAGC TTTAAAATGA CCTCATTTGC GCTGAGATTA AAGGAATTAA 30120 

GAAAAAATAA AAAAATGTCA CAGCAAGAAC TTGCAACTTT TTTAGGTATG AAAAGAGAAA 30180 

ATATTTCTAA CTATGAGCGA GGGGTAATTA CCAATGTATC TAGTGATACA TTAGAAAAGA 30240 

TTTCAGAACT TTTTGATGTA TCTATAGACT ATTTGCTAGG AAAATCAGAT ACATCTAATC 30300 

AGAAAAATGA ATCGATGGCT AAACAAGTAA TGATGCGCAT GGATACAGAT GGACTTTCTT 30360 

CAAAGCAAGT AAAAGAACTA GAGGATGAGA TGGAACGTTT CTTTTCATGG AGAATAGAAG 30420 

AAATAAAAAA AGAAAATTTA AGTGAATAAT TACAAGGTGG TTTTCAGGAT GACAGTAGAA 30480 

GTAGACGAGT ACCTTAACTT TTGCGGTACC ATTAACGAAT TTATTTCAGC ACACATGTTA 30540 

TGTTTGGGAA TGAGTGTTAA TAATTATGAG CATAGATATA TTTGGGATGA AATATTAACA 30600 

TCTAAATCAA TTAAAATAAG GCCCTTTCCA TTTGAAAAAA CAGCACGTAG ATCTATATCA 30660 

GGTATGATTA TTAAAGATGA TTACGAGACT ACGCTAGCTT ATAATTCTAA CATGGGCGAA 30720 

AAAAGAAAAA ATTTCACGAT ATCTCATGAA CTTATTCACG CCATGTATCA TTTAGATAGC 30780 

GAAAATAAGG TATTCACCGA CACAAAAGAT ACTTTATCCT ATTCACTGGC AGATATCTTA 30840 

CCAGAATTTC AAGCGAACAT TGGAGCATCT TCTATACTAC TACCCGAACC TGTTCTTATT 30900 

AATGAGCTGA AAAAAGGAAC TCCTCCTTAT TTTATCTCAA ACCGTTACGG TATATCTGAA 30960 

CAAGCTATTT TTATGAGATT ACTTCAGCAA ATGCAGGCTA GTTTTGAAGC ATCTTACGTT 31020 

GCCGCCTATG ATACGGCCAA TAAGATAATG AACGGCAACT CTAAAAATTT GGCAATTGAA 31080 

TTAGGACGAA ACTTAGAAAG AAAAATTTTA TACAGCAACC CTTTTTATGA AGCAATTACG 31140 

CTATAATGAA TTTATATTAA ATCGAAAGAA GGATGATAGA AATGAAAAAA ATGATTAAAT 31200 

TTGCAGGCAT TGCTCTTATT TTTGCAGCTC TTCTCTCTGC CTGTAGCAAC GCAAAAAATA 31260 
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ATACACAAAA GAAAGCCGAA ACTGCTGCCC AGTCAAGCAC 
ACGAAAACGA GCCTAATACA GAAAACATAA CCCAAGCAGT 
TTAACTCT6A CGAGAAATTA GTAAAAATAG ATGTTAAAAA 
CAGATAACCC TCACGCTGTC ATTACGGTTA AGGTAATTAA 
TGGAAGAAAT GCAGACTGCG ATAGATTCCA ACTCAGGTAC 
TATACGGAAT TCAATTAAAT GTTGAAGAAG TAGCCAAAAC 
TTATTTCTTT CATCACACCT TACACGAATG GGAACGACAG 
AAAATGAAAA TATTATTCCG TTAGTAAAAT AAAAAAGTGC 
TGGATGCACC TTATCAGAAA ATAAACCAAA GGCTTATTTA 
AAGCCTCATT ACGAGGTTTT TATACACCTC CAAAAGAACG 
GTTGAATATG TGGGTAGAAA AGAAAAAGGA TAAAAAAGGA 
CGAAAGATAC ACCGACCCTC GGACCGGTAA AACACAAAAA 
AAATACAAAA CAAGCTGAGA aTGCAGCACG TCGTGAGTTA 
ACTGACCGAC TACGAAGAAG AGAAACGTCT AGAAAATTTA 
ACCAAATGAT AAAACGTTAG AAGATATGAT AAAAGAATGG 
AAAATGAAAA GAACGTAAAA AAGGTTCCAC GCTCTCGGGA 
ACTCTTAAAT GAGTTCTTAG TAATAGAAAA AGATAAACGA 

TGATATACAA GCATTTTATG ACCGTATGAT TTTTGACTTT 
AAAAAGATTC CGTGCGATTA TCCGTGGTGC ATTTACTCTC 

TAAAAATCTT GATGCCATTA ATCTTTCTAA AATAGTAACG 

TCTTATTAAT GAGCAAGTAC CTAGATACTT AGAAAAAGCC 

AGTTTTAAAT GGATACAACC CACTTTATAC GCAAATACTC 

GATGCGTTTT GGTGAATTGA TTGCTTTAGA ATCAAAAAAC 

AGATATTCAT GGAACTTACG ATCATGCAAC ACGTAGTCAC 

GCCAAAAACA AGAAAATCAT TTCGTACTAT T6AGTTAGGT 

AAAAGACC 

(2) INFORMATION FOR SEQ ID NO: 101; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
AGTGAGCGGG TCACTCTAGA GAAAAGGAGT AGAAAAAAAT GGCGGATTTA AAACGACTGT 



TATTGAAGCT 
TAAACAGTTA 
TAATGTTAAA 
TGATGAAGCA 
AGAGGCACAA 
ATTAGAAAAT 
AACCATAGCA 
ATCCCCCGCC 
ATGCTATTAT 
TAAGTTCCCA 
AATATCACAT 
GTCACCACGA 
GAAATAAAAA 
ATC/^AAAACG 
TTTGATTATA 
TATAGCTATG 
ATCCAGGATT 
GAATATCAAA 
GCCTATAAGA 
CCAGCAAAAA 
GAGGCTGAAC 
GAACTACAAT 
TTTCACGACG 
ACCCGAGGAC 
GATGCTGCAG 



TCAGACAGTA 
GAAGAAAAAT 
GATGACACAT 
AAAAAAAATA 
AAGACTGCCA 
GATAACGATG 
AAATCAACTA 
GGCAAGCTAG 
ATCAAAAAAT 
TAAAAGGAGA 
ACCAATATCG 
CTTTAGCAAA 
TCCAAAAGAT 
AGGGCGTAAT 
TTCAGGATCC 
GCATTGATTC 
TAACTTTTGA 
AGTCCTATAT 
AAAGATACAT 
CGGTGGaTGA 
AAATATTTCA 
TTCACACTGG 
GATTTTTGTT 
AAAAAGTCCC 
TACAGATTAT 



31320 

31380 

31440 

31500 

31560 

31620 

31680 

31740 

31800 

31860 

31920 

31980 

32040 

32100 

32160 

32220 

32280 

32340 

32400 

32460 

32520 

32580 

32640 

32700 

32760 

32768 



60 
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TCATTGGGAA ACCATTGAAA TCAGCCGAAA ATGATGAACA TAAATTAAGC AGATTTGCAG 


120 


CGCTTGCGTT 


GTTATCTTCT GATGCACTTT CATCCATTGC 


GTATGGTACA GAACAAATAG 


180 


TGGTGGTTCT 


GGTTACTTTG 


TCGGCGGCTG CGATTTGGTA 


TTCATTGCCA ATTGCTGCAT 


240 


TTGTTATTAT 


TTTACTTATT 


TCTTTAACAT TGTCGTACCG 


ACAAATTATT CATGCGTATC 


300 


CGCATGGCGG 


CGGCGCATAC 


GTTGTTAGTA GTGAAAACCT 


CGGCAGAAAT GCAGCTTTAT 


360 


TGGCTGGTGG 


TTCGTTGTTA 


GTCGATTATA TGCTAACAGT 


AGCCGTTTCA GTTTCAGCGG 


420 


GTGCAGAGGC 


GATTATATCG 


GCGATTCCAG CGTTATAtGG 


TCATCAAGTA GCAATTTCTA 


480 


TTGGGATTGT 


GATATTGATT 


ACCTTAATGA ACTTGAGGGG 


CCTGAGAGAA TCAGCTTCGT 


540 


TCTTAATGTT 


GCCAGTGTAT 


AGTTTTATCG CAATTATTAC 


CTTACTGATT GTTGTTGGGC 


600 


TATTTAAAAT 


TGTGACTGGC GCACAGCCAT TGAATGCGAC AgcTTTACCA GGAGCCGTGG 


660 


TTCCAGGAAT 


TTCTATTGCC 


TTGGTTTTAC GGGCTTTTTC 


ATCTGGTTCT TCTTCCTTAA 




CAGGGGTCGA AGCAATCAGT 


AATGCAGTGC CATTCTTTAA 


GAAACCACGA GCAAAAAATG 


780 


CTQCTGGGAC 


GTTAGCGTTA ATGGCAACGA TTTTAGGGTT 


CTTCTTCGTG GGGATTACGT 


840 


TTATTAACTA TTGGTATGGG ATTGTGCCAA AAGAAGAAGT GACAGTTCTT TCACAAATTG 


-7 V \J 


GGAAAGCGGT 


CTTTGGTCAA AATATTTTAT ATTATTTATT 


ACAGTTTGCG ACAGCCTTAA 


960 


TTTTAGCGGT 


TGCCGCTAAC ACAGGATTCT CAGCCTTTCC 


AGTTTTGGCT TATAACTTAG 


1020 


CCAAAGATAA 


GTTTATGCCG 


CATATGTATA TGGATCGTGG 


CGACCGTTTG GGCTATTCAA 


1080 


ATGGTATTTT 


AACTTTAGCT 


GCTGGTTCCA TTGTTTTATT 


ATTGATTTTC CAAGGTTCCA 


1140 


CAGAACGTTT 


AATCCCATTG 


TATTCCATTG GTGTGTTTAT 


TCCGTTTGCT TTGTCTCAAT 


1200 


cGGGGATGGT 


TGTAAAaTGG 


CGAAAAGAAA CTAAAAATTG 


GTTGCCAAAA TCAATTGCTA 


1260 


ATATTGTCGG 


CGCCTTTATC 


TCTTTCGCGA TTATCGCCAT 


TTTGTTTATT TATCGATTAG 


1320 


GGGACATCTG 


GCCGTTCTTT 


ATTATTATGC CAGTGTTGAT 


TTATGCATTT TATCGGGTCA 


1380 


ATACTCACTA 


TAAAAATGTG 


GCGGAACAGC TACGTTTAGA AGATGGGGCA CAGTTACATG 


1440 


AGTTTGACGG 


GAATACAGTT 


ATTGTTTTAG TAGGGAACGT 


TUVCCAAGGCA AATGTCGGCG 


1500 


CCTTGAATTA 


TGCGCGTTCC 


ATTGGCGATT ATGTCGTTGC 


CATGCATGTC TCAATGGATG 


1560 


AAAATGTTGA 


GAAGGAAAAA 


GAAATTCAGG AAGAGTTTAA 


AAAACACTTC CCGGACGTTC 


1620 


GTCTATCAAT 


TGTCCATTCT 


TCGTATCGTT CACTGCAAAA 


TCCTATCTTA CGTTATGTTG 


1680 


ATTTGGTTAG TAAAAATGCC ACGAAGCATA ACTACAGTAC AACGGTTTTA GTGCCACAAT 


1740 


TTGTACCAAA 


TAAACGTTGG 


CAAAATATTT TGCATAACCA 


AACAAGTTTA CGCTTGCGGA 


1800 


TTCGTCTAGC 


GTGGCGGGAA AATATCATCG TGGCTACCTA 


TA6TTATCAC TTGAAAAAAT 


1860 


AAATAAAAAA ATCTGCCATC TTTTGATGGC AGATTTTTTT ATTGACGTAA TTGTTTTTGA 


1920 


TAAATTCTAG CGCAAGACTG CTCTTCGGTG GTTACCATTG TCAAAAATTC CCAGTCATAT 


1980 


CTTTCATAAA AAGTAGTATG GTCTGTGAGC AAATAAAGCT TTTTAAATCC AGAATTAGCT 


2040 
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AGAAAATCAC 
TCGACATACA 
ATTACGCCCA 
ACGTGGTTTT 
TGCCACTTTT 
TCGATAATAG 
AAGAAACAAT 
CTAAACGCCG 
GTAAGGCACC 
TATCCCCTAA 
CTAACGCATT 
AAAGAAGTTG 
CTTGAATCAT 
TTACAACCAC 
AAGCTGGTAC 
TACGTGAAAT 
TCGAAATTaA 
AGTAGATAAA 
ATTTtCTTTT 
ATAGTTCAAT 
TTTAATTTCC 
CGGGAAGAAG 
TGTTGAGTAA 
AATTTGCATT 
ACCGTCGCCT 
TTTCCCTAAA 
GAATACATAA 
AAATTCTGAT 
CGTAGTAGTC 
CATTGCTTGG 
TGTGTAATGC 
CCCACCTGTA 
ACCTGCCAAT 



CAACTGTTGT 
ATGCACAAAG 
CGCCTGCAAC 
CGCTAGTTAG 
GGCTAAACCA 
TATAGTTAGA 
GTTGGATCTG 
TTCAATAATC 
GACTAAAATG 
ACCACCAGCA 
TCGAATACCT 
TACTTTTGTC 
GCCAGTGTAC 
CGTGTTGACG 
TGTTTGGATA 
CATAATCCCA 
AAAGTGACGT 
TTGTTGGAAT 
AAGAAaCGTT 
TCTTGCATTT 
GGCGTTTCTT 
CGCTTGTCAT 
CCTAAGATGA 
GGTAAAATAG 
TCACGTGTGA 
TCACTGACTG 
GTATTTTCAA 
TGAACTAGAT 
ACGTCAGTAC 
TGATTGATGG 
TCCACCATAT 
ATGGCAACAG 
AAAAGTAAAC 



TAAAAGAATT 
GTTTGGCGTA 
GATGTCCTGC 
CATTTCCTGC 
TTGAGCAGCT 
CATTATTGAT 
CTCTGTATTT 
CCTAGAACCC 
ATAGAAGCAC 
CCAACGAAAG 
GCCATGATTA 
ATACCCATTC 
GTATTCTTGA 
CCTAAGCCCA 
ATGTTCGCTA 
ATAGGaATCC 
AAAAATTGTT 
AAATCATATT 
CGGCTACGAC 
TTTTGGTAGA 
TTAATAATTT 
CTTTTAACAT 
CGTCCATTTT 
ATTTAAATTG 
TCCACGCAGT 
TTTTTAAATT 
AACCATATGA 
TGAACGCTTT 
CAGTATAACG 
TAGTAGTGGC 
CCGCTACAAT 
TGTTATCATC 
TGCTTAAGAA 



663 
CGTGCTAAGC 
AGGTCTGCAC- 
TGTTTATTCA 
ATACTTGCTT 
GACGGAATTA 
CTCCTAGATA 
TTTATTAATT 
AATCTGTGAT 
CATCTGTGGC 
CACCAATCGC 
CTGAAACAGA 
CTTTACCAAC 
TGATTGGTAG 
TTCCCAACAT 
AACGGATAAT 
CGaCAATTGC 
GaAAGACGTA 
ATTCAtCTGk 
AGCAGGTTCA 
AATTTTACCA 
GTCACTAACC 
GACTAAATCG 
TCCAGCGTTG 
GAAGTTGTAT 
ATCGACACCA 
GTATTTCTCA 
AGGGAACCAA 
TTTAGGGTCT 
AGCTGCAGAA 
TAAGTTGTTA 
GCTCCCTAAA 
AGAAGTTGAA 
GAGTAAGCTT 
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CTTGGTGACG AAATGTTGGT 2100 

GATTGTGGAA -GTCATTATCA 2160 

AGCAGACCCA CCAATGTGGT 2220 

GATACGCTTC AGCGGGGATT 2280 

AGTGTGGTTG TTGGTTTAAA 2340 

AAAAAACAGA TCCCATTGAA 2400 

GGAATGTTTG CTAGCTGGGT 2460 

AATCGCCATA AAGGCCGTTG 2520 

ATTGGTTCCA CGAATAATAA 2580 

AGTAATCCCG ATTGCGACGA 2640 

T7U\AGGTAAC TCGACCATAT 2700 

ATCTAAAATA TTGCGATCCA 2760 

GAGAGAATAA AGGAAGACCG 2820 

TAAAATCGAA AGCATTGCCA 2880 

CCAGTTGGCT AAGCTACGTT 2940 

AGCAAAGATT ACGCCATAAA 3000 

GCCGCCATTT tGTTGGaAAT 3060 

TACTCCCCCT CAAAATAGTT 3120 

ATTAAATCAT TATCTGCTTG 3180 

TCTAAGCGAT TT7UVGACATC 3240 

ACAGGAGCGG CATCATAAGG 3300 

TAACTACCAA TCCGGCCATC 3360 

ACTGCATCGT ACACCAAGCC 3420 

TCCTTTTTAA AGCCTTCATA 3480 

GCAACTAATT TGTCTGCGAC 3540 

GGCGTTTCTT TTGTAACTAA 3600 

GTTTGTTGAT AACGTTTTTC 3660 

TTTTCCGCTG GTAAGCCAAG 3720 

ATTGAGGCAT CTCCGTTTAA 3780 

ATCACATTTG TTTTTTTATC 3840 

ATCTGCGCTT CTGAAGTAAT 3900 

GCGAGACCAG GTAAAGAACA 3960 
AACTTGAGTT TCTTTTTCAT 4020 
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CATTATTCGC 


CCTCCCTTAA AGCTTTTGGC 


GTTAAACGAT 


TXTCTAATAA 


TCCAAGAAGA 


4080 


ACATCTGCTA 


AAAGCGCTAA 


AATAGTTACC 


GGAATAGTCC 


CGCCAAAGAT 


TAATTCTGGC 


4HQ 


TGATAGTTAT 


TTAATCCACT 


GAAAATCATA 


TCACCAAGAC 


CACCTGCACC 


AATATAAGAA 


4200 


GCCAGTGTAG 


CCCAAGCAAT 


GACATAGACA 


GCAGCTAAAC 


GAATGCCGGC 


CATAATTGTT 


4260 


GGCATGGCAA TGGGTAATTC AACACTAAAA ATAGACTGTA AATTGGTCAT GCCCATACCT 


4320 


TTAGCAACAT 


CTCGGTAATT 


TTCATCGACA 


TTTTTCATTC 


CGATATACGT 


GTTTCTTAAA 


4380 


ATAGGTAGAA GGGAATAGAT GAATAGCGCG ATAATAGCGG 


GTGTTTTGCC 


GACACCAAAA 


4440 


ATTGGAATCA TCAAAGCTAA 


TAAAGCCAAC 


GAAGGAACCG 


TTTGTAAGGC 


ACTTGTTAAA 


4500 


CCAATGACAA 


TATTGGCTAC 


TTTGG6CAAA CGTGTTAACC 


AAACGCCAAC 


GGGCACAGCG 


4560 


ATAGCGACCC 


CTAATAAAAG 


TGCAAAGAAT 


GAAATATAAA 


TATGTTCAAT 


ACTTTTTGAA 


4620 


ATCATTTCTG ATCCATATTC TTGAAAGAAG CTACTCATTG yTaCGCCTCC 


TCTTTATCAG 


4680 


TTTCGGACTC 


GCTTT6TTTT 


GCTTCTACAG 


CTTCACTAAT 


CGTTGTTTCA 


TCGCCCCAGA 


4740 


TAACATCATA AACAATGTCA ACAAGGGAAG 


CACGTGTTAA 


GATGCCCACC 


ACTCGTTTTT 


4800 


GTTCATCAAC 


GACTGGCACA 


TATTTTAAGC 


CGCGTTTTAA 


GATTCTTTGT 


AAAGCATCAC 


4860 


GTAAAAGAGC 


GGTTTTTTGA 


ACAAAGAAGA 


CATCTTTATT 


CAAGATATCA 


CCGACACTAG 


4920 


AAGCTTTGCC 


ACGTTGCTGA 


TCTAATGTTT 


CAACATCGAT 


AAAGCCTTTA 


AGCACATGGG 


4980 


AATTATCAAC 


AAGTAATAAG 


GTATCAACCC 


GTTTTTCACG 


CATCAATTTA 


ATCGCTTCTT 


5040 


GTAAAGATTT 


TTCCGGCGTG 


ATAGTAATAG 


CTGAATTTAA 


CATGACTTCA 


TCAACAGTGG 


5100 


TGAAATCAGG 


TTTGGCTTGT 


AATAGGCGAT 


CTTCTCCGAT 


GAGTTCTTCT 


ACAAATTCAT 


5160 


TGGCTGGATG 


ACGTAAGATA 


TTATCTGGTG 


TGTCAAATTG 


AATTACTTTG 


CCTTCGCTCA 


5220 


TAATGGCAAT 


TTTATTTGCT 


AATTTCAACG 


CTTCATCCAT 


ATCATGGGTA 


ACAAAGACAA 


5280 


TAGTTTTTCC 


TAAACGTTCT 


TGCAAATCTT 


TCACTAAATC 


TTGTAATGAA 


TCACGGGTGA 


5340 


TAGGATCTAA 


AGCACCAAAA 


GGTTCATCCA 


TTAGAATAAT 


ATCTTGATTG 


GCAGCTAAAG 


5400 


CCCGCACAAC 


GCCGATTCTT 


TGTTGTTGTC 


CGCCAGAAAG 


TTCATTGGGA 


TAGCGGTCCA 


5460 


ACATTTCTCG 


TGGTAACTCA 


ACCAAATCAA 


TCATTTTTTC 


GGCTATCTTA 


TTGCGTTCTT 


5520 


CCACAGGTAC 


TTTTAATAAT 


TTAGGCACTA AAACGATATT 


TTCACGAATG 


GTCATGTGAG 


5580 


GCATCAAGCC 


GATATTTTGA ATGACATAGC 


CAATTTGGCG 


ACGTAGCTCT 


ACAGGATTGA 


5640 


TTTTTTGAAT 


ATCTkGGCCa 


TCaATTAAAA 


TTTTTCCTTT 


TGAAGGGTCG 


GTCATGCGGT 


5700 


TAAGCATCCG 


CATTGATGTT 


GTTTTTCCAC 


TACCACTTGT 


TCCGATGAAG 


CAGATAAATT 


5760 


CACCTTTATC 


GAAAGACAAA 


TTAATGTCAT 


CAACGGCGAC 


TTTTCCGCCT 


TTATAAAATT 


5820 


TTGATACATG 


CTGAAATTCT 


ATCAACTTAC 


TCACCTCAAC 


TAAATATTTT 


GAAAATGTAT 


5880 


CTTATTACTA 


GTATGCTATA AAAATGCGAT 


TTCGACAAAT 


GTAATCTATT 


TTTAATGGGA 


5940 


AAAAGAAGGC GGTAAAATTT TTTTAATTCG TTCTAAAAAA AAGAAAAGAT AACTATTGTT 


^000 
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TAGTAAAAGT TATCTGAACG 


AAACAAGATG 


TTAAAAAGAG TATTCTGAGA AGGAAGAAAA 


6060 


GCCGCTATAA CAAAGAT^G 


TGTCCTTTGT 


AGCAAGTTGA CGGGAGCTGA GTTTTTCATT 


6120 


AGAATGGAAA GAGAACAAGG GAATAAAATA AAAAAAGGCG CTTCATGATT TAGCGGCAAC 


6180 


TAAATCACGA AGCCTTGAAT AGTCACCTGT GCAAGTGGAC CATCAAGTTG CCCGTTCAAT 


6240 


CAATGGAAAC CATTGATCAC 


TTTTATTTTA 


CTGAATTTTT GAAAAAATTA CCAGTTTTTT 


6300 


TAAGAAGTAA GAAGTGGAAG AACGAAGGAT GCTTTTTATG GAATGTGCAA TGACGTTTAG 


6360 


AAAAATCTGA AGGTCATTTT 


TTATTCTCTT 


GTTCAAACAT TCT^CATAG GGGAAGAGAG 


6420 


ACGGACGTAA TTTTGTCACC GATTATTACG TCCGTCTTCC GTTGTTTTCT AAAGCGTTGG 


6480 


TTACATGTCT TTGTAGAGGT 


TGGCTTCAAT 


TGCAGTCCGT TTTGGTTCTA AAAAAGGCGG 


6540 


TAAAGCCAAC GTTTCCCCTA AATGTTCCAG 


AGGCTCATCC AATGTAAAAC CAGGAGTCAA 


6600 


AGTAGCAAAT TCAACCCGAT 


TCCCGCCTGG 


TTCACGAATG TATAAACTTT TAAAATATCC 


6660 


ACGACTTACA TACATTTCAA 


TTTCCCAGCC 


TTGTGCTAAA GCGCGCGTGT GTAGTTCTTG 


6720 


TAATGCTTCA TCATCAGCCA 


CAGAAAAAGC 


AATATGATCC ATACTGCCAC GCCCCATTCG 


6780 


GCTTGTTTTA TCCGACGTTG 


TTTGATGAAC 


AGAAAAGCTT TCCGTCTCAC TTAAAATAAT 


6840 


CGTATTGCCA ACTGCTTGCC 


AACCAAGAAA 


TGTTTCAAAA AAGGCCAAGG TTTTTTCAGA 


6900 


ATCTGGCACA TGAAATTCAG 


TAGAGGCCAA 


ACCGATAATT TGAACCGCTG CTGGAATGTC 


6960 


GTTAGAAACA ACGAGTGTTT 


CTTTTAATGG 


ATTTTCAGGA ACTTCTGTCA GGATAATATG 


7020 


CGTATCATCT GGATCACTGA 


AATGTAGGCT 


CTTTTCTTTT TTGTT7\AATG GAATTTTGTT 


7080 


TTCTGTTAAA CGTTGTTCCC 


AAAAGGAAAG 


GCTGTTTTGG GGAATTTTTA AGCCAATCGT 


7140 


GGCTAAATAA tTTTTTTCTT 


CATAGCGGTT 


GCCTAATAAG GGCACGATAA AAAAAGTTAC 


7200 


TAAGCTTCCC GGCGTGCCTT 


GATAATCGCC 


ATAAAAATAA TGAGGCATGC GATTGTTGTC 


7260 


TTGATTTACG GTATTTTTGA 


CAAAACGTAA 


GCCTAACAAT TTTGTATAGA ATGATTGGTT 


7320 


TTCTTGACCA TGACGCGCTA ATAGTGAAAC ATGATGAAAT TGAGCATGCA TTTTGAGTTC 


7380 


CTCCTCTAAA TAACTATGTA 


TACTTACTTA AAGTAAGTAT AGTGCTTCTG ACATTTTTTG 


7440 


ACAACTATTT A6TTTGATTC AGAAGGCTGA CATAGCCAGA AACmAGAAAA AGTGTTAGAA 


7500 


TAACCAAGAT wAAAACTAGT AGAAAGAGTG ACGGAAATGC TAAGTACaGA AGATTTTGAT 


7560 


TTTGATTTAC CAGAAGAATT 


AATTGCTCAA ACCCCACTAA AAGATmGCGC TAGTTCACGA 


7620 


TTATTAGTTG TAAATAAAGA 


AaCAGGGGAC 


ATGGAGGATA AACATTtCCA TGATATTCTT 


7680 


GATGAATTAC AGCCTGGGGA 


TGCATTAGTC 


ATGAATAATA CACGGGTACT GCCGGCCCGT 


7740 


TTATATGGAG AAAAACCTGA 


AACAGGCGGT 


CATTTGGAAG TTTTACTTTT AACAAATACA 


7800 


GAAGGCGATA CATGGGAAAC 


CTTGATTAAA 


CCCGCGAAAC GTGCAAAAGT CGGCACGGAA 


7860 


ATTCAATTTG GCGATGGTCG 


CCTAAAAGCG 


GTGGTTAAAG AAGAGTTGGA ACATGGTGGT 


7920 


CGAATCATTG AATTTAAATA 


TGATGGTATT 


TTCTTAGAAA TTTT 


7964 
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(2) INF0R^4ATI0N FOR SEQ ID NO: 102.: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

AGAAGTGAAA AGAGCTTTTG ACTTACGCTn nTATTATGAG GAACGAACCT TTCAGGCTGG 60 

GAAAATAAGG AGGTATCAAA AGAATGAATT TGTTCAAaAA GArACAATCA AAGArACAGA 120 

CGTTATCTAG TAACGTTGAA AAAGTAGATA AAAAAGTAAA AAAACGTGTA CAGCAACAAG 180 

CAAATCCTAC CGTACAGGAG AGTATTTTTT ATACGAGTCA GTTTGAAGAA GnGCTCATGC 240 

ACATTGTAGA GGATGAGTTT TCTAAATGCT ATAAACTtGG AGAaGTGGAC TACGAAATTG " 300 

CTGTAGAAGA AGaTCmATTA AACaCGGTAA TGGGGTATGC AGAAGGGTTA AATACACTAG 360 

ACAAAAAATC TCGGTATCaG CTGCTGGTTG TTAATAAAAG AGAAGAAGAG TCGTTGATGG 420 

AAGACGTTTT ATTGGCGTAT CaGGGAGATC ACTTAGATAA cTACCGTCAA GAAATCAATG 480 

AAATTATAAC GAAGCAGTAT . GAACAAGACG AACGAAACTT TAAGATTGAA AAATATGCTA 540 

TTTTCACCAC AAAAAGTTCG TCCATGAAAC AAGCTAACAA AGAGTTAGAA GGAATCATGC 600 

AGAACTTTGC CAATCGTTTT GAAGCCAACC AAATTAAATT AGAGGTAACC CCAATGACTG 660 

GCTTAGAGCG ATTGTCTTTG TTCAGCTATT TTTTGCGCCA CGGACAATAT TTTGGCACAA 720 

GCTATCAAGA TATTGCCGTA TCAGGGCTAA CGTCTAAAGC GTTTATTGTT CCAAGTAAAA 780 

TCAAGTTTCC AAACAATAAA GCTTATTTTC GGTTAGGAGA 7UUVCTACGCA GCTGTTTTAA 840 

CCATTCAACA ATACCCAAAA TACCTAGAAG ATAAGTTAAT CAAAGAATTA TGCGCTACAG 900 

GAAGAGAGTT AGCGATTAGC ATTCATGCCA GACCTTATGA CATGATGGAA GCCAGAAAGG 960 

CAATACAGGG AACCAAGACG TTAAATGATG TTGCCATTCA AAAGCAACAG AAGCAAAACT 1020 

TCAGAAGTGG TGTTTCAGAG GATGCGATTT CAGGAGAAGC AAAGGAAATC CAAGAAACAA 1080 

CGAACGCGTT AATGGATGAA ATAAAAAATA ATGGCCAAAA GTTATTTAAT GGCCTATTTT 1140 

CAGTTTTCCT AATTGCAGAA ACAGAAAAGG AACTTTTAGA AGCTGTTGAA TCGGTGAAAA 1200 

ACGTAGGAGC AACCTGGCAA GTTGTGTTTG ATACGGTAGA CAATTACAAA GAAGAAGCAC 1260 

TCAATACTAT CTTACCGATT GGAAAACCTT ACTTAGACGT TGAAATGAGT TATATGCGtG 1320 

ATCTAACmAC mACmAATATt GCTACACAaG TACCTTTtAC CAATGTGGrA TTACAAgCCC 1380 

AcAGGTCAGT TTtATGGCCG GAaTCmGTTG ACGAAGAaCA TGATtACTAT tGrCCGAAAA 1440 

AAaG%^TAAT TwCACCAgTG GGCTTATTTc GGAcGcyGGT tCTGGTAAAG GAtGGCGACC 1500 

AAATGGGAAa TTCTTtCAaC GCGTTtAAAA TtTCCGAAGa TCGCTTTTTA ATTGTTGATC 1560 

CAGAAAGTGA ATACTTACCG ATTGGGAAAG AATTGGATGC AGAAATACTC GATATCTCaA 1620 

CTGGTACGAA AAACCACTTG AATATTTTAG GAATGGTAGA CAAGCAGTTA TTGGATGAAG 1680 
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AG6ACCAGGA 
CGCTACTAAA 
CGTATGAACG 
TAAAAGAACA 
TGGGTTCACA 
TCTTTAATAC 
ATCAGTTGTG 
ATGAATTACA 
CGCGTATTCG 
TAGACAGTAA 
AAAAaCCAAT 
AGTATaTCmA 
TTCCGTTTGA 
CTTAGGATGT 
ATCTCAAGTA 
tTGCAAAAAA 
AAGCAATCAC 
AACCATTTAC 
ATTGCTTACA 
GAAACCAAGG 
GAAGATACAC 
ATGCGTACAG 
GATACGTACG 
ACACCAAAAG 
CGCTTTAAAG 
TTTTTTAATG 
TCTTGGGTTG 
ACACAAAAAC 
ACTAAATTAG 
GTTCTTTTCT 
GGTACAAAGA 
CCTGATTATC 
ATCGAaGAGG 



ACTTGATTTA 
AAGCTATACG 
CTTTGAAAAC 
AACGGATGAA 
AGATATCTTT 
AAAAAAATTA 
GAAACAAGTA 
AGTCAATTTC 
TAAGTATGGC 
AGCGGGGCAA 
TGATTCACAA 
TGATCGAGCC 
AAATCCTATT 
GAGGACATG6 
GAAGAGCGTC 
AA6CATTAAA 
TAAAATU^CA 
TGGATAAAAA 
AACGAACGAA 
AAGGGTTGAA 
TGAATGAAGC 
CTTTGAAAAC 
GATTGGGGAA 
ACTTTACGTT 
CAGCAAAAGA 
GTCAGAA6AC 
TTTTATTGGT 
CAGCGATTGT 
ATGCGCAACA 
ACATGAATTA 
ATTTTGAAAC 
AGTTGAAAAC 
AACAAGCAAA 



GTTAAAGAAA 
GACGTGGATG 
AAAGAGACAG 
GGTGCCAAAG 
GCGCACCAAA 
GATGATCGCT 
GTATCCGGAC 
ACAACGGaAG 
GCCGTGACAA 
AAAATGATTG 
CGATTAGTAG 
CCACAAGGrA 
CCAAAAGAAA 
TAGAAAGTAA 
TTAGAAAAAA 
GACGACTAAA 
TAAACTTACC 
GAAATTTGTA 
AAAAGTAGAT 
AAGAGAAGCG 
AATGACCCTT 
AGGAAAAACG 
TCGCTTATTT 
AAAAAAACAA 
AGCGAAGAAA 
ATTAGGGAAA 
TCTTTTTCTT 
TCAAGAAGAG 
TACAGACGAT 
TCGGTACGAT 
GATTCTATCT 
AATGCAAAGT 
GCATTATCAA 



AATCTAATTT 
CGGGACTTGT 
AGCCAACATT 
AATTAGCGAA 
CGAATATCGA 
TGaAAAAGTT 
AAGGAAAAGT 
AAGAAGCAGA 
CAGGTATCAC 
CCAATAGTGA 
rAATTTTAAG 
CGGGGcTAAT 
CACATCTATT 
GCGTACTTGG 
TTTGACCTTA 
AAGGAGTTTA 
TTGAAAACAG 
GAGCAAAAGA 
GAAACGCGTG 
ACGCAAAAAG 
TATGAAAAAA 
GTTAAGAATC 
AACTTTTCAA 
CTTATGAAAC 
GCCGAACAAG 
GTCGCTGCAT 
GTTTTTTTAT 
GAAGATTTAA 
AACAACCTGT 
GATTTTAAAT 
GAACTTTGGA 
TTAGAGACAG 
GAGaTAAAAA 



GTTATCCACT 
TGATCGTGTG 
AGTCGATTGG 
TAAAGTGGAA 
CTTAACGGCA 
TGCCATGAAA 
AACGTCaCGT 
ATGGTTCTTA 
GCAAAACCCT 
ATTTTTGATT 
TCTGACACCT 
TtCTGCAGGT 
TGAAATTATG 
AAAGAAAAAG 
AAATCAAGAG 
AACTTTCTAA 
GGAAGGAACC 
AACGCAAAAA 
TCGTTTVACCA 
TACGAACCAC 
CACAACAAGC 
TTTCGGTGAA 
GAGGACGTGG 
AACTACGGAA 
GTATTGGCTT 
TGATTCTAAA 
TGAGTGGCGT 
CCGCTTCATG 
TTTATAGCAA 
TATTAGATAT 
CGGCACTAAA 
ATAAGAAATC 
AAGAGTTAGG 



ATGTTTGAAT 
ACGTTGTTGA 
TTTAGAATCT 
ACCTATTGTG 
CCCTTTATTG 
GTGATTTTAG 
TTGTATTTTG 
AATTTATGGG 
ACGACGTTAT 
TTATTACGGC 
GCATTgAAGA 
GGGGTAGTTG 
AATACCGACG 
TACATGTTGA 
AAGCGACGAA 
GAAAAATTAC 
ATTAAGTATT 
AGCACAGAAA 
AGTGAAAAGG 
GTTAACGCAG 
TAAATTTAAC 
AACTGCAAAA 
GTTTCAGCGC 
TCGCGCTATG 
GATTCGTTCT 
AAATCCAATT 
TGCAAGTAGT 
GACGTATTTT 
TATCGATGAT 
GGATTCCACT 
CGGAAAGAAA 
GAGTTATTTT 
CTACCAAACA 



1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 
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TTGGATGATC TCCTTTCTTT TCCAGTAAAA ACTGACGCTT TAATTGTCaA TaAGCGTTAT . 3720 

GGCTATGACA AtCmAAAGAG aAGCTCACCT TATATAAAGg CATTGATGTA TTGATTGAAG 3780 

ATAATCAGCC GTTTCATTCA CCAATGAaCG GCCAAaTCGT TTCAGTGCCT GATACaGAAA 3840 

CGCTGGTTAT CGAAAAaGAG AAAGTAGCTA GGTTAACGAT TCGTGGTGTG AaTACCCTTC 3900 

GACTAACAAA AGGTATrGAT GTTGAAGAAG GAACATTTCT TGGGAATACA AAGAATTCAA 3960 

CAGTTACGTT CCAATATGAA AAATATAAAA AAGAAACGAA AGACTGGTTT TTTGTAAATC 4020 

CAGCCTTTTA TTTTCCACGT GTAACCTACA CGCAAACGAC TCTTTTAGGA AGTGCAGAAT 4080 

TTTCTCCTGG AGCAAGTGTC GAAAAAAGAG CGCAAGCTGT TTATGATTAT TTATCAAAAA 4140 

AGGGATACAC CAAAGAGGGA ATTAGCGCGA TTTTAGGCAA TTTTTCGGTA GAAAGTGGCA 4200 

TTAATCCTiAA ACGTGCAGAA GGGGATTATT TGAATCCGCC TGTAGGTGCG CATGGGAATT 4260 

CATGGGACGA ACCGAGTTGG CTTGCTATGG GAGGACCGCA AATTTATGGA GGACGTTTTC 4320 

CAAATATTCT CCATCGAGGA CTAGGTTTAG GTCAGTGGAC GGATACAGCT GATGGTGGTC 4380 

GTAGGCATAC TTTGTTATTA GACTATGCCA AAGGTAAAAA TAAAAAATGG TACGATCTGC 4440 

AATTGCAACT GGACTTTATT TTTGACGGCG ATGCACCAGG TTCACGAACA GCAGCAGACA 4500 

ATGTAGCAAG AAGT7\AGGTG GCTGCAACTA TTCCAGAACT TACTACTTAT TATCTAACTG 4560 

TTTGGGAAGG AAATCCCGGA GACAAGTTAG GAGAACGAAT TCAAGCTGCA CAAAATTGGT 4620 

TTACATTCTT TTCTCGATCG GGTACACCAA TCGGTGGTTC TGGGAGTGAG GTCTTTGCGC 4680 

AGTATAAAGA AAAAATGCAA CCATTACCCA CAGACAGAGA AmCAAAaGCA GGACaAGGCT 4740 

GGCCAGGCmA TGCGTATGCG CCAGGGAATT GTACTTGGTA TGTGTTTAAT CGTTTgCACA 4800 

GTTAGGAAAA AGCATTTACC CTACTATGGG AAACGCCAAT CAATGGGTGC ATAACTACAG 4860 

TCmAACGCCC GGAGCsACAC TTGAATCGGC CCCTAAAAAA GGGGATGCTG TTATTTTTAC 4920 

AAATGGTGTA GCTGGTTCTA GTACGCAATA TGGACACGTG GCTTATGTGG AGCATGTAAA 4980 

CTCTGATGGT AGTTTTGTCA TTTCTGAAAT GAATGTCAGT GGTGAGTATT CTATGAATTG 5040 
GCGTGTCTTA AAAAAAGAAG CAGGCGAATA CTTTATGCGC CTAAACTAAA AAGAAAGGAA 5100 
CCAATAAGAA CAATGTATAA TATGG7VAAAA TTAAGTAGAT ACAAGAAAAT TGGCTTTATT 5160 
GGTGGCAGCT TACTGCTTAT TGTGCTTATT TTTTTGATTG GGCTTAGTGT TGGGCAAAGA 5220 
AAACAAGTAA ACACCAATGA AAAACAAGTA AAAGTCGAGA AAAAAGAGGA ATTAACAACA 5280 
AGTACGGTAA AAAAATTTTT GATTGCGTAT TATACAAAAA AAGATCTTGG TGAAAATAGA 5340 
AATCGCTATG AACCTTTGGT GACAAGCGCC ATGTATAACG AATTAGTGAA TGTAGAAAAA 5400 
CAGCCTGTTA ATCAAGCGTA TAAAGGCTAT GTAGTCAATC AAGTACTTGA TACGTATAAA 5460 
ATCTATATTG ATACTGAATA CAATGAAGTC ATTGTGGACG TCACGTATAA GAATACACAA 5520 
CGAACCAAAC GAAATAATGA T6AAGGAGCG TTGAAAAATC AGAGCAATCA AGAAGCGATG 5580 
AAATTAACCT TTGTAAAACA gGCGCTAATT TTTTAGTGGA TAAAtGGCAC CAGTAACACT 5640 
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TACCAATGAA TTmCAAGAGG AGCCAAATTC ' TTATAATACa CATGTAGTAA CGACAGAAGA . 5700 

ATCAACAAAG GAGAGTGCAA ACAGTGGGGA AAAGTAAAGT AAGTGAATTA GAAAAAAAGA 5760 

AGGATCGAGT GGAGAAAGAG CTTAATAAAG CCAATGAACG TCTCGATACA TTGAAACAGT 5820 

CGATTGCTAA AAAAGAGAAT GAATTGAAAC AAATAGAAGC GGAAATTGTC AGtGCTCTAT 5880 

TAACAGT^AAA TAATTTATCT TTTAGTGAAT TGACTGCGTT GTTGAGTGAA AAAGAaTCAG 5940 

GTGGTGAAGC mAGTGTTTAT AGGGAaTAAA ACAACCTATC GCTATGAACT TCTTCATTCA 6000 

AAAAATATTA AAGGCGAGGC GATCGACTCT ACCGCTACAC TTGTTCGTTA TGTGATTGGC 6060 

GTAAmCAaCG ATCTTATTAA GAAGGAAGAA GAAGGATGGC TTTATGTCGA AAAAATAAAA 6120 

GAAGGAAAAC AaGAGGACCA AATCCTTTAT GGCAAGCGTA TAGATTTACC AATTTTAGAA 6180 

GACAATCtTT ATTTTCATGA TCTTTTAGAT CCGTTTGATA CAAAAAAAGT CATTCCTTTT 6240 

GTAGAGCTAT TGCCTTCAGC TGTAGAAAAT CAGGTAAAGC CAGAAGAAAA AATTAAAGAA 6300 

CCACAGGATA TTGTAGAGTT AGAGAAAAAT AACAAAACCA CTATTACTAC TGAAGAAAAG 6360 

GAAATCACTG AATTAAAAGA GCAACCAGAT GAAATGGCCG ATAAAGAAAC ACAGAAGAAA 6420 

ATTCTGGAAC AAAAGATTCA GTTTAAAAAA GACGAGTTAG CCACATTGAC CGAGGAGATT 6480 

GCGACTAAGC AGGAAGAAAT CAATGACTTA GAACAAAAAT TAAGTACATT TTCAAAGAAG 6540 

CCAGAGGGAC TTCATTTATA AAAAAGAGTT TTTAATCAGC TAGTAAAAAA GCyTCTTCTA 6600 

GCTAAGAAAy AAGTATTATC ACTTtmCCCT TACAGTTGTT TATGGCCTAT CAmAAAAGTA 6660 

GAGAAATAAA TVAAGACTaGC AAAAGTTGCT AGTATTTTTT TTGTAAAGGA GGAGAATTTT 6720 

GGTGGAACAA AGAGAAATTG TACACCGCTA TTATGTTTTA GGTACTAGTA CTTTAGATAA 6780 

AGTATTAAAA GTGCTGCTTA ATTTATCCAG TGATGGTGTA TTGGCGCTAA AAGACAAGTT 6840 

TCTTCCTTTA AAAGGGGAAA ATAATTTGTA TAAGCTAATG AATCGTGAGG ATCCACTGAC 6900 

TTCGGCGCAA CTTCATGAAA AAGTGAATTT AAATAAGTTA AAAGAACAGT TGGAAGCCCA 6960 

AGGGCTTCCT TTTGCGTTTA AAGAAACCAA AGAAGGGACT AACTTCTATT TTCGAGTAAA 7020 

AGATACGGAA CTCGCAAAGA AAGCCTTAGA ACGCGTATTA ACGGATATTA AGAAGAACCC 7080 

ACAAATGATT TTGCGTAACC AAATACCATG ACTTTTGATG AACGGTTAGC CTACACTGCA 7140 

GCCAATAAAA AATACGTGGG tAAAAtCGaT CAAACCAAAA GCcAtCcCcA AAGGGAAGGA 7200 

AACTGTTGAT TGCnG "^215 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5504 base pairs 

(B) TYPE: nucleic acid 
ic) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
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TGTTCGCCTA AGAGATTCAC CATAACCACT GATTGAAGTA ATTGGGGTAA CCCAATGCCC. 60 

CAATTGCTAA TCCCTCGAAT ATGCGCATCA AATTGACTAA ATGAGCAAGC TTCAATAGAG 120 

TAATGACCTG AGTTATGAGG CCGTGGCGCT AATTCATTGA CGTACAACTC TTCTGAATCC 180 

GTTAAAAACA TTTCGATAGC TAAAACCCCA GATAGATTTA ATTCCTCAGC AATGTATTGC 240 

GCCAATTCCG CTGCTTTTTG TTGCACATTC TCAGAAATTC GTGCTGGCAC AATTGATTCA 300 

TGCAAGATAT TCTTTCGATG AATGTTTTCA GATACAGGAA AGCTTTGGTA ACCATGCTTA 360 

TTCCCTGCAA CGATGATTGA AAGTTCTTTA GTGAAAGGCA CCCACTTTTC TAATTCACAA 420 

GTGGCGTTAG cTAATAATTT AGTCGCTTGG GAGAAATCCT CTTCGCATTT TAAGACGACT 480 

TGTCCTTTTC CATCGTATCC ACCTTGAATA GTTTTTAAGA CACATGGAAA ACCAATCTTT 540 

TCTATTGCTT GTGGCAGCTC TTTCTGGCTT TTTACTTCCG CAAAAGGGGC AATCTTACAG 600 

CCAATCGATT GAAGAAACGT TTTTTCAAGC AATCGATTTT GAGTAATTTT TAACAATTCT 660 

GTTCCTTGTG GCAACTTTTC TAGTGGAATG GCCTTTTCTA GGGAGGCCAC TTCAATATTT 720 

TCAAATTCAT AGGTGATCAC ATCGCTTAAC TCTGCTAGTT CAGTTAAAGC AGATATCTGA 780 

TCAAAGGCTG CCACAATTTG ACGGTCAGCT ACTTGACCAG CTGGACTATT GAGCGTTGGG 840 

TCCAACACAA TCATTTGGAA ACCCATTTTT TTACCACTCA TGACTAGCAT TCGCCCTAAC 900 

TGTCCCCCAC CTATAATGCC AATAGTTGCT CCTACTTTTA AAGGTTTATT CAAGTTGTTC 960 

CCCACTTTCC ATAACCGCTT TTTCCAATGA TTGTCGTCTT GCTTGGATTC TTTCTGCTAA 1020 

CTGTAAATCA TCAATACTTA GTATTTCTGC AGCTAATAAC CCAGCATTGG TAGCACCTGC 1080 

TGAGCCAATC GCAGTTGTTG CTACAGGAAC TCCCCCTGGC ATCTGCACAA TAGACAACAG 1140 

TGAATCAATA CCGTTTAACG TTCGTGTTTT AACTGGAACA CCAATGACAG GTAAGGTTGT 1200 

CTTTGCAGCT ACCATCCCTG GTAAATGGGC CGCTCCACCA GCGCCAGcgA nTAATAACCT 1260 

TCAATCCTTT TGTTCGTGAT TCCTCTGCAA AACGAAACAT CAAGTCAGGG GTCCTATGTG 1320 

CTGATATCAC TCGTTTTTCA TAAGGAATTT TCAACTCTTC CAACACAAGA CAAGCAGCTT 1380 

TCATCGTTGC CCaGTCTGAG GTACTTCCCa TAATCACAGC GACTAATGGT TtCATGAATG 1440 

TAATAACTCC TTCCGCCTTT TTTCACTGTT ATTCTAACAA TGAGAAAAAG AGATGTCAAr 1500 

GAAAAaCCGA AATATTATAG ATTAAAATTA TATAATGTTC GTGTTTATCA AAAAAAGAGT 1560 

CATCGGAAAG ATgcTTTTTT GCtaTTTTTA GCGTTGCTTC CTCTTTAAGA AGAAACGTCG 1620 

CCTGTATTAA TAACTGGTTG TGTGCCCTTA kCAGTAATGA TTGAGACTAG TAAATTATTA 1680 

ATAAGTTGTA CTTTTCGTTC ATCGGTAAAA TTAATTTCTT GCCCTTCTTC AATCTGTTCT 1740 

AAGGCCATTT GTGTCATGCT TACCGCACCT TCAACAATTG TTTGTCGAGC CGCTAAGATT 1800 

GGTTTTGCTT GCTGTCTTTG CAACATGGAG CTAGCTATTT CAGTTGCATA AGCTAAATGA 1860 

TTCaACCGTG TTTCGATAAC CTCTACACCA GCCACTGCTA AGCGTTCTTG CAATTCTTTG 1920 

GTTAACTCCT CGGAAATTTG TTCGGTATTT CCTCGCAGAG TTACATCATT ATCTGAAAAA 1980 
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GTGTCATAAG 


GATATTGTGT 


TGCTACATGC 


CGGATTGCCG 


TTTCACTTTG 


TU^TTTCCACA 


2040 


AAGTCTTGaT 


AATAATCGAC 


ATTAAATAAC 


GCCTTAGCTG 


tATCGACyAC CCGAAAAACG 


2100 


ATGACCGCAG 


AGATTTCGAT 


TGGATTCCCG 


TCAGAGTCAT 


iUAul i 1 1 AA 


1 AlaAi^AAC TA 


2160 


TTGAAATTTC 


GCACTTTCAA 


AGAAATGTTC 


ATTTTTTGCG 


TAAATGGAAT 


GGTAATAAAC 


2220 


AACCCATTTT 


CTTTAATCGT 


ACCTAAGTAG 


CGACCAAAAA 




TTTCGCCTGG 


2280 


TTGGGACTAA 


CAATGGTTAA 


GGAACTTAAA 


AATAAAATAG 


CCCCAACTAA 


CAAAATAATA 


2340 


CCTAAAACAA 


CAAGAACACC 


ATTGGTATGA 


CTCGCTCCAT 


AAATAAATAA 


ACCAATTCCC 


2400 


GCAATTAAAA 


TAACTAATAA 


GCTCGCAATT 


CCTACGTAGC 


CACTCATATA 


AAATGTTTTT 


2460 


TTCTCTTCCA 


TCTGTTTCAC 


TCTCCTAACT 


CATTTATCTG 


GTCCTTATTA 


TACACCTCAT 


2520 


TGCCTTCAAT 


AGAGTAACTT 


TTAACATTAA 


ATAAACTGAT 


TTTTTCGGGG 


CTCTTGTAGT 


- 2580 


ATACTGAGAA 


CT^GAAATC 


CCAAGAAAAG 


GAGTGATAGA 


ATGCGCTATT 


TAAGCAAGGA 


2640 


CATTTTAGAG 


GAAGTCATCA 


CTCAACGCCC 


CTCTGACTCT 


TATAAAAGTA ATTTTGGGCG 


2700 


TGTTGTATTA 


ATCGGAGGAA 


ACCGACAATA 


TGGCGGAGCa 


TCATCATGAG 


TACCGAAGCG 


2760 


TGTATCAATA 


GTGGCGCTGG 


TCTCACCACT 


GTGATTACTG 


ATGTTAAAAA 


TCACGGGCCT 


2820 


TTACATGCAA 


GATGCCCAGA 


AGCGAtGGTC 


GTGGGCTTTG 


AAGAAACAGT 


CCTTCTGACG 


2880 


AACGTCGTAG 


AGCAAGCGGA 


TGTTATTTTA 


ATTGGTCCAG 


GTTTAGGCTT 


AGATGCTACT 


2940 


GCACAACAAA 


TATTAAAGAT 


GGTACTCGCC 


CAACATCAAA 


AACAACAATG 


GTTAATTATC 


3000 


GATGGCTCAG 


CAATTACCCT 


GTTTAGCCAA 


GGCAATTTTT 


CCCTTACCTA 


TCCTGAAAAA 


3060 


GTTGTTTTCA 


CACCTCATCA 


AATGGAATGG 


CAACGTTTGA 


GTCACTTACC 


TATCGAGCAA 


3120 


CAAACACTGG 


CAAATAACCA 


GCGCCAACAA 


GCAAAATTAG 


GCAGTACGAT 


TGTTTTAAAA 


3180 


AGTCATCGCA 


CCACAATTTT 


CCACGCAGGA 


GAACCTTTTC 


AAAATACAGG 


TGGCAACCCT 


- 3240 


GGGATGGCTA 


CTGGCGGAAC 


TGGGGATACC 


TTGGCTGGCA 


TCATTGCTGG 


TTTTTTAGCT 


3300 


CAGTTTAAGC 


CGACAATTGA 


AACCATTGCT 


GGCGCCGTCT 


ACTTACACAG 


TCTCATTGGA 


3360 


GATGATTTGG 


CCAAAACTGA 


CTATGTTGTT 


TTACCAACGA 


AGATTAGTCA 


AGCCTTGCCG 


3420 


ACATATATGA 


AAAAATATGC 


CCAGCCCCAC 


ACTGCACCTG 


ATTCTGAACT 


ATTAGAACAG 


3480 


AAGCGTTCGA 


GATAAAACCC 


GAAAAGGCTA 


AATGGTGACA 


AATTTTGGGT 


TAAAAAGCAG 


3540 


GATTTAAGCC 


TGGGCCGAAA 


ATCAAAAGCG 


ATTTTCAGCC 


CAGGCTCTCT 


TTTTATTTTT 


3600 


CCTTATAAAC 


ATTTTTTAAC 


GTTAATGGAG 


CGCCAAAGGT 


ATGAAAATTC 


AAGCCTTTGA 


3660 


CTTTATCATT 


TTGCAGGACT 


GTTTGTTCAT 


TTtGGCTAAG 


TGG7UVTCATG 


CCAGCAGTCG 


3720 


TTTCAATCAC 


TTCTTTTTCA 


GCTGCAATCA 


GTGTCGCCCA 


TCTTTTTTCT 


GGCTCTAAGG 


3780 


CATAGGTTGT 


GGCTGCTTCA 


TCTAATAATT 


TGTCATAGAC 


AGGGTTCTGA 


TAATTGCGAT 


3840 


CATTGCCCTT 


GTATAAAGTC 


ATCAGGGTA6 


AAATAGGGTC 


TTGATAGTCT 


GGTGTCCAGT 


3900 


AAATTAAGAA 


TAAATCATAG 


TCACTTTCTC 


GCCCAAAGTT 


CAATGCAGCT 


TCAGTCGGCA 


3960 
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ATGCGGTTAG CTCTATTGTC AAACCAGGAA ACAATTCTTG TAGCGAGCCT TGCAAACTTT . 4.020 

CACCAATTTT TTTGTAAGAA CCATCATCTG TTACCATCAA TTCAACGTTA ACCTTTTCTC 4080 

CTAATTCTGC TTGTGCTTTT TTCCAATAAC TTTGCGCTTT TtCTTTGTTA TAAACCATTA 4140 

AATTACCTGC TTCTTGACGA AAATCGAGAC CCGTTGTGGG ATTCGCCACA AAGCCTTCCG 4200 

TAATCGCACC ATGTAGCGCT TTAGAACCAT CTGCAATAAT ATTATtGACT AGATTTTCTT 4260 

TATCTATTCC TAAAGCCAAT GCTTTACGCA GGTTTTCGTT TGCAAGCGGC GTkGCTTGCC 4320 

CTTyCCGTTT TTGATTTAAG CGCAAATAGT TCATTGTCGC TGTTGGATAC GAATGATACA 4 380 

ACGTATTATT TTTATTCTGT TGCGCCAGTT CACCACTTAG TGTAGCCACA TCTAATTGTC 4 4 40 

CATCTTCAAA AAGATTCAAT GCGGTAGATG TTTCTTTGAT AACTGTATAA TTGATAATGT 4500 

CTGAGCGCAC GTTCTGGTGA TCCCAATAGC GATTATTTTT TGCTAGTTGC CAGTCCATCT 4560 

TTGTTTGCTG CCAATCTTTT ACCACGAACG GACCATTATA GACGACTTTA TCACtAGCAG 4620 

TTCCATAGTC CGCACCAAAT TGTTCGACTA CTTTTkGATT TtGCGGGaAA AATGTCGGAA 4680 

AaGCTAACAA GGACGTAAAG TACGGTTTTG GCTCTTTGAG CGTCACCTTT AATGTATAAT 4740 

CATCCACAGC TGTGACACCT AGTTCATTGG GTGCTAATTT CCCCGCTGAG ATTTCTGCAC 4800 

CATTTTGAAT TGTTTCAACG ATGAGGAAGC TATAAACAAA GCCGTTTTTA GGATCAATCA 4860 

TTTTTTTCCA AGCATATTCA AAATCATGTG CTGTGACAGG ATCATCGTTA CTCCAAACCG 4920 

CTTCTTTTCT CAAAGAAATC GTGTAGGtTT TTCCATCTTC TGaAATCATC GGCAATGCTT 4980 

TTGCTACGGC TGGTACCAAT TGGTCTTGTT CATCTAAACT ATATAACCCT TCAAACGCTG 5040 

CAGTTTGGAC AATAGCATCT GGAAAATCCA ATAATACAGA GGTGTTGAGC GTTGTTAATT 5100 

CCGAGGGTGA CATCAGTTGA ACGGCTTCTT TGGTATTTTT CTGTTTCTTT TTATTTTCAT 5160 

TTTTATTGGT ACACCCAGCT AAAAATCCTA CCAATAAAAT ACAAACCATT GTGATTTTTA 5220 

AATACTTTTT CATCTTCTTT CTCCTATAAC TCTTCATTTT TTTATGCTCT TACTTACAGA 5280 

AAGACAGCTC CCCTTTTATT TTGTCCTTTC CACAAGCATC CAGACAACGA CCGCTGGCTC 5340 

CAACCCATTA TTATAATTAA TGTGCCCCTC ACCAGCTTCA ATTAATTGGG CTTGTCCAGC 5400 

TTTTCCAACT AAGTCTTGAC CATCTTCTAA AATCGTATGT GTcGCCAGAA ATCACATAAG 5460 

AATATTCATC CTGTTCATGG GGAGCATATC CAGCTAAAGG CCGT 5504 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

GTTGCGACTA AACTATAGCC AATTGCTGAA AAACGCTTAG CAATTTCTAA CGCCTCTTCT 60 
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TTTGTTTCAT 


CAGCAATGGT 


GAAAAGTACA 


GCCCCATAAC 


TTGGCAAATG 


CAAGCCAGAT 


120 


GCTTCAAAGG 


CTTTATACAA 


GGCTTTTTCT 


AAATAATAAT. CACTACCCAT, 


.AACTTCCCCA 


180 


GTTGATTTCA 


TTTCAGGCCC 


TAAGTACGTA 


TCTACTTTTT 


GTAGTTTCGT 


AAAGGAGAAT 


240 


ACAGGGGCTT 


TCACATGAAC 


TTGTTTACTT 


TCAGGATATA 


ATCCATCTTG 


ATATCCTAGG 


300 


TCAGTCAGTT 


TCTCACCAAG 


AATCGCTTTT 


GTAGCAACTT 


GGGCCATCGG 


AATCCCTGTG 


360 


ATTTTACTTA 


AGAAAGGAAC 


TGTTCGACTC 


GCACGAGGAT 


TCACTTCAAT 


CACATAAACA 


420 


CGATTTTCAT 


GAATAACAAA 


TTGAATATTC 


ATCATACCTA 


CACAGTTTAA 


ACCAAGTGCC 


480 


AGTTTTTTTG 


TATAATCGGC 


AATTGTTGCT 


TGAATTTCTT 


GAGAAAGATA 


CTGCGGTGGA 


540 


TAAACCGCCA 


TTGAATCACC 


AGAATGCACG 


CCAGCTCGTT 


CAATATGTTC 


CATAATCCCT 


600 


GGAATTAGTA 


CTGTTTCACC 


ATCACAAATG 


GCATCAACTT 


CACATTCTTG 


TCCCAATAAA 


660 


TAGCTATCAA 


CCAAGACAGG 


ATGTTCTGGA 


GAAGCTTTGA 


CTGCATGTCG 


CATATAATCT 


720 


TCCAAATCCT 


TTTGATTTTC 


GACAATTTCC 


ATTGCACGAC 


CACCTAAAAC 


ATAACTTGGA 


780 


CGAACTAAAA 


CGGGATAGCC 


GATACGATCA 


GCAATCACGA 


CTGCTTCTTC 


TGCACTTGTG 


840 


GCCGTATCTC 


CTGGTGGTTG 


AGGAATCGCT 


AACTCTTGCA 


AAGCTTGTTC 


AAATAAATCA 


900 


CGATTTTCCG 


CACGATCTAA 


ATCTTCAATC 


GTTGTTCCAA 


GAATTTTAAC 


GCCTTGTTTT 


960 


GTGAGCGGCT 


CAGCTAAATT 


AATCGCTGTT 


TGTCCACCAA 


ATTGGACGAT 


CACGCCAATC 


1020 


GGATTTTCTA 


AATCAATCAC 


ATTCATGACG 


TCTTCTAAAG 


TTAATGGCTC 


AAAGTATAAT 


1080 


TTATCAGAAA 


CAGAGAAATC 


TGTTGAAACT 


GTTTCAGGAT 


TACTATTCAT 


AATAATTGCT 


1140 


TCATAACCAG 


CCGCTTGGAT 


CGCTTTGACA 


GAATGAACCG 


TTGCATAATC 


AAACTCGACC 


1200 


CCTTGACCAA 


TGCGAATGGG 


ACCAGAACCT 


AATACTAGTA 


CAGAAGGCTT 


TTTCGAAACA 


1260 


TTGCTTTCGT 


TTTCCACTTC 


ATACGTGCTA 


TAAAAATAAG 


GCGTGTGTGA 


TTCAAACTCT 


1320 


GCAGCACACG 


TATCCACCAT 


TTTATAGACG 


GGCACAATTT 


GATTGGCTCT 


GCGGAAGTCA 


1380 


GCGATGGCTT 


GTTCTGTTTG 


TCCCCAAAGT 


GCTGCGATTT 


TCCGATCAGA 


AAAACCATTT 


1440 


TGTTTGGCTT 


CTTTCAAAAC 


AGCTACGTTG 


TCCACGTGAC 


TTTCAAGGGC 


TGTTTCAATT 


1500 


TCGATAATGT 


GTAAAAGTTT 


ATCCAAGAAG 


AATAAATCAA 


TTTTCGTTAA 


ACTTTGTAAT 


1560 


TCTTCGATGC 


TATATCCTCT 


GCGAATGGCT 


TCTGATAAGT 


AGAACAATCG 


ATCATCTTGC 


1620 


GCATGGACCA 


TTTTCTTCGT 


TAATTCCAAA 


TCACTGACAT 


GACTTAGTTC 


TGCTAACTCG 


1680 


TTGTGATAAG 


CACCAATTTC 


TAAAGAACGA 


ACTGCTTTTA 


ATAAGGATTC 


TTCAATATTG 


1740 


CGGCCAATTG 


CCATGACTTC 


CCCGGTTGCT 


TTCATTTGCG 


TTCCTAGTTC 


TCGAGCACCT 


1800 


TTTTCAAATT 


TGTCAAAAGG 


CCATCGAGGA 


ATTTTGGAAA 


CAACATAGTC 


CAAAGCTGGT 


1860 


TCAAATTCGG 


CATAAGTGGT 


TCCAGTAACT 


GGATTTTTCA 


TTTCATCTAA 


TGTTAAACCA 


1920 


ACAGCAATTT 


TTGCAGCTAA 


CTTCGCAATG 


GGATACCCTG 


TGGCTTTACT 


AGCAAGTGCA 


1980 


GATGAGCGAG 


AAACACGAGG 


ATTCACTTCA 


ATcACaTAAT 


AwTTAAAGcT 


ATGCGGGtCC 


2040 
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AArGyTAATk 


GAACwTTACa 


GCCACCTTCA 


AT,TTTTAATG 


CACpAATaAT 


TTTCAArGAC 


2100 


GCATCTCGCA 


ACATTTGGTA 


TTCATAATCT 


GATAAGGTTT 


GACTCGGAGC 


AAAAACAATC 


2160 


GAATCCCCAG 


TGTGAATGCC 


GACAGGATCA 


AAGTTTTCCA 


TGTTACAAAC 


CACAATCGCA 


2220 


TTATCAGCCG 


AATCACGCAT 


CACTTCATAT 


TCAATTTCTT 


TAAAACCAGC 


AATACTTTTT 


2280 


TCAATTAAAC 


ATTGTGTAAC 


TGGTGATAAT 


TTCAGACCGT 


TTTCAGCAAT 


TTGGCGGAGT 


2340 


TCTTCTTCCG 


TATCACACAT 


GCCGCCGCcT 


GTTCCACCTA 


AAGTAAATGC 


GGGACGAACG 


2400 


ATAATTGGAT 


AGCCAATTCG 


TTTGGCAAAA 


GCCACGGCTT 


GTTCCACCGT 


ATTGACAATT 


2460 


TCACTTTCTG 


GAATCGGTTG 


TTCTAATTCT 


TCCATCAATT 


GTTTAAATAA 


ATCACGGTCT 


2520 


TCCGCTTGGT 


CAATCGCACT 


GAGCTTTGTT 


CCTAGCAATT 


CTACATTTAG 


CTCATCTAAA 


2580 


ATTCCTGATT 


CCGATAATTC 


CATCGCCATA 


TTTAATCCAG 


TTTGACCACC 


TAATGTTGGT 


2640 


AGAAGAGCAT 


CAGGACGTTC 


TTTTCGCAAA 


ATTCTTGAAA 


CAAATTCTAA 


AGTAATCGGT 


2700 


TCGATATAAA 


CATGATCCGC 


GATTTCCTTA 


TCGGTCATGA 


TAGTCGCTGG 


ATTTGAGTTT 


2760 


ACTAAGACAA 


CTTCATACCC 


TTCTTCTTTT 


AGCGCTAAAC 


AGGCTTGTGT 


GCCTGCATAG 


2820 


TCAAATTCTG 


CTGCTTGTCC 


AATAATGATT 


GGTCCTGAAC 


CAATCACCAT 


GATTTTCTTG 


2880 


ATGTCCGTTC 


GTTTTGGCAT 


TAGTTTTGCT 


CCTTCCATGC 


ATCCATTAAT 


TCCATAAATT 


2940 


CGTCAAATAA 


ATGAAGACCA 


TCGTGCGGTC 


CTGGAGCCGC 


ATCTGGGTGA 


TATTGCACAG 


3000 


TAAAAGCAGG 


ATAATCTCGA 


TGACGCACGC 


CTTCAACCGT 


TCCATCATTG 


ACTTCCACGT 


3060 


GGGTCACAAG 


TAATTTTTCT 


GGATCAATTG 


TTGCTTCATC 


TACGGCATAT 


CCGTGGTTTT 


3120 


GTGAAGTGAA 


ATCAATGCGT 


CCTGTTGCAA 


TTTCTCGTAC 


AGGATGATTC 


AAACCGCGAT 


3180 


GTCCAAACTT 


CATCTTATAC 


GTATCCGCAC 


CATTCGCTAA 


TGAAAATAAC 


TGATGACCTA 


3240 


AGCAGATGCC 


GAAAATCGGA 


ACTTTTCCTT 


GGATTGCTTG 


AATCATTTCA 


ATAGCTTCAG 


3300 


GCACATCTTT 


GGGATCTCCA 


GGCCCATTTG 


TCAACATGAC 


ACCATCGGGA 


CTTAATTCTA 


3360 


AAATCTCTTC 


GGCCGTTGTA 


TTATAAGGCA 


ACACGGTTAA 


GTTGCATTGC 


CGTTTTGATA 


3420 


ATTCTCGCAA 


AATACTATGT 


TTCAACCCAA 


AATCAACGAC 


GACCACATTA 


CGCCCAATAC 


3480 


CTGGACTCGG 


ATAAGGCTTA 


GTGGTGGAGA 


CTTGTGCCAC 


TTGATTTTTC 


GGCATGACTG 


3540 


TCGCTTTTAA 


TTGATCAAAC 


GCATGCGGCA 


AATCATCCAC 


GGCATCAATG 


ATGCTCCCTT 


3600 


TCAJcTGTTCC 


AGCTGATCGT 


AACTTACGAG 


TCAGCGCACG 


CGTATCAATT 


CCTGAAATTC 


3660 


CCGGAATCCC 


TTTGCGCTTT 


AAAAATTCAT 


CCAAGGTCAT 


TTGTTGACGC 


CAATTTGAAG 


3720 


CAACCCGCGC 


ATGCTCTTTA 


ACAATGACAC 


CTTTACAGGT 


TGGTGCaATG 


GATTCATaAT 


3780 


CaTCTCGGTT 


GACCCCGTAA 


TTCCCaACTA 


ATGGATACGT 


AAAAGTAATC 


ATTTGTCCAT 


3840 


TAAAACTTTG 


GTCAGTAATT 


GCTTCTTGGT 


AACCTGTCAT 


GCCAGTAGTA 


AAAACAACTT 


3900 


CTCCTACTAC 


ATTGCCtTCT 


GCaCCAAACG 


CTTTTCCTTC 


AAATACTGTT 


CCGTCTTCTA 


3960 


AAATCAATAA 


CCTTTTCAAA 


CCTACTGCGC 


CTCCTCCGAC 


CAAGCTAATG 


CTCCGTCAAC 


4020 
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AAATGTCATA AGTGTCTTTC 


CTTTGACTGT 


CCACCCTGTA 


AAXGGTGTAT TGACAGCCAT 


4080 


TGATTCAAAT 


GCTTCTGCAT 


CAATTGGTGC 


AGCTGTTGCT 


AAGTCAAAAA 


CTGCAATATC 


■ 4140 


TGCTGGTGCA 


CCAATTsTTA 


GTGTTCCTGC 


GTTTAATCCA 


AAAATTTCTG 


CCGGTTTCAC 


4200 


TGCCATCCAA 


TCAATCACTT 


GTTCTAATGT 


GAAAATGCCT 


GTTTCGACAA 


AGTTCGTATA 


4260 


AATCAATTGG AAGGCCGTTT 


CACTACCGAC 


AATACCAAAA 


GGTGCGTTTA AAAATGATTG 


4320 


TTGTTTTTCT 


TCTAAACC7VT 


GGGGCGCATG 


GTCGGTAGCA 


ATACAGTCAA 


TCGTGCCATC 


4380 


TAACAAGCCG 


TCAATCAATG 


CTTGTCGATC 


TGCTAAACCA 


CGTAATGGCG 


GGTTCATTTT 


4440 


CCAAAACCCT 


TCGTCACCAG 


GAATATCCTC 


ATCGACTAAA 


ATTAAATGGT 


GTGGCGAAAC 


4500 


TTCGGCTGTT 


ACGTGAATCC 


CTGCTTTTTT 


AGCATCTCGA 


ATAACGCGCA 


CGCTTTCTTC 


4560 


TGTTGAAACA 


TGACAAACAT 


GATAATGAAC 


GCCTGTTTCT 


TTCGCTAAAG 


TAATATCCCG 


4620 


AGCAATTTGT 


GAAGCTTcCG 


TTGCACTTAA 


AATCCCTGGT 


AGGCCTAATT 


TCTTAGAAAC 


4680 


TTCTCCTTCA 


TGCATCACAC 


CACCAAATAA 


TAAAGATTCA 


TCTTCCGTGT 


GCGCAACAAG 


4740 


TGCCATATTT 


AATGCAGCCG 


CTTCTTTCAT 


GGCTAAGTAC 


ATCGTTCCTG 


CCGTTTGGAC 


4800 


ACCAACACCA 


TCATTCGTAA 


ATGCAAAGGC 


GCCCGCTTCT 


TTTAAGGCCT 


TCTGATTGGT 


4860 


TAAAACTTCA 


CTCCGTAGCT 


CTTCTGTAAT 


GGGGGCATAT 


TGTAAAACTT 


TCACGACTGC 


4920 


ATCTTTTTGA 


ATTAAGTCGT 


AAACCTCACT 


TAATTTTTCA 


GCCGTATCTG 


GTACTGGATT 


4 980 


CAAATTGGGC 


ATTGCGCAAA 


CCGTAGTAAA 


ACCACCGCGA 


GCAGCAGCTT 


TACTACCCGT 


5040 


TTTAATCGTT 


TCTTTATACG 


TAAAGCCTGG 


TTCTCTAAAG 


TGAACATGCA 


CATCCACCAA 


5100 


GCCTGGCGTG 


aTTAGTTGCC 


CGTTCGCATC 


AAACACTTGC 


TCAAAAtCTG 


CTTCGTCAAA 


5160 


CGATTCCCCG 


ATTGCGTGAA 


TCACACCATT 


TTCTAGCCAA 


AGTGCTGCCT 


CAATCAATTG 


5220 


ATTCTCTTTT 


TTGATGATTT 


TTCCATTCTT 


AATTAACGTC 


TTCATTTCTC 


CACCTAGGCC 


5280 


TTTCCATGTA AAATTGCTTC 


CAAAATTGCC 


ATGCGCATAA 


AGACTCCGTT 


ACTCATTTGC 


5340 


GCGACGATTC 


TTGATTGCAG 


ACTTTCCACT 


AATTCATCTG 


CCAGTTCAAC 


ATCCCGATTG 


5400 


ACTGGCGCTG 


GATGCATAAT 


AATTGCGTGT 


TTCTGTAAAC 


GAGTTGCCCG 


TTCGTTGGTT 


5460 


AAACCATATT 


CCAAATGATA 


GCCTTCTTTT 


GAAAAACTTT 


CCTTACCATC 


ATGACGTTCG 


5520 


TGCTGCACGC 


GAAcAACATC 


ATCACATCGA 


CTTTTTCAAC 


AATTTCATCT 


AGCGGCACAT 


5580 


ATTGTCCATA 


GACATCAAAC 


TGATGATCGT 


ACCACTCTTC 


TGGACCTGAA AAATAAATTT 


5640 


CAGCGCCTAA 


ACGGTTTAAT 


AACTGCATAT 


TTGATTTAGC 


AACGCGAGAA 


TGTGTAATAT 


5700 


CTCCCACAAT 


CGCCACTTTC 


AAGCCTTCAA 


ATCCACCAAA 


TTCTTCATAA 


ATAGTCATCA 


5760 


GATCCAATAA 


ACATTGTGTA 


GGGTGTTGTC 


CACTCCCGTC 


CCCACCGTTA 


ATAATAGAAC 


5820 


ATTGAATCGT 


TTTACTTTGA 


ATCAATTCAT 


CGTAATAATT 


TTCTTTACCA 


TGACGAATGA 


5880 


CTGCTACATC 


TACACCAATT 


GCCGACATCG 


TTAGTACTGT 


GTCGTATAAC 


GTTTCTCCTT 


5940 


TTTGAACAGA ACTTCTACTT 


GCTTCAAACT 


CAATCACTTC 


TAAGCCTAAT 


TTTTTCTCAG 


6000 
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CTACCTCAAA ACTTTTATGT GTTCGAGTAC TATTTTCAAA AA^CAAGTTT GTTGCAAAAT 6060 

JVTTGACGTTC TTCTGGGTGC CATTTTGCCC CCTGTTTAAA TTCTCCTGCG CGACGGATTA 6120 

ACCCCATGAC TTCACGGTCG GTTAAAGCCT CAGCAGTTAA AAGATGTTTT AAACTAATTC 6180 

GTTCAGATGT AATAATCATT CGTCCAGTCC TCCTACTGTT CTTCGCTACG TGCGTTTTCC 6240 

GGTAAAATTA AATTCAAAAC AATTCCTAAG ACAGTTGCTA ATGCCATTGA AGACAATTCA 6300 

AT^GTTCCTA CTTTGAACAC TAAGCCACCA ATGCCGATGA CTAAAATTAC TGAAGCAATC 6360 

AATAAATTCT TTTTCTTATC AAAATTAATC TTATTATCGA TTAAGATTTT CAACCACTGG 6420 

GGnAACAAAC CGA 6433 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 



ATAGAAACCG 


CCTCTCTCTC 


GACACGCAAC 


TTGCGTGTTA 


TCTGTACGAT 


TTGTTTACAT 


60 


GTTGCATTGT 


AAACAGTTTT 


TAGGCAATCG 


TCAAGCAGAA 


AATAATTAAC gTTGCAtGAA 


120 


GAAACtAATA 


ATTGCAAAAA 


TAGCAATGTG 


AGCTGGATAG 


AAAATATAAA 


AGAAATTCCG 


180 


CATGCCACGG 


CCTTTTTCAC 


CATTATAAAG 


AACAATTGGA 


ATCGCTGCTA 


AAATCATCAT 


240 


CCATTGGTGG 


TTCACOVvSA 


ATAGTTCTTG 


GAAATTATAG 


CTGGCTGCTG 


AAATCAACGC 


300 


AACAACGCCA 


ATTGCTGAAA 


TTTGTAACCA 


CTTTTTGCCA 


TGGAATAAAT 


AAAAAGCTAC 


360 


GGCCAAGAGA 


ACAAATAAAA 


ACCCACCTTC 


TGTCATCATT 


AGTGATGGCA 


CAATCAGATT 


420 


GAAAATTTGG 


AAACCAATTA 


AGATAGCAGG 


GTTGCTTAAT 


ATGTCTGGAG 


AACTAAAAAC 


-480 


AACGAGAGGT 


AGGGCACTGA 


TTAATAAAGG 


CACAATAAAC 


CAAAGAAGCC 


CTTTACCAAT 


540 


TTGTTTTTGT 


TTAAAATAGT 


CCATAGACTG 


CATGTAGACC 


GTACCTAAAA 


ATAACGTCCC 


600 


AA/^TATTA 


TTGATAATTA 


AGCCACCTGT 


TGAAAAATAC 


GCATTTAAAA 


AGCCGTTCAG 


660 


AATCCCCATA 


ATCCAGAAGC 


CTAACAATAA 


CCGGAACATG 


TATTTTCGCC 


GATTGCTTGT 


720 


GTGAATGAAG 


CCTTCAGAAC 


TTTCAAATAA 


GAAAATAGGT 


GCAGCAATTC 


GTCC7UVTCCA 


780 


TCCGAACCAA 


CCAGGCACGC 


CTAAAAAACT 


AAAAAATTGC 


AATAGATGGT 


CAAAGACCAT 


840 


TGAAATAACC 


CCAATGACTT 


TCAAATGAAA 


GCCAGTTAAT 


GTTTTACCCA 


TAGTATAAAC 


900 


GCCTCTCTTT 


CATAAAGAAC 


AATATAACAA 


AGTCCAGAAA 


AACCAACCAT 


TTATTTGCCT 


960 


AACAGTTATC 


TTACAAAATT 


GTTTGATTTC 


TTCAGCTTTT 


GATGAAAGAA AAAAGTGCCA 


1020 


AGACTAGTTT 


TTATAATTAT 


TCTATGCTAT 


AGTAAAAGTA 


CATTTTAATA AAAAACATTT 


1080 


GGGGTGCTGT 


TATGGCTGAG 


ATGATACCCA 


TTGAACCTGA 


TGCAGTTAGT 


ACTGTCGCAG 


1140 


GGAAATGCCG 


ATTTACTTTG 


ATTTTTCACG 


CACTCATTCT 


GTCCTAGGAT 


GAGTGTGTTT 


1200 
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TCTTTTATCA 


GGGTTTATTT 


AAGTCACTCC 


TTTTGCTTTT 


TGTTAAGAGA 


AAGGAGTTTT 


1260 


TTTATTGAAA 


AAATGGTCAT 


TGCAACAAGT 


CGTCTTACTC 


GCTTTTTTAG 


CCTTTTTGTT 


1320 


TGGCGGTGTT 


TTTATGGGGG 


CAGGATTCCT 


CTATGCGTTA 


TTGAAtGCTG 


CGCTGTTACC 


1380 


ATTAGGACTG 


AGTCCATTTG 


CCAATGAGCT 


TTTATTTGGA 


ATGTGGACCA 


TGGTGGCACC 


1440 


AATTGCAGCG 


ATGCTGATTC 


CACGAGCGGG 


AAGTGCTGTT 


TTGGCGGAAG 


TTTTAGCCGC 


1500 


TTTGGCAGAA 


ATGCTTTATG 


GTTCGTATTT 


CGGTCCTAGT 


GTATTAATTT 


CTGGTGTGAT 


1560 


TCAAGGTCTA 


GGTAGTGAAA 


GTGGCTTTTT 


GGTGACACGC 


TACAAACGGT 


ACGATACACT 


1620 


AACGTTATTT 


TATAGTGCGA 


TTGGGACGAC 


TATTTTTAGT 


TATGTGTATG 


AATACTTTAA 


1680 


GTTTGGTTAT 


GGC7\ATTACG 


GTCTGGGGAT 


GAACATTGCT 


TTAATTAGCG 


TTCGGTTCGT 


1740 


TTCTATTTGT 


TTCTTTGGTA 


TTTTCTTAAC 


GAAAGTTATT 


TTACGAATGT 


ATCAGTCTGC 


1800 


GCAAGGGTTA 


GCTGTGAAGG 


CTW^TGAAT 


AGTATCCAAT 


TAAAAGCTGT 


GTCCTTTTCA 


1860 


CGAGAACAAC 


CCTTGTTTGA 


AAAGACAAAT 


TTAGCAATTC 


CTCAAGGAAC 


GTTTTCCTTA 


1920 


TTAATTGGCG 


ATAGTGGTAG 


TGGAAAATCA 


ACGCTCTTAC 


GATTGATTGC 


CGGATTTGCT 


1980 


CCGCTTGATT 


ACCAAGGAGA 


AATTTTGATT 


GAAGGAATGG 


AACGTCGACA 


ATTATCAACT 


2040 


CGTGAAAAGG 


CGCAAAAAAT 


TGGCATGCTG 


TTTCAAAACC 


CCAGTCAACA 


ATTTACAATG 


2100 


AAAACCTTG6 


AACGAGAACT 


GATTTTTGCT 


TTAGAAAATT 


TAGGAATCCC 


GCCTGAAGAA 


2160 


ATGAACCGGA 


AAATCCAAAC 


AGCGCTTCAA 


CTTGTTCAGA 


CTCAAACCTT 


ATTCACACGA 


2220 


6AATTAGCGA 


CGTTATCTGG 


TGGTGAAAAA 


CAAAAAGCGG 


CGTTGACAGT 


TCTGCTAGCA 


2280 


ATGAACCCAG 


ATATTCTTTT 


ATTAGATGAA 


CCTTTTGCAA 


GTATTGATCC 


AACCTCTCGA 


2340 


AAACAATTTA 


TTCAAATATT 


AGCAAGACTC 


CATCAAGCAG 


GCAAAACGAT 


TTTGGTGTGC 


2400 


GACCACGATT 


TCAGCGATTA 


CGCTGATGTT 


GTGGACCAAG 


TCGTCACGTT 


GAAAAATGGC 


2460 


CAGTTTGAAA 


AGCAACCGTT 


GACGTTTATC 


AAAACAAAAC 


CACAGACATT 


TCAGCTTACA 


2520 


ACTTCTGTGG 


TAAAACAACC 


AATGCTCCAA 


TTGAAAAATT 


TTCGTTTGTC 


TCAGGGAAAA 


2580 


CGAGTGCTAC 


TTGAAGAAAA 


AGAaGCGCTT 


TTGTTTAAAG 


GAATTACCAC 


GTTGACGGGG , 


2640 


CCAAATGGGG 


CGGGAAAATC 


CACCTTATTA 


AAAGCCATCG 


TTCAGAGACA 


AAAATATCAA 


2700 


GGGAAAATGT 


TCTTGGCAGG 


CCGCCGCTTG 


CGGGCTTCAA 


AAAAATTGTA 


TCAACACATG 


2760 


ACATTAGCGG 


TGCAACAAGC 


CAATCGTCAG 


TT.TGTCACGT 


TGACTTTACG 


AGAAGAGCTT 


2820 


TTGTTTGGTC 


AGAACATGAC 


AGCAGAAAAA 


AGAAGAAAAC 


AAGAGGAAGC 


CTTAACTTTT 


2880 


TTAGGTTTAA 


AAGAAAAACT 


AGAGCACAGT 


GTTTTTC7\AC 


TAAGTGAAGG 


GCAAAAAAAG 


2940 


ATGGTGCAAT 


TAATTAGTTT 


GCTGAGCTTA 


GATTTG6ACT 


GTTTACTTTT 


AGATGAACCT 


3000 


TTTGCTGGAT 


TAGATGAGCG 


AGCATGCAAT 


TATTTTGTTG 


AATGGATCAA 


AGAAAAAAGT 


3060 


GCGCAACAAG 


ATTTTCTAAT 


CGTCACGCAT 


CGGCTAGAAC 


CTCTCTCTGG 


CGTGAGCAAT 


3120 


TACCGAATAG 


AATTGCTGCA 


GCAACAACTG 


ATTATCTGGC 


AGGAGGGAAC 


CACATGCAAA 


3180 
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TAAAGCAAAC 


AAATGCAGCA 


ATTTATGCTT 


CTTTGATTCT 


m Ik fiiniwtiii^ >k 

TATTTTGACC 


TTTGAATTAT 


3240 


CTTTTTCTCA 


GTa\ATTTTG 


^^ik Ik Ik fnmm^^ 

GCAAATTTGG 


CGGTTTTTAT 


GGGTTGTGTC 


ATTTTTTTAA 


3300 


TAGGCCAGCG 


AAAATCCCGC 


TTACTGCTTT 


GGTTATTTTT 


CTTGCCACTT 


TTACCAGCAA 


3360 


TCGGGACGTT 


TTGGTCTATT 


TATTTGCATG 


GTACGAGTAG 


TCAACAAGCC 


TGGCTTTTGT 


3420 


TTAGCCGAAC 


GTATGCCTTT 


GCCGGTTTAG 


GACTAGCTTT 


TGCTGTAGGT 


GTTGATTTTG 


3480 


AAGAATTATT 


GTTGTTATTG 


GAACAAAAGG 


GGCTAGCACC 


GAACTTTGTT 


TATGGAATTT 


3540 


TAGTGGTGGT 


TCACGCCTTA 


CCGGAAGTGA 


AAAGAGAAAT 


TAATGATTTA 


AAAGAAGCAA 


3600 


GCCTACTCCG 


TGGCAAAACG 


TTTCATTTTT 


GGTCGCCGAT 


GCTTTATGTT 


AAAACACTGC 


3660 


TCGTCGCTGT 


TTCTTGGCGA 


GATAAATATA 


CTGAAGCAAT 


GTCTGCCCAT 


GGCTATGAAG 


3720 


AAGGAGCCAC 


CAGAAGTCGA 


AAAGAGCATT 


TTGTGTcAGC 


AAAAGTCAGT 


CTGGGGCTTG 


3780 


CTATTTTGAT 


CATCATTTTT 


ACAAATTTGT 


TTATTTTTCT 


AACGTAAAGG 


AGAAACCTAT 


3840 


GACATTTACA 


GAACAAGCCA 


AAGATCaAGC 


GCAAGCTTCT 


TGGCAAGGGA 


GTTTTCAGCA 


3900 


TCCTTTTATC 


ACAGAATTAC 


ATGAAGGAAC 


CTTGAGTCCA 


ACGATTTTTC 


GCTACTATTT 


3960 


AATTCAAGAT 


CATTACTATT 


TAAAACACTT 


TAGTCAACTT 


TATCGCTTGA 


TTGCGGCACA 


4020 


AACGCAACAA 


CCTCGATTGA 


AAAAATTACT 


CTTAACAAAT 


GCGGAAAATC 


TCGCGTTGGG 


4080 


CGAATTAGCT 


ATTCGTGAAA 


CTTTTTTTGA 


AGAACTAGCG 


ATTACGGAAG 


AAGAAGTTGC 


4140 


AGCGACACCG 


ATTGCACCAA 


CCGCGTATCA 


CTATGTTTCT 


CATATGTATC 


GCCAACTTAT 


4200 


TGAAGGAACG 


CCTAAGACGG 


CCGCAGCAAG 


TATGTTGCCT 


TGTTCTTGGC 


TCTATCAAGA 


4260 


AATTGGTGCA 


CAGTTAGTCA 


AACAACACTC 


ACCAGAGCCA 


CTGTACCAAC 


GTTGGATTGA 


4320 


AACCTATGCT 


GGGGAGGAAG 


CGTACCAACA 


CGTCCAAGAG 


GAACGCCAAC 


TGTTAGACCA 


4380 


ATTATATGAA 


GAAAGTTCAC 


CACAAGAACA 


AGCAGCTATG 


ATCaCCGCTT 


TTGTTATCAG. 


4440 


CaGTGAGATG 


GAATACGCTT 


TTTGGGAAAT 


GGCCTATACA 


CACGAAACCT 


GGATTGGTTA 


4500 


ATACAAAAGC 


AAGTCGTTTA 


GTGCTCGTAA 


CTAACGACTT 


GTTTTTGTAT 


TAGAAAGCAG 


4560 


TAATTGTCAG 


TTGCTTTTTT 


GACAGGGCGT 


GATAAGATAT 


TCAGTGACGC 


AAAATCTGTT 


4620 


GTAACTCGCT 


CAGTAATAAC 


AGAAACGCCT 


TGCTAAGAAA 


GGACTGTAGA 


AGCCTCATGA 


4680 


ATTATCCAGA 


ACCGATTGCA 


AAATTGATTG 


AAAGTTATAT 


GAAATTACCA 


GGAATTGGTC 


4740 


AAAAAACAGC 


GACACGGCTT 


GCTTTTTATA 


CAATTGATAT 


GAAAGAAGAA 


GATGCTAATG 


4800 


CTTTTGCAAA 


AGCGCTAATT 


AGTgTCAAGA 


GAGaTCtCCA 


TTTCTGTAGT 


aTTTGCGGAA 


4860 


ATATTACTGA 


AGAAGATCCT 


TGTGAAATTT 


GCCAAGATAA 


AAATCGTGAC 


CGTAGTATTA 


4920 


TTTTAGTAGT 


GGAAGAACCA 


AAAGACGTTA 


TGTCAATGGA 


AAAAATGCGT 


GAATATCAAG 


4980 


GCTTGTATCA 


CGTACTCCAC 


GGGGTGCTTT 


CACCGATGGA 


AGGAACAGGA 


CCAGAAGATA 


5040 


TTAATATTGC 


CTCTTTGATC 


AAACGATTGC 


ACGATGATGA 


AGTGAAAGAG 


GTCATTATAG 


5100 


CAACCAACGC 


TACCACTGAA 


GGAGAAGCGA 


CGGCGATGTA 


TTTATCACGG 


CTCATTAAAC 


5160 
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CAGCTGGAAT 


CACCGTTACT 


CGTTTGGCAC 


-ACGGCTTGTC 


TGTCGGCAGC 


GATATTGAAT 


5220 


ATGCaGACgA 


AATAACCTTG 


CTAAAAgCAG 


TCGAAGGACG 


TCGAGAAATT 


TAAAAGAAAC 


5280 


ATCGTTGGAA 


AAAGAGGTTG 


AAATCGAAGT 


AGGTCGTTTC 


TGTGAGAAAA 


TGTCGCAGTT 


5340 


CAAGCTTTwC 


CGTACTTCTT 


TTTCAGTTTC 


TTTTTTTCAG 


CGGATGCATC 


ACTTTAAATA 


5400 


GGACAT7UVGG 


ATTTCAATTA 


GACTGATAGA 


GCAAGATGAA 


GTATAATAGA 


TACAAGTAAA 


5460 


AATTGATTGG 


AGTTTTCGCA 


CGTGCAGGgT 


ATTTTTATCA 


CATTTGAAGG 


TCCAGATGGT 


5520 


GCTGGtAAAA 


CCAGTGTTTT 


GAAGGAAGTT 


TCTGAAnATT 


AGCTAAAGAG 


TCAAAACGAA 


5580 


AAATTGTCAC 


TACAAGAGAA 


CCAGGCGGCA 


TACCGATTGC 


CGAAAAGATT 


CGAACAGTTA 


5640 


TTTTAGATCC 


AAGAnATGAC 


AGAATGGATG 


AGCGAACAGA 


AGCTTTACTT 


TACGCAGCTG 


5700 


cTCGTCGTCA 


ACATTTAGTG 


GAAAAAATAT 


TGCCAGcTTT 


GGAAGCGGGT 


CATTTAGTCC 


5760 


TATGTGATCG 


TTTTGTTGAC 


AGCTCATTAG 


CCTACCAAGG 


CGCTGGACGT 


nGAATTGGGG 


5820 


ATGGCCCCCA 


TGCCTCAATT 


AATGCGGTTG 


CGGCA 






5855 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 



GTTTCATAAC 


TGTACCAATT 


GGAAAAACTG 


ACATGGAACG 


GCGTAACTCT 


TTTTCTCTCA 


60 


TCTCCATCCT 


CCTCTCTTAG 


TTATTTCTTA 


GAATAACAAT 


AGAATGACAA ATTGTCAAGA 


120 


ACTTATGTTA 


GATTTTCTAA 


CATAAAACAA 


AAAACCTGAT 


AAATATGGAC 


TTTTCCATCT 


180 


TTTCACGGCT 


GATTTTCTCC 


TACTAATCAA 


AAAAATAGAA 


ATGATTTGTT 


TTTACAAATC 


240 


ATTTCTAATC 


AACTTACCAT 


TTTTTATTCT 


TTTTCGGATT 


CAGCCAACCA 


TTTAGACTTT 


300 


TTTTTCGCAA 


AACCTTTCAC 


GATATACCGA 


TTTTCATTTT 


CAGCATATTC 


TTCTGAAATC 


360 


ATTAGTGTCA 


TCCGGCGAAG 


TTCACTTAAT 


AGTTGTCCGT 


CAGTTGGGCT 


AATTTCTAAT 


420 


TGATAAGGCT 


CGAGTAATTC 


CATCATTTGG 


GCACGGATAG 


CTTGAACCAG 


TCGTTCTTTA 


480 


TCTTCAGAAA 


TTTTAGTTGA 


GATCAATACA 


TTGGGAAATA 


AGGTTGGGAC 


AAATTCTTTA 


540 


CTGTCCACTT 


GATCTCTTTT 


ATTATAAACC 


GTCAAGCACG 


GAATATTTTC 


CAACGCTAAT 


600 


TCTTTCATCA 


ATGTTTGAAC 


TGTGCGTTCA 


TGTTGTAAAC 


GATCTGGCGC 


ACTTGCATCC 


660 


ACCACATGGA 


GCAGTAAATC 


CATCGTCCGA 


CTTTCTTCCA 


ATGTCGACTG 


AAAGGCTTCA 


720 


ATCAGCTGTG 


TTGGTAAATC 


TTGAATGAAG 


CCAACCGTAT 


CGGTCAGCGT 


AACTTCCATG 


780 


CCTTGAGGCA 


ATTGCCATTT 


TTTGGTTAAA 


GGATCTAACG 


TTGCAAACAA 


TTGATCCTCT 


840 


GAATAAGTTC 


CTGCTGTTGT 


CAACATGTTG 


AGAATAGTTG 


ATTTGCCAGC 


ATTGGTATAA 


900 
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CCAATCAAGC 


CAATTTGAAA 


TAGTTCTGAA 


T.TTTGTCTTT 


TCTGGCGGCT 


TCGTTCACGA 


960 


TGAGCAGTGA 


CTTCTTTTAA 


CTCACGACGG 


ATGCCTAAGA 


TTTTATTCCG 


AATGTGGCGG 


1020 


CGGTCCGACT 


CTAATTTCGT 


TTCACCAGGC 


CCGCGTGTCC 


CAATACCTCC 


TCCAAGTCGG 


1080 


GACAACTGTT 


TTCCTTGGCC 


AGCCAAGCGA 


GGCAATAAAT 


AATCCAATTG 


TGCTAATTCA 


1140 


ACTTGTAACT 


TTCCTTCTTT 


GGAGCGCGCA 


CGCAAAGCAA 


AGATATCCAA 


AATTAATTGT 


1200 


ACACGATCAA 


TTACTCGTAC 


AGCTAACGCT 


TCAGATAACA 


GTTGAtTTTG 


ACGCGGTGTG 


1260 


AGCTCATGAT 


TAAAAATCAC 


AAGATCCGCT 


TCATAGGCAT 


CTGTTAAtTG 


AATTAATTGT 


1320 


TCCAATTTCC 


CCTTACCGAT 


TACGGTTTGT 


CGATCGACTT 


GTGGTCGTTT 


TTGCGTTAAA 


1380 


ACAAACACCA 


CTTCTCCATT 


AGCGGTTTCC 


GTTAAATTCG 


CCAGTTCTTC 


CATTGATGCA 


1440 


GAAAATGTCT 


GATAATTTTn 


TTCCGTTTCA 


ACTCCTACTA 


GAATCACTTT 


TTCATGAGTG 


1500 


GTCATCGCCT 


ACTCACTCTC 


CTTCTGTTGT 


AGCCATTCCG 


CTACTTCTTT 


TTCCAACTTG 


1560 


GGcAAATCTG 


TTGGcTGCTG 


AACCAAATTC 


CACCAATGCG 


CAGCCATTCG 


ATTTCTAAAC 


1620 


CAAGTCAATT 


GACGTTTGGC 


ATATCGCCGC 


GATTGCTGTT 


TGACTGTTTC 


AACCGCCATC 


1680 


TCTAATGATT 


GTTCACCAGA 


AAAATACGGG 


AAAAATTCTT 


TATAGCCAAT 


ACCTTGAGCT 


1740 


GCTTGTGCAT 


GTGGTTGCTG 


AAACATCTGT 


TTTGCTTCTT 


CCAACAGCCC 


TTCTGTCATC 


1800 


ATTTGATCCA 


CCCGCTGATT 


AATACGCTCA 


TACAATAAAG 


CGCGATCTGT 


TTCCAAACCT 


18 60 


AACAAATAAT 


AATCATAGAG 


ACGAGCTGGT 


TTTTCCTTCG 


GGGTCAAAAT 


ACTATAACCG 


1920 


GTTTTATCAA 


AGACCTCTAA 


CGCACGAATC 


ACTTTTTTGC 


GATTATTAAA 


ATGGATGCTG 


1980 


TTAGCCGCTA 


GAGGATCCTT 


TTGTTGTAAT 


AACAGCCATA 


AGGCTTGATT 


CCCTTTTTCT 


2040 


TCCGCAAATA 


AATTGTACGT 


TTCTCTAATT 


TCAGGCGAGT 


CATCTATCTC 


ACGACTTCCT 


2100 


AGTTGAAAAT 


CATAAAGGAG 


TGATTGAATA 


TACAAACCTG 


TTCCTCCAAC 


GATAATTGGT 


2160 


AGTTTCCCTT 


TTTCAGTAAT 


TTCTTTGATT 


TTTTGACGAC 


CCTCTTTCTG 


AAAATCAGCG 


2220 


GCAGAATAT6 


TTTCGCTGAC 


TTCACGGCAA 


TCAATTAAAT 


AGTGAGGAAT 


TCCTTCTTTT 


2280 


TCTGTTTCTG 


TGACTTTAGC 


GGTGCCAATG 


TCTAACGAGC 


GATAAACTTG 


CATGGAGTCA 


2340 


CCGCTAATAA 


TTTCACCATT 


AAATTTTTTT 


GCTAAAGCAA 


TACTTAATGC 


TGTTTTCCCC 


2400 


ACCGCAGTGG 


GTCCTACAAT 


AACTAATACT 


TTCTCCAATT 


ATGTTTCTCC 


TTCTTCTTGC 


2460 


CAATTTTGAC 


GAGCAAATTT 


TGCTTCCTCA 


GGAAAATCAG 


TATGAACACC 


CGCCAATTTT 


2520 


AGCTGAAAAC 


AATTTTCCAT 


TTCTTCCATC 


CGATTAACAG 


TCCACGGCCG 


CAGCGGTTTG 


2580 


CC7UVTTGTGA 


X nn.\3 X X ^ X x x 


TTCATGAGTP 


AACACCCAAG 


nK^n X X X X X oo 


ATG T AAArT A 


2640 


TCGACAAGCG 


CGAATGCCGG 


TCCCGCTTGA 


CTTAAATCCT 


CTGCAGATTC 


ATAAATAAAG 


2700 


GCAATTTCTG- 


TCTTAGGATC 


AGCTTTTTTT 


AATTTAACTA 


AACTTGGAAA 


ATAAAAACTG 


2760 


GAATATAAAT 


AACGAAACGG 


CCAGTTTTTT 


TGCTTCATTT 


GTTGGACGAT 


TTTTTTCTCA 


2820 


ATTCCTTCAT 


AATGGATAAT 


ATCGGTTTTA 


AGTTCAATAT 


TCAAAAAACC 


ATTGAATTGT 


2880 
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TCTTCTACAA 


GACAATTTAA 


AACATCTTCC 


AATGTTGGAA 


C^CATTGAAC 


AGAAGGATTT 


2940 


TTCTGAAACC 


AGCTTCCAGC 


ATCTAGTTGT 


TTTAATTCGT 


TTAATGTTAA 


CTGTTGGATT 


3000 


TCACCATGAC 


CATCTGTCGT 


ACGATCCACT 


GTTTCATCAT 


GCATAACGAT 


TAAATAACCA 


3060 


TCTTTAGATA 


GGTGAACGTC 


TAATTCAATT 


CCTTCAGCAC 


CAACACGAAC 


AGCCTCTCTG 


3120 


AAAGCAATAC 


ACGTATTTTC 


AGGATGGGTG 


CCTTTACTTC 


CTCGATGTGC 


AATAATATCT 


3180 


GTCATGCTCT 


CACTCCAATC 


TTGTTTATTG 


TACCATTGGA 


GTGAGGTGGG 


TTCAAATAAT 


3240 


GTAGAAAAAA 


GAAGCCTCTC 


ACAAATGAGG 


GGCTTCTTTT 


TAAGATATAA 


CAACTAACGC 


3300 


ATTCCTGGAG 


CATTTTTTTC 


TGTAATATCA 


AAATTTACCG 


AATCAATAAC 


TTTGTTAGTA 


3360 


TTACTATCflA 


TAATCACAAA 


GTTCAAACCT 


TTTTTATTTC 


TAGACATTTC 


TGTATCATTT 


3420 


ATTTTTACAG 


AAGCAAATGT 


TTGATTTTCA 


TAATTAGTAT 


AACTTGAGAA 


TACATCATAA 


3480 


TTTAAAGCTT 


CAATTTTACC 


AATTAGTCTC 


GATAATTCTG 


CATTTGTTTC 


AAATGTTTTA 


3540 


TTATCTTTTG 


AAATCACTGC 


ACTTAAAGTC 


CACGCATTGT 


TACCtTCGAA 


TTTTACGTCT 


3600 


ACCCCTAAGT 


CTTTTAAAAG 


CGACTCTTCC 


GCTTGTGAAA 


GATAGACTTG 


ATTACCATAA 


3660 


CCACTTAATA 


AAATAGTATA 


TTGCGTATCA 


ATTAATTTTT 


GTAAATAATC 


ATTAAGATTA 


3720 


AAAATATTCA 


TCAACTCTGG 


GTTACCATTT 


CTATATATAT 


TGTCCCAATC 


TTTTTCTGAA 


3780 


ACACTTCGCA 


CAGACTCCAA 


GCCGTAATAT 


TTTGCAACAG 


AACGATTAAC 


TTGTGCATCT 


3840 


GAATTAGTTT 


TTACATGCGA 


TAAATGATTA 


GAATAAGCTG 


AATATTTTGT 


TGTATTACTA 


3900 


TATGATATAT 


CTGGAAACAC 


AGGATTGATA 


TTTCCTTTTT 


TCTCTTGCTC 


AACTAAATAA 


3960 


CTATATTGTT 


TTGTCAAATC 


TAATTTTGAC 


AAGAATACAT 


CACTTACACC 


AACCATTACG 


4020 


TTCATGAAGA 


ATGTACATAC 


CAAAATCACA 


TACAAAACAC 


TGTAAATTGG 


AGAAAACCTG 


4080 


ATATTTACTT 


TATAATTTGG 


CAGTGACATA 


GAAAAAGCGA 


TAATCAAGAA 


TAATACGGCT 


4140 


CCATAATATG 


AACGTCCGTA 


ATCTAATGCT 


GCTGGAGAGA 


TGGCTAAAAC 


TAGCATAGTA 


4200 


GCCATTCCAG 


AAATAAAATA 


TACAACACTT 


AGGTATACTC 


TTTTTTTATC 


TTGACTAATA 


4260 


TAAGCATATA 


AAACAA6CAA 


CAAGATCAGT 


ATCAATAAAA 


ATAATGATAA 


GTGTTCTTTC 


4320 


ATTTiSCTCAA 


ATATTGTAAT 


AATTCCTGTC 


ATTAATTTTC 


TTGGTAAAGA 


CCAAGTACTC 


4380 


CTAGCAAAAT 


AAGTAGCGCG 


AATCTTATTT 


CCTGGTGCTA 


AAGCCATTAA 


CGCTAAACCT 


4440 


ATAGTATTAC 


CAGCAACACC 


CGTAAACATC 


CATAAAGAAA 


GTTTTCGTTT 


TGTATAAAAC 


4500 


TGCCAGCCAA 


GATACCCTAA 


AACGATAAGC 


AAAGCTCCTC 


CaGAGGTATT 


TTCATTACAC 


4560 


CAACCAGCCA 


AAATTCCTAA 


AGCAAACATC 


AAAAAAATCA 


TTAATTTTGT 


TTTCGTAAAG 


4620 


ACGTGTTCTT 


CAATGATTTC 


TCTATGATAC 


ACATAAAGGA 


ACGAAAGAAT 


AATTAAACTT 


4680 


CCCCAAAGAT- 


AATTAGCTGA 


GCCTGTCTCC 


CACAAAAACA 


CTTCTCCAAA 


GGCTGGGGTA 


4740 


AACAACCAAA 


ATAGCACTTG 


AATAATAATA 


AATTTAAATG 


AATTATACTT 


TTTATCTTGC 


4800 


TTTTGTAAAG 


TAAGTCTATA 


AACTAAATAT 


GTTACTATTG 


TGTATGCCAA 


AGCATTAAAA 


4860 
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ACATTAAAAA 


TCCCTTTTGG 


TAACAATAAA . AATATTCTTG 


QAATTACATG 


GACCACTGAA 


4920 


CGTCCTGTCC 


ATGTCATGTA 


TTGATJGTAT 


TCATCTATAA 


GTATCGTCAT 


CCAACTTTTC 


4980 


GTTTTATAAA 


GGTATTCATA 


GTCATCTGCA 


AGTAATGGTG 


TTAAATTATT 


TAAAGTAAGC 


5040 


ATCACCAAAA 


ATGTAATAAT 


AATTGAGCAT 


ATAAATATCT 


TCTTTTTATT 


CAATTTAACT 


5100 


TtCATCTCTG 


CTTACCTTCC 


GCTTTTAGTT 


CATACTTAAC 


ATATTTATCC 


AAACCAGATT 


5160 


GAATACTATA 


TTTCGGTTCA 


TAACCAATGG 


CACGTAATTT 


rGATATATCC 


GCTAAGGAAT 


5220 


CTTJcGATGTC 


CCCAGCTCGC 


GCTTCTTTGT 


ATTCAACTGG 


CAAAGTGACT 


TTCATAATAT 


5280 


CATTTAAAGA 


AGAAATTAAC 


TCATTTAAAT 


CGGTTGCTTT 


CCCAGTCCCG 


ACATTATACA 


5340 


CTTCCCCAAG 


TGATTGGTCA 


GAATGAGCGA 


CAAGATTCAA 


TGCTTGAACC 


ACGTCCTCAA 


5400 


CAAAGACAAA 


ATCACGAGAT 


TGTTTTCCAT 


CGCCAAAGAT 


ATTGAATGTC 


ACTTCTTGAT 


5460 


TGGCTAATAA 


GCGTTTGTAA 


GAGTCCATCA 


CAATCGAAAT 


AACGCCAGAA 


TAAGGCGAAT 


5520 


TAGGGTTTTG 


GTTTGGCCCA 


TAAACGTTAA 


AGAAGCGAAC 


AGCGCTTGTT 


GGAACATCAT 


5580 


ATAAATGACA 


ATAGTTTAAC 


ACATATTTTT 


CAGAAGCAAA 


TTTATCCACA 


GCATAAGGCG 


5640 


TTAACGGACG 


AATTACTGAT 


TCTTCCTGTT 


TTGGTAAAGT 


TGGCTCATCG 


CCGTAAACCG 


5700 


CTGCAGAAGA 


AGCAAACACC 


AAACGTTTTA 


AATCTTTCTG 


GTATTTCCGA 


ATTAATTCCA 


5760 


GTAATTGTAA 


CACACTCTCA 


AAATTCACTT 


GGTGTGTTTC 


CACTGGACGT 


GCTACTGAGT 


5820 


CTGCTACACT 


AGCAATCGCG 


GCTAAGTGAA 


AAATGTAATC 


GAACTGATAC 


TCTTGTAATA 


5880 


CTTTTTCCAT 


TAATTGTTGA 


TCAGTCACGC 


TACCTTCAAT 


GAACGTGATG 


TTTTCAGATG 


5940 


CGTTTAAATT 


TTCAGTTTGG 


CCCATTGATA 


AATCATCAAT 


GACCACAACT 


TGATTGTCTT 


6000 


TACTATaATa 


ATTTGCTAAG 


GTTGAGCCGA 


TAAAGCCGGC 


GCCGCCTGTG 


ATTAAAAAGT 


6060 


TCCTCAAGTT 


TCCCACTCCA 


ATTAAAATAA 


TACATTAATT 


TTACCATTTT 


TAGGTGTCAA 


6120 


AGACTACAGT 


TTGTTTTTTC 


AATAAAAAAG 


GAGAGAAAAT 


AACTTTCTCA 


ACCTTTTACT 


6180 


AAATATTTTA 


AATTCTTTGC 


GTTATTTACA 


AAGCTCTTAT 


GAATGAATAA 


GAAATAGATA 


6240 


ATATACACAA 


TTGCATAAGA 


AATGAAACTA 


TAAATACTAC 


CGCCAAACCA 


AAGATTACTA 


6300 


AAGATAAACA 


ATAATGTAAC 


CAACGTTTCC 


AATGCAGTGC 


TATTTCCTAT 


TTTAATATTC 


6360 


ATATCTTTGC 


ATAGAAAAAT 


TTCCGCTAAA 


TTACAACGAA 


ATGCTAAACT 


TACTAATATC 


6420 


AATCCAACAG 


TTAAATTCAA 


GTTCCCAACA 


ACGAAAATCA 


CAAACAACGA 


TAAGATTAAC 


6480 


GACAAGGCAA 


GTGTCmATaC 


GtTAACAAAT 


AAAATTGTTT 


TTTCTTTTCT 


CAACGTTTTT 


6540 


AAATACGTGT 


TAATTA6TAA 


AGACATCCGG 


CCTTCATAAA 


TAACTATAGG 


AAATAGAATT 


6600 


CCCATAAATT 


TCAAACTCTC 


GCTATATTCG 


GGTAACCACA 


TACCTAAAAC 


AAATTTGACG 


6660 


GGTACATATA 


ACAGCAGAAT 


TGCATACGTC 


AACGGAACAA 


AAACACCTCT 


TAATGTGACA 


6720 


AATAAAGACG 


GcAATCGCTC 


TTTATTTGTC 


CGTCTTAACA 


ACGGAAACAT 


TACAATTCCA 


6780 


ACAGCATTAA 


TAAAAGTTAA 


AAACATATTT 


GATAAaCTTA 


AAGTAAAgGm 


CAaTTTACCA 


6840 
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AAGGTTTCGA TTGTCCAACG CTGTTGAACA AAAAASCGAA 
CTTGCTATAC TACTGAGCAT TAnTTnACTA CC2AT3TTAA 
GTAACrrCCT TCAAOTCAT CATGr7CACA CGCaACATGT 
CAAACCGTAA TAATCAGTTT A3AA;^^ATA TCCATAATSA 
CCTCCTAATG CAAAGTAAAT rSCA-VTCAAC AGGACATA7A 
AATTGAGCAT ATTCCTTAAT ACGATTTSTT Si^a-AAAA 
GCAATTGTAA TTACTGAAAC AATGGCCSTC AATATCAAAA 
GGCATGAAAA AGTAAGCCCA AAGAArGACA AAAGCftGAAA 
AAATACCAGA ATTGAGAACC GAGATTTCTT TTATC^AAAT 
ATTTTTAAAT AGATACCATC AATCCAGCCT AASTGGAGAA 
AAAACATAAA GTTGCCAATA ACTGTA7TCT GTAACGCCTA 
TTTAAGAGAA CCGAAATACC CAACGTTGCA AAATT7GCAG 
TTAGCCACCA TTTTTGCTTT TGAATTCATA SaATTATGCT 
CATGATATGG TAGA7ATCTG TSCTA-^UiATA AAACTZGTGA 
TAGAATCGGG ATTATTTTTT TTCTTTCTAA TAAAG-^-CT 
CAAATTT6AA AGAACTTTGT ClTTArATTC AAATTCTTCA 
ATAATTCTTT GCAGCCTCTG TACGC7GTAA rCTCCGGCGT 
AAAGTGATTT TGTTTC7TAG CTAAArCAAT TTCTKTTTA 
ACGCCATCTA ATTAATTCTT CCCTAACAAT ATAGAGCCCA 
CCACATCGCC ATATCATGTA GAGGArrTTC CATTTCAGAT 
AATGAAGCTC TTTCTAATTG CATATACACA TCCCGSGCGA 
ATTAAATATA ATCTGTTCTG CTTTTrGATT AGSCA^JTTTA 
TTCCTCTGCT AATCCCCCAG GTTCTACTAA rTCAGTATAA 
ATCAGTGTTT GCAAAAGCTT CACACA7CAC TTCAATrTTA 
TT6ATCACAC GTAAaAACAA TTTCTTTAGT AGCAGCATTC 
GCGCCAACCT AA AT TTTC CG kGTTGATTtG CAAaTSCcAC 
AGAAaTAAAG TtCTCAATtA TTTtAACTGT rGAaTCyGTA 
TACTTCATCA ATTTTTTTCG ACTGArTTTT ?AAGG?ATCC 
(2) INFORMATION FOR S2Q ID ysot 107: 

(i) seqoei«:e CHMACTEHISTICS: 

(A) L2NGTH: 1508 base pairs 

(B) * TYPE: nucleic acid 

(C) S?RANDE3»2SS: double 
CD) TOPOLOGY: linear 



rGGTTCCCAT TATTnGCnTA 6900 

TASTATCAaC, ::AmCAGGA 6960 

CTTTAACTCT AAACATTCCC 7020 

TTASCCAATA AAAATCTCGT 7080 

AATAGCGATC ATTTCTGGAT 7140 

CATACAGAAT AAACGTTTTT 7200 

TAATCCCTTG ATATTCTTTC 7260 

TACAACATTC AAAAATCGCT 7320 

TTTCGTATTC TTCTCCCCCA 7380 

ATCCAACATA TGAAGAATAA 7440 

ACAACTTAGG GACAAATAAA 7500 

CCAXAGTATA ATAAAGGTTT 7560 

CACTTTCTAA TTTTATATTT 7620 

TGATACTTAA AAAAGGCACA 7680 

CGCATATTCA ACTCTTTAAT 7740 

ACATGTGGTT GATTATTTAA 7800 

AAAGTATTCA ATCTTTCCTC 7860 

AAGGAACTTT GTCCATGTTT 7920 

TCiwTTAATA AAGAACTTCC 7980 

GCGTATAGAT TAACATTGGC 8040 

CGTAAAAATA AATTGTTCTC 8100 

ACnrSAATCAA TTTTTTTTAA 8160 

TCACTTACTA AAACTTTAAC 8220 

GATGCATCCC ftAATATCGTC 8280 

AATAGATTAA AGAAATTTTT 8340 

GTTtCACCTA AwTTAwTTTC 8400 

GrGCAGTCAT CCCAAATTAA 8460 

^JVCTGTTCTA AGATATTTT 8519 



(xi) SEQUEKCS DESCRIPTION: SZQ ID NO: 1C7 : 
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TTTTGGGGGG ATAATCAAAT tCCCCTGnCT TCCGCCACAA GCCACACAAT GATGCCcATT 60 

CTGGTCGTAC ACCAAACCGT TCTAACAATT GAATTTCAAA AATATTTGTA ATGATTTCCg 120 

GATCTTTTTG CTCaTTTAAA GCGGATAATG CCATGTGGGT AAATTGAAAT AAGTTCGGAT 180 

CATAGTGCTG ATCTTCAATT GCTGCATCGA CTAAGTTTAA AATATATGTT CCATATGCGT 240 

TAATAAATAT GTCTTGTTGA ATGTTTCGAA AAGGTTGAAC TTCTTTGCTG CTATTGAGAA 300 

AAGATAAGCC TTCTTCTCTA AAGTTTCCGA TGTAGACGGC TTCTGTAAAT GGCAGAATCG 360 

CTGGCAAAAG AGGATTATTT TTTCGATGCG CACCTTTCAC AAAAAACATC AATTTTCCAT 420 

AGGATTCTGT AAAAATTTTG ACGAGTTTAT CCTTCTCTTT AAAATCCTTC GTAAATAAAA 480 

TAATGCCTTT GGTCTCCCCT AATGCCATTC AATTCACTTC CATTTCTTAT GTAAATTAGT 540 

GTAGTGTTTG -GAAAGTTCGT TTATACACTC GCTTCCCAAA CACTACTTTC TTAATATTCT 600 

TCTTTGCGAT AACCAAAATC TTGTAAATAG ATTTTTTTGT CACGCCAGTC TTTTTGGACT 660 

TTTACCCACA GCTCCAAATA CACTTTGTCA TCTAATAAAT ATTCAATATC TTGACGCGCC 720 

TTGGTACCAA TTTGTTTCAG CATTTTTCCG CCTTTGCCAA TAATGATACC TTTTTGACTG 780 

TCCCGTTCCA CAATAATGGT TGCTTGGACA TGGACTTTAT CGTTTTCATC CCGTTTCATT 840 

GAGTCCACAA CAACAGCGAC AGAATGAGGa ATCTCATCTC TTGTTAAAAG TAACACCTTT 900 

TCACGAATTA GTTCAGAAAC GATAAAATAT tCTGGATGAT CCGTAATTTG GTCaTCaGGG 960 

AAATATTGTG GTCCTTCTGG CATTTGCtCA ACGAGAACAT CCATCAAATG CTsCAAAATT 1020 

ATTTCCTTCA GTGGCAGAAA TGGGAATAAT TTCTTTAAAG TCCATTTGTT TCGAATAGTC 1080 

TTCAATAATA CTCAATAAAT CATCTGGATG GACTTTGTCA ATTTTATTAA TAATCAAGTA 1140 

GACTGGACTA TTATTATTTT TCAATCGTTC AATAATAAAG TCGTCACCTT TCCCGCGTTT 1200 

TTGATCAGCA CTTACCATAA ATAAAGTAGC ATCTACTTCA CGCATCGCAT TGTAGGCACT 1260 

TTCCACCATG AAATCGCCTA AGCGGT6TTT AGGTTTGTGA ATCCCTGGAG TATCGATAAA 1320 

AACAATTTGT GCTTCAGGAA TTGTATAAAT TCCTTGAATC TTATTACGAG TTGTTTGTGC 1380 

TTTGTCACTC AtAATTGCAA TTTTTTGGCC GACAATTCGA TTTAACAATG TTGATTTnCC 14 40 

AACAnTTGGT CTGCCGACAA TGGGCTACGA AACCAGATTT AAGGTTCAGT TGTCATGGAA 1500 
TnCGCCGC 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1508 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
ATGAAGTATT CAAAAGGACG GTACGTCCTT GTAGTCTTAG TAATGGTCTT GATTGCTTGG 



60 
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TTAATTTTTA 


TTTTGTCAGG 


TTTAGCGAAC 


GGTTTAGCTC 


AAGGAAATCG 


TTTAGCCGTA 


120 


GATCAATGGC 


AAGCGAATCA 


AGTGGTTTTA 


TCAAAAGAGG 


CCAATAGTAA 


TTTAAATGTA 


180 


TCAGTGTTAG 


ATGAAAACGT 


GAAAGAAACG 


ATTTCAGGAG 


GCAAAATTGC 


ACCGATTGGT 


240 


CAACAATCTT 


TAGCCATCCG 


CCCAGCAGAT 


GATAAAAAGG 


CTGAATTAAC 


GAATGTTAGC 


300 


TTATTTGGGA 


TTGAAAAAGA 


AAGTTTTTTA 


ATGCCAAAAG 


TGATTGAAGG 


AAACGCATTT 


360 


ACTGATAAAA 


ATCAAGTGAT 


TGCTTCaGAA 


ACATTGAAAA 


ATCAAGGATT 


TAAAATTGGC 


420 


GACAAATTAA 


CTGCAGGGAA 


ATATGATGAG 


CAATTAGAAA 


TTGTCGGCTT 


TATTTCTAAA 


480 


AGTAGCTATA 


ACATTGTGCC 


AGTTATTTAC 


ACTTCTCTGG 


ATACTTGGCG 


GTCAATTAAA 


540 


TATGGTGACA 


ATCCAGCAAT 


GGCCAAAATG 


GTTAATGGTT 


TCATTGTTCG 


CAGCAAGGAC 


600 


AACACGGAAG 


TTAAAACGAC 


TAATAAAGAC 


AGCCAAGTTC 


TTTCAATTTC 


AGATTTTATT 


660 


GAAAAACTGC 


CAGGATACAG 


CGCTCAAAAC 


TTGACCTTGG 


ATGGCATGAT 


TTATTTCTTG 


720 


ATTGTGATTG 


CAGCGTTCAT 


TATCGGTATT 


TTTATCTTTG 


TGATGACTTT 


ACAGAAAACA 


780 


GCAATGTTCG 


GCGTCTTAAA 


AGTTCAAGGG 


GTGCCAACTA 


GTTTCTTAGC 


AAAGGCAGTC 


840 


ATGCTACAAA 


CCGCTTTATT 


GGCTGTTCTA 


GGAGTGGCGA 


TTGGTCTTGC 


TTTAACGGGA 


900 


ATTACCGTGC 


TCTTTTTACC 


AGAAGCGATG 


CCTTATGCAA 


CTAATGGACC 


ACGCATGATT 


960 


TTGTTTAGTG 


TACTATTGAT 


TTTATCTGCA 


TTAATTGGCG 


GGGCATTTTC 


AATTCGAACG 


1020 


ATTGCTAAAA 


TTGATCCGTT 


AATCGCGATT 


GGAGGTTAGA 


GACATGGCAA 


ATGTTTTAGA 


1080 


AATGAAAAAT 


ATTTATAAAA 


AATATGGTGA 


AAAACATACA 


GAAGTGATCG 


CGTTAAAAGA 


1140 


ATTATCTTTT 


GCGGTTCAGC 


CAGGTGAATT 


TGTCGCAGTA 


ATTGGTCCTT 


CTGGTTCTGG 


1200 


TAAAAGTACC 


TTTTTAACGA 


TTGCAGCTGG 


TTTACAGGCA 


CCGACAAGTG 


GCGAAGTCAT 


1260 


TGTTGGCGGG 


CAATCACTAA 


ATAAGTTAAC 


GAAGAAGCAA 


CGTTTGGCGC 


MCGTTTTCA 


1320 


AAAAATCGGC 


TTTATTTTAC 


AAAGCTCTAA 


TTTGGTGCCA 


TTTTTAACAG 


TGGAAGATCA 


1380 


ATTTCACTTA 


ATTGAAAAAG 


TTGATAAGTC 


ACGTAAAAAT 


AGTGAATTAA 


AAGAGCAATT 


1440 


6TTAGAGACG 


TTGGGTTTAA 


AAGAATTACG 


AAATAGTTAT 


CCTCGTGATC 


TCTCTGGTGG 


1500 


GGaAAGACAA 


CGGGTAGCCA 


TTGCGTGTGC 


ATTGTATCaT 


GAGCCAGACG 


TAATTTTGGC 


1560 


AGATGAACCA 


ACGGCTAGTT 


TAGATACAGA 


AAAGGCGTTT 


GATGTTGTCA 


AATTACTAGC 


1620 


GAAAGAGGCC 


AAAGAGAAAG 


ATAAAGGGAT 


TATCATGGTG 


ACACATGATG 


AGCGTTTGTT 


1680 


GAAATACTGT 


GATCGAGTGG 


TTCGTATTCG 


TGATGGTGAA 


TTGACAGAGT 


AACAAAACAA 


1740 


ATAAAAGaGA 


ACATGATCGG 


ATTGCTGGAG 


CAATCCGATC 


ATGTTCTCTT 


ATTTCTTTTC 


1800 


AATACAGAAA 


AGAACAGGTG 


GCTGATTTTT 


TTGATTGATA 


AACTCATAAC 


GTAAGACACT 


1860 


GTAAACATCT 


TGTGGTAAGG 


CTTGCGTGTA 


ATTCGTGACC 


AAATCAAGTT 


CCTCTGAACC 


1920 


ACCTTCGTGG 


CCATAATAAA 


CGACTAAAAT 


AATGCGACTT 


CCTTTTACCA 


AATGAGGCAG 


1980 


TAACCCATCT 


AACGCCTGTT 


TAGTCGTAGT 


GGGTTTTGTA 


ATAATTTCCT 


TATTACTTTT 


2040 
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AGGTAAATAG 


CCTAAGTTAA 


AAATAGCGGC 


AGTGATTTCT 


GTTTCTTCAT 


CTAGGACAGC 


2100 


ATGAAGATGT 


TCATGTCCTA 


ATGGAAATAA 


CGTAGTTTGT 


GGGAGTAAAT 


TAAGTTCTGT 


2160- 


TAACTTTTGT 


TCTGTATTGA 


TCAACGCTTC 


TTTTTGGATA 


TCAAAGGCGA 


AAACTTCACC 


2220 


AGAAGGGCCA 


ACCAGCTCTG 


CTAAGAATGC 


TGTATCATGA 


CCATTTCCCA 


TTGTTGCATC 


2280 


AATGACGAAG 


tCGCCAGGTT 


CCACCACTTC 


TTTTAAAAGT 


TGGTGACTAA 


AATGTAGGGC 


2340 


TGTTTGTAGC 


ATTAATTGAA 


AACTTCCTTT 


CCTGAAATGA 


CAGTATATTT 


TCCTTGATAA 


2400 


CTGTTGCGGC 


GTTTCATTTC 


CGCATCAATC 


GCATTGAGAA 


CTTCCCATTT 


TTTCAAGCTC 


24 60 


CACATCGGTC 


CGATAATTGT 


TTCAAAG6GA 


GCATCGCCTG 


TCAAGCGATG 


AATAACAATT 


2520 


TCGGGTGGAA 


TCATTTCTAA 


CTGATCACAA 


ATCACTGAAA 


CATAGGCTTC 


TTTGCTCATC 


2580 


AATTGTAAGC 


GTCCTTCATT 


ATAATCCCGC 


ATCATTTTGG 


TATTTGTCAT 


TAAATGAAGT 


2640 


AAGTGCAGTT 


TAATCCCTTG 


GATATCTGAA 


TCCTGAATAG 


TTCGCCGCAC 


ATTTTCGCGC 


2700 


ATCATGGCAG 


GTGTTTCCCC 


AGGTAGCCCG 


TTAATTAAAT 


GTGTACAGAC 


CCGAATCCCG 


2760 


TGTTTCCGTA 


ATTTAGCCAC 


ACCATCTAGA 


TATGTTTGAT 


AATCATGAGC 


ACGATTAATG 


2820 


GCAGCGCTAG 


TTTCCTCAAA 


CGTGGTTTGC 


AAACCCAACT 


CTACCCATAA 


ATAAAAGCGT 


2880 


TCATTTAATT 


CGGCTAAATA 


GTTCACTACC 


TCATCAGGCA 


AACAATCAGG 


ACGGGTACCA 


2940 


ATGGATAAAC 


CAACGACACC 


TTTTTCATTG 


ACTACTTGTT 


CAAACCGATG 


ACGAATAACG 


3000 


TCAACGGGCG 


CATGTGTATT 


GGTGAAATTT 


TGGAAGTAAA 


CAATGTATTG 


ATCGACAGTT 


3060 


GGCCATTTTT 


GATGCATCAA 


TTGGATTTCT 


TTTTGAAATT 


GAAGCGGCAG 


GGGATCGCTA 


3120 


GGTGCAACAA 


TCATGTCGCC 


TGAGCCAGAA 


ACACTACAGA 


ACGTACAACC 


ACCTTTGGCA 


3180 


ACCGTCCCAT 


CACGATTGGG 


GCAATCAAAG 


CCACCATCAA 


TTGGCACTTT 


AAAAATCTTT 


3240 


TCACCAAACT 


GTTGACGAAG 


TGCATAGTTC 


CAAGTGTGAT 


AGCGTTTATT 


TGGATCATCT 


3300 


GAATAGGGAA 


AAATTTGCAT 


AATCAAGACT 


TCTTTCTAGC 


GTTGTTTGCT 


TTGAAATCGC 


3360 


CAAACAAAGC 


GTTTTTCTAA 


ATAAATAGGG 


TAACTTAATA 


ACAACCAAGC 


TAAACCAAGA 


3420 


CAAAAACCAC 


CTAAGATATC 


GCTAGGGAAA 


TGGACACCTA 


AATAGATTCG 


ACTAACCCCA 


3480 


ATCATGAAAA 


TCCCAATACC 


TAGTAAAATT 


TGCACACAAA 


GGCGGACCGT 


TTTATTTTTA 


3540 


ATGAAAATTG 


GCAGAATCAA 


TAAAATAGTG 


CCGTAGAACA 


AGGTACTTCC 


TACAGCATGT 


3600 


CCGCTAGGAA 


AACTATACGT 


ATGTTCAGTT 


ACTAAATGCT 


CCAAAGTAGG 


ACGTTGGCGC 


3660 


ATAAAGAATA 


ATTTTAATAA 


AGAATTACCT 


ACGCCAGCAA 


TTAAGGCAGT 


ATTAATAAAG 


3720 


AGCCATAATG 


CTTCAGCATA 


ATATTTCCAA 


ACAATAAACA 


GAGCAATGAC 


AACGATTGCT 


3780 


AAGATGATAA 


TCGTCAATGA 


ATTGGCAAAT 


TTGGTGTACC 


AAATAAAAAA 


GTTATTTAGC 


3840 


TGTGGAAAAG- 


GACGACGAAC 


AAAAGAAGTA 


ATCGTTTGAT 


CAAAGCCTTT 


GAGCCAGGTT 


3900 


GGATAAAAAC 


GAACAACATA 


GCCTAAAAAC 


ATAAATACTA 


AAAGAAAACA 


ACTACCTGCA 


3960 


AATTGATAAT 


AAAGTTTGTT 


TTTCATAAAA 


AATCCTTTCA 


AAATTTAATC 


GCAACAATCC 


4020 
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GTATTATACC 


ATCAAAAGGG 


TAAAAAATTA 


AAAGATAATT 


TC^TTGCCTG 


ATTTGGA6GG 


4080 


GATACAGATA 


TTAAAAAAGt. 


TCIGGCACCA 


CAATCTCTCA 


TTGTGGTGCC 


AGmACAGATG 


4140 


TTTTAACTGG 


CACATGTTTA 


TTGATTATCA 


TCATCAGGAG 


TATCAACGGT 


AATATCTCCG 


4200 


AATTCGCTAG 


AAAGGTTAAA 


CTCTGTATCC 


GCACGACCAT 


TCTTATAAGA 


TGTTCGCTCT 


4260 


TTGTTTAAAA 


GTTGAACATC 


GCCTAATTTT 


GATTCCCCTT 


TGATTGAAAT 


GTTACGCAAA 


4320 


ATTTCTCCAG 


AATTTACATA 


AATATCTCCA 


TGGACAAGAG 


CGACAGTTAA 


TTTTTTAGGC 


4380 


ACAGCAGGCA 


TGTCTAAAGA 


AAAATCACCA 


TTTTCACTTG 


AAAAGTTAGC 


TTCTCCTTTG 


4440 


AAATCATAGA 


CGCTAATATC 


GCCATCTGTC 


GTTGTTGCTT 


GGATGTTTCC 


ACGAGCATTT 


4500 


TGCACTCGAA 


TATCACCAGA 


AGTAGTCGTA 


TTTTTGATTT 


CATCGAAATA 


GGCTGAATCC 


4560 


AACGTTATAT 


CGCCGTCAGT 


GGTTTCAACT 


GTTGCTGTTT 


TAGCGGAAGT 


ATCCGCTGAA 


4620 


AGGTAGATAC 


CATCTGAAGA 


GTTAAGCGTC 


ATTTTTTTGG 


CTTTAACACC 


CGATAGACTA 


4680 


ACATAAGAAT 


TTGGtAGATT 


AGTTGTTAAT 


TCGTCTACGT 


TAAAAGCATT 


CAATGAAACT 


4740 


GGTGAATGGG 


TTGAGCCATC 


TU^TCACTAAC 


TTTTTCACAT 


TATTAGGAAT 


CTGGACACTC 


4800 


ACGTTCGGTG 


TGAAGTCACT 


AAAAATACCA 


ATATTAAAGA 


ATGACAATTC 


GGATTGGTTT 


4860 


TGTGTACGAG 


TGGTATTGAT 


GGTTGCTGTT 


ATTTTATTTT 


TTTCTTCTTT 


GACATCTAAG 


4920 


GTGCCTTCAG 


CTTTTGAATA 


AAAGGCATTA 


TTCGGTGATC 


GTAACCCCAT 


TTTGTTACTA 


4980 


TCAGTTTTAT 


AAATGGTATA 


AGGTCCTTTG 


CCTTTTAAGG 


ATAAGTGAAG 


TTCTTCGCCT 


5040 


GCTTGTTTAT 


TTTTTAGTGT 


ATAACTTTCT 


TTTAGCGGTG 


TGACTGATTT 


TTCCGCTTGC 


5100 


TTGTAATAAT 


AGACGGCGCC 


AGATCCACCA 


ATTAGCATCG 


CTAAGACAGC 


AATGGATAAG 


5160 


AGAATCCCTG 


TTGTTTTTTT 


CaTTAGATAT 


CACCTCGTAA 


AACAGCAATG 


TTCCATTGGA 


5220 


AATAACGCTT 


AAAGACATGT 


GCTGAAAATT 


TTAAAATTGG 


TAGAAACAGT 


AACCAACCAA 


5280 


TGATTGATGC 


ACCAAATAAA 


AAGAGACTCA 


TAGATAATTG 


GAAAAAGGAA 


GCGGCATTTG 


5340 


CTTGGGTAAT 


AACACTATAA 


ATaCCATAAA 


TTGGTGAAAA 


TAAAGTTGCA 


GCTCCGGCAA 


5400 


TGCroACAACC 


GACGTACATT 


AAGGCAATCC 


CTAAAATAAC 


CCAAAACATG 


AAGrAAAAAT 


5460 


TTAAACAGAA 


AATACCAAGC 


ACTTGGAAAA 


ATCGGGTTAG 


TGGCCGATGC 


TGTGGTCGTT 


5520 


GATAGTATGG 


AGAATCATTA 


TCATAGTAAA 


ATTCAGATTC 


TTCTTCATAG 


GGATGATCAT 


5580 


AAGCAGGTGT 


AGCCGCTGGA 


ATTTCTTGCC 


AGCCATTATT 


TTCAATTTTT 


CGTTCCGTCA 


5640 


ATTCAATGCC 


TTGTTCTTTC 


AAAATTTCTT 


CTGCGATAAC 


CCGTGGTTTC 


CCC/^TGATT 


5700 


TTGCCACTTG 


TTCTTCTGTT 


TCGCCTTCCG 


CAACACGAGC 


ATCAAATAAC 


GCCTTGTATT 


5760 


TCGCAAGAAT 


AATAGCTTGA 


TCTTTTGGAT 


TTAATGCTTT 


TAAATAAATT 


TTCAGCTCAA 


5820 


TAATAAAATG 


TTCTTTATTC 


ATTCCGCTCC 


CTCTTTCTAC 


ACGATTCATT 


TCTCATCTTA 


5880 


AACTATAATT 


TCTTTTTGAG 


CGAATAACAA 


GAGGAATT 






5918 



(2) INFORMATION FOR SEQ ID NO: 109: 
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(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 2350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



PCTAJS98/08985 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

CATACCCGCT CAATCATGAG GAGACTGGCT AAAAGTAATC CTAGCGTTTG CATCTTTTTA 60 

TTTTTATTAA AGAAGTGCCC TGCTAACCAA CCTTGTCCGC CATGAATCAA TAACGAAAAA 120 

ACGAACCATT GTGGATAGCC TGACAATAAA TCAATTAAGC CACCGCTTAA AGCTCCGACT 180 

GTCAAGCCAC CGATCGGACC AAAAAGTAAA CTAGCAATGT AAATCCCTGC ATCCGCCAAT 240 

GTAACAAAGC CATTTGTTGC TGGGACGGGG ATAATAAAGA CCATTGACAC AGCCACTGTT 300 

AAAGCGGTAA TTACTGCATA TAGACTAATT TTTCTTGTCA TTATCCGCCA CACCTCCTGC 360 

TAGTAATCCT TCATGCCACA CACTTCCAAA AGCTGCATTT AGCTGGACCC CAGAAACAAT 420 

GGCTTCATGA ACGTATGTTT TTGCTAAATC AACTGCTTCA ATTAACGGAT ATCCTTTAAC 480 

TAAATTCGCA CCAATTGCAG AGGCAAAGGA ACAACCAGCG CCATCAATGG TGTGGGCCAT 540 

TACTTTTTCT TTGACCAATT GAATATGTTG TTGTCCATCA TAAAAAAAGT CACGTGCTTC 600 

TTGACCAAGC CAACGATTAC CACCTTTAAT CACCACCGTT GCCGCTCCTA AAGAGTTTAA 660 

CTCTTCCGCC GCTTGTTGGA CTTCTTCGAC GGTTGTTAAT TTTCTGCCAA TGAGTAATTC 720 

TGCTTCTTTT AAATTAGGCG TTATAATCGT TGCGAATGGA AAAAGTTTTT CAATCAATAA 780 

CTGCATGTAT CTTTTTTCAT AAACTGAAGT CGTTTCTTTA AAAGCCAAGA CAGGATCCAA 840 

CACCACCGGC AGACCTGCGT GCTTTTTCAA AAAGGTTACA ATTTGTAACA TACTTTCTTC 900 

ACTGTGAATC AAGCCAATCT TGATGCCGTC TAAACTCCCG GcAGGTATGG AAGCGAGTTG 960 

CTCTTTccAC CAAAGAAgTC TCaAGGTCAT GAATCTGAAA ATGATTCTTG GGGGCTGTAG 1020 

CAAGGCAAGT TAAAGTTGAT AAGCCAAAGA CTTGATGATT TTCAAACGTT TTTAAATCTG 1080 

TTTGAATCCC GCCACCTGAA AAAGGATCGG AACCGCCAAT TGTTAAAATT ATTTTTGTCA 1140 

TTTGTTTCGC TCCTTTATAC ACTCAGTTAT CATACAAAGT TTTCTAAAAA GAAAAAACAG 1200 

CCAATTGGCA GATTTTTAGC TCAGCCACTT TTTAAGTCTT TCGTTATCTT TTTCCTTTCC 1260 

AAAAAAAGAG TGACCTCATC CGTCAATGAA GTCACTCTTT CTTTGGATTT AACACTTATT 1320 

CGTCAATGGC TGTTACATCA ATGTTTAATT CAAACAATTG TAATGGAGAA ACTAAACTTG 1380 

GTGCGTTGTT CATTGGGTCA ATTGCTTTGC CGTTTTTAGG GAAGGCAATC ACTTCACGAA 14 40 

TATTTTCTTC CCCAGCCAAT AGCATTGCTA AGCGGTCTAG CCCTAAAGCA ATCCCTCCAT 1500 

GTGGCGGGAA GCCGTAATCT AATGCGTCTA GCAAGAAACC AAATTGGTCT TGTGCTTCTT 1560 

CTTTTGTAAA ACCTAACGTT TCAAACATTT TTTCTTGTAA TTCTCGTGTA TGGATCCGTA 1620 

ATGAGCCGCC GCCTAATTCG TAGCCGTTTA AGACAACATC GTAAGCTTCA GCATACACAG 1680 

AGGCTGGGTC TGTTGCTAAG CGGTCCACAT CCTCTGCTTT TGGTTGTGTA AATGGATGAT 1740 
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GAGCAGAAAC 


ATAGCGACCA 


GCTTCTTCAT 


CATATTCAAA 


CAACGGCCAA 


TCCACAATCC 


1800 


ACAAGAAATT 


GAATTTCGAT 


TCATCAATTA 


AACCTAATTC 


TTTACCTAAA CGTGTACGTA 


1860 


CCGCACCTAA 


AGCAGCGGCT 


ACGATTTCAG 


GTTTATCGGC 


ACCAAACATT 


AAAATATCAC 


1920 


CGACTTCTGC 


ATTGGTAGCA 


GCAATTAATT 


CATCACTAAC 


TTCTGTTAAG 




1980 


TCGGTCCTTT 


TAAGCCATCT 


TCTTCAACTT 


TTAACCAAGC 


TAAGCCTTTT 


GCGCCAAATT 


2040 


GACTCACATA 


CTTACCTAAG 


TTATCCATAT 


CTTTACGAGA ATACTTATCT 


GCGGCGCyTT 


2100 


TTGCATTCAA 


AGCCTTCACA 


TGTCCACCAT 


TTTCCAACGC 


TGCTTGGaAA 


ACTTTGAAGT 


2160 


CTACGTCTtT 


CACTACCTCT 


GCAACATCAA 


TTAATTCCAT 


AGCAAAACGA 


GTGTCGGGTT 


2220 


TATCGyTACC 


ATAGCGTGCC 


ATTGCTTCGT 


CATAGCTCAT 


TCGTGGAAAC 


GGAACAGArA 


2280 


TCTCAATTCC 


TTTTGTCTCT 


TTCATCACTT 


CAGCCAACAT 


ATTTTCTGTA 


TATGTTTGAA 


2340 


TTTCTTCTGG 












2350 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



TGAATTGGCC 


TCGATGACAG 


GTCTTCTACG 


AGGATACGTG 


CAACAATTAG 


ACGTCCCTGA 


60 


ACAGCGTGCA 


CTAACAGAAG 


AACTAGCCAA 


AATTTGCGAG 


TTAATCTACC 


ATGTCAATCC 


120 


AACGACCCGA 


ACAAAATTAA 


CAGTCACAGA 


GGACGAAATA 


GCCTGGCTTT 


TGGAACGTGT 


180 


CAATGCTATG 


AATGAACTGA 


CATACGAAGA 


AAATCGTCCT 


TTTGTTTTAC 


CGATGGGCAC 


240 


AATCTGTTCC 


TCTTATGCAC 


ACATTTTAAG 


AGCCAAAGCC 


AAAGATATTG 


TTCGTTTACT 


300 


TTATCGCATG 


GACTATGGTG 


GTAAAAAAAT 


TGATCCGCAG 


CTGTATGATG 


TAGTCAACTT 


360 


ACTCTCAGGT 


TACTTTTTCA 


TGTTAGCCTT 


GTACCTAAAT 


CAGCTCGAAA ATGGTGAAGA 


420 


AGTTCCCTTT 


GTTTCTCGCA 


ACTATTCTAT 


TTAAAAAGTA 


CACCGACCGC 


TAACGACGAA 


480 


TTGCTAGCGG 


TCGGTGTACT 


TTTATACAGG 


TAAGTACGCT 


TCTCCTTTTG 


CAAATAGTTT 


540 


AGCAGTACCA 


CCAATAATGA 


CTCGATTTTC 


TTTCACTTCA 


CAAGTTAAAA AGCCGCCTCT 


600 


TGGGGACACT 


TGATAGGCCG 


AAAGGGTCGT 


TTGATTGAGC 


CGTTTTCCCC 


AATAAGGAAT 


660 


TAAATTGGCG 


TGCGCAGAAC 


CACAAACTGG 


GTCTTCGTTA 


ATTCTTAGTT 


TCGGGAAAAA 


720 


AGTTCGTGAA 


ACAAAATCCA 


CCGAGTCTCC 


TGATGCCGTT 


ACAATGACTC 


CTACTCCTAA 


780 


ATCCAATGCT 


TTTAATGCAG 


AAAAATCTGG 


TGTGATTTTT 


GCCACTGTCT 


CTTCGTCTTT 


840 


TAAGACAAAA 


AATAAATCGC 


GACCCAAATA 


AGCTTCATAA 


ATTTTCGTGC 


CTAAAGCGGC 


900 


TTCATACTCT 


GGCAAAATTG 


GAATCCTTTC 


CGGTAGGATA 


TAAGGAAAAT 


CTAGATAATA 


960 
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GTATTCCTCT 


TTTTTGGTCA 


CTGCGAGCGG TgCACTTTGA CTAGTAAAAT GGAGTGTTTC 


1020 


TTCCGCAACG 


GAATAATAGT 


TAAACAACAC 


AAAAGCCGTC 


GCTAATGTAG 


CATGTCCACA 


1080 


GAGATCGATT 


TCTCGTTCCG 


GCGTAAACCA 


GCGCAGGGCG 


TAACTTTGTC 


CCTCCTTGAC 


1140 


CGTAAAAGCA 


GTCTCTGACA 


AATTATTTTC 


AATCGCAATG 


TTTTGCATCA 


CCGCTTCTGG 


1200 


CAACCATTTT 


TCCAAAACAT 


AAACAGCGGC 


GGGATTGCCT 


TTAAAAACTT 


CCTCCGCGAA 


1260 


AGCATCGACA ATATAATAGG 


GATAACTCAT 


TTTAATTCTC 


TCCTTGTATT 


TTTTGATAAA 


1320 


TGTATTCCGC GCAACACAAA 


TCCTCAACTG 


CTAAGCCAAC 


CGCATCAAAA 


ATCGTGCGGC 


1380 


CTTTCTGCTC 


CGCCTTTAGT 


TTAACTTTGC 


CAGTGACTAA 


TTCGCCGAGA 


GAACCTACAA 


1440 


TTTTTTCAGC 


TGAAAA6GCA 


CCTTCTGAAA 


TCGGAATTAA AAGATCCCCG 


CTTTCTTTTA 


1500 


ATGCAGCCGC 


ATACTCATCT 


ACAAACACTT 


GACTGTTTTT 


AACAAGGTCG 


CTTGTTAATT 


1560 


CTCTGGAAGT 


CGGCGTGAAT 


GTACCAATTG 


CATTGATATG 


GGCATTCGGC 


AAAACGTCTT 


1620 


TTTCTTCTAA AAATGCGTCC 


TGACTGGAAG 


TTAGTGTACA 


GATAATTTCC 


GCTTCCTGGA 


1680 


CGGCTTCTTG 


GACAGAAGAA 


CAACCAATAA 


AGGTACATTC 


GGGGTAATTG 


TGCTGTTGCT 


1740 


TAGCGATTAA 


TTTTCGTGCA 


CGTTCTGCTG 


TGCGGTCATA 


TATAAAAACC 


TTTTCAATCG 


1800 


CCCGTUVCGCT 


TTGTAGCGCT 


GCGAGATGAG 


ATGAAGCTTG 


TTGACCTGCA 


CCAATCAAAG 


1860 


CGATACTTTT 


TGCCTTTGGA 


GCTGCTAAAT 


AATCGGTTGC 


CAAAGCAGAA 


ACAGCTGCCG 


1920 


TGCGAAsCCA 


CGTAATCGCA 


TTGGCGTCCA 


TCATCGCGAC 


TGGTCTTCCA 


GTAGAACTAT 


1980 


CAAACAATAA 


AATTTGACCA 


AGATGCGAGG 


CCAAATTTTT 


TTGTGGATTT 


TCTGGAAAAG 


2040 


CTGTAATGAT 


TTTTGCCCCA 


AAAAAGCGAT 


TTTTTCCTAA 


GTAAGCAGGC 


ATTAATGCAA 


2100 


AAAGATTTTG 


ATTTTTCCCA 


TCTGGTAAAA 


CCTCTACATG 


GCGCTCACTT 


TGCGAAATTT 


2160 


TTCCTTTTTG 


CAACTCTGCA 


AAGCAACGTT 


TCATCACTTG 


AATTGCTTCG 


GAAAACGACA 


2220 


ATTGTTGAAT 


AACCCTCTCA 


TAAGAAATAA 


TGTCCATCTG 


AATTCCTCCC 


TTTGCGCTAT 


2280 


CATAAACCAG ACAAACAATA 


CACTCAATGC 


TATAATTGTA 


TGGAATACAA 


TTTTATAGAA 


2340 


AGGAAGTTCC 


CCTCATGCCG 


ATTAATTCCT 


ATGAGAACTA 


TGTGTTGTCA 


TGGCGCCCTG 


2400 


TTAAAGAACG 


TTTAACAA6A 


CCTTATTACC 


AATCATTGGT 


TCAACAATTA 


GAAGCGGATA 


2460 


TCTTATCTGG 


TGCGCTTCAA 


AAAAATGTGA AATTGCCTTC 


GCAAAGAGAA TTAGCTGATT 


2520 


ACTTGGATCT 


GAATTTCACG 


ACGATTGGTC 


AAGCCTATAA ACACGCTATG 


GAAAAAGGCT 


2580 


TACTTTATAC 


CAACATCGGT 


TGCGGCACTT 


TCGTTTCTCC CAACGCGTTT CACTCAATTA 


2640 


CAATCTCTAC 


CAACCAAGTA 


GCGGACCATC 


TAATTGATTT 


AGGGCTGGTT 


AGTTCCTTTG 


2700 


ATATGTGTAA 


TCAGCGGATT 


TTACCTTTTA 


TTGAATCCGT 


TAGTAAAAAT 


GCGGCCCTTA 


2760 


ACAGTTTGCT 


AAATTATCGC 


GATCCTTTAG 


GTACGCACTT 


TCAACGAGCA 


ACCGCTGCCG 


2820 


AATGGCTTCA 


GACACAAGGC 


GTTCGGACCA ATGCCGAAGA 


AGTTGCCATT 


GTATCTGGTG 


2880 


TCCAGAATGG 


ACTGGCCGTG 


ACGTTAGCCG 


CCGCTTTTTC 


TCCAGGTCAG 


CGGATTGCCG 


2940 
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TAGATCGATA 


CACGTATTCA AATTTTATTG AACTCGCCCA GCTTTATCAT TTAGAAATCG 


3000 


TCCCGATTGG 


CTATGATTCC GAGGGGATGG 


ACCCAGAGCA TTTACTGCAA 


GAATGCAAAA 


3060 


AGAAAAAAAT 


ACATGGTATT TTCTTAATGC 


CCGCATGTAA TAACCCGATT 


GGTTTCCAAA 


3120 


TGTCTAGCGC CCGCCGAATG ACTTTAGCTG AAATCATTCA GCAAGAACAC 


TTATGGGTAA 


3180 


TTGAAGATGA CATTCATTCT TTTTTAACAA CTTATGCACA GCAAGCAGTT 


TTACCAACGT 


3240 


TTCAAGAACT 


CTTACCTCAA CAAACGATTT 


ACTTAGCGGG GATGAC/^AAA 


TTTGTCTGTA 


3300 


CAGGcTTACG CATTGCGTAC CTTGTTTTCC CTCCGCTTCT TCGCCAAGAA 


ATAGAGCGTG 


3360 


CGATTTTTAA CATTAACGTT AAAACTTCTG GCTTTGATGC TGAAATTGTC 


ACCCAAGTCT 


3420 


TACGCTCACC AGTTGCAAAA GAATTAGTGA TAGAAAAATT GGCTTTAACA 


AAACAAGCCA 


3480 


ATGACTTATT 


TGATACCATT TTTGGGTTAG 


CTAGACCTAG CAATCCCTTG 


CCGTACTACC 


3540 


GAACTATTCC 


CCTTTCTACC GAAAAAACGG 


CGCCGCAAAT TGAACAAGAA 


TTTCTCCAAA 


3600 


ATGGGGTACG 


TCTTTTTCAC TCGAGCCGCT 


TTACTGTGCA AAATCAGCCA GATGCTTTTT 


3660 


TACGAATTTC 


CTTGGCCTCC AATCAGCTTG 


AGGTATTGGC G7UVAGGATTA 


ACCATTATCC 


3720 


AAGAACTCTT 


ACCGACATTA AACGAAAAGA 


AAGGACATTC CCTATGAATT 


CATTGCCAAT 


3780 


ATTTGAAAAA 


GCAGTGGGCA CGATTACCCT 


GCTTTCCGAA GAAAAACAAA 


ATGCCGAATA 


3840 


TGAAGGCTTT 


TTATGTCAAG TTAATGAATC 


GCAACAACTT ATCCGCAGTC 


GCTTAGCCAA 


3900 


AAAAACGCCG 


AAAAAAGAAG GCTATTTTGT 


AGCCTTCTGG GAAAAAAATC 


AACAAAATCA 


3960 


GAACGAAGCC 


TTTGATGCAA CGGAAGCCCC 


TGAAATGTTA GCAATTGTCA 


TTGCTGATCA 


4020 


AGAAAAGCAA GGTCTCTTTC TTCTGCCAAA AGAATGTTTA ATCCAACAAA AAATTCTCAA 


4080 


AACGCATCAA CAAAAAGGCA AAATGGCGGC 


CCGTTTTTAT CCTTCTTGGT 


GCCAAAACCT 


4140 


TAATCAAACC GCCAAAAAAA CACAAAAATG 


GCAACTAACT TACTTTACTG 


ATTTATCTAA 


4200 


ATACTAAAAA 


CGTCAGCTGG CTTCAGAAGC 


CAGCTGACGT TTGTTTCTTC 


TACTGTTTAA 


4260 


CCAATTGGTG 


GCGTACCTTC TGCGTTCCCA 


GCAATCGCTT CGATTTGAAT 


CTGAGCACCT 


4320 


TTTGGTAAAT 


CCGTCACACC AATCACTCTT 


CTTGCAGGCG TGCCTTCAGG 


GAAGTACGTT 


4380 


TGGTAGACGT 


CATCTACTGC CGCCAGCTCT 


TCAATCTCTT TCACAAAAAT 


GTTCACTTTC 


4440 


ACCAGATCAG 


CTAAGCTGTG ATCCACACTT 


TCAATAATCG CTTTAATATT 


TTCTAAACAT 


4500 


TGCGTCGTTT 


GTTCCTTGAT TCCGCCAGCG 


ACTAACGCAT TTGTTTTAGG 


GTCCAATGGT 


4560 


AGCTGAGCTG 


ATAGATGGTT GTAATGAGAA AAAGCAACAG TTTGTGTCGA 


AAATGGACAC 


4620 


TTCGGTGCGT 


GCTCTGTATT ATTTGCTTCA ATAATTAATC CATGTCGATC 


TTCAATCGCT 


4680 


TGTGGTGGCG 


TACCATCCCC ATGAGAAACC 


ACCGCTTCTA CTTGCACCGC 


CGCGCCCATT 


4740 


GGTAAATCAG 


CAACAGCAAT CACTGTTCGT 


GCTGGCATAT AATTGACCGT 


GCGCGCAATT 


4800 


CCTGAATCTG 


GGAAAAAAGC GGCATGTACT 


TGATTGACAG CTTCCAGCTT 


AGATAAATCT 


4860 


TTTAAATAAA TGTTGATCTT CACAATATCA 


TCAAAAGGCA CATCAATACT 


GGTTAAAATG 


4 920 



WO9JW50S55 PCr/US98/08985 

692 



GCTTTTATGT 


TTTTCAAACA 


TTGCTTTGTT 


TGCGTTTTAA 


CACAGCCAGC 


CACAACTCTG 


4980 


CCCGTTTGAG 


GATCTATAGG 


CAATTGTGCA 


GACAGATTGT 


TATAGTGAGA 


AAACGCGACC 


5040 


GTTTGACTGG 


CTAGAGCATC 


AATTGGTGCC 


TGAACAGTCT 


GATTTGTTAA 


TTTAATTAAA 


5100 


TCCCCAGCTT 


GAGGTGCATT 


AGGGATAGTT 


CCTTCCCCAT 


TTGAAACAAG 


TGCCTCCATT 


5160 


TGAACAGAAG 


CgCcTAACGG CAAATCTGCG ACGGCGACCA 


CTGTTTTAGC 


TGGATAATAA 


5220 


TGGGTAAAAA 


AGGCCGCATA. GACCTCCTCA ACTGCGGCAG CATCCCGCAT ATCTTTGACA 


5280 


AAAATGGTCA 


TTCTGACAAT 


GTCACTCATC 


ACATGGTTGA 


TACTTTCAAC 


AACCGCTTGG 


5340 


ATATTTTTCA 


AACATTGTGT 


CGCTTGGGCC 


CGAATGCCAC 


CTTCTACCAA 


TTGTCCAGAG 


5400 


GTTGGCTCAA 


TAGGTAATTG 


TGCCGACAAA 


TGATTATAGT 


GGGAAAATGC 


GACTGTTTGG 


5460 


GCTGCGAGAT 


GATGCTTTGG 


TGCACGCTCT 


GTATTTCTTG 


CTAAGACGAT 


ATTTTCATTA 


5520 


CTCATAACTC 


TACTCCCTTT 


TTTAGCATTC 


CAACAAAAAT 


GATTCCGCTT 


ACATTTTTAA 


5580 


GCATTAAACT 


TCCAACAAGA 


TCCTTTTTAT 


TTTCCTGCTT 


TTCTATTAAT 


GGTGATAGCT 


5640 


TTCCACCACC 


ACTTCTTTCA 


GTCTACAATA 


AGAACAGGAA 


CATTTCAATA 


AGGCATTCCT 


5700 


AAATTGATTG 


TTGGCCCGCA 


AACAAATCTT 


CTTCAGTTAA 


AGAAGACTGG 


CACGAATCAA 


5760 


GCTTGATTCG 


TGCCAGTCTT 


TATAGaTAAC 


TCGTTAGCTT 


TAACTAGTTT 


TATttAaCCA 


5820 


CCATATACTC 


ACGATAATTT 


TCCGAGAACG 


TATCCATTAG 


CGTATAAATT 


CGTTCAAAAT 


5880 


CATAGTCACG 


ACGAGGGTCA 


TCATACCAAA 


CACATTGAAT 


ACTTCCAATA 


ATTGGAAGAT 


5940 


TCGGGCTACC 


TGCTGGCACA TCTGTAAATG 


CTTTTTTCGA 


CATATTTGCT 


AAAGCATCAT 


6000 


CATATTGATA 


AATGCCAGAA 


TCAACATTGC 


CAGCGACCCA ATACCACGCA 


TCATTGGTAT 


6060 


TAAAAATTTT 


GTGTCCTTTT 


TGAACAAAGT 


ACTCAGGTTT 


GGCTACGTCA 


TATCCGCTCC 


6120 


AACCGGCTGT 


CCAATAAGAA ATAATAATCT 


CTGGGTCAAA 


TGTACCGAAA 


GAATCGTCGC 


6180 


TGTTGTAATA 


AATGCCATCA 


TTGAAGCTCA 


TTGGCTGCAT 


GCCCGCATCT 


TTAATTATTT 


6240 


TAGCTAAATC 


ATTCGCATAA GCGACAAAAT 


CTTTGTAGCG 


CCCAGAAGAT 


TGCAGTTTCG 


6300 


CCCAACCGCC 


TGTATCGACA TCATTtGCAT 


ACTCATCGCC 


GCCAAAATTG 


AAAATTTCAC 


6360 


TATGCGCGGA 


AAAATAAGCC ACGTACTTAC 


TGATAATCGC 


CTTTGTGAAA 


TTCACTGCCT 


6420 


TTTGGTTCCC 


TAAATCTACT 


GrrCGTTTTG AGCCATCAAA AGCGGGATTT 


TTAATCGCTA 


6480 


GTTTTTCCAT 


GGCCACTAAC AAGGCATCCA TATGTCCTGG 


ACTGTTAATC 


ACGGGAATGA 


6540 


TGTTGATGTT 


GCGCGCTTTC 


GCAAAAGCCA 


ACAACCGATC 


CATTTCTTTT 


TGTGTTAACG 


6600 




AGGATCATTG 


TAATATGCGT 


TGTTGCCTCG 


TTGGATTGCT 


TTTGAAACCC 


6660 


GGTTGTGGTT 


GTATTTTTTA 


CCATTCACAT 


TGACCGACAT 


ATCATCCAAG 


ATAAACCGTA 


6720 


AGCCATCATT 


TCCTAAAATT 


AATTGGACAT 


CTGTGTACCC 


ATTTTGACTA 


GCTTTGGCCA 


6780 


CTAATTCTTC 


CAGTTGCTCC ACGGAAAAAT ATTTTCTTCC CGCATCAATG GAAAAGACAC 


6840 


TTTTTAAAGG 


TTGTGTTTGT TTCTCTTTTA AATTATCCAA AGCTGTTTGG ACCTCCGTTT 


6900 
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GCCAAGTAGT CACTTGTGCG GCTGTATACG TJTTTCCCGC AAQTTCTGTG GCGACTGTCT 6960 

CTTTTGTTGT TTGTAGTGCT TGAATCGTTT CTGGTGTATA GCTTTCT.G.CC. TTXXCAATGG 7020 

CTGTAACTGC TTGATCCAAC GCACGATATA AAGTGGTTAA GTCAATGGCT AACAGGCCTT 7080 

TTTCTAATTC AGCATCATAG GTGTTTGAAA TCCCTTGGCC ATAATCGCCA CCTAAAATAG 7140 

CTTTATTCAC ATCATAAAGC GTTGTTGCTT GTGTCACTGC TTCTGCGGCT TTATTTTGTG 7200 

CTTCAAGGTC TGTGTTGGCG TAgCTAATTC GTTTTAAATA GTGTTGCGCA GCcGCTTTCA 7260 

TCTCTGCAAG AGAAACGCCT TTACTCTCTG CAATGGCCGT TTTTGTCCAA AGTAAATTAG 7320 

AAGGTTTTAT CTGATTTAAA TCGTCTTCAT TATAGGTGCG ACCATCGCGA TCTAAGGCAT 7380 

AGAGGAACAT GCCCCCTAAA TTATTTTCAT AAGAATAACG AGCTACTTTA TACATGTTAC 7440 

TTTCCATATA CGGCTCTTTT GTGTCATACC AGCGGTTGCG ATCTTGCTCT TCTGGGAAAG 7500 

TTAATCCTGG AACAAATCGT TCCCCATTGA GAACCGGACT TAAATTATTT AATGCTCGTT 7560 

GCGTGCGTTG GTCATCGCTG CCATACTGTT GATACGCGAG AAAATCGAAA CAGTCACTGA 7620 

CATCTTGTAA AGGTGGTAAA TATTCTGCAT TGGTATCATA CAAAAATGGA CGATCCGTTC 7680 

CCGACTTCGG TCCAATGTAT TTTGATAATG CACGAATGAC ACCATTAGAT AGAACAATAT 7740 

CTTTTTCACT TGGACGAGTT TCCATGTCAA TATCTAACCC ATCAATTCCT AAATCATCGA 7800 

CAAATTTGGT TAACAACTCT TTCGCATAAG CATCAAATTC TGCTTCTGTA GGCGTTGTTC 7860 

CTGCATAAGG AACTTTTAAT AGCTCGCTGT AATCAATCCC ACGAACTAAA CGAACACCTC 7920 

GTGCATGCAG TGctGGCGCA TACTCATTTT TTAACGTATC ATAAAATGGC TGTGCAAGTG 7980 

CTTCTTGTCC TTTTGGCACA TAACTAAAGA CATTCACAAT GTCAATACCA TAAGGAATAT 8040 

CGTGCATCGT TAGCCAATTT TCATCTGGCA ATGTTGTATT AACTCCTTGC ATCGTTTTGT 8100 

CACGCCAAGC CCGATAATAA ACCATAAAAT GTTTCTCTTG GGTCGGCTCA TTGGCTTGGC 8160 

TGATCATTGG TTGATGAAAC AGACTAAATA AGATAAACGC CAAGCCAACC AATAACGAAA 8220 

GACCCAGCCA ACTGACAATT GTTTCTCTGT TTTCTCCTTT TCTTGTCGAT AAATTGTTCC 8280 

CCACTTTGAA CACCATTCCT TTCTGCACTC CGTTCATTTT TGTGTATCCA CCTAAAAAGA 8340 

AAGCGCATAC ACAACCATCA ATAGAATAAC AAAAAATAAC GAATTTGCAA AATAGAAACT 8400 

GGAAAAATGA GTATTTTCTA TAAAAATCTT TTCCCGCCTA AAATCACTTT TGTTAGACGG 8460 

CGTTTAAATG GTAAATTTTG AGACTTGCAA GAAAAACAGG CCTATTTTTC TTGCAAGTCc 8520 

CCTTTACTTA GTCGTGATCA ACCAATAAAT AATCACTGTT TGAATTTTTT GGGATCTACT 8580 

CGTTTGAAAT TTATCTCATC ATCAATAACT GCTCTATGCT TTAAATcTAA ATGATATCGC 8640 

ATATCATGAT TTATATTTGA ATTTTTTTTC AGATTATCAT TATATGTTTC TAAAAATAAT 8700 

TTTAAAACTT-TTTCAGGTTG CCTTATACCT AATACTTTTC TTGATATTTC TAAACCATTT 8760 

ACCTTTTTTA TAAACCATTT TTGATTGTAT TTCATTATAG ATGACAAATT AAAATAAGCT 8820 

TGGAATACAG CTCCCGCATC TTTAATTTGA GTATTTTTTT TCAATTCATT GTATTCATTC 8880 
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AATAAATTCT TTAGCTCTTC ATAATTGTAA A^ACTGAACA TTCCATTCTT AACTTCTGTC 8940 
ATCATTTTAT TCTGCCTAGT CCAAGATCCC ATGCATACAG AAACTAACTC AACTAAGGGA * 9000 

ATTTTGTTAT TCTCTATATG TAAATCTTTT AACTTTTTGT ATTCTTCATT TCCTAAAATT 9060 

ACGTATTTAT TAATATAGTC TATTACCCGC CACTTACATG CTGTATTATT TATTTTAATA 9120 

ATGTCATTCA TTTCTTTGTT TACGGAAACA TAGTATGGCA CGGGTAAGCC ATATTTTCTT 9180 

GCAATACTAT ATCTATGTTG CCCATCGATG ATCTGCATTG TGGAGTTTAC TATAATGGGT 9240 

CGAATAATTC CTTTTTGTAG AATAGAATTT TCCAGTTTCT TGTTCAGTGT AACTTCACGA 9300 

TTACTTCTTA AAAAGCTGAA CATTTCATAT TCTTCCGTTA TATAAATATC ACTCATTTCT 9360 

CTATCCATTT TTTATCTCCT TTATTAAAAA CATACTTTGA ATATCTCTTA TTACTGGCTT 9420 

CCCATCAATT TCGATTTCAT ATTTAATTGC TTTGTTTGAA TTTCTATGCA ATTTATAATT 9480 

ATATGCTTTC AAAAACTCTT CCAAAACGAC TCCTTGGTGA AAAATTCCTT CTATAACATT 9540 

TTTAGTTTTT ATTTTAAATC CATTAATTAA CCTATAATAA TCGAAATCTT TTACAATAAA 9600 

TAATTCTACA TAAGCAAAAA ATGTTTGCTG TGAACTTCTC T^GTTCAACT CTTCACAAAA 9660 

GTTTCCGTAA GAATCTAATA ATCTTATAAA TTCCTTGTAG TTATAAAATG AAAATTGACC 9720 

TGTTTTTACT AATTCTTGTG AATTACCATC ATAGTGTTTA TTTCCCATAG CCAAAGATAT 9780 

TAAAGAAGAG GGTGAGATAT TTTTATACTT CTTTACAATA AACTGTAGTT TTTTATATTC 9840 

GTTCATACCA TCCATCGAAT ATTTCGTATG AAATCTCCAA TTTCCAGTCT TAATGTCGAA 9900 

TTAAGCTCAA TTACATCATC TAAGTTCAGA CCTTTCGAAA TAATATAAGG AATGGACTTA 9960 

TTCAATTTTC TGGCTATCAA AAAACGAGTT TGTCCATCAA TAATTTCAAA TTTCTCATTT 10020 

ACATCTATAG GAACTAAAAT ACCTGATTTT TCTATAGATT CTATTAAACT ACGCTTTTCT 10080 

TTAATTGCTC TGTTTCCCTT TAAAAATTTA AATTtTGAAT AATCCTTAGT TTCATACAAT 10140 

TTACCAATCA ACTTTCCCAT ATTCTTCACT CCTTATAAAA TATATTTTTA ATTTATATAT 10200 

TTTATAATAT CGTCTAAAAA AAAGAAATGC AATTCTTTTT TAATCAATAT AACACCAACC 10260 

ACACATTAAA ATAACATTTT GGTTAACCAT CCATGTTTAT TAATTCGGCG ATTTTCACAA . 10320 

CAATTTTCCC TTGCGCTCGT CCTGTCAATG AATAATCTAC CGCTTCTTGA ATTTGGCTAA 10380 

AAGGAACGAC TCGGTCAATT ATCGGCTGTA ATTTTCCCTG TTCAATAAAC TCAGTGAGTA 10440 

AAGCTAATTG CTCGCCATCT GGACGCATAA ACAAAAAATG ATAACTAACA TCTGTGGCTT 10500 

GCTCCAATCG ATGAATCTTG CGGGTAGCTA TTTTAAAGGC CCATTGTTTC CAAAGCGGCA 10560 

AGCCATACTC TTTAGCAAAA CGTTCGTTGG GAATGCCTGA CAATGTAACA ACTTTTCCCT 10620 

GAGGTTTAAC CACTGAGAAA GCTTTTTCTA AGATTGTCCC CCCCATTGTA TCAAACACAT 10680 

AATCATAGTC GGATAAAACT TCTTCAAAAT TTTGTGTCCG ATAGTCAATC ACTTCATCTG 10740 

CTCCCAACGC TTGAACCCAT TCTTTATTTT TACTACTCGT TGTGGTGGCA ACGTAAGCGC 10800 

CTGCTAGTTT TGCT7UVTTGA ATCGCAATGG TTCCAATCCC TCCTGAACCT GCTTGAATCA 10860 
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GGACTTTCTG GCCTGGCTGT ACATTCATAA TATCATGTAG CG^TTGATAA CTTGTCAAAC 10920 

CGACTAACGG AATGGCCGCA GCTTCTTCAA ACGTTAAATT TTTCGGCTTC ATAGCTACAG 10980 

CCGCTTGATC CACTGCAATA TATTCAGCAA AAGTTCCCAC GCGATTTTTG GGTACTCGCC 11040 

CGTAAACTGC ATCGCCCAGC CTAAAATTTT GAACATTTTT TCCAACAGAG ACAACAATCC 11100 

CAGCAAAATC ACTGCCTAAA ATCAGTGGCA TTTGATAATC TAAAAGCATC TTTACTTTGC 11160 

CATCTTTTGT TTTTAAATCA ATCGGGTTGA TACTGGCAGC GATAATTTTG ACCAAAACAT 11220 

CATTGTCATG AATGGTGGGT AAnGGCACCT CCTCGATTGC TAATTCtTTT TGTCCGTATT 11280 

TATGAATTAA AGCGGCTTTC ATCATTGAAT CTTTCCTCTC TTTCTATCCC TTTTATCGGT 11340 

TTAAAAGTTT AAACCTTTAA TCAATTAGTA TCTTAACGAA GAATGGGATC CACGTCAAGT 11400 

TTCAGCCCTC AGCGTATTTT ATTGACAAAA AACACGAAAA ATGGTTTAAT CGTTTAAAAA 11460 

TACGAACCAA AGAGGCGAAA GAATATGAAA GAATTAGAAG AAATTCAAAC AGAATTTACT 11520 

GCGCTTAGTG ATTTTTTAAT CGCATTGGGT GATGAAAAAC GCCAAGCAAT TATTATTGCA 11580 

CTGTTAGAAG ACCAAGCCTG TGTTGGCCGT CGTGTCACTG AGTTAACAGA GGCGACACAG 11640 

CTCTCACGCC CCGCTGTTTC TCACCACTTA AAAATTTTGA AACAAGCCAA ATTAATTGAT 11700 

TGTCGCAGTG AAGGAACTAA AAATTATTAC TCACTTTCTC ACGATACCAC CAAAATTGAG 11760 

CAACTCCAAC AACTATTGAC CCACATTACG TCGATTATGC CAGGGAGGAA GCAAGCATGA 11820 

AAAAAATTCT CGTGGTCCTC ACAAACGTCT CCCGTTATCA CGGCACCGAA GAACCAACGG 11880 

GCCTCTGGCT AGGTGAAGCC ACGGAGTTCG TGGAAGAAGT AACCAAAGCA GGATTTTCAG 11940 

TTGACTATGT TAGTCCCCAA GGAGGCTATG TCCCTTTGGA TCCCCGGAGT ATGAAATATG 12000 

TAGATTCATC CATTATGGCA GTTTATGAAA GTGCTGATTT TCAAGAACGA GCGCTAGCAC 12060 

ATTCCCTCTC TCTTGAAGAA ATTGAGCCAG ATGAATACGC AGCAATTTAC TTTACAGGAG 12120 

GCCATGGCGT TATGTGGGAC TTTCCTGATA ATCCTCAGTT GCAAGCAATC ACTCGTAGCC 12180 

TTTACGAACA GGGCAAATAC CTAACTTCTG TTTGTCACGG AATTGCTGGC CTTTTAAATG 12240 

TCAAAGATGC GTCAGGAAAT TATTTGATTG CAGATAAAAC CATTACTGGC TTCACTACTG 12300 
CAGAAGAAAT TTTAGCCATG AAAAAAGCAG TTGTTCCTTT CTTAAACGAA ACAATTGCCA 12360 
AAGAACATGG TGCTACTTTT GCGAAAAAGC GTCCCTATAA AGAATTTGCG TATCAAGATG 12420 
ACCGTATAAT CACTGGGCAA AATCCTTTTT CTGTCCGCGC TGTCGCAAGA TTGTTAATTC 12480 
AACAACTATC AAAATGAAAC AACAGCGACT ACTTTTAGGT AGTCGCTGTT GTTTCATTGC 12540 
TTCTTCTTTT TTCCGAAAGC CGAAGCCCTA ACATAAAAAA TTTTTGATTA GCTAAAGAGT 12600 
AACATCGCGC CAATTAAGAT AATTAGGTTA ATTAAGCCTA AGACAATCAT TGGTTTGGCC 12660 
ATGAATTTTG-TCCAAGTGGC ATATTGAACT TTGGAAATTT CCAAACCACC CATTACGACA 12720 
CCAGATGTTG GCGTGATCAA ATTAATCAAG CCACAACCTG AAGCAAAAAT CATAACCATC 12780 
ACTTCTGGAC TTAAACCAAT GTTTTTAGcT AACGCACCCA TTACAGGAAT AGAAACGTAC 12840 
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GCTAAACCAG AAGTTGAAGG AATTAGGAAA GAXAAACCTA AGTATAAAAC AAAGGCACCG 12900 

ATGACAAATA AGACTGGGGA GAGTCCTTGT AATAAGCCTG CCGCTTTATC AAGGATAAAT 12960 

GAATCTAAAT ACGTTTTAGA CATCAAAACA GATGCCCCGC GAGCAACTAC GATAATTAAA 13020 

ACAACGGATA ACATATCTTT GGCACCGTCG ATAAATGCTT CAATGGTTTC TTTTTCAGTA 13080 

AAGCGACCAA TCAAGGCAGC AATGATCCCT AAAAGGAAGA ACCACATCGC TAAGTCGCCA 13140 

AAGTACCAGT CACCGAAACT GTTGCCTGTT AAAACAGACG TCCAGCCTTT GAAAATCGTC 13200 

ACTCCAAAGT CGTTCCATGG AATTAAGGAA ACAATCATAA TGACAAAACA GAAAGCGAAT 13260 

ACCATTAAGA TTTTTTTATG ACGATTCGTA AATGGCATTT CTTTCGACGC TGCTTGTCCA 13320 

TACGTTTTTT CCATATCCTC TTGTTCTTGT AAAGATAAAA TTGTTGAACC TTTGTCAGCC 13380 

TTCACTTTTT TCGCATAACG CATCACAATG AAAATAGAAT AGCTAGTAGA GGCCGCCCAC 13440 

AAAATAGCAC CGACAATTAA AATAATTCCT GTATTCGGCT GAATGCCAAT ACCTCGCAAC 13500 

GCATCCATCG CTACCCCTGT GGA6AAAGGG TTAACCGTTG AACCGATAAC ACCTGATCCT 13560 

GCACCCAATA GCACAGTACC CACTGCGACA AACGTATCAA ACCCAGCGGC GACCATGGTA 13620 

GCACTCAATA AACCATAAAA AGGAATTGTT TCTTCTGCCA TTCCATAAGT CGACCCACCA 13680 

ATAGAAAATA AAATCATTAA AATCGGAATG ATTTTCAATT CGTTCCCTTT TAACTTTTTA 13740 

ACGACTGATT GAATCCCTGC TTCAAGAGCC CCAGTTTGCG TCACAATATT TAAAAAGCCA 13800 

CCGAGTACTA AAATAAAAAT ATTAATTTCA ATGGCATCTT TAAACCCATT GAATGGAGCC 13860* 

ATGATAATAT CCGATAATTT GGCCCCTACC ACGTGGTCAA CCATTTTGCC ATCAATTTCT 13920 

CTTGGGGTAA ATGGCTGGCC ATTTAGCAAG ACCGTTACGA TACCTAAAAC AATTAAAATA 13980 

ATAAATAAAA TACTAAAAGA CGAAAGCTGT TTTTTGTTTT TCTTCTTTTT TTCTGCCATA 14040 

TCCTTTTATA CAGGAACCTA CTTTCCTGTA TCTCCCTTCT CTATGATTAA TTACTCAATC 14100 

GATCAATAAA GAATGGCGTA TCTAGAAAAG TTAAGAATTT ATGACTGTAG CTAATACGGT 14160 

TCATTTGGAC TAATTTTTTC ATAACACTGC TTGCTGTTTC CCGCGTAGTT CCACTCATTC 14220 

GTGCAATGTC ATTAATGGTA ATTGGACAAG TAATTGCAAC GCGTTCTCTG TTGTCAATTC 14280 

GTTClrCCTAA ATCATTTAAT AACATCGCCA GTGATGTCAC AATCCGTTCA CATGCATCAT 14340 

TGGTTGTTCC TTTTTGGATT TTGATCATAC TTTCTTTTAA 7VAGGACCGAT TGTTCTTGAA 14400 

TCCAATGAAT CAATTGCTGG GGATTATTTT GGAGAAAACG CTCAAACAGT TTAACTGGCA 14460 

CAAACACAAC CTCAATATCC GTATAGGCAA TCGCGGAGTC GTAATGAACC TCATCTTGAA 14520 

ACAGTCCGAT TCTCGGAAAA AGAGCCTGCT TTTTCATAAA ATGCAGATAG GAAAAATTAC 14580 

CACTGCGATC ATTTTTTTCA ATTTTAATGA CACCTGAAAC CAATAAAATT AAATGATTTC 14640 

TCGGATCACC TGGATAAAAA AGGACTTGGC CTTTTTTATA GGAGCGAACG AAACTCTCGT 14700 

TGCGCAATTC TGTTAGTTCT GATTCAGAAA AGTATTGAAA ATCTTGATTA TTTTTAGTGA 14760 

TTATTTTCTC TATCATCATC TTAATTCTCC TAAATACTAT ACTTTATTTT GTCACAACCG 14820 
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TTCCTACGAT 


TTCATCGCCG 


CTCATACTTC 


GTAAGTTTTC 


CAAAGAAGTG 


ATAATTGCTT 


14880 


GTTTATTGGG 


TTGGCTTTCT - ACAAACTGAA 


TGGCGGCTTC 


AATTTTCGGT 


AACATACTAC 


14940 


CTGGTGCAAA ATGACCCGCT 


TGTTTGTATT 


CTTCCAACTC 


TGCTACAGTG 


ACATTGGTTA 


15000 


ATTGTTTTTC 


ATCTGGTTTG 


CCATAATTAA 


TACACACATA 


ATCCACGCCA 


GTTAAAATAA 


15060 


CTAGTGCATC 


CGCATCCACT 


AATTCTGCCA GTTTTTCAGA 


AGCAAAATCT 


TTATCGATTA 


15120 


CTGCTTCGAC ACCTTTTAAT TCTTGTCCTA CGACAGGGAT 


GCCGCCACCA 


CCACATGAAA 


15180 


TGGTAATTAT 


ATCATTTTTT 


ATTAAGGTAT 


TAATAGTCTC 


AGCCTCGTGG 


ATGTCAATTG 


15240 


GCTTAGGACT 


TGGAACGACT 


TTGCGCCAGC 


CACGTCCTGC 


ATCTTCTTTA 


AAAATAGCAC 


15300 


CTGCTTGCAT 


TGCTTCTTTG 


GCTTCAGCTT 


CTGTTAAA/^ 


TGGACCGATC 


GGTTTTGTTG 


15360 


GATTTTTGAA 


TGCCTCATCT 


GCTGGATCTA 


CGACCACCTG 


TGTTAAAACA 


GTAGCCACTT 


15420 


GTTTCTTGAT 


TCCTGCTTTG 


TTTAATTCTT 


GATTAAGCGC 


ATTGGACAAC 


CAGTAGCCGA 


15480 


TACTTCCTTG 


TGTCATAGCA 


ACACACGTAT 


CTAGCGGCAT 


CGCTGGATTT 


TTTTCAGAAT 


15540 


CAGCTGCTTG 


TTGTTGCAAT 


AATAGATTTC 


CAACTTGTGG 


ACCATTGCCA 


TGTGAAACAA 


15600 


TCAACCGATG 


CCCTTGTTTA 


ATTAAATGAA 


CTAAATAGGC 


AGATGTTTGA ACTAATGCTT 


15660 


GCTGTTGTGC 


ATGTGCGCTA 


GCATCATTTG 


AAAGGATGGC 


ATTTCCGCCT 


AGTGCAACGA 


15720 


CCATTTTTTT 


CCCCATTACA 


AATCCTTCTT 


TCCTATCATA 


ATAAAGAATA 


GGCCGCCCTT 


15780 


CAAAAGAAAC 


TCTTAAAGGG 


CGAAACAACT 


ATTCTTTTTT 


CAATGTTTTT 


TCAGTTGCTT 


15840 


CCTACACACG AG(3AATGAAT AAGTTGCCTA AAGTAGCTGC CATGATCGCT TTAATAGAAT 


15900 


GCATTCTGTT 


TTCTGCTTCT 


TCAAATTGAC 


GACCATATTT 


ACTGCGGAAG 


ACTTCATCCG 


15960 


TAATTTCCAT 


TTCAGTAATG 


CCAAATTTTT 


CAGCAACCAT 


TTCACCGTAT 


TGTGTTTTCG 


16020 


TATCATGAAA 


GGCTGGCAAA 


CAGTGTAATA CGATTAAATC 


GCCATCCATG 


TTTTCGGTTT 


16080 


TTTCTAACAT 


AGCCATATTA ATTTGATAAG 


GTTTCAACAG ATTGACACGT 


TCTTCAAATT 


16140 


TATCTTCTTC 


ACCCATGGAC 


ACCCAAACGT 


CTGTATATAA 


CACGTTGGCT 


CCTTTAACGC 


16200 


CTTTAGCAAC 


GTCGTCTGTA 


ATCATTAATT 


CTGCACCAGA TTCTTTCGCA AATTCTTTTG 


16260 


CATAATTAAC 


GACTTCATCA GAAGGGAACA ATTCTTTTGG TGCACAAATG CGAACATTTA 


16320 


CGCCTAAAAT 


AGCACCTGTC 


ACTAACAAGC 


TGTTTGCCAT 


GTTGTTACGG 


CCATCGCCGA 


16380 


CATATACTAA 


AGTGATTCCT 


TCTAAACGAC 


CAAAGTTTTC 


TTTCACTGTC 


ATAAAGTCCG 


16440 


CAATCATTTG 


TGTTGGATGC 


CATTGATCCG 


TTAAACCATT 


CCAAACAGGA 


ACACCAGAAT 


16500 


ATTTCGCTAA 


ATCTTCCACT 


ACTTCTTGAC 


TAAACCCACG 


GAACTCAATC 


CCGTCAAACA 


16560 


TACTACCTAA 


AACAATCGCT 


GTATCTTCAA CAGATTCTTT 


TTTACCTAAT 


TGAATATCGT 


16620 


TGGCGCCTAA* ATACTCAGGA 


TGTGCCCCTA AATCAATCGC 


TGCTGTAGTA 


T^TGCAGAGC 


16680 


GTGTCCGTGT 


CGAAGTTTTT 


TCAAATAAAA 


GAGCGATATT 


TTTTCCTTCT 


AAATAGTGAT 


16740 


GTGGAATCCC 


TTTTTTCTTC AATTCTTTTA AGTGAATACT GAAGTCTACC AAATACTCTA 


16800 
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ATTCTGCTCT 


TGAAAAATCT TTTTCTGCTA 


ATAAACTACG 


TCCTTGAAAT 


ACTGAATTCA 


16860 


TGATGATTCC 


TCCTATTTTT GGGTGTTTTT 


TTAAAGATCT 


TCACGGTAAA.. GTGGCATACT 


16920 


CATACAACGA GGGCCACCAC GGCCACGTAC 


TAGTTCACTT 


CCTGGAATGT 


AATTCAATTT 


16980 


CACGCCTGCT 


TCTTCTAGCG CTTTATTCGT 


AATCGTATTC 


CGATCGTAAA 


CAACAACTTC 


17040 


CCCAGGGGCA ATTGCTAATG TATTTGAACC gTCGTTCCAT TGTTCACGGG CTGCTGCTGT 


17100 


TAAATTTCCA 


GCGCCGCAAC GGATTAGTTG 


AACATTGTCT 


AAATGCAAGT 


ATTTGCATAA 


17160 


AATGTTATCT 


AATGTATCTT TTTCTTTTGT 


AATTTGGATG 


TCTCCATCTG 


CTTTTTCAGT 


17220 


GATCGAGTAA 


ACAACCAAAC CACCTTCGAT 


TTCTGGATGA ATCGTAAATT 


TATCATAGTC 


17280 


AATCATGGTA AAAACGGTAT CTAAATGCAT 


GAATTTACGG 


TGTTCACCGA 


TATCAAAAGC 


17340 


CAAGATATTT 


TTAAACCCTA ATTTTTGTTC 


AAAAATATTT 


CTCGCAATTT 


TTTCAATTGA 


17400 


CGCGGCATCC 


GTTCTTTGAG AAATCCCAAT 


GGCCACTACT 


TCTTTTGAAA 


GAATTAATTC 


17460 


ATCGCCACCT 


TCAATTCTGG TTGATTCTGA 


ACGATCATAG 


ACTCTAGGAA 


CCTCTTTTCC 


17520 


AGCAAAACGA 


GGATGATAAT CAAAAATGTA 


TTGCCCAAAA 


ATGGTTTCCC 


GTTGTCGAGT 


17580 


TACTGAATAC 


ATATGATTTA AAGAAATCCC 


GTGCCCCATT 


GTCGCAAAGT 


TATCACGCGT 


17640 


GAAGTATAAG 


TTAGGCATTG GATCAATAAT 


GAATGGATAA 


TCCGCTTCAA 


CCATATCTGT 


17700 


TAAAAATTCT 


TGCTCATATT TAGGTAATTC 


TTGTTTTTGA 


ATGCCTGCAA 


TCGCTTTTTG 


17760 


AATCAGTTCT 


TCGTTGTCGT CAATTTCTAA CATTAACTCT 


CTAACTTTTT 


CTTTTGCTGA 


17820 


TTCGCTGCGA ATATTGGCTT CTTCTAAGAA TTGGTCAATA AATTGTCGGC GGACCTCTTC 


17880 


ATTAATCAAC 


GCTTCAGCAG CTAAGTCCTC 


TAAATAGACC 


ACTTCGATAT 


CTTTTGATCG 


17940 


TAACAACTCT 


GCAAATGCAT CATGTTCTGC 


TTGTGCTTTT 


TCTAAAAACG 


GAATATCATC 


18000 


AAACAACAGT 


CTCTCCAGAT AATCTGGCAT 


TAAATTTTCT 


AATTCCTTAC 


CTGGACGATG 


18060 


TAACATCACT 


GTTTTCAATT TCCCGATTTC AGAAAATACA TTAATTGGAT 


GACTCATGTG 


18120 


AAATAACCTC 


CTCAACTTAA TTACAAACAC 


AGAAT7ACAG 


ACCTGATTAA 


CCTAGGACGT 


18180 


GAGCCATTTG 


CCAAACAAAA TACCAAATAT 


ACAACGGTAT 


TCTTTCTTAT 


GCATATTTCT 


18240 


AGCCTAAAAG 


AATGCGCTTT CATAGATATT 


CTAAAAAAAT 


AATGCATAAT 


TAAATAAAAA" 


18300 


AAAATGCATT 


TATTCAAAAA AAATTATTTC 


TCCACACACT 


TCATAAAGTT 


ATAATCTTAT 


18360 


AGAGGkAAAA 


GGAAGTGAGC CAGAAGTCAA 


AACGACTTCT 


GGCTCACTTC 


CTTTTATGCG 


18420 


TTACTTTTTG 


ACAAAGGTGT TGGCTGAATA 


AATTGTTGAA 


AATACTGCTC 


TAGTTCTTGT 


18480 


GCTTTTTCTT 


CGGATGATGA AACAATAATA ACAGTGTCGA 


AGcCAGCGAC 


AGAACATTTC 


18540 


TTTTCACACA 


AAGACACTTC ATCAAGTACA GCTGAAAAcA 


ATGTTGCGTT 


GTCTGGTAAT 


18600 


GTATGAACCA 


CTAGTAAAAA GGCGACACGT 


TCAACTTTGG 


TCACTACCTC 


ACGAATTGCT 


18660 


TGTATCAACT 


GTTCTTTTTT TCGTTCTTCT AAAGAAGTGA ATTGTTCAAA AATCGTTAAC 


18720 


CGCATCTCGC CAGACTCATC TAATGCTTTG ACGACTTTTA ATTCCCATAA ATCCCGTGTUV 


18780 
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ATCGTTGCTT 


GTGCGACATT 


TATCTCGCGC TTTTTTAATT 


CTCTTAACAA CTCATTTTGA 


18840 


CTATGAATTA 


CGGTTGTTTG 


AATGATTTCT TTAATCAAAC 


TTTGGCGTGT 


TTGCTTATTC 


18900 


ACAAATAACA 


CCCTTCCTCC 


AGACATTTAC AAAGCAATAC 


TAGCAGAATC 


TCTCTAATTA 


18960 


GAATAGAAAA AGGGTCGTTT AGCTAAAAGG TTCTCATTGA 


TTTTGTCGGA 


TATTTCGCAT 


19020 


TTAAAAAAGT 


GAGACTTGCA 


CAATTTAGTA AAACGTTGTA 


CAGTGATTTT 


aGGGAATTAC 


19080 


TGGCACCGTT 


TTAAATAACA 


GCAATGATGT TTTTTTGCAT 


GATTAACTGA ACTGGAGGAA 


19140 


AGTAATGAAC 


CGATTAAAAC 


GCCAAACGAT GATTAAGCGA 


TTGGTCACAG 


AAAATGAAAT 


19200 


CCATACACAA AGTGAATTAC 


TAGCATTATT GGAAAAAGAA AATATTCACA 


CCACACAAGC 


19260 


AACTATTTCT 


CGTGACATTC 


GTGAGCTGAA TTTAAACAAA 


GTAGACAATG 


GTCAAGGAGA 


19320 


AGCCTATTAT 


CGTATCTTGA ATAATTCTGC CCTTCGTCCA AAATATAAAA CAGATGAAGA 


19380 


ACGTCTTGTT 


GACATTATTA 


TTGAAACTGG TGTTTCTTTA ACACAAATAG 


AGTTCACAAA 


19440 


TCTTTTAACC 


GTCTTACCTG 


GCAACGGGCA AGCCATTGGG 


GTTTTAATTG ATCGCATACG 


19500 


TATCCACAAC 


CCTCATATCT 


TGGGCTGCAT CGCTGGGGAT 


GATACCATCT 


TAATTCTTTC 


19560 


AAAAAATAAA 


GAAGATGCCC 


TTGAAGTTAA CAACTATTTC 


CAGCAGTATT 


TGTACCACCC 


19620 


ATAATTCATC 


ACTTAGTAAA 


ATCGACTATT TTTTGACATC 


ATtGCTATTT 


TTATTAAAGA 


19680 


TAAATAAAAA 


AAGGCGTCTC 


AGTGaAAGCT GAGGCGTTTT 


CTCATGAGTT 


TTTCTTTTGA 


19740 


ACACCGTCTT TTTTTTGTAT AATJUVTCAAA AAGCAAAGAA AgGGkTTCTT ATGACTGAAC 


19800 


AATTCATACT 


CACTTCAAAC 


GATCAACAAA CTCAGCTCAA 


TGTACGTCAT 


TGGCCCTGTC 


19860 


CTTCACCCAA 


AGCCGTAGTT 


CAATTGATTC ACGGCATGGC AGAGCATATC 


CAACGTTATG 


19920 


ATGAATTTGC 


TCGTTTTTTA AATCAATTAG GATTTACCGT 


AATTGGTCAT 


GATCATCTAG 


19980 


GTCATGGCGA ATCTGTAsaA 


CCAAACGCAC CGATCTATGG 


TTTTTTTGGA GAACAAGGGC 


20040 


CTGAAAATGT 


CGTTACAGAT 


ATTCATCAAG TGAAACAATG 


GGCTGTGAAT 


CGATATCCAC 


20100 


AATTGCCCTA 


TTTTATGATG 


GGTCACAGCA TGGGGTCTTT 


CGCCTTGCGC 


AACTATCTGC 


20160 


AAGACTATCC 


TGTGACTGTG 


CAAGGAGTCA TTTTCATGGG 


CACTGGAACA 


AGTCCGTTAC 


20220 


CTTTAACCGC 


AGCATTGCCT 


TTTATTAAAA AAATGGCCGA 


GAAACAGCCG 


AAAAAACCAG 


20280 


CTCCGTTCAT 


TGATAAGCTG 


GCATTTGGTT CTTTTAGCAA AAAATTTCCT 


GAAGCAAGTT 


20340 


CCTTTAATTG 


GCTTTCTAAA AATCAAGCCA ATGTGGCTGA CTATGAAAAT GACCCATTAA 


20400 


TGGGCTTCGT 


ATTTACCAAC 


AATGGTTTTG CTACCCTTTT 


TTCGCTTGTT 


AAACGAGCCA 


20460 


ATCAAAGGAA 


TTGGTACCAA GCGATCCCAA AAGAATTACC 


TATTTTAATC 


ATTAGTGGCG 


20520 


CTGAAGACCC 


AGTTGGTGAT 


TTCAGCAAGG GGCCAGCGAA 


AATTCAAAAG 


CAATTAAAGC 


20580 


ATGCAGGTTT 


TCAGCACGTG 


ACGTTACGAC TATTTCCCAC 


ATTGCGTCAT 


GAAATTTTAT 


20640 


TAGAAACGGA AAAAGCTACA 


GTTTTTCAAG AAATTGGGCA 


TTGGTTAACG 


GATTTAACGA 


20700 


ATTAAAAAAG 


AAGGTTGATG 


TTGTGACATC AACCTTCTTT 


TTTAATCTAT 


TTTCGTTAAA 


20760 
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CCGCCCATAT 


AAGGACGTAG 


AACTTCTGGA ATCGTTACGG 


TGCCATCTTC 


ATTTTGATAA 


20820 


TTTTCC7\AAA 


TGGCAGCCAC 


TGTTCGACCA ACAGCAAGTC 


CTGACCCATT 


AAGAGTATGG 


20880 


GCATATTGAA 


CATGACCCGT 


TTTATCACGA 


TAACGAATCA 


ACGCCCGACG 


TGCTTGGAAG 


20940 


TCTTCACAGT 


TTGAACAAGA 


ACTAATTTCT 


CGATACGTTT 


TCTGTGCCGG 


AATCCAAACT 


21000 


TCTAAATCAT 


AGGTTTTCGC 


TGCCGAGAAG 


CCCATGTCAC 


CTGTCGAAAG 


AGTAATCACT 


21060 


CGATAAGGGA 


GGCCCAATTT 


TTCTAAAATG 


TCTCCAGCAT 


TGTTGGTCAT 


TTTTTCTAAT 


21120 


TCTTCATAGG 


AGTGTTCTGC 


ATCACTAAAT 


TTAACCATTT 


CTACTTTATG 


AAACTGGTGT 


21180 


AAACGT^TTA 


ATCCTCGGGT 


ATCACGACCT 


GCACTACCCG 


CTTCCGAGCG 


AAAAGAAGGA 


21240 


CTTAAAGCAG 


TAAAATAAAT 


CGGTAACTCT 


TGCGCATCTA 


AAATTTCATT 


ATTATAGTAG 


21300 


TTAGTTAACG 


GGACTTCCGC 


TGTGGGGATC 


AGCGTCAAAT 


CACTTTCTGC 


AACTTGGAAA 


21360 


ACATCTTCTT 


TGAACTTAGG 


AAACTGACCG 


GTGCCAAACA 


TTGCTTTGCT 


ATTGACCAAA 


21420 


TAAGGAGGAA 


TCACTTCGGT 


ATAACCATGT 


TCATTAACAT 


GTTGATCTAA 


CATAAAATTA 


21480 


TAGACCGCTC 


GTTCTAAACG 


AGCCCCTAAG 


CCTTTGTAAT 


AAAGAAAACG 


GCTACCAGAT 


21540 


ACTTTCGCGC 


CTCTTTCAAA 


ATCTAAAATT 


CCTAAGGCGG 


GGATCCTCTA 


G 


21591 



(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 



CAATTGCCCC 


GATTGGAATA 


AATGAAGGGC 


CTTGCGCAAC 


TGGTAATCTC 


ACACAAAAAT 


60 


GAGATTGAAT 


MTCGTGGCA 


AGTCCTGCTG 


CTATAAATGT 


TGCTTGGATT 


AAAGAAGACG 


120 


TTTGTGCTGT 


GCTAATCCCA 


ATAATTGAAG 


CAATGATCAC 


AGGTACGACG 


TACACGTCCA 


180 


TTGCTAAGAC 


ATGCTGAATC 


CCTAACATAG 


TGGCCTGCCC 


AAAAGATACT 


GAATCTTCTG 


240 


GCCCCACTGT 


TAAATGAAGA 


TTATTTTGTT 


TCTTTTCACT 


CATTTTTTAT 


ACTCCTTTAG 


300 


TTGCTGACAT 


TAAATTTTTA 


TCATGTACTT 


TTTCACCTTG 


TACCCAGACT 


TCTCGAATAT 


360 


TTTCTGGTCG 


TGCTAAATAT 


AAGATTTTTT 


GGAAAATATC 


AAGCAATGAT 


TCCCCTGGAT 


420 


TTTTAGGAAT 


TCGTGCTTGT 


GACAAGCTTG 


TATTAATAAT 


TTGAACATCC 


CAAGCATACC 


480 


CTTTTTCTAA 


TTTTCCAATT 


GGTAAACTTA 


GCGCTTGACC 


ACCACCAGCA 


GTAGCTAAAT 


540 


AAAACGCTTC 


ATTTACGGTA 


ATCCGTGCTG 


TTTTAACACC 


TCGTACTTCT 


GGTTTTTTTG 


600 


TAGCATCGAC 


GCCATCTTCT 


AACATTCTAG 


AAGAAAT7UVC 


CGCTTGCTTT 


AAATTGTCAA 


660 


AAAGACTTGG 


TGAAAAACCA 


CCTGAAAGAT 


CACTACCTAA 


ACCAACTTCT 


ACTTGATGTT 


720 


GATGTACTAA 


TTTTGCAATC 


GGTGTTACTG 


CATTACCAAA 


ATAAGCATTA 


GAGATTGGGC 


780 


AGTGCGCCAC 


AGCCGTCCCT 


GTTTCATGGA 


ATAAATTCAT 


ATCTGCTTCT 


TCTAAAAAGC 


840 
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CTGCATGTGC 


CATCAcTbCT 


TTTTCTGtCA ATAAACCAAA ATCATTCAAT 


GCtTGAGCAT 




CATTTTTTTC 


AAAACGTTCT 


TGaACAAATT 


GATGTTCCCA ATCACTCTCA CTACAATGAC 




TTTGAACAJcG 


GACTTGATAT 


TTCGCTGCTA ATTCTCCTAA 


TCCTTTAAsT 


GCTTCCTCTs 


1 non 


TACAACTTGG 


AaTGAAAcGC 


GGaGTTACAa 


CGGGrTACAC 


GCCTTGTTTT 


GTTTGCTGCG 


xuou 


CTAATGTTTG 


AATTTCTTGA 


ATAAACCGTT 


CCGTTTCTTC 


TAATGCTTGT 


TGCGTTGTTT 


114U 


GATCACGATA 


GTATTCTGGA 


TTCTCATTAA 


GATCATCCAT 


TACAACTTTA 


CCAATTAAGC 




CACGCTGCCC 


TTTTTCCGCA 


CAAATCTCTG 


CCAAACGACG 


GCTAGCTTCA 


TAATGAATGG 


XZdU 


TTGCAAAATA 


AAGGGCTGTT 


GTTGTCCCAT 


TAGCTAACAA 


TTGGGTTACC 


AATTCATCAT 


X j^U 


AGACTTGTTG 


GGCAAATGCT 


AAATCTGCAA 


ACTTGGACTC 


CAAAGGAAAC 


GTGTACGTAT 


1 OOA 

XisoO 


TCAGCCATTC 


ATACAATGGA 


ATATCCAAGG 


CGGTCCCTGA 


TTGTGCCCAC 


TGAGGTGCAT 


1440 


GGACATGTAA 


ATCAACAAAG 


CCGGGTAAAA 


GATAAGAATC 


TTGGGGCAAT 


TCTTGAAAAC 


1500 


GTTGTGCCTC 


TTTTTGTTCT 


GCTAGCACAG 


TTTCATACGC 


TTCAGCATCT 


GGTTCAATTA 


1560 


TTTCCTGAAT 


TACTCCGGCC 


TCATCTATAC 


AAAAAAGTGT 


CTCTTCATAA 


ACTTTAATCT 


1620 


CTTGCAAATT 


TTTACTAGAA 


AAACCTGTTC 


CTTTTATTAC 


AAAATGAAAC 


ATTTAATACT 


1680 


CCTCCTTACT 


ATCACATGAA 


GATTATACAT 


AAACTTTCGC 


TTAATAACTA 


TAAAAATGTG 


1740 


ATTTTTTTAA 


AAAAAATCCG 


TGTTAAAACA AGTAAAAACC 


GAACATTCTA AGaATGrTTT 


1 AAA 

loOO 


ATTTTTTCTT 


TGGATAAACA 


AGAAAACTTT 


CTATTCTTAA 


GATTCTTCGG 


TATACTTGAA 


n 0£A 


GAG7WVTCGTT 


AGAACTTTAG 


GAGGTCCTGA 


GATGATACAG 


ATCGaACAAA 


TTAGAACAGG 




TGTTATTCAA 


GAAAACTGTT 


ATCTTGTTTA 


TAATGAAGAA 


GCGTTATTAA 


TTATTGATCC 


1 QQ A 


AGGTGCAGAA 


GGAGCAAAAC 


TAATTCAAGA 


AATTACCCGT 


TTAGGGAAGA AACCAGCCGC 


ZU4U 


TATTTTATTA 


ACACATACTC 


ACTATGATCA 


TATTGGTGCG 


GTAGAAGAAT 


TGCGCCATCA 


oi An 
^XUU 


TTATCAAATC 


CCTGTTTATG 


TTAGTCCATT 


AGAACAAGCG 


TGGCTTTCCA 


ATCCTATTTT 


'^l £A 


GAATCTTTCA 


GGATTAGGCA 


GACATGATGA 


TATAGCAAAT 


ATTATTGTTC 


AACCCGCCGA 


OOO A 


ACATGAATTT 


AAACTAACCG 


ATTATGAAAT 


TGGCGGCATG 


AAATTTTCCG 


TAGTCCCAAC 




ACCTGGTCAT 


TCAATTGGTA 


GCGTCAgcTT TGTCTTTGAT GATTTTGTTG 


TTGTTGGGGA 




TGCATTGTTT 


AAAGGTAGTA 


TTGGTCGTAC 


CGATTTACAC 


ACTGGTGATA 


TGCAACAATT 


^ *i VI V 




Ai iACGAL.Oi 


ATCTATTCAC 


TTTACCAGAA 


GAATTTCCTG 


TTTATCCAGG 


2460 


CCA i (sV? 1 viAI 




TTCAACATGA AAAAGCCACA 


AACCCATTTT 


TTAATTAGAG 


2520 


ATTTATTCAC 


TATTTAAAGA 


AACTTAGGAG 


GAGTAGCAAT 


GACTGAAGCA ACGACACTTT 


2580 


ATATTATTCG 


TCACGGAAAA 


ACGATGTTTA ATACGATTGG 


CCGTACCCAA GGCTGGTCAG 


2640 


ATACCCCCTT 


AACCAAACAA 


GGAGAAGAAG 


GCATTTATCA 


TTTAGGCTTA 


GGTCTTAGGG 


2700 


ACATAGACTT 


TAAGGAAGCC 


TATAGTAGTG 


ATAGTGGTCG 


TGCCATGCAA ACCGCTCAGA 


2760 


TTATTTTACA 


AGAACATCAA 


AACCACCAAA 


AGATTCCTTA 


CCTAACCGAT 


AAACGGATTC 


2820 
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111 l\3i3l Iwl 


•PT a^; a t/2/;t^ 

1 InlaAlvKallv 




1 UAAL 111 uVs 


l3u i l3 1 GuXGC 






appn*'p'p'pa a a 


a a •pr* a pp 
Abi 1 AiGAGCa 


a p a TP a Tr* ap 
AUAl uAl VaAv.* 


pappaapai»T 
LALGAAGAX T 


ALLIATCGTG 




AGCTAGCAAA 


TGCTATTATT 


GAAGCCGATA 


C#ov> Ui GAL» T G 


GGCGGAGCnT 


TATGAAGTTA 


3000 


TCCGTGATCC* 


TGTTTGGTCT 


GGi 1 1 UbAAu 


aTaTPPPTPa 
Ai Ai UoU 1 CA 


1 LA i LGAGAA 


Ik jk Ik jk 9k fn^f^m^ 

AAAAATGGTG 


'3 Af A 

oOoO 


GCGGCAAAGT 


CATGGTGGTT 


TGCCATGGAC 


TAALAATTtlC 


CTTnTTACTT 


tCCTTAATgA 


3120 


TGCAAr TTGC 


CTATGCAGAT 


GGCGCTGGAG 


AATGGTAGCG 


TGACGACCTT 


AACTTACGAA 


3180 


% i& ^ ^^^^ V ^ ffn 

AAAGGAACAT 


TTACCATnCA 


Ik ^^^*ik ^ ffitni ^ ^ m 

AGGAATTAAT 


GATATTTCTT 


ATATTGAAAA 


GGGAAAAAAG 


3240 


ATCGCTGAAA 


AGCgCCGTTT 


s wmmtitui ik ik nk ^ 

GTTGT7y\AAA 


AAGAGAAGGT 


TATCTGTCAA 


AATGACAGAT 


3300 


AACCTTCTCT 


TTTTATTTAT 


TCTTTTAAAC 


GTTCTACGTT 


TTTTCTTGAA 


ATTTGAATTT 


3360 


TTTCTAAGGT 


TTCCTTATAG 


TTTTTTGCTG 


CATAAGCAAA 


AAATAAAACA 


TCGACTACAT 


3420 


ACAATTGTGC 


CACCAAGGAA 


ACAGTAGCCG 


CACTTCTCAA 


TGGCACGTCT 


TCCCCACTAG 


3480 


AGGTTTGGAG 


GACGATATCA 


CTTTTTTCAC 


CAATCACTGA 


TTTTTCATCT 


TGTGTGATAC 


3540 


TAACAATAGG 


AATCCCTTGC 


TGctTCGCTA 


AATCAGCTAA 


AGTACTTGCT 


TCTTTATTCG 


3600 


TTCCAGAATT 


AGAAATAACA 


ATAAAAACAC 


TAGGCTTTTC 


TGTTGAGCCT 


AACATGGAAG 


3660 


CAAATAAATG 


ATGATCCAAT 


GTTGTAAATA 


CCGTTCGGCC 


TAATCGCGTA 


AATTTTTGAT 


3720 


AAATATCTTG 


AGCAACCAAC 


GAAGAGGCGC 


CCAACCCATA 


GACAAA7WVTT 


ACTTCTGTTT 


3780 


TTTCCAACAA 


ATCAACGGCT 


TCATTCACTG 


CGACATCCTC 


TAATGTCTGC 


aCgctTCGCT 


3840 


CAACCACATG 


AACAAAGCGC 


GCCTGTAACT 


TTTGTTTAAT 


GTCAGCTGTT 


GATTCCCCTT 


3900 


TAGTAACTTC 


AGTATACATT 


TGCTGCTTCA 


TTTGGCCAAG 


ATTTGCAGAT 


AATAAAAGTT 


3960 


TCAACTCTGT 


GAAGCCATTC 


ACATCCaTAG 


AATGACAAAA 


GCGAATAATT 


GCCGCTGGGC 


4020 


TTGATCCTGC 


TTGTTTCGCT 


AGCTCTTGCG 


CATTCATTGA 


AATAACTGTT 


TCTGTATTTT 


4080 


TTAAAATATA 


TTCTGCGATT 


TTTCGTTCTG 


ATTTTGGTAA 


TTGTTGTAGA 


CGATCTTGGA 


4140 


TAGATAAGAC 


JUVTATTTTGT 


TGCATAAACA 


ACCCCTCCTG 


Ik tIfWk ^ Ik Ik tlkfUt 

ATACACCATT 


TCCTACGCAA 


4200 


TTAAATTTTA 


ACAAAGAAAA 


M ^^/^ Ik ^1k ^ 

AGGGAGAAGC 


^ik Ik / »m»w/ »f ii^y^ 

CAAGTTCTCC 


TTTTTCTATT 


TATTATTTAG 


4260 




AiCTTTAGGA 


ATCCCAAAGA 


AI AACGT T GC 


rnik /**ik 7k Ik Ik ^^ik 

TACAAAACCA 


ClTGCGTAAG 


4 JZU 




AUU i AAI ACA 


1 ATbCAAGCC 


ATTGGTTGTT 


GGLAAT T AAW 


GCaAATTAAAG 


A *)<) n 
4 joU 




i Uva 1 UL.AA1 A 


GCAAl CbCAC 


oi\ a/^ai*»P7i 
CAAL^r r ALL. 


GAAgGLLLLA 


A i LALLGCAL 


4 4 4 U 






wiAoUAla 1 AA 


loAAKaGALG 


ILLlAAlGGi 


AAAG i AAL i L 


43UU 


CGTAGATTAA 


TGGTTCACCA 


ATACCTAAAA 


TCCCAACTGG 


CAAGGCACCT 


TTAATCATTT 


4560 


CAACTAATTT 


TTTATCTGAT 


TTACAACGAA 


TCCATAATGC 


TAAAGCTGCA 


CCGACTTGTC 


4620 


CCGCACCAGC 


CATGGCCAGA 


ATTGGTAATA 


ACAAAGTCAT 


CCCTGTTTGA 


TTAATCATTT 


4680 


CAATGTGGAT 


TGGCGTCAAA 


ATTTGATGTA 


AACCAAACAT 


TACCATTGGC 


AAsAAGGTTA 


4740 


AACCAAGAGT 


AAAGCCAGCA 


ACCATGCCAC 


CTTTTTCTAA 


GACAACATTG 


ATAATACCAA 


4800 
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CTAAGCCATT 


AGAAATAGCT 


CCGGCTACAG 


r^f* jk Twtr* Ik mm ik 7k 

GCATGATTAA 


GAAAATAGTT 


GUI AAGC wAA 


4860 


TCACTAAAAG 


TGAAATGGTT 


GGCGTTACGA 


TAATATCAAT 


TGATTC rUisG 


M TV M •IHIVIIimik fit 

AC AATTT T AT 


4920 


GCAATTGTTT 


TTCTAACAAT 


GATAATAACC 


ATACGGCAAA 


AATAACACCG 


ATAATTCCGC 


4980 


CTTGTCCTGC 


AGATAATGTT 


CCCCCAGTGA 


AAATATTACT 


GATTGGAGCA 


TCGGGATTCA 


5040 


TTCCTGTTAA 


CATTGTGACT 


GCCCCAAT/yV 


CACCACCCAA 


CGCTGGCGTT 


GCACCAAAAA 


5100 


CGCTGGCACT 


GTTAATCCCA 


GTGT7WUVGAG 


CTAAATAAGC 


AAAAATCCCA 


TTTTTAATGA 


5160 


TATTTAAGAC 


ATCAATATAT 


TGTTGCCATG 


AAGCTGAAAT 


ATCACCAGCA 


ACAACTAAGT 


5220 


TAGACATAAC 


AGCTGCAATC 


CCGCCAATAA 


TACCAGCCCC 


AACAAAGGCA 


GGAATCATCG 


5280 


GAACAAAGAT 


ACTAGAAATG 


GCTTTTAACA 


CTTTTTTAAA 


TGGTGAAGTG 


TTATTTTGTT 


5340 


TCGCTTT'TTG 


TTGTGCTTTC 


ATTTGAGCGG 


CTTTTTCTTC 


TACTAAATCT 


TTTCCAGATT 


5400 


TGCCTGCTGA 


AGCATCAGTT 


GTGGTGCCAT 


GATTAAATGG 


TTCTCCCAAT 


TTGACACCTG 


5460 


CTTGATCTAC 


CATTTCTTGT 


GCAACCTTGT 


TAACAGTCCC 


TGGACCAACA 


ACTACTTGTA 


5520 


ACGTATCATC 


TTCTACGACG 


CCCATAACGC 


CATCAATTTT 


TTTCAATCCT 


TCAATATCCA 


5580 


CTTTGTCATA 


ATCGCGGATA 


TCCATACGAA 


CACGAGTCAT 


ACAATGAATG 


ACTTTGTCCA 


5640 


CGTTTTCTTG 


ACCACCAACT 


TCTTTATAAA 


TCCCATCAGC 


AATGCGTTTA 


ATTCTTTCTT 


5700 


CTGCCATGTT 


AAAACACCCT 


CCTAATTCCT 


TAATTTAATG 


TATTCTTGAT 


AAAtTGaTTC 


5760 


CCGTCAATTA 


ATTTTTGTGT 


TGCTTCTTCT 


TTAGTACTAT 


CAGTTAAAAT 


CATTACAATT 


5820 


GCTAACTTCA 


CATTTTCTTC 


TGCTTCTACA 


AATTTTAGTT 


CGGCAACTTC 


ATAACTACAA 


5880 


CCCGTTGCTT 


CCATAATAAT 


CCGTTTGGAA 


CGCTCCACTA 


ATTTTTCATT 


GGTTGGTTTT 


5940 


ACATCAACCA 


TTAAGTTATT 


ATAGACTTTC 


CCAATACCAA 


TCATGGAAAT 


AGTTGATAAC 


6000 


ATATTTAAAA 


TTAATTTTTG 


AGCTGTCCCA 


GATTTCAAGC 


GAGTCG7UVCC 


TGTTAAAAAT 


6060 


TCAG6TCCTG 


CATCTACTTC 


AATCGGCATT 


TGCGCATATT 


TACTAATTTC 


GGCATTTTTA 


6120 


TTACATGCCA 


CAGTTGCGGT 


AGCTGCTCCC 


ACCGTTGTAG 


CATATTCTAA 


ACCGCCAATT 


6180 


ACATAAGGCG 


TCCGACCACT 


GGCTGCAATC 


CCAACAACAA 


TGTCATTTGC 


TGATAATTTT 


6240 


AGATCAACTA 


AATCTTGACG 


ACCTAGTTCT 


TTTGAATCTT 


CGGCACCCTC 


TACTGCGACA 


6300 


GTCATTGCTT 


TTTGACCACC 


AGCGATGAGG 


CCTTGAACCA 


TCTCTGGCTC 


GACGCCAAAA 


6360 


GTTGGTACGC 


ATTCGGCTGC 


ATCTAAGACA 


CCTAACCGAC 


CACTCGTACC 


AGCGCCCATG 


6420 


TAGATCAAGC 


GGCCGCCTTG 


AwTGAAGCyT 


TTGaTGAkGG 


CTTCTACCAC 


TGGTTCAATG 


6480 


GCTGCTAATT 


CTTGACCCAC 


TGCCaTTGCA 


ACTTTTTGGT 


CTTCTTGATT 


CATTTTTTGT 


6540 


AAAGCTTCTT 


TGACACTCAT 


CTCATCTAAG 


CCCATTGTaT 


TTTCATTTCT 




6590 



(2) INFORMATION EX)R SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3660 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 



GACATCTGTC 


AAAACGACCG 


ATGTCCTACT 


CTCTAACGAT 


TAAGATAGTC 


CGCTAAATGC 


60 


GTTTGTTTTA 


GTCTTTTATA 


AAAGTTAAAT 


ACTGAGAATT 


TATTGTGTAA 


TTCGTGTACC 


120 


ACTTTCTCCT 


TTGATTGCTT 


CCGAAGCTAA 


TTCTAAAAGC 


CCGATAATCG 


CCTGGCGGTT 


180 


TGGTTTACTT 


TCTGCAAAAC 


TGATGGCCGC 


TTCCACCTTA 


GGTAACATAC 


TACCTGGCGC 


240 


AAATTGATTT 


TCTTCTATAT 


ATTGTTGCAT 


TGCCGTAACC 


GTAACATCGG 


TCAAAGCTTG 


300 


TTGATTTGCT 


TGACCAAAGT 


TAACAAAAAC 


ATGTTCCACT 


GCAGTTAATA 


CCACAAAAAC 


360 


ATCTGCTTCA 


ATGAGTTCAG 


CCATTTTTGC 


TGCCGCAAAG 


TCTTTATCAA 


TGACTGCATC 


420 


TACACCATGA 


TAGACTGGaC 


CATCATAGGc 


AaTAGGrACC 


CCGCCACCAC 


CACAAGCAAT 


480 


TACGATTTGT 


CCTGgCATCA 


ACAAGTGTTC 


GCAGCCTAAC 


TTTTTCGTAA 


ATATCAATGG 


540 


TTTTGGnTGA 


AGGAACTACC 


CGTCGcCAGC 


CTCGaCCCGc 


ATCTTCyTGa 


TAGACGTTCC 


600 


CAGTTTCCGc 


CATCmACTGt 


TTGGckGTTT 


CyTCGtCATA 


ATAACCACCA 


ATTGGTTTTG 


660 


TTGGTTTTGT 


AAAcGCTGGG 


tCTTTTGGaT 


CCcACCACGA 


CTTGTGtAAT 


TAACGTAACA 


720 


ACTGGTTGTT 


CCCCTAAATT 


TCTGGCTACT 


AGTTCTTCAT 


CAATTGCTTG 


TTGTAAATGA 


780 


TAGCCAATAT 


AGCCTTGGCt 


CATTGCTGTG 


CATTCaGGAA 


ACGGCACGAC 


ATCTTCTGGA 


840 


GATTGACTCG 


TTTCCTCAAA 


AGCTAAGCGA 


ATTTTGCCAA 


CTTGTGGCCC 


GTTGCCATGA 


900 


GCAATCACCA 


CTTGATGGCC 


TTGTTCAATT 


AAATCAACAA 


TGGATTTTGC 


AGCGGTTTGT 


960 


GCTTTTGCCA 


ATTGTTCTTT 


CGAAGAATTT 


CCTAACGCAT 


TTCCACCTAA 


TGCAATCACT 


1020 


ATTTTTGACA 


TCTGTTTCCC 


TCCTAGCGCT 


TTGGTTCTTG 


TTGTGTGATA 


CAGTGAATAT 


1080 


TGCCGCCCCC 


ATAAACAACT 


TCTACCGTAT 


TAACGCCGAC 


AATTTTTTTA 


TCTGGGAACA 


1140 


TCGTTTGAAC 


TTGCTCT7U\A 


GCCAAATGAT 


CATTTTCATC 


ACCATACTGT 


GGCACAATCA 


1200 


CGCCATCATT 


TGTAATTAAA 


AAGTTCATAT 


AGGATGCGAT 


ACAAATATCG 


CCGTCTTCTC 


1260 


GTGGCATGGT 


TCCTTCCACG 


AAATCAATTT 


TGAAGGAGCC 


TTTAATGGTT 


ACATTTTTGA 


1320 


CAGGACAGCA 


TAATTTGTGG 


X X xnnx x 


CGCCTPPTTT 

X \^\^ XXX 


nSo\^i\x l.«nu X 


nl X X xl\iiXj^,^i\ 




AACGTTGATA 


CGCATCTTGT 


GCCGCTTCAT 


AAAATGGGCT 


ATTTTGATCT 


TCAGTGTAAA 


1440 


TACAAGCCAC 


TTCACCAGGT 


GCAATAAAAC 


AArCTACATC 


ATCCACATGA 


CCATTGGTTT 


1500 


CTTCTGGGTC 


AATTCCATCA 


CCTAGCCAAr 


GAaCTTTTTC 


AACATTTAAA 


TAGTCGCATA 


1560 


ATTTTtGTTC 


GATTGCTTCT 


TTTGAAAGTT 


GTGGATTTCG 


ACCTTCACTT 


AATAAGCACA 


1620 


TTTCTGTAGT 


CAAGACTGTT 


CCTTGTCCAT 


CTACGTGGAA 


TGAACCGCCT 


TCTAAAACAA 


1680 


AATCATCTGT 


ACGATAAGAA 


TCTACATGCT 


CAATTTCACA 


AATTTTTTGT 


GCAACCAAAT 


1740 


CGTCTTGGTC 


CCAAGGGAAA 


TATAAACCAT 


CTACTAATCC 


CCCCCAAGCA 


TTGAATGTCC 


1800 
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AATCCACACC 


ACGGATTTCT 


CCATGGTCGT 


TAATGACAAA 


TGATGGTCCA 


CAATCACGAA 


1860 


CCCAGGCATC 


ATTGTTTGAC 


ATTTCGTAAA 


CCGTAATTTC 


TGGCGGCAAT 


TGGCGACGGC 


1920 


AATTTTGAAA 


TTGTTGTTGA 


GAAACCACCA 


CATTCATCGG 


TGTAAATTGA 


CTAATTGCTT 


1980 


TGGCCACATT 


AGTAAAAGCT 


TCTTGGACAG 


GTTTACCACC 


ATCACGCCAA 


TTATCCGGTC 


2040 


TTTCTGGCCA 


AATCATCCAA 


ACTTTCTCTT 


GTGGTTCAAA 


TTCTCCTGGC 


ATTCTAAAGC 


2100 


CATCTTGtTT 


AGGGGTACTT 


CCTACAATTC 


GTTTAGCCAT 


TTTCTTTCGT 


TCCCTTCTTA 


2160 


ATTTTTTTCA 


CGTAAATAAT 


TACTTCACCA 


AACAGAATAA 


AAATGATTGC 


CCCAATTGTG 


2220 


ATTGGTAATT 


TTTCATTTAA 


TGCTGCGGGA 


CTAAAATCTA 


AAGGcACTGC 


TGTGAAAATC 


2280 


AGTGAAATAA 


TAATCATAAT 


CATT6GCAAA 


ACCACTAAAA 


CTTTTAAAAA 


TCCTGGTTTT 


2340 


CCGTTAACTT 


TAAACGGACG 


TGGTGTATCG 


GGATCAATTT 


TTCTAAGCTT 


ATAAAAAGCT 


2400 


GGGAAAACT6 


GGATATACGA 


TAACAAGAAT 


AGGACGACAT 


TCAAGGAGAA 


AAAGGCCCA6 


2460 


AATAAATCTT 


GATTAGGTAA 


AATTGGTGCT 


AATACAATGA 


CCACCGATGC 


CACAATGCCA 


2520 


TTCATCAAAG 


CAGCCCCAAT 


TGGCATGTCA 


TTTTTCTTAC 


TACGTTTTTC 


AAAAAATTTA 


2580 


GGCATGTCAC 


CATTTTCAGC 


AGCATAACAA 


GCTGTATTAT 


TGACTCCTAA 


TGACCAAGAA 


2640 


ATCATGTTTC 


CGAACAACGT 


AAGAAGGAAT 


AGGAAAGCCA 


TCAACATAAT 


GAACCAACCT 


2700 


TCTGTCGAGC 


CTGTCAATAG 


TTTAAAGCTA 


TCCATCATAC 


CACTGCCGGT 


ACTAATTTGA 


2760 


TCGGTGGGAA 


TCGCTACCCC 


AATTCCAAAG 


GCTGAAAAAA 


TATAAATGGC 


CGCAATGACT 


2820 


AACCCCGCTG 


CTACAATAGC 


TTGTGGAATT 


TGCTTCTTAG 


GATTTTCCAT 


ATCATCTGCA 


2880 


AACGTACAGA 


TGACTTCAAA 


CCCTAACAAA 


TTAAAAATGA 


TTACAGAAAT 


AAAAGATAAA 


2940 


CTCCTTAAGT 


CAAACGTTG6 


CAGCATTGAT 


TTCAAGGTAA 


ATTCATTGGC 


AACACCTTTC 


3000 


GTCAAGGCTA 


CATACAATCC 


TAGCCCACCA 


ATTAAGACAG 


CTAAAACCAT 


CTTGATTACG 


3060 


GCAGCTCCAT 


TTAGAATCCA 


GATACTGTCG 


CTGACTGGAT 


AAAAACTAAT 


CCAAACCACA 


3120 


ATCCAAGTAA 


AAATTAATTC 


AACAATAATC 


ATCATCGGTG 


TTGAAAACTT 


AATACCTGTA 


3180 


ATTGTCGTTA 


ATA6TTCTGG 


CGTCATAACG 


GCTAATGAAG 


CTAGCCACAA 


TGGATAATTA 


3240 


ATCCAATAAT 


ACCAAGAAAC 


TCTGGCACCC 


CAACGATGAC 


CATATGCTTT -GGTCACCCAA 


3300 


TCATAGATGC 


CCCCTTCACC 


AATATAAGTT 


GTTCCTAATT 


CTGATGAAAT 


TAACCCATAT 


3360 


GGCAATAGGA 


ATGCAATTAA 


TAAGAAAATC 


CACCAAAAAA 


ACTGAGAATT 


ACCAATTGCA 


3420 










TCACTGATAA 


AATGGCACTC 




GTTAAACTAA 


ATTTCTTTTT 


CCCCTCCATG 


AAGACTCTTC 


CTCTCTTTAT 


AAACTGTCCA 


3540 


CTGATTCATT 


TTTTCAATCA 


GTGGACAGTT 


GGCCGTTAAA 


TGCTTTGAGT 


GTCTAAGAAA 


3600 


ACTTCTAGTT • 


CTTTTTTCGC 


TTCTGcTTGC 


TTGATTAAAT 


CATAAGGATT 


TTTgCTTCAT 


3660 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

AATGACAACA TTTACAAAAA AGTTTGAGGC AAAAATTAAA ACGGCGGACA wTAAaCTTaA 60 

AGTAAAAaTT TTGTTTTTAA AAaTTGCGAA CTTAATAAGT GGGCTTTTTC TTTTTCGTTC 120 

AACCATTATA AATAATCCTA ATGCGATTAC TGCAATAATG AACAaTAAaT AGGATTGTAA 180 

CGAGCCaAAC CCGCTCTCTT GTCCGAGGAA AATCCCGCCG AAGAACGTCA TAATCGCAAT 240 

GGCAATACAA GCAAATCCTG AAAAATCGAT TTTTTCTTTT GTTTTTGTAA TATCTTTTGG 300 

TAGGAACTTT TCGCCAATTA AAATTGTAAC TAGCCCCACG GGAACATTAA TCCAAAAAAT 360 

ATACGACCAA GAAAAGTTTG ATAAGATCAA TCCACCAATC CCAGGACCAG CAATCGCGCC 420 

TAAAGAGACA AAGGCACCTA CTGCACCAAG CGCCCGCCCT CTTTCATTTA AAGGGAATAC 480 

TTCCGTAATA ATTCCTGAAT TGGTCGCCAT GGTCATGCTA GCACCAATAC TTTGAACGAC 540 

TCGCGCAAAT AATAGAAAGC TCAGCGATTG ATT7UUVGCCA CAAAGTAACG AACCAATCGT 600 

AAAAATAACC GTGCCAATTC GATAAACTTT TATTTTTCCA AAACTGTCGC CAATTTTTCC 660 

AAATAATAAT AAACAAGCAC AGACAACCAT TAAATAAATA GATACGACCC ATTCCGCTTG 720 

ATTCATGGGT ACGGACATTT CTTtTGAAAT AGTCGGTAAa GCAATATTTA CAATGCTGGA 780 

ATCTAATGTA GACATGAAGG TAAACATAGC CACAGAAACT AAGATCC 827 
(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dovible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

AAGGCATGAA GTAATnGGGA TGGATTTATT AATCGrATGA AGyCAtATCa GGAAGAAGAA 60 

GCAGTTTGCT TTACAGATCG TTTGAAAGAG CAGGTGGAGA AAAATGGCTA GAAAAAGTCG 120 

TATAAAAACA CCGATTGcGG AACAAGCACC TACTATCAGA AAGAATAATT TTGAAGAG6T 180 

TTGCTTAGGC TATACATTAG CAGAAGGGCA AGAAGAAGCG ATTCGCTGTT TACAATGTAA 240 

AAATGCACCT TGTATTGCTA AATGCCCCGT GATGATTGAT ATTCCTGGAT TTATTTTAGC 300 

TATTCGTGAA GGAAACATGG AACAAGCAGC CGATATTTTA AGTAAATATA CGAATTTACC 360 

TGCTATTTGT GGACGCGTTT GTCCGCAAGA GAAACAATGT GAACAAGTTT GTAAATTAGG 420 

GAAAGCCAAA- AACTTTGAAC CAGTGGCAAT TGGCAAATTA GAACGTTTAG TTGCCGATTG 480 

GGCATTAGAA AATCACAAGT TTCCTAAGAA AACAGCGGAA CCAACAAAAG GGAAAATCGC 540 

AGTGGTCGGT TCTGGTCCTT CTGGGTTAAC AGTTGCGGGG GATTTGTCAA AGCTAGGCTA 600 
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CGAAGTAATT 


ATTTTTGAAG 


CGCTTCATGA 


AGCTGGTGGC 


GXATTAACCT 


ATGGCATACC 


660 


TGAATTTCGA 


TTACCGAAAA 


AAATTGTTAA 


ACAAGAAATT 


GCGAGTATTG 


AAGCCTTAGG 


720 


TGTAACAATA 


GAAACCAATG 


TGGTAGTAGG 


TAAAACGATT 


ACGATGGAAG 


AAATAATGTC 


780 


AGAGTTTGAT 


GCCTGTTATC 


TATCGGTTGG 


TGCAGGGGCA 


CCAAATTTTA 


TGGGAATCCC 


840 


AGGAACTAGT 


TTAAATGGGG 


TTTATTCTTC 


TAGTGAATTT 


TTAACACGAA 


TCAATTTAAT 


900 


GCATAGCTAT 


GAGTTTCCAG 


AATATGATAC 


ACCGATTAAA 


CGTGCTAAAA 


ATGTTGTTGT 


960 


GATTGGCGGC 


GGGAATGTTG 


CAATGGATGC 


CGCTCGTTCA 


GCCAAACGAT 


TAGGCGCTGA 


1020 


AAATGTGAAC 


ATTGTGTATC 


GTCGTTCGTT 


AGAAGAATTG 


CCAGCAAGAA 


TT6AAGAATA 


1080 


TCACCATTCC 


CTTGAAGAAG 


GAATTAACTA 


TTATTGGTTA 


ACGAATCCAA 


TTGCTTATTT 


1140 


AGATGACCAA 


CAAGGAAATC 


TAGCCGGTGT 


TGAATGTGTC 


AAAATGGTTT 


TAGGTGAACC 


1200 


AGATGCGTCT 


GGAA6ACGAC 


GGCCAGAACC 


AATTCCAAAC 


AGCACGTTTA 


CCATTCCTGC 


1260 


TGATGCAGTA 


ATCGAGGCAA 


TTGGCCAAGG 


GGCAAATCGA 


GTCTTGTTAT 


CAACATATCC 


1320 


AGAAATAGAA 


CTAAACCAAT 


GGGGTTACAT 


TCAAGCTGAT 


CCTAAAACAG 


GAGCAACCTC 


1380 


AATACCTGGC 


GTTTTTGCTG 


GGGGCGATAT 


AGTCACAGGT 


GCTGCCACCG 


TTATTTTAGC 


1440 


GATGGGCGCT 


GGGAAAATCG 


CAGCTAATGC 


AATTGATCAG 


TAT6TCAAAA 


CACAAAAAAC 


1500 


aACAGTCACA 


AGCAATGTTT 


AGGAAAGATA 


AAAGGTGAAA 


AAATGAAAAA 


AACAATGGAT 


1560 


GGAAACACCG 


CAGCGGCTTA 


TATTTCATAT 


GCGTTTACAG 


AAGTTGCCGC 


TATTTATCCT 


1620 


ATCACACC7VA 


GTTCAACAAT 


GGCTGAGTTG 


GTAGATGAAT 


GGGCGGAAAG 


TGGTTTAAAA 


1680 


AATATTTATG 


GTCAAAAAGT 


TCAAGTTATT 


GAAATGCAAT 


CAGAAGCTGG 


GGCAGCGGGA 


1740 


GTAGTTCATG 


GTTCGTTAAA 


AACAGGTGCG 


TTAACCACTA 


CTTATACTGC 


TTCACAAGGT 


1800 


TTACTGTTGA 


TGATTCCCAA 


TATGTATAAA 


ATCGCAGGAG 


AATTGCTGCC 


TTCTGTGTTC 


1860 


CACGTGGCGG 


CTCGGGCAGT 


AACGACAAGT 


GCGTTGAGTA 


TTTTTGGTGA 


TCATGGCGAT 


1920 


GTgATGGCTA 


CCCGCCAAAC 


AGGTTTTTGT 


ATGTTAGCAG 


AATCGAGTGT 


TCAAGAAGTG 


1980 


ATGGATTTGT 


CGGCAGTTGC 


TCATTTAGCT 


AGCTTAGAAG 


GTAGTTTACC 


TTTTGTAAAC 


2040 


TTTTTTGATG 


6TTTTAGAAC 


GAGCCATGAA 


CTACAAAAAA 


TCGAAGTGAT 


TGATTATGAT 


2100 


GACCTGAAAG 


AAATGGTGAA 


TCAGGAAGCT 


GTAACTCGTT 


TTAGAATGGG 


AGGCATGAAT 


2160 


CCTAATCATC 


CCACTGTTTC 


AGGGACCGCT 


CAAAATCCTG 


ACATTCATTT 


TCAACAAAGA 


2220 


GAAACAGTCA 


ATAAAAATTA 


CGAAGAAATG 


CCAAAAATTG 


TTCAAAAATA 


CATGAAAAAA 


2280 


ATTAACAATC 


TTCGTGGAAC 


AACGTATGAT 


TTAACCGATT 


ATTACGGGGT 


TGAAGATGCA 


2340 


ACAGAAGTCA 


TTATTTCAAT 


GGGGtCGgct 


TCGCCAGTAA 


TCAAACAAAC 


AGTCGACTAT 


2400 


CTCAATCAAC 


AAGGACGCAA 


AGTTGGTTTT 


ATCAATATTC 


ATTTATATCG 


TCCATTTCCT 


2460 


ACCGAAAACC 


TTCTTGAAAA 


ATTACCAGCT 


ACAGTAGAAA 


AGGTTGCGGT 


TTTAGATCGA 


2520 


ACAAAAGAAT 


CGGGAGCCGA 


AGGGGAACCG 


TTATTGCTAG 


ATGTTCAAAG 


CGCTCTCTAT 


2580 
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CAACATGAAA 


ATCGACCAAT 


TGTGATTGGT 


GGACGTTATG 


GGXTAGGCTC 


TAAAGACGTA 


2640 


GCGCCTAATC 


AAATTAAAGC 


GATCTATGAT 


CATTTGTTAT 


TGCCATTTAA 


AGAATTAAAA 


2700 


CAACGTTTTA 


CGGTAGGGAT 


CGTTGATGAT 


GTAACCTATC 


AATCATTGCC 


AGAAGAACCT 


2760 


ACTTTAGATT 


TAACGCCAGA 


TACTACCTTC 


CAAGCAAAAT 


TTTGGGGCTT 


TGGTTCTGAC 


2820 


GGAACTGTGG 


GAGCAAACAA 


GCAAGCGATT 


AAAATTATCG 


GAGATCATAC 


CGACCTTTAT 


2880 


GCACAAGCTT 


ACTTTAGTTA 


TGATTCTAAA 


AAATCAGGCG 


GTTTAACCGT 


TTCGCATCTG 


2940 


CGTTTTGGTA 


AAGAGCCAAT 


TACTTCCACG 


TATTTAATTG 


AACAAGCAGA 


TTTCATCGCT 


3000 


TGTCATAATG 


CTTCCTATTT 


GCATAAATAC 


GACCTGTTAA 


AAGGTTTAAA 


AGATGGCGGA 


3060 


ACGTTTTTAC 


TGAATACTAT 


TTGCGATCAA 


GAAAAGATTC 


ATCGTTTATT 


ACCGGCTAAG 


3120 


TTAAAAAAAT 


ATATTGGGGA 


GCACCAAATC 


AAATTTTATA 


TTATCAATGC 


GGTTGAGTTA 


3180 


GCTCGCAAAG 


TAGGCTTAGG 


CCGTCGGATC 


AATACAGTGA 


TGTCTACTGC 


CTTTTTTGAA 


3240 


GTAACAGATA 


TTTTAACGAA 


AGAGCAATAT 


TTACCGTTAC 


TTAAAGAAGA 


AGTGAAAAAG 


3300 


ACTTATGGAA 


AAAAATCAAT 


GGAAATTGTT 


GAAAAAAATT 


ACCAAGCGAT 


TGATGCTACC 


3360 


TTTGAATCGC 


TGCAAGAAAT 


TGTAGTTCCA 


GAAGATTGGG 


CAACCATTGT 


TTTAGAGCCA 


3420 


GAAAATCAAG 


AGTCGGAaGC 


TCCAGCTTTT 


GTGACAAATA 


TTTTAGAACC 


AATCAATCGT 


3480 


CaAGAAGGGG 


ATCAGCTGAC 


CGTTTCAGAT 


TTGATTGAT^ 


ACGGAATGAA 


GGGCGGTGCC 


3540 


ATTCCTATGG 


GCACTGCGGC 


TTATGAAAAA 


CGTGGCATTG 


CGTTAGAAGT 


GCCTGAGTGG 


3600 


CAAATGGATA 


AGTGTACGAT 


GTGTAATGAA 


TGTGCCTTTG 


TTTGTCCACA 


TGCGGCTATT 


3660 


CGACCGTTTT 


TAGCGGATGA 


AGAAGAACTG 


AAAGAAGCAC 


CTGCAGGTTT 


TGCCATGCGC 


3720 


GAAATGCGTG 


GCACAGATGG 


GTTGATGTAT 


CGTATCCAAG 


TTTCTTTAGA 


AGACTGTACA 


3780 


GGTTGTGGTC 


TTTGTGTTCA 


GGCATGTCCC 


GTGAAAGACA 


AGGCGATTTT 


AATGAAACCT 


3840 


TATGAAACTC 


AGAAAGAGCA 


AGCGATGAAT 


TGGGCCTTTG 


CGATGACTCT 


TAAACAAAAA 


3900 


GCCAACCCAG 


TCAAAAAACT 


TTCTGTTAAA 


GGTTCCCAAT 


TTAACCAGCC 


ATTGATGGAA 


3960 


TTTTCAGGTG 


CTTGTGAAGG 


CTGTGGCGAA 


ACACCTTATA 


TTAAGTTGTT 


GACGCAATTA 


4020 


TTTGGTGATC 


GAATGATGAT 


TGCTAACGCA 


ACAGGCTGTT 


CATCAATTTG 


GGGTGGTTCG 


4080 


TCACCGGTGA 


CGCCTTATAC 


AACGAACGAA 


TGTGGCCAAG 


GACCAGCTTG 


GAGTAATTCG 


4140 


CTGTTTGAAG 


ATAATGCGGA 


ATATGGTTAT 


GGTATGTATA 


TTGCCAATCG 


AACGAAACGT 


4200 


CAACATTTAG 


CTAGTCTGGT 


TGAAGAAAGT 


CTTGCTAAAA 


ACGTCGGATC 


AGATTCACTT 


4260 


CAAGCCTTGT 


TAAATGATTG 


GTTGGAACAC 


ATGGCTGAAG 


GGGAAGGAAC 


ACAACAGCGA 


4320 


GCCACCA/WVC 


TAGCGGCAGC 


ATTAAGTGAA 


GAAGCCGATG 


AAGATCCACT 


TTTAACAAAA 


4380 


ATTTACGAAC 


AAAAAGATTT 


ACTAGTGAAA 


ACGAGTCAAT 


GGATTGTCGG 


CGGCGATGGT 


4440 


TGGGCCTATG 


ATATCGGGTT 


TAGCGGCATT 


GATCATGTTC 


TTGCCAGTGG 


GGAAGATGTT 


4500 


AATATTTTTG 


TTATGGATAA 


CGAAGTGTAT 


GCCAATACAG 


GCGGACAAAC 


ATCTAAAGCT 


4560 
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ACACCAGCTT 


CAGCCATTGC 


CAAATTCTCA 


GCGGGTGGGA 


AACATACCTC 


TAAAAAAGAT 


4620 


TTAGGGATGA 


TGGCAATCAC 


.T.TAXGGGAAT 


GTCTATGTTG 


CACAAGTGGC 


TTTAGGCGCA 


4680 


AACTCAATGC 


AAACGATTAA 


AGCAATTGAC 


GAAGCGGAGC 


GGTACCCAGG 


CCCATCAATT 


4740 


ATCATCGGTT 


ATACCCCATG 


TATTAATCAC 


GGGGTCAAAG 


GCGGAATGGT 


TCGGACGTTA 


4800 


GATCAAGCCA 


AAGAAGCCGT 


TGAATCTGGC 


TATTGGCCAC 


TTTATCGCTA 


TAATCCAGAG 


4860 


TTAGTTAAGA 


AAGGCAAAGA 


TCCGATGGTC 


ATTGACTACA 


AGAAAACAGA 


TTTTGACAAA 


4920 


ATGCGTGATT 


TCCTTGAAAA 


ACAAACACGT 


TACTCTGCGT 


TGCATACAAT 


TAAACAGAAT 


4980 


CAAGAAGTAG 


TCGAACATTT 


ACTTGATAAA 


ACTAAAGAAG 


ATGCAATTGA 


ACGCTCTGAA 


5040 


AGTTATAACA 


AACTAGTCAC 


ACACATGAAA 


AAATAAAAAA 


ACAAAGCCTC 


TTACTTGAGA 


5100 


GGCTTTGTTT 


TTTTATCCAA 


TAATAATTTT 


TTCTGAAGGG 


TATTCGTAGC 


CTAAGTCACG 


5160 


TGTTTCTTTT 


GGTACAAAAA 


TCATTAAGGT 


ATAAAGCATA 


CCAATACGTC 


CAATAAACAT 


5220 


TAATAAGGCA 


ATCATTAATT 


TACCTACTGT 


TGTTAAATCG 


TCTGTAATAC 


CCAAAGAAAG 


5280 


ACCGGTTGTC 


CCAAAGGCAG 


AAGCGACTTC 


CACAATGATT 


GAAATCAAGG 


GCAGATTTTC 


5340 


AGTGGCTGAC 


AAAAAGACGA 


CAGCGAAAAA 


GCACATAATC 


AAAGATAACA 


TGAAGACGAC 


5400 


AATCGATTTT 


TTGACATCAT 


CGTCATCTAT 


TTTACGGCTA 


AAGACGCTAA 


TATCATCTTC 


5460 


ACTTTTTAAA 


AAGGCTAACA 


AATACAAGCC 


AATAATCGCA 


ACAGTCGTTG 


TTCGAACACC 


5520 


ACCACCAACA 


GAGCTAGGAC 


TACAACCAAT 


AAACATTAAC 


ACAGAGAAAA 


TAATCAACGT 


5580 


TGTAATTTGG 


AAGTCCCCTA 


AATCATTAAT 


TTGGAGGCCA 


GCGTTTCGTG 


TAGTCATCGA 


5640 


ATAAAACATC 


GAACTTACCC 


AACGGCCACT 


TTCATTCATT 


GTCAAAAATA 


GGTGATCTTT 


5700 


TTCTAATAAG 


TAAATCAAAA 


TAGTCCCACC 


AATAAACAGA 


ACAATAAACG 


CTAGCAAAGC 


5760 


AATTTTACTA 


AACAATGAGA 


AGCGGAAAGG 


CAAGCCTCGT 


TTCTTCTTAC 


GAAAGTTCTT 


5820 


TTTCTTAAAA 


TGTAACCACT 


CATGAATTTC 


CAGTAAAACG 


GGAAATCCAA 


TCCCGCCGAT 


5880 


AAAAATCAAG 


AACATAATTG 


CCAGTAAAAA 


GAGATAATCA 


TGAGCAAAAG 


GAATAATTGA 


5940 


ATCACCTGAA 


ATATCAAAGC 


CTGAATTAGT 


CACTGCAGAA 


ATAGCTTGAT 


AAAATCCATA 


6000 


AAATATCGCA 


TCTCGCCAGT 


TCGTATAATA 


ACCACTTATA 


TAAAAATAAA 


GCGAAAAGAG 


6060 


CGAGCCGAAG 


ATTACTTGGA 


ACCAAATTAA 


AATCAAAA/^ 


GTGGTGCGGA 


TCAGTCGAAC 


6120 


AATTCCGCTC 


AGTCGCGGCT 


GATTCATGTC 


AGTCATAATT 


AAcTGACGTT 


GCTTTAATGT 


6180 


GATTCGCCGT 


TTTGATAAAA 


TAAAAAAGAC 


CGTTGAAATC 


ATCATAATCC 


CTAAGCCACC 


6240 


AACTTGGAAA 


AGAACCTCCA 


ATAAGACGAT 


CCCACGATCA 


TTAAAAACCG 


AATTAATATC 


6300 


GACTGTGGTG 


AGTCCaGTAA 


CACTGACTGT 


ACTGATTGCT 


AAGAAAAATA 


AATCAATAAA 


6360 


AGGAACATGC- 


GAGTCAGGTT 


CTCTGAAAAT 


AGGGAGGCAA 


AACAATAAAT 


AAGAAATAAC 


6420 


CGTCATTAAT 


ATGTAATAAG 


CGACGATGAT 


TTGAATAGAT 


GAAAAATTTT 


TGCGAATAAA 


6480 


TCGATCGCTt 


cTctTTCGCT 


GTGCCAGCCA 


CAGCAAACGT 


GTATGGTTCA 


CTAGATaACy 


6540 
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TCCCaATTCT 


AATTAAACTA 


mGTATACCAA 


AACTTTTCGC 


CTCAAACGAA 


TAATTCTGCC 


6600 


AGAAAATCCT 


TGAAATGACA 


AATAAAAATA 


AGAAATTCGT. 


.TAGAAATAAT. .TGGAAAACAT 


6660 


CCACAAACAA 


GCGAGAACAT 


GATATTTTAT 


GAAAGAATAG 


AAAAATAGGT 


TGTTCCCCAA 


6720 


ACAACTTATT 


AATAATTGGT 


CAGTTTGTCT 


AACGATACAA 


CTTTAAAAAG 


TAGTGGTAGC 


6780 


TTTTTTATTT 


CGGTTGATAT 


GTGAATGTGT 


TCACTTTATG 


TAACAAACTG 


TTATTTTTAT 


6840 


TCAMTGAAA 


AATAAGAAAT 


TGTGTTGGTA 


GAAATCAATA 


GTTTATAAGG 


TGGGAAAACA 


6900 


ATG7VAAAAAG 


TGTTAATGGG 


AGTTCTTTCC 


CTAGGGTTAT 


TATTAGGTGC 


AGCAACAGGT 


6960 


TGTACAAGCG 


ATCAAGAAAA 


AGCCGCTGGT 


AAAACAAAAG 


CGTCTTCTGA 


AAAAACAGAA 


7020 


GCAACATCTG 


GTGCCTCTGC 


TAATGGTTAT 


ACAGATCCAA 


GTGAACTAAA 


AGACAGTTAT 


7080 


GATGTTGTTA 


TTGTTGGTTC 


AGGTGGTGCG 


GGGATGACCG 


CTGCATTACA 


AGCCAAAGAA 


7140 


GCTGGCATGA 


ATCCAGTTAT 


TTTAGAAAAA 


ATGCCTGTTG 


CAGGTGGGAA 


TACAATTAAA 


7200 


TCGTCTTCAG 


GAATGAATGC 


TTCACAAACA 


AAATTCCAAG 


AAAAAGAAGG 


CATTAAAGAT 


7260 


AGCAACGATA 


AATTCTTTGA 


AGAAACACTA 


AAAGGCGGTA 


AAGGAACCAA 


CGATCAAGAA 


7320 


TTACTACGTT 


ATTTTGTTGA 


TCATTCAGCT 


GAAGCAATTG 


ATTGGTTAGA 


TACAAAAGGr 


7380 


ATTACATTGA 


GCAATTTAAC 


GATTACAGGT 


GGAATGAGCG 


AAAAACGCAC 


ACACCGACCT 


7440 


GCTGACGGAT 


CAGCAATTGG 


GGGCTATCTA 


GTTGACGGTT 


TAGTACGTAA 


CGTTCGCGAA 


7500 


GAAAAAATTC 


CATTGTTTGT 


TGATGCTGAC 


GTCACAGACT 


TAGTTGAAGa 


AAACGGTCaA 


7560 


ATCGATGGC6 


TAAAAGTGAA 


AATGAAAGAT 


GATAAAGAAA 


AAACAGTTAA 


AGCAAAAGCA 


7620 


GTCGTTGTCA 


CAACAGGTGG 


TTTCGGTGCC 


AcGAAAAATT 


AATTACACAA 


TACAAACCAG 


7680 


AATTGAAAAA 


CTATGTTACA 


ACGAACCAAG 


AAGGCACAAC 


AGGTGACGGT 


ATCCAAATGA 


7740 


TTCAAAAAGT 


TGGcGGTGCG 


TTagTGGATA 


TGAAAGAAAT 


TCAAATTCAC 


CCAACTGTTC 


7800 


AACAAAGCGA 


TGCTTTCTTA 


ATTGGTGAAG 


CAGTTCGTGG 


AGAAGGCGCA 


ATCTTGGCTT 


7860 


CTCAAAAAGG 


AGAACGTTTT 


GTTAATGAGT 


TAGATACACG 


CGACAAAGTT 


TCTGCAGCAA 


7920 


TCAATGCCTT 


ACCAGAAAAA 


TCAGCTTATT 


TAGTCTTTGA 


CCAAGGCGTT 


CGCGACCGTG 


7980 


CAAAAGCCAT 


TGATTTCTAT 


GATCAAAAAG 


GTTTCGTGGA 


AAAAGGTGAA 


ACGATTGAAG 


8040 


AATTAGCTGA 


AAAAATTGGC 


ATGCCTGCCG 


ATACATTGAA 


AGCAACGATT 


GATACTTGGA 


8100 


ACCAAGATGT 


TAATGCCAAA 


GATGACAAAC 


AATTTGGCCG 


GACAACTGGA 


ATGGAAGCTG 


8160 


ACTTAAGCAC 


AGCCCCTTAC 


TACGCAATTA 


AAATTGCACC 


AGGGATTCAC 


CACACAATGG 


8220 


o I uo^u i V3AA 




AAAACACxAACs 




AGATGGCACA 


CCAATTAAAG 


o o o n 


GTCTATATGC 


TGCTGGTGAA 


TTAACAGGTG 


GCTTACATGG 


TCaAAACCGT 


ATCGGAGGCA 


8340 


ATGCAATCGC 


CGATATTATC 


ATTTATGGTC 


GTCAAGcAGG 


TACGCAATCA 


GCAGAATTTG 


8400 


CCTCTGCTCA 


AAAATnACTA 


GCATTAAAAA 


GTCTGAAATT 


GTAGTCAAAT 


AACTACAATT 


8460 


TCGGGCTTTT 


TTATTGCATA 


CTAGCAGGAT 


CATTAGTTGG 


AGTATAAATA 


GCCCTACTAA 


8520 
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AATGACTTTG 


GCGTAGATTC 


TGTTTTTTTC 


AGAAAAATGC 


ACGACCATTT 


TAGCTAAAAA 


8580 


TACTGCTTTT 


TTGCTGAAAA 


TATAGTAAAA 


TACGTAAGGA 


ACTAACGGAA 


AGGTGTAGTG 


86dn 


AACATGGCTC 


AATTATTTTT 


CAAGTATGGC 


GCAATGAACA 


GTGGTAAAAC 


CATTGAAATC 


8700 


TTAAAAGTAG 


CACACAATTA 


CGAAGAACAA 


GACAAACCAG 


TTGTTTTGAT 


GACCAGTGGA 


8760 


CTAGATACAC 


GTGATGGTGT 


TGGGAAAGTA 


TCCAGTCGCA 


TCGGATTACG 


CAGAGATGCT 


8820 


ATTCCAATTT 


TTGAAGAAAC 


CAATGTCTTT 


GACTTGATTA 


ATGATTTGTC 


TTACAAACCT 


8880 


TTTTGCGTAT 


TAGTCGATGA 


ATGTCAATTT 


TTAAACAAGC 


ATCATGTCAT 


TGAATTTGCA 


8940 


CGAATTGTTG 


ACGAATTAGA 


TATTCCAGTA 


AtGGCATTTG 


GGcTAAAGAA 


TGATTTTAGA 


9000 


AATGAATTAT 


TTGAAGGCTC 


AAAGTATTTA 


CTTTTATATG 


CAGACAAATT 


AGAaGAaTTA 


9060 


AAAACGATTT 


GCTGGTTCTG 


TCATAAAAAA 


GCAACTATGA 


ACTTGCACTA 


TATTGATGGC 


9120 


AACCCCGTTT 


ATGAAGGGGA 


TCAAGTCCAA 


ATTGGTGGGA 


ATGAAGCCTA 


CTATCCTGTT 


9180 


TGTCGCAAAC 


ATTATTTTCA 


TCCAAAAACA 


GTGAATGAAG 


AACAGTAAGG 


AGAGACACTA 


9240 


TGTACGATCA 


ATTwCAATCT 


ATTGAAGATC 


GCTATGAGGA 


ATTGGGCGAA 


TTATTAAGTG 


9300 


ACCCAGCAGT 


CATTAGTGAT 


ACAAAACGTT 


TCATGGAATT 


ATCGAAGGAA 


GAAGCTAATA 


9360 


CACGTGAGAC 


GGTTGAGGTT 


TATCGCCGAT 


ATAAACAAGT 


TGTTGAAGGG 


ATTAGCGATG 


9420 


CCGAAGAACT 


TTTAAGTGAA 


AACTTAGATG 


CGGAAATGGC 


CGAAATGGCG 


AAAGAAGAAC 


9480 


TTTCTGATTT 


GAAAAAAGAA 


AAAGAAGTGT 


TAGAAGACCG 


CATTAAAATT 


TTATTATTAC 


9540 


CAAAAGATCC 


AAATGATGAT 


AAAAACATCA 


TCATGGAAAT 


CCGTGGTGCA 


GCCGGTGGCG 


9600 


ATGAAGCCGC 


ACTTTTTGCT 


GGCGACTTAT 


TTAATATGTA 


CCAAAAATAT 


GCAGAAGCAC 


9660 


AAG6GTGGAA 


AGCGGAAGTT 


TTAGAAGCGA 


ACGTCACTGG 


AATCGGTGGA 


TATAAAGAAG 


9720 


TAATTATGAT 


GATTTCAGGA 


GATAACGTCT 


TTTCAAAATT 


AAAATATGAA 


AGTGGCGCTG 


9780 


ACCGCGTGCA 


ACGAGTTCCT 


TCAACAGAAT 


CACroAGGACG 


GATTCATACA 


TCGACAGCAA 


9840 


CCGTTGTAGT 


TATGCCGGAA 


GCGGAAGAAG 


TCGAAATCGA 


ATTAGCTGAT 


AAAGATATCC 


9900 


GTGTCGATAT 


TTATCATGCA 


AGTGGCGCTG 


GTGGACAGCA 


CGTCAATAAA 


ACCGCTTCGG 


9960 


CTGTTCGATT 


AACCCATTTA 


CCAACAGGAA 


TCGTAGTCGC 


AATGCAAGAT 


GAACGTTCAC 


10020 


AGCTAAAAAA 


CCGTGAAAAA 


GCAATGAAAG 


TTTTGCGTGC 


TCGTGTGTAT 


GACCAAATTC 


10080 


AACAAGAAGC 


ACAAAGTGAA 


TATGATGCTA 


ACCGTAAATC 


GGCGGTGGGG 


ACCGGAGATC 


10140 


GTTCAGAACG 


AATCAGAACG 


TATAACTTCC 


CGCAAAATCG 


AGTAACGGAT 


CATCGTATCG 


10200 


GTTTAACCAT 


CCAAAAACTA 


GACCAGATTC 


TTGCTGGCAA 


ATTAGATGAA 


ATCATTGATG 


10260 


CATTAGTCTT 


ATATGATCAA 


ACATCGAAAT 


TAGAAGAGAT 


GCAAAATGGC 


TAAACGTTAC 


10320 


TTTGAAGTCC- 


TTAACTGGGC 


TTCTTCTTTT 


TTAGAGGCGC 


AAGGCAAAGA 


AGGCTACGCT 


10380 


ATTCACTATG 


TTTTTTTAGA 


ACGTAAAGGT 


TGGGATAAAA 


CGCAGTGgCT 


CCTGCATATG 


10440 


CAAGAAGAAA 


TGCCCCAAGA 


AGAAGAGGAA 


CAATTAAAAA 


CGGATTTAGC 


CCAATTATTG 


10500 
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ACAGATTACC 


CAGCACAATA 


TTTACTAGGA 


CAAGCTGAAT 


TTTATGGACA 


TTCCTTTATT 


10560 


GTTAATGAAC 


ATACTTTGAT 


TCCTAGGCCA 


GAAACAGAAG 


AATTAGTCGA 


GCGTTGCTTA 


10620 


AAAGCCAATC 


CAGATACTCC 


TTTGACGGTC 


GTGGATGTCG 


GAACAGGGAC 


AGGTGCCATT 


10680 


GCGGTTAGCT 


TGAAACTGGC 


ACGGCCCAAC 


TGGCGAGTTA 


TTGCCATTGA 


TCTTTCAGAG 


10740 


GAAGCGTTGA 


CTGTCGCTAA 


ACAAAACGCG 


CAAGCATTAG 


GTGCTGGCAT 


CGAATTTTAT 


10800 


CATGGGAATG 


GCTTACAGCC 


AGTTGCTTCT 


GAAAAAATTG 


ATCTGCTTAT 


TTCTAATCCG 


10860 


CCCTATATTA 


GTGAACAAGA 


ATGGTATTTA 


ATGGATGCAA 


GTGTCCGAAC 


GTATGAACCT 


10920 


AAAACCGCTT 


TATTCGCTGA 


AAATAACGGT 


TTGGCTTTAT 


ACCAACAATT 


AATTCATGAA 


10980 


AGTCAAACGA 


TGCTTAAAGC 


AGACGGTAAA 


ATTTATTTTG 


AAATAGGGTT 


TCAACAAGGC 


11040 


GCGGCCCTCC 


AAGAATTATT 


GAGCGCCGCC 


TATCCACAAA 


AAACAATTAA 


AATTGAAAAA 


11100 


GATTTATCTG 


GCAACGATCG 


CCTCGCGATA 


GCCGAGTAAC 


TAAGGTTCTT 


CGTCGCCACT 


11160 


TTTCTAGAAG 


GAGCGATTTG 


CAATGGAAAC 


AAAAAAATTA 


ACCAAACAAG 


AAATAGCCCA 


11220 


AGCAGCAGCA 


GCCATTCAAG 


CAGGCGAGCT 


AGTGGCTTTT 


CCAACAGAAA 


CTGTTTATGG 


11280 


CTTAGGTGCC 


GATGCCTTAA 


ATGAAGCGGC 


AGTAAAAAAA 


GTTTACCAAG 


TAAAAGGCCG 


11340 


CCCCAGTGAT 


AATCCATTGA 


TTGTTCATGT 


GAACAACGTT 


GAGATGGTCA 


AAAAATACGT 


11400 


GGCGACCTTA 


CCAGAAGTCG 


CTACAAAATT 


GGTGTCTGCC 


TTTTGGCCGG 


GACCATTAAC 


11460 


TTTAATTTTT 


AATGTGCCTA 


GCGATACATT 


TTCTAAAACA 


GTAACAGGTG 


GTTTAGAGAC 


11520 


AGTCGCTTTT 


CGTATGCCAG 


ATAACCAAGC 


GACCTTAGCT 


TTGATTGAAC 


AAGCCAATGT 


11580 


TCCGTTGGTC 


GGACCAAGTG 


CGAACACTTC 


TGGCAAGCCT 


AGTCCAACAT 


CTGCAGAGCA 


11640 


TGTTTATCAT 


GATTTGCAGG 


GGAAAATTGC 


TGCCATATTA 


GATGATGGCC 


CAACGCAGAT 


11700 


TGGGGTGGAA 


TCAACGGTCT 


TGGATTTAAC 


GGCGGAAGAT 


GGTGTGCCAG 


TTATTTTACG 


11760 


ACCAGGAGCA 


ATTACCAAAG 


AGCAGTTGGA 


AAGCATTATT 


CCCCAAGTGA 


AAATCGATAC 


11820 


ACATTTGATT 


AGTGAAACTG 


CAGCACCCAA 


AGCGCCAGGG 


ATGAAATACA 


AACACTATTC 


11880 


TCCTGATGCG 


GAAGTTTGGA 


TTATAAGCGG 


TCAAACAGCA 


GAATGGCAAG 


CGGCGATTCA 


11940 


GCAAGCAAAA 


GAACAACAGG 


AAAAAATCGG 


CTTATTTTTA 


AGTGATGAAC 


AGGCAGCACA 


12000 


ACTAGATACA 


GAAAATGCGT 


TTGTTTATTC 


TTATGGTGCA 


GAAACGGTTG 


AAAATGCCAC 


12060 


7VAAAGAACTA 


TTTGCAGGGT 


TAAGAGCGTT 


AGATGAGCAA 


GGAGCTACCA 


CGATTTTTGC 


12120 


GCAAgGATTT 


GCAGAAACTG 


GTTTAGGCAC 


TGCTTACATG 


AATCGCTTGA 


AAAAaTCAGC 


12180 




TTTTTTGAAA 


AATAAUAbCj 1 




iMk r* T r* r» a r» 
AAbU 1 oAvJAU 






TACAATCAGA 


CAAAAGAAAA 


CAAAATGGAG 


TGTTGTTTGA 


ATGGmTTACA 


AAACGTATGA 


12300 


CCCAGATTTA 


TGGAATGCAA 


TTGCAAGAGA 


AGAAGAGCGC 


CAAGAAAATA 


ACTTGGAACT 


12360 


AATCGCATCT 


GAGAATGTCG 


TGTCAAAAGC 


AGTTATGGCT 


GCCCAAGGAA 


GTATTTTAAC 


12420 


GAATAAATAC 


GCAGAAGGTT 


ACCCTGGCAA 


ACGGTACTAT 


GGTGGTTGTG 


AATTTATCGA 


12480 
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TATCGTAGAA 


AATTTAGCTA 


TCGATCGTGC 


CAAAGAATTA 


TTTGGTGCAA 


AATTCGCGAA 


12540 


TGTACAAGCC 


CATTCAGGTT 


CTCAAGCCAA 


TACAGCGGCA 


TACCTTTCAT 


TGGTTGAACC 


12600 


AGGTGATACC 


ATTTTGGGGA 


TGGATTTATC AGCTGGTGGT 


CACTTAACAC 


ATGGTTCGCC 


12660 


CGTTAACTTT 


AGTGGAAAAA 


CCTATAATTT 


TGTCAGTTAT 


GGAGTGGATC 


CTTCAACAGA 


12720 


AGTAATCGAT 


TACGATGTCG 


TGCGAATTTT 


AGCAAGAGAA 


CATCGTCCAA AACTAATCGT 


12780 


TGCAGGCGCA 


AGTGCGTATT 


CACGAACGAT 


TGACTTCAAA 


CGTTTCCGTG 


AAATCGCTGA 


12840 


TGAAGTAGAT 


GCCAAGTTAA 


TGGTTGATAT 


GGCGCATATT 


GCAGGCTTGG 


TAGCTTCaGG 


12900 


GTTGCaCCCA 


AATCCaGTTC 


CgTATGCTGA 


TATCgTAACA AGTACgACCC ATAAAAcATT 


12960 


GCGTGGTCCc 


TCGTGGCGGT 


TTAATTTTGA 


CAAATAGCGA 


AGAATTGGCG 


AAAAAAGTAA 


13020 


ATAGTAGTAT 


TTTCCCAGGC 


ATTCAAGGTG 


GCCCATTGGA 


ACATGTGATT 


GCCGGAAAAG 


13080 


CCGCAGCTTT 


TAAAGAAGCA 


TTGGATCCAA 


GTTTTGCTGA 


ATATAGTCAG 


CAAGTGATTG 


13140 


CAAATGCGCA 


AGCGATGACG 


AAAGTTTTCA 


ATCAAGCGCC 


AGAAGCTCGT 


TTAATTAGTG 


13200 


GGGCAACAGA 


TAACCACTTA 


TTACTGATTG 


AAGTGACAGG 


TTTTGGTTTA 


AATGGAAAAG 


13260 


AAGCAGAAGC 


TATTTTAGAT 


AGTGTAAATA 


TTACTGTCAA 


TAAAAATTCA 


ATTCCGTTTG 


13320 


AACAATTAAG 


TCCATTTAAA 


ACAAGTGGTA 


TCCGTATTGG 


T^ACGCCTGCA 


ATTACTTCAC 


13380 


GTGGTTTTAA 


AGAAGAAGAT 


GCGGTAGAAG 


TTGCTAAATT 


AATCGTTCAA 


GTCTTAAAAG 


13440 


ATCCAGAAAA 


TACAGCAGTT 


CATGATGAAG 


TCAAAGCAGC 


TGTAGCAGCG 


CTAACTAAAA 


13500 


AATATCCATT 


ATATAACTAA 


CATCAGGGGA 


TAGCTGAAGC 


GCCGATTCTC 


TGGTACAAAT 


13560 


AAGAGTCGGA 


GCTT 










13574 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

ATTTACAGCG TTACTGAnAC AAGCAAAAGA CGCTTTAAGC AAATAAGCTT AAACAATCTT 60 

TCACACCCTA CTCAAGGAGT AGGGTGTTTT TTTAATAACT CCGTTGGTAA CTTTCATCTA 120' 

TAAATGAAAA AACTTGTGAG AGCG6AATTT CTTCAGAAAG ACTTAACCAA TGCTCTTTAT 180 

TCATATGATA GGCAGGAAAA TAGCCAGGTT GGTTCTTTAA TAAGGCAATC AATTCAGGCG 240 

CAAGTTTAAC GGTTAAAATA GTGAGCTGCT TATTCTCGGT AGCGCCGAGC AGTTGTGCAT 300 

CCAGTGGCAT AATTAAAGCA AACCATTTTG CATTTCTAGG ATGACGAAAG ACGAGATAGT 360 

CTGGGTGTTT TTTAAAAGGA AAATCGGGCG TTATCTGATA ACGTTCTTGG ATATAGGCGC 420 

TAATAGTTTC TTGAAAAGTC ATAGGTTTCT TCCTTTCGTT AAGGCTAGGC ATATCTATTT 480 

ACATCATAGC AAAAATTACA TAAAATTACT TCGTTCAAAG GAATGGGAGG AGGGAGCAGA 540 
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TGACATTAAT 


TTCAACAATT 


TTAGTAACCT 


TGGTGGCTGT 


AGAATTTTTC 


TACATTATGT 


600 


ATTTAGAAAC 


GATTGTACCA 


ACATCTGAGA 


CAACAAGCCG 


TGTTTTTAAA 


ATGGATAGGC 


660 


AAGAGTTACA 


AAGArAATCA 


GTCACTACAT 


TGTTTAAAAa 


TCAaGGAATC 


TATAATGGAC 


720 


TAATCGGTGC 


AGGACTTATT 


TATAGTGTCT 


ATTTTGCGCA 


AgCAACGATG 


GAGATGACAA 


780 


AATTTTTATT 


GATTTATATT 


ATTTTAGTGG 


CGGCGTACGG 


TAGCTTAACT 


AGTGATAAAA 


840 


AAATCATTTT 


AACACAAGGC 


GGTTTAGCTA 


TCGTGGCTTT 


GATTAGTTTA 


CTTGTTTAGC 


900 


CATCAGAGTT 


AATAGAGAGA 


GGCAGTCAAA 


TCGACAGAAA 


AAACACGGTT 


GATTTGACTA 


960 


TTTTTTATAG 


AATTGAGAGA 


GACAAAATGT 


GGTATACTTT 


GTTTATTAGA 


TAGGGAGGAA 


1020 


TTAAAAATGC 


TGAACCAAAA 


AGAAGTAAAA 


AAAACCGATT 


ATACTCAATT 


AATTTCTCTA 


1080 


TGGCGTTCTT 


CAGTAGAAGC 


TACCCATTTG 


TTTTTGAGTC 


AGGCAGATAT 


TGATAAAATC 


1140 


GAAGTGGTTT 


TACCTGATTA 


TTTTCAACAA 


GTTCAATTAA 


GTATGTGGTT 


AAATGAAGAA 


1200 


CAGAAaTGTG 


TGGGCTTTAG 


TGGAACAAAT 


CAACAAaCGT 


TGGAAATGTT 


ATTTATTGAC 


1260 


CCAGTCTATT 


TTCGTAAAGG 


CTATGGTGGa 


GAAATTATCC 


mAAAGTTAAT 


AGAGCAAGAA 


1320 


TCAATTATCT 


TTGTTGATGC 


TAATAAACAA 


AATGAAGGGG 


CAGTGAAGTT 


CTATCAATCT 


1380 


CAAGGGTTTC 


AAGTAATTGG 


TGTUITCAAAG 


GAAGATCCTC 


AAGGAAATCC 


GTTTCCTATT 


1440 


TTACATATGA 


AACGGATATA 


GAAAAGAAGG 


GAGAAGCCAT 


GCGAGTAATC 


GTTACAGAAT 


1500 


ACCAACCAGC 


CTGGGTGGAA 


CAGTTTGAAG 


AAGAAGCCCA 


AGCGTTGAAA 


CAGATTCTAA 


1560 


AGGAAAATTG 


TCTTAAAGTT 


GAACATATCG 


GTAGCACATC 


TGTGCcTAAt 


TTAGCAGCCA 


1620 


AACCAATTAT 


TGATTTTTTG 


GTCATTGTTG 


AAGAAATTGA 


AAAAGTAGAC 


CTGTTACAAT 


1680 


GGGAATTTGA 


AAGAATCGGT 


TATGAATATA 


TGGGGGAATT 


TGGGCTATCA 


GGACGTCGTT 


1740 


ATTTACGAAA 


AGGCCCGATA 


AAAAGAACAC 


ATCATGTGCA 


TATTTATCAG 


TTTGACAATA 


1800 


CGCAAGAAAT 


TTTGCGCCAT 


CTTGCTTTTC 


GAAATTATTT 


ACGAGAAAAT 


CCGGCAATTG 


1860 


CTACAACCTA 


TGGTACATTA 


AAAAAGCAAT 


TGGCCCAAGC 


ACACCCCGAT 


AGCATTGATA 


1920 


AATATATGGA 


TGGCAAAGAT 


GCGTTCATTA 


AGAAAATAGA 


AAAAGAAGCG 


TTGAAGAAAT 


1980 


ATTGGGAAAA 


ATAAAAAAAG 


TTGCTAAGGC 


TAGCAACTTT 


TTTTTGGTTC 


TTTTTTGAAA 


2040 


AAATGATAAA 


GAAGACCACT 


TAAAAGAACA 


CCTTAGAATAA 


TTCCAATCGC 


TAAATTATGC 


2100 


GTATAGACGA 


TCACACCCAC 


TGTCACGAGT 


AAAATAACAA 


TTTCAGTAAT 


TTCAAAAGTT 


2160 


CGGAAAAGCT 


GCAAACTTTC 


CCAATCAAAG 


GTGTCGACTG 


CCACTGTCAT 


CATGATCCCA 


2220 


ATCAATGCAG 


CTGTAGGAAT 


GGCTACCATC 


ACGGATTTAA 


AAACAAAAAT 


GAATAGAAGG 


2280 


AGGGTAATAC 


CTGAAATCAA 


AGTAGAAAGC 


C6TTTTCGTC 


CTCCAGATTT 


AACATTAATG 


2340 


ACCGCTTGGC 


CAATCATCGC 


ACAGCCAGCT 


TGGCCACCAA 


AAAAGCCGGT 


AATTATATTA 


2400 


GCGAATCCTT 


GTGCTTTAAC 


TTCTCGTTGG 


CTATCACTTT 


GGCTCGCTGT 


CATTTTGTCC 


2460 


ACAATGGGGA 


TCGTCAATAA 


AGATTCAATT 


AACCCGACCA 


TTGCTAAGGC 


AACAGAGCTA 


2520 
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GGTAAAATAA 


TCCACAACGT 


TTCTAAGTTG' ACGGGTACAT 


TAGGCCAACG 


GATCGCTGGA 


2580 


TGT^AACTGAC 


TCATTTCACC 


AAGATCACCT ACCGTTTGCA 


AATGACCTTT 


TAAAAAATAG 


2640 


GACAGGATAG 


TCATGAATAA 


AATAACAATT AAAGCAGGAG 


GCACAATTGT 


AATAAACTTA 


2700 


GGTAATAGAT 


AAATAAGTAA 


GATACTACAG ATAATCATAC 


CGTAGGACCA 


CAGTGTTTGA 


2760 


TGGGGCAATT 


GTTGTACTTG 


CGCCATGAAA ATTAAT^TGG 


tAACGCATTG 


ACGAATCCGT 


2820 


ACATCACTGT 


CTTGGGAATA 


AAGCGCATCA ATTTGTGGAT 


ACCTAAATAA 


CCTAAAATAA 


2880 


ATTGTAGAAA 


GCCTGTTAAA 


ATAGTCGCAG CAAGCATGTA 


TGACAAACCA 


TGTGCTTGAA 


2940 


TTAAGCCTGC 


aAGGACTAAG 


GCCATTGAGC CAGCAGACGC 


CGAAACCATT 


GCCGgACGGC 


3000 


CGCCCGTAAA 


AGTAATCACA 


AGTAAAGTAA cAGctGAAGC 


AAACACTGCA 


GAAAGCGGGT 


3060 


GAACACCAGC 


AATAATCGCA 


AACCCGATTA CTTCTGGTAA 


AATGGCGACA 


GAAGAAACGA 


3120 


GACCGGCAAA 


GAGATCATTT 


CGTTCATTGC CTAACCATTC 


GCTTTTTTTA 


ATACTAAATA 


3180 


ACATCGAAAA 


ACTCCTTTAA 


GTTAATAAAA CCTTATTCAT 


AATAGCATGA 


ATAAGCTGAA 


3240 


ATGAGCGTGG 


GACAAAAATC 


ACTTTGGATT TTTGCTCCaC 


GCTCaAAAAC 


TGATAAACGG 


3300 


CGGGAACAAA 


AGCAACTCCT 


TCGGAAATAA GCCGGAATTC 


TCCAAAAATT 


ACAGAACAAT 


3360 


TTTCGGAAAT 


TCCTTCTTAT 


TTCTCGGAGT TAAACGCTTC 


TGTCCCGACC 


TCATTAGAAC 


3420 


TGTAGCAGTG 


CCAACAACAA 


AATGGCTAGG ACGACTTGAA 


CAGTTCCTAC 


TGATAAACCA 


3480 


TAGATTAAGA 


AACGTTTGCC 


TTCTTGGAAA AACTTTTTAA 


AATCGAGTCG 


TAACCCGATT 


3540 


GCTGCCAAGG 


CAGTAATTTC 


AAACCAAGAA CTAAAGAAAT 


GAGCAGTCTC 


ACTGATCACG 


3600 


ACGGGGAAAT 


GAATCAAACT 


ATTAAAGACA CAGGCAATGA 


AAAAGCCAAC 


TACATACCAA 


3660 


GGTAGGGCGC 


TGCTTTTTTT 


GGTGACTTCT ACCAACTCAG 


CCTCTGATTC 


TGCCGTCTTA 


3720 


CTTTGCTTGA 


AACGTCCAAA 


TAAATAGACA ACTGCTACGA 


GTAGGACAAT 


CCGCATAATT 


3780 


TTAAAGAGCA 


TCGCAAGTTG 


AAcTGCGTTC TCATTGACCA 


TATTGGCACT 


GGCGACAACC 


3840 


TGACCAACTG 


ATTGTAAAGT 


CCCTCCGATT AGTGCGCTAC 


GAGCCAACAA 


ATTGGTACCA 


3900 


TAGAGAATCC 


CACTTAAAAT 


GGGTAAGGTC AACATTAAAA 


CGGTTCCTAA 


TAAGTTTACT 


3960 


AAAGTAATGA 


TTTGGCCTTT 


TTCTTCCTCA TCTGCTTGAA 


TTGCTGGAGC 


AATCGAAGCA 


4020 


ATAGCAGAAG 


ACCCACAAAC 


GGCATTACCA CCAGCCATCA 


ATAAAGACAT 


ATTATCAGAA 


4080 


AAAGCCAGTT 


TTTTACCGAT 


CAAATAGGCA AAAATAATGG 


TCAGCGACAT 


TTGAATGAGA 


4140 


ATAAATGCGA 


CACCTTGTAA 


GCCAATTTGG GCAATCGTTT 


GAAAGGTCAC 


GGTGGTTCCC 


4200 


AATAAAACAA 


CAGAAAATTC 


TAGTAATTTA CTTTCGGCTA 


CTTTGGTTCC 


GCGATTTAAA 


4260 


TTGGCACCGC 


GAACAAAGGT 


ATTTCCCAAA AAGATTCCTA 


GTAAAATAGC 


TATCGTAGCG 


4320 


GCGCCTAAGC 


TAGGTAACCA 


GATGGCAAGA AACTTGCTAA 


GGCAGGCGAC 


AAGGAAAGCT 


4380 


GTTAGCAAGC 


CGGGAAGAAT 


TTGGAATAAA GATTGGATAT 


AATCATTTTC 


TGAATTTTTC 


4440 


ATTGAAAAAA 


TCTCTCTTTC 


TTGTTCATTA CTTCCAGTAT 


AACGAAATTT 


TTTACATTTG 


4500 
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AGAnATAAAT AGTTAAnATA ATATTTAT . 4528 

(2) INFORMATION FOR SEQ ID NO; 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 





AAGPCAAATT 


AAAAGAAGCA 


GCCAATCTTA 


TGCCAGAAGA 


GCCGTTAATT 


60 


K^nn X X ^ \3\0 X X 


Tf^GCAfJAACT 


ATACTTTACA 


AATGGTCAAT 


TTGTCGAGGC 


TATTACTCGT 


120 


XnX \^nfv\\9\.»t\ 


TTGTAGAAAG 
X x\3 X n.\3nnn\9 


X V3VJw«Aw><»U^O 


CAAATCTCTG 


CTATTTCCTT 

X XXX X X 


GAATGAACGC 


180 


TTAfZfZAAf^TA 


f^TTAPA^^PAT 


\3X X /\oo X \3r\X 


TTTGAAGAAG 


CGGTTCCTTA 

X X ^« w X X aX 


TTTGGAAGCC 


240 




AAGAAftAAAC 


AGACGATCGT 


TTATTCCAGT 

X X X X \«\M^W X 


TAGCTTTCAC 


TTATTTACAG 

X X XXX 


300 


TTAPATf^ACJA 


ATCAAAAAGC 


PAT TGPGTTG 

V^i^ X X \J\^\S X X W 


TTCCAGCAAT 


TAAAAACACT 


AAACCCACAT 


360 


T ATP AATP AC 


TTTATPTATA 


TTTAGCGGAA 


GCGCTCCAAG 


AAGAAGAGCA 


ATTAGAGGAA 


420 


f^rAAAAGCGG 


TGATCGAAGA 


AGGAATTGCG 


GAAAATCCCT 


TCCAAGTAGA 


GTTATACCAA 


480 


TTTGCCTCTG 


AAAATGCCTA 


TCGCTTACAC 


GATATTGCTG 


GTGCTGAAAA 


CTGGTTGTTA 


540 


AAAGCTTTGG 


AATTAGGTGA 


GAAAACAGAT 


GAAAGTCGTT 


TGACCTTAAG 


TAATCTCTAC 


600 


ATCACGGAAG 


AGCGTTTTGA 


GGAAGTAATT 


ACTATCTTAA 


ATGATTTAGA 


GGAAACGGAT 


660 


CATCCTTATG 


CTGAATGGAA 


CTTAGCAAAA 


GCGTACAACG 


AATTAGAAGA 


CTTTGCCGTC 


720 


GCTAAAGTTC 


ATTATGAACA 


AGCAGCGCAA 


GAATTGTCTC 


ACGAACCAGA 


ATTTCTAAAA 


780 


GAATACGCAT 


TGTTCTTACG 


AGAAGAAGGC 


GAACTTGAAA 


AAGCAAAACA 


CTTGTTATAC 


840 


CACTATTTGG 


AACATGAACC 


TGGTGATGTC 


GACATGCAAT 


CTGTGCTAGC 


AGATATGGAA 




GAAAGATAGG 


AAACGACGAT 


GTTTGTTAGT 


TTATCAGAAA 


AGAAAAAATT 


TTTAACTTGG 


960 


TTAGTGAATA 


CCGCGCCTTT 


TGGAAGAAGA 


GAAGTACTGT 


GGATTTTAAA 


TTACTTATTA 


1020 


ACGCACGATG 


CGATTTTAAA 


TAATGTTCAT 


TTTGTTGAGA 


ACGTAGAAAA 


AACCGATCGG 


1080 


GGGATTCGTG 


TGGTAGCGGA 


TGGTTTAGGT 


AAGGAGCCGT 


TATTGCTTTT 


TATCCAAGCA 


1140 


CAAGAATTTA 


CAGATCCGGA 


ACAAATTTTT 


CACGAGATTC 


GGATGAATTG 


GCGGAAAGCT 


1200 


CTTTATTTAG 


AATGTGTATT 


TCCAGAGGCT 


TGGCAAACGA 


GCCAGTATTT 


ATCTGTTTTA 


1260 


GAAGATAATC 


CGTTTGCGCC 


ATGGAATGAA 


CAAGTGGATC 


AGGAAGTTGC 


CAGAGCGATT 


1320 


GATCAATATT 


TTAAGCAAGA 


AGAACAAACG 


CAGCGAATGG 


cATTACTTAA 


AGCTCAAATT 


1380 


GATGATGCTT 


TAGAAACAGG 


CAACAAAGAA 


GCGTTTTTAG 


AATTATCCGA 


TGAATTAAAT 


1440 


CGTTTGAAAC 


AACAATAAAT 


AAAACCCAGC 


ATTTATTGAA 


AATGCTGGGT 


TTTATTTATT 


1500 


GTTTTAATTT 


AAAGCGCCGT 


TGTTTGCTGG 


GTTTTGCATA 


AGTAAAGCCT 


TCGCCAATTG 


1560 
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CTTCATGTAC 


GTTAATGATG 


GACACAAAGG 


CTTTTTCATC 


TATTTCATTA 


GCAATCCGTT 


1620 


TGACTTCCAT 


TATTTCTCGA 


GGGCTGACTA 


CCGTGTAGAG 


AACCCGTTTT 


TCCACTTGTG 


1680 


AATAAGCACC 


TTCACCATTT 


AAATATGTTA 


CCCCACGTTC 


TAAAGAAGCC 


ATTAAACTTT 


1740 


GACCGATTTC 


TTCGGAATGA 


TTGGAAACAA 


TTAAAATTCC 


TTTCGCAGCA 


TAGGCTCCAT 


1800 


CGAGTACAGA 


ATCAACAACA 


CGACTGAAAA 


CAAAGGACAC 


AATTAAAGTA 


TACATCATTC 


1860 


GTTTGACATC 


AATATAGGTC 


AACGAGAGGA. 


GTAAGACTGC 


AACATCAAAA 


AGTAACAACG 


1920 


AGCGGCCCAT 


GCTAATCCCA 


AAGTATTTTT 


CGAAAATGCG 


GGCAATGACG 


TCGGTCCCAC 


1980 


CAGTGGTTCC 


GCCTACGCGG 


TAAACAAGGC 


CACTACCGAA 


TCCTGCGGCC 


AAACCAGCTA 


2040 


AGAGAGCAGC 


AATCAATAAA 


TCATGATCTA 


AATTAATTTC 


AACAGGAAAT 


TTTTGCCAAA 


2100 


ACCACAAAAA 


TACCGATAAC 


GACACGGTAC 


CAATTAGCGT 


ATAATAGAGG 


GAACGTTTGC 


2160 


CTAAAATTTT 


CCCACCAATT 


AATAAGAGCG 


GAATATTGAT 


TAATAATGTC 


GTATATGCTG 


2220 


GGTTAAAATG 


AAATAAGGCT 


CTTAAAATCA 


AAGTAATCCC 


CGTGACACCA 


CcTTCGGCCA 


2280 


GGTCATTGGC 


AATATTGAAA 


GTAACCAAAC 


CAAAGGCATA 


GATACAGCTT 


CCTAATACAA 


2340 


TAAATAACAC 


ATCTTTAAAA 


TAAAATCTTT 


GTTCTTCCAC 


TTAATCACGT 


CCTTTTACAG 


2400 


TATAAACTGT 


ATCATTCTCT 


ATATAAAACA 


ACAAGTAGAA 


CCTTTCCAGA 


TAGCCAAGTT 


2460 


TTTGATAAGA 


TGTAGGAGGT 


AGAGTAAGAA 


AGGTTGGGGT 


AAAATGTCAG 


AATCAGAACA 


2520 


ACGTAGTTTG 


CAATCGATGC 


AAACAGAAGT 


TGATGATTAT 


ATTCAACAGT 


TTAAAACAGG 


2580 


CTATTTTTCA 


CCATTAGGAC 


AAATGGCTCG 


TTTGACGGAA 


GAGGTCGGAG 


AATTAGCCCG 


2640 


TGAAGTCAAT 


CATTATTATG 


GTGAAAAACA 


GAAAAAAGCA 


GACGAAAAAC 


CAAAGACAGT 


2700 


ATCAGAGGAA 


CTTGGAGATG 


TCTTTTTTGT 


ATTGATTAGC 


ATGGCAAATT 


CTTTAGAGAT 


2760 


TGATTTAACC 


GAAGTTTTTG 


AAGAAAATAT 


GGCCAAATTT 


AACAAGCGGG 


ATCGCTATCG 


2820 


CTTTGAAAGA 


AAGGATGGGA 


AAACACATGA 


TTAAAATAAT 


TGTCGCTGGG 


TTTAAAGGAC 


2880 


GGATGGGCAG 


TACTGCTACA 


CAAATGGTCT 


TAGAAACCGC 


AGACTTTGAA 


TTAGTAGGTG 


2940 


TGTATGATCC 


TCATGAAGCA 


CAAGAAACTG 


TTTCTTTCAA 


TGATGAAACA 


GCAATTCCCG 


3000 


TTTTTCAGCG 


CTTAGAAGAA 


GTCCTAGCTG 


TCAAACCAGA 


TGTTTGGATT 


GATTTTACGG 


3060 


TTCCTGAAGC 


CGCCTATCCA 


AATACGCGTT 


TTGCGTTGGA 


ACACGGTATG 


GCGCCTGTTG 


3120 


TTGGGACGAC 


AGGCTTTACG 


GAGG7VACAAA 


TCAACGAATT 


GACAAACCTA 


TCTCGAGAAA 


3180 


AAGCCATCGG 


CGGCTTGATT 


GCGCCAAATT 


TTGCAATTGG 


GGcAGTTTTA 


ATGATGCAAT 


3240 






I AX I 1 iCCJAU 


ATCjTGGAAAT 


CATTGAATTA 


CATCATGATA 


J30u 


ACAAATTGGA 


TGCACCGAGC 


GGAACGGCTA 


TAAAAACTGC 


AGAAATGATT 


CAAGAAGTTC 


3360 


GGCCAGCAAa* 


AAAACAAGGG 


AAcCGCAAGA 


GGTAGAATCA 


ATACCAGGGG 


CACGCGGGGC 


3420 


TGATTTCGAG 


GGTCTGCGAA 


TCCATAGTGT 


CCGCTTACCT 


GGTTTAGTGG 


CACACCAACA 


3480 


AGTCCAATTT 


GGCAGTGTAG 


GTGAAGGGCT 


CACTATTCGG 


CATGATTCTT 


ATGATCGGCG 


3540 
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TTCATJCATG 


ACAGGAGTAG 


CGTTAGCTTG 


tCGGCAAGTG 


GTACAGAGAA 


CAGAACTACT 


3600 


TTACGGATTG 


GAACAGATGC 


TATGAAATTG 


ACAACTATTC 


CAAACGAATT 


TAAAGAAGCC 


3660 


GCTCCCGTTA 


TTCGTGAAAT 


CAATGCACAG 


GGCTTTGAAG 


CGTATTTTGT 


CGGGGGCAGC 


3720 


GTCCGTGATG 


CTTTATTAAA 


TAAACCAATC 


CATGATGTCG 


ATATTGCGAC 


AAGTGCCTAT 


3780 


CCAGAAGAAA 


TTAAGCAAAT 


TTTTAAACGA 


ACCGTTGATG 


TGGGCATCGA ACATGGCACA 


3840 


GTCCTTGTAT 


TGATGa^lGA 


CCAACAGTAT 


GAAGTAACAA 


CCTTTCGTAC 


CGAGTCAACT 


3900 


TATCAAGACT 


TTCGCCGACC 


AGATGAAGTC 


ACCTTTGTTC 


GCTCCTTGAA AGAAGATTTA 


3960 


AAACGGCGAG 


ATTTTACGAT 


TAATGCCTTG 


GCGCTCGACA 


GCACAGGTGA AATCATTGAT 


4020 


TTATTTGATG 


GTATCGAAGA 


TTTAACTAAT 


CAAACCATTC 


GTGCAGTCGG 


TAATCCCCaT 


4080 


GAGCGGTTTC 


ATGAAGATGC 


TCTGCGAATG 


ATGCGGGGcT 


TACGCTTTGC 


TAGCCAATTA 


4140 


GATTTTAAAA 


TTGAAGAAAA 


AACCTTAGCC 


gCTATCGCTG 


AATTTCATCC 


TTTATTGGAA 


4200 


AAAATTTCAG 


TAGAACGCAT 


TACAATTGAA 


TTTGTAAAAA 


TGCTTTTAGG 


GGTTAATCGA 


4260 


CAAGGTGGCT 


TAGCTCCGTT 


TATTGAAACA 


GAATGCTATC 


AATACTGTCC 


AAAATTACGA 


4320 


GAACAAGGGG 


CTGGACTTTT 


TCGTTT7UVTG 


GACTTGCCTG 


CTCGTCAAAT 


TGAAACAGAA 


4380 


GCAGAAGCGT 


GGACGTTACT 


TATCCAATCG 


TTGAATTTAC 


CAGAAGCAGA 


AATTCGCTCC 


4440 


TTCTTAAAGG 


CTTGGAAACT 


TTCAAATCAA 


TTAATCCAAA 


ACGTTTCACA 


ATTAGTACGC 


4500 


GGTTTGCGTT 


TTCGCTTGAG 


TAACGACTGG 


CAGCCAATGA 


TGCTTTATGA ATTAGGCGAA 


4560 


GAATCAGCAG 


TACTTGTTGA 


AAGGCTTTTA 


TATTATTACC 


AACAAGAGAG 


TCAAGTACAA 


4620 


GTAACAAAAG 


AGTTAGTTAA 


GGCATTACCC 


ATCCATCAAA 


GACATGAATT 


GGCCATAACA 


4680 


GGAAAAGACC 


TTTTAGCCGT 


CTTAGAAGAA 


ACACCAGGCA 


AATGGTTAGG 


GGAACTGATT 


4740 


GCTGAAATCG 


AGCAACACGT 


TGTTGAAGGT 


AGCTTAGAGA 


ATAAACAAGA AGTTCTGCTT 


4800 


TCGTTTGCCA 


AAAAACAACG 


TTCAAAAGGA 


GAGAAAGCAT 


AATGCCAACA GAAATGTTTA 


4860 


CAAAAGAATT 


CTTGGTGAAA 


GAATCACAAA 


CGGCTArAAT 


GCTCGGTTCT 


GGTGACTTAG 


4920 


AAGTTTTAGG 


AACCCCTGCT 


TTAGCTGCCA 


TGATTGAACA 


AaCAGCCAAA 


GAaGCTGTGA 


4980 


AGGACCAwTT 


ATCAGTTGGG 


GAAaCAaCTG 


TCGGAaCGGT 


GTTAGAATtG 


CGCCACTTAT 


5040 


TTCCATCCGC 


GGTAGGGGCG 


ACAATCGTTG 


TTACGATGAC 


CAGCATTGAA CAGACTGCAC 


5100 


ACAAAATACG 


CTATGAATTT 


GTTGCTTATG 


AAGGAGAACG 


ACAAATT 




5147 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
XD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
AAGAATTTAG ACAACTATTA GCCGCAAAAG GCATTGAATT GACAGACCAA CAGATGACAC 



60 
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AATTTGACCA 


ATATTTCCAT 


TTACTAGTAG AATGGAATGA AAAAATGAAT TTAACAGCCA 


120 


TTACAGAAGA 


AAAAGAAGTT 


TATTTGAAAC 


ATTTCTATGA 


TTCTGTTTCG 


TTGGCaTTCT 


180 


TTGAAGACTT 


TGCCAGTGAC 


AAAGCCATTT 


GTGATGTTGG 


CGCAGGCGCA 


GGATTTCCAA 


240 


GTATTCCTTT 


GAAAATTGTT 


TTTCCTTCAT 


TAAATGTCAC 


AATTGTTGAC 


TCTTTAAATA 


300 


AACGAATTAC 


TTTCCTAACA 


GAGTTGGTGA 


ATCAACTAGG 


TTTGGCTAAT 


GTTTCTTTGT 


360 


ATCATGATCG 


TGCAGAAACA 


TTTGGTCAAA 


AAGCAGAATT 


TCGTGGCGCG 


TTTGATTACG 


420 


TTACAGCGCG 


AGCAGTTGCA 


CGATTAAATG 


TTTTAAGTGA 


ACTGTGTTTA 


CCCTTAGTTA 


480 


AAAAAGAAGG 


CTATTTTTTA 


GCCTTAAAAG 


CATCGAAAAG TGAAGAAGAA ATTAATGAAG 


540 


CAAAACCTGC 


AATCGCTACA 


TTAGGCGGTC 


AATTTCAAAA 


AGAAGTTGGT 


TTTACTTTGC 


600 


CGATTACGGC 


AGACGAACGC 


CACATTGTCG 


TGATTCAAAA GAAAAAAGAA 


ACCCCTAAAA 


660 


AATATCCCAG 


AAAACCTGGA 


TTACCAAATA 


AACAACCGAT 


TAAATAAGAT 


GGAGGCACAC 


720 


ACATGACACG 


AATTATTTCT 


GTAGCGAACC 


AAAAAGGCGG 


CGTTGGTAAA 


ACAACGACCA 


780 


CTGTAAATTT 


AGGCGCATGT 


TTAGCAAACT 


TAGGCAAAAA 


AGTCTTACTG 


ATCGACATTG 


840 


ATGCACAAGG 


AAATGCTACC 


AGTGGCATGG 


GTGTTCCTAA 


ACCAGATGTC 


GCACACGATG 


900 


TGTATGATGT 


CCTTGTAAAT 


GAAGAACCAA 


TCACTAGCGT 


GGTTCAACAT 


ACCAGCCGTG 


960 


AAAATTTAGA 


TATTGTCCCT 


GCAACTATTC 


AATTAGCAGG 


CGCTGAAATT 


GAATTAACTT 


1020 


CAATGATGGC 


TCGAGAATCG 


CGCTTGAAAT 


TAGCAATTGA 


TGAAGTAAGA 


GACATGTATG 


1080 


ATTTCGTTTT 


AATTGATTGC 


CCGCCTTCTC 


TAGGGCATTT 


AACAATCAAT 


GCTTTTACAG 


1140 


CAAGTGATTC 


GATTCTAATT 


CCTGTTCAAT 


GTGAATATTA 


CGCATTAGAA 


GGCTTGAGTC 


1200 


AATTGCTAAA 


TACdATTCGC 


TTGGTTCAAA 


AACATTTTAA 


CCCTGAATTG 


AAAATTGAAG 


1260 


GGGTCTTGTT 


AACCATGTAC 


GATGCTCGGA 


CCAATTTAGG 


TGCCGAAGTC 


GTTGAAGAAG 


1320 


TCCGTAAATA 


TTTCCGTGAA 


AAAGTCTATG 


ATACGATTAT 


TCCAAGAAAT 


GTCCGTTTAT 


1380 


CCGAAGCACC 


AAGTCATGGT 


TTACCAATCA 


TTGACTATGA 


TATTCGCTCA 


AAAGGTGCCG 


1440 


AAGTGTATCA 


AGCACTAGCA 


AAGGAAGTGT JGGAGAATGA ATAAAGGCAA AGGTTTAGGc 


1500 


AGAGGCATCG 


ATGCCTTGTT 


TCAAGATATC 


GCAAaGCTAG 


AAGATGTCGA 


CGTTAAGAaC 


1560 


GAACAAGTGA 


CAGAAATTCT 


ATTAAACGAA CTTCGTCCCA ATCCTTACCA ACCGCGGAAA 


1620 


ACATTTGATG 


AAACCTCTTT 


ACAAGAACTT 


GCAAACTCGA TTCTACATTC 


CGGTGTGTTT 


1680 


CAACCGATTA 


TTGTTCGTAA 


ATCGGCAGTT 


AAAGGCTATG 


AAATTATTGC 


AGGTGAGCGT 


1740 


CGGTTCCGGG 


CTTCTAAATT 


AGCCGGCAAA 


GAAAAAATTC 


CCGCAATTAT 


TCGCGAATTT 


1800 


GATGAAGAAT 


CCATGATGCA 


AGTTGCTGTT 


TTGGAAAACT 


TACAAAGGGA AGATTTAAAT 


1860 


CCGCTAGAAG 


AAGCAGAAGC 


CTATGAAATG 


CTAATGAAAA 


ATTTAAAATT 


AACACAAGCG 


1920 


GAAGTCGCAG 


AACGTTTGGG 


CAAAAGTCGC 


CCGTATATTG 


CCAATTATTT 


ACGGTTGCTA 


1980 


ACTTTACCAG 


ATGCGGTCAA 


AGCAATGGTT 


CAAAAACAAA 


GCATGTCAAT 


GGGTCAAGCA 


2040 
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CGGACGTTAT 


TAGGCTTGAA 


GAACAAAGAA 


CAGCTATTGC 


CATTAGCTAA 


TCGTTGTATT 


. 2100 


AAAGACAATT 


TAACGGTTCG 


ACAATTAGAA 


CAACTTGTCG 


CTGAATTAAA 


CGAAACACAA 


2160 


GGCAAAAAGG 


GCAAAAAAGC 


GAAAAAAGCC 


ATCAAGGAAA 


AACCCATTTA 


TATTCGAGAA 


2220 


AGTGAAGACC 


GCTTAATGGA 


TAAATTTGGG 


ACAACCGTAG 


CGATTCAAGA 


AAAAGAAGGC 


2280 


AAAGGAAAAA 


TTGAAATTGA 


GTACTTGTCC 


TCTTCGGATT 


TGGCTAGAAT 


TTTAGATATT 


2340 


TTAGATATTC 


ATTTTGACGA 


GGAATAAAAA 


GCAAAGTGAG 


AAAGTAAGAC 


AGAAAGTCAC 


2400 


AAATCCTTTT 


GCTAGTTTCG 


GTAAAAAATG 


GTATTGTAAT 


CATGGTATTT 


TTATATAAGT 


2460 


AAGAGAAGAA 


AGAAGGCTTG 


GAACATGTAT 


GATTTAGGCG 


ATATTGTCGA 


AATGAAAAAG 


2520 


CCGCATGCTT 


GCCAAGCAAA 


TCGTTGGCAA 


ATCATTCGCA 


TGGGCGCTGA 


TATCAAAATC 


2580 


AAGTGCACGA 


ATTGCGGGCA 


TATTGTCATG 


ATGCCGCGTC 


GTGATTTTAC 


TAAAAAATTA 


2640 


AAAAAAGTAA 


TCGAGAAAAA 


AGCAGACTAA 


TGCTTTGATG 


CCTCTAATAG 


AAAAGAGAAT 


2700 


AGACTGAGAG 


AAAGAGTGGA 


AAACAAATGG 


CACTAACAGC 


TGGAATTGTC 


GGCTTACCAA 


2760 


ACGTTGGTAA 


ATCGACCCTA 


TTTAACGCAA 


TTACAAAAGC 


AGGAGCAGAA 


GCTGCAAATT 


2820 


ATCCCTTTGC 


AACGATTGAT 


CCGAACGTCG 


GCATGGTTGA 


AGTACCAGAT 


GCCCGTTTAC 


2880 


AACGCTTGAC 


AGAACTAGTA 


AAACCGAAAA 


AAACAGTTCC 


CACAACGTTT 


GAATTTACAG 


2940 


ACATCGCCGG 


AATTGTTAAA 


GGTGCCAGTA 


AAGGAGAAGG 


GCTAGGAAAT 


CAGTTTTTAA 


3000 


GTCACATTCG 


TCAAGTAGAT 


GCAATCTGTC 


ACGTGGTTCG 


TTGTTTTGAT 


GACGAAAATA 


3060 


TCATGCGTGA 


ACAAAATCGG 


GATGCGGATT 


TCGTTGATCC 


TTTAGCAGAC 


ATTGATACAA 


3120 


TTAATTTAGA 


GCTAATTTTA 


GCAGATTTAG 


ATTCAATCAA 


TAAGCGCTAT 


ACTCGCGTGG 


3180 


CTAAAATGGC 


GAAAGCAAAA 


GATAAAGAAG 


CGGTGGAAGA 


ATTAGCTGTT 


TTAGACAAAA 


3240 


TTAAACCAGT 


TTTAGAAGAA 


GGTTTATCTG 


CTCGTACGAT 


TGAATTTACG 


CCAGAAGAAG 


3300 


AGAAAATTGT 


TAAATCTCTT 


TTCTTATTAA 


CCACAAAGCC 


GGTCCTTTAT 


GTAGCAAATG 


3360 


TTTCAGAAGA 


TGAAGTGGCG 


GATCCAGACA 


ACAACGAGTA 


TGTACAACAA 


GTTCGTAATT 


3420 


TTGCAACCAG 


TGAAAATGCG 


GAAGTAATCG 


TAGTTAGTGC 


GCGAGCAGAA 


GAAGAAATTG 


3480 


CTGAATTGGA 


TTCTGAAGAA 


GACAAGGCTG 


AATTTTTAGA 


AGCAATGGGC 


ATTGAACAAT 


3540 


CTGGCTTAGA 


TCAATTGATT 


CGTGCGGCTT 


ACGATTTGTT 


AGGCTTAGCT 


ACCTACTTTA 


3600 


CAGCTGGTGA 


ACAAGAA6TT 


CGTGCATGGA 


CGTTCCGTAA 


GGGAATCAAG 


GCACCTCAAG 


3660 


CAGCTGGGAT 


TATCCACAGT 


GACTTTGAAC 


GAGGCTTCAT 


TCGTGCGGAA 


ACAGTTTCTT 


3720 


ATGAAGACCT 


AGATAAATAT 


GGCAATATGC 


AAGCTGCCAA 


AGAAGCGGGC 


CGTGTTCGTT 


3780 


TAGAAGGAAA 


AGACTACGTC 


GTTCAAGATG 


6CGATGTTAT 


GTTGTTCCGT 


TTTAACGTTT 


3840 


AAAAAAATGT 


GCCCAACAAG 


CAATTTGTTT 


TGGGATAAGC 


CCACATATTT 


AAGGCCATTC 


3900 


AGCCGTTTAG 


AAAGAGGTTC 


CCTCTACATG 


AGGGTGAGGA 


GGATACAACT 


ATGGAAGCAG 


3960 


AAGCATTACG 


AGCAATCGTG 


GCTGAGAATC 


GTCAATTAGA 


ACAGAATTTG 


ACTAAAAGAA 


4020 
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ATliAACAATA 


CATTTTTGAC 


TTAAAAAAAT 


CATTAAAAGC 


GGCTAATTTA 


TCTGAAGAAG 


4080 


AGCTAGCATT 


GGCTCTACAC 


GGTATTTTGC 


CAGAATTAGT 


AGCCGGTCAA 


AAAACAGGTA 


4140 


AAACAGCCCG 


TCAGTTATTT 


GGCACTGTTT 


CAGAACGGAC 


GGAAGCAATT 


TTAAATAAAC 


4200 


CAGCAGAAGT 


C/WVGAACCA 


GCGGGCTGGA 


TGATTTGGCT 


CGACAATACC 


TTACTATTAC 


4260 


TTGGTTTATT 


AACAATCATG 


TTAGCGGCGA 


TGTCTCTATT 


TTCAAAAGGA 


ACCGCACAAC 


4320 


CTCTCGGATT 


AACCACAIAT 


ATCTTAGGTG 


CGATGGCTGG 


GGGTTATGTT 


TTCTATCTAA 


4380 


TGCACAAATA 


TGTTTACCGT 


TTTGATCGTC 


AAGGTGGCGA 


CAAAAGCAAA 


CGTCCTGGTT 


4440 


GGTTAAAAAC 


GACCGCCATT 


CTATTTGGTG 


GGATGTTCTT 


GTGGATTGCA 


GTTTTCGCTG 


4500 


GTTCAGCGAT 


GCTACCACCA 


GTGATCAATC 


CCATCTTAGA 


TCCGATGATT 


GCTTTAGTAA 


4560 


TCGGTGCfGTT 


AgcCTTTGTA 


GCACGCTATT 


TCTTCAAGAA 


AAAATACAAT 


ATTCAAGGTA 


4620 


GTTTCATGAC 


ACGTCAATAA 


ACGGCGCGCG 


CTCAATTGTG 


AATGCGCTCG 


GcTGACGCAG 


4680 


AAACTGCAAC 


AAGCAAAGTC 


GCTCTCTCAA 


TAGAGCGTTT 


TAACAGTTAA 


AGCCTGGGGG 


4740 


ATTAGTCGAA 


AAAACTAGTG 


TCCCAGGCTT 


CTGTTTTCAG 


AAGTTTGCTA 


AAGAAAGGAG 


4800 


CCTCTTATGG 


GACAACAAGT 


GAAGTCTTGG 


TGCCGTCAAC 


ACGACCGGCT 


ACTCATTTTT 


4860 


ATTAGTTTTC 


TCTTATTAAG 


TGGACTTTCT 


CTCTATTTTG 


TCTATTTTCA 


AGAGAATTTT 


4920 


TTACGAGCAA 


GTGATTTCCG 


CTTTCATCAA 


AATCGGGTGG 


AAGGTCTCGC 


ATTAGCAATT 


4980 


AAAAACAACG 


ACTGGTTTCC 


TAAAATCAAT 


TATTTTTTCT 


TGGGAGGCTA 


TGGCTATGCC 


5040 


TCTAGTTTGT 


TTTATCCAGA 


TGCGTATCTT 


TATTTACCTG 


CGCTTTTGCG 


TGTTCTGGGG 


5100 


ATTTCCTTTG 


TCGCTAGTAT 


GGCGATTTTT 


GTTTTTGCGG 


TAAACTTGGC 


AACATTTAGT 


5160 


CTTACTTATT 


ATGCAGGCCG 


ATTAATGGCG 


CTCTCTAAAA 


AAAGAAGCTA 


TCTCTTTGCC 


5220 


ATTTTGTATG 


GCTTATCTAT 


TTACCGCATG 


CAAGACTTGT 


TTAATCGCCA 


AGCGTTAGGA 


5280 


GAATTTTTGG 


CGCTAAGTTT 


TTTTCCGTTG 


GTCTTAGCGA 


GCCTGTTTTT 


ATTACGAAAG 


5340 


GGAACCACGA 


AATGTTGGCC 


GTTGTTAACG 


CTGGCGATGA 


CTGGGATTGG 


GTTGGCTCAC 


5400 


TTTATTTCTA 


TTGAAATGGT 


TTCTATTTGG 


ATTGGGTTAT 


ACATTCTGTT 


TTATTGGCAG 


5460 


CAGTTTTTTA 


AAAAGGAGGT 


CCTGTTGGCA 


TTAGCGAAGG 


CTGCAGGTTT 


GACGCTGCTT 


5520 


TGGCTGGCCT 


TTTATCTTTT 


ACCAGTTGCT 


GAACAAATGA 


AAAATCAAGT 


GTTTAAAGTT 


5580 


ACGTCCAATC 


CACTCACTTA 


TATTTCCGAA 


AGAAGTTATC 


CTATTGTTTC 


TCTTTTCATT 


5640 


AATAGTTTAA 


AGAGCAGTGT 


TTTTCACGCG 


AAAACAGCGA 


ATCTAGGAAC 


CCTTTTGTTT 


5700 


GTGGGGCTTG 


TTGTAGCAGT 


GGTTTCGCTT 


GCCAGCAAAA 


AAATTCAGAA 


CAAACGGTTT 


5760 


ATTGGTTTGA 


CGCTGGTACT 


CTTATTGATG 


GTTACCACGC 


TTTTTCCTTG 


GCACTGGCTG 


5820 


AACCACACGC 


CTTTAAATAC 


AATTCAGTTT 


CCTTGGCGGT 


TTTTAGGGAT 


TCTTTCTGTA 


5880 


ATGCTCGCTT 


TTTTCATTGC 


ACAAGACGAA 


TGGGGCGTTT 


TCCGTAAATC 


TTGGACCGTG 


5940 


GCTTTGTTGG 


TTTTCCTTGC 


CATCAGCAAT 


TTGGGCATTT 


ATCAATATCA 


AAGTATTCAG 


6000 
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TCACAACAAG 


GCCGCCTGTT 


AACAAAAGCT 


GAGTATGAGC 


AGCCAGCGCC 


TTTTTATATC 


6060 


GGTGCGGGCC 


ATGAATATTT 


GCCAGATGAA ATCAATTATC 


AGGAATTATT 


GAAACAGAAA 


6120 


AAACGACCAC 


TT6ATTATTC 


6GCAGAACAA 


GTAACCATTA 


CAAATGTTAG 


AaTGCCTTAT 


6180 


GGGAAAATTT 


CATTTGACTA 


CCAAGTGGTG 


AtCAATCAGC 


GAAGGTGACC 


GTTCCTTTTA 


6240 


TTTATTACTT 


AGGCTATCAA 


GCCACGATTC 


AAATGAAAAA 


CCAAACGGGC 


GCTAAAAAAA 


6300 


TGAGCCTAAC 


CAATCAAGGA 


GGCTTAACCG 


CTCTTTCTTT 


ATCTGGTACA 


GGACATGTCG 


6360 


ACATCCGTTA CCAAAGAACA AAAGTGCAAA AAATAGGGAC AATGATTACC TTACTTTCTG 


6420 


TTGGTGGTTT 


TGGATTTAGC CGATTTTTAC AACAAAAAAA GAAACACAAG ATAAAAGAAC 


6480 


AACGATGAAA 


GTAGGACCTT 


AATTAGGAAG 


AGTTGAATAA 


AGGACAAACC 


GATTTTGGCG 


6540 


TTTgTCCfTTT ATTCCTCATT GATTATTTAA ACAAATCTAG 


TTGACTTCCT 


TAGTATTATC 


6600 


TGTTATTTTC ATATATAAAA TTAAGTCGAG GAGTGGACAA AAAAATGTCA AAyTGGGT^AA 


6660 


CAAAATTTGC 


6AAAAAAGGT 


CTTACATTTG ATGATGTGCT 


ATTAATCGGT 


GCAGAAAGCC 


6720 


ATGTCTTACC 


AAATGATGTA 


GACATGAGTG 


TACAATTAGC 


TAAAAATATC 


AAGTTAAATA 


6780 


TTCCATTGAT 


6A6TGCCAGT 


ATGGATACTG 


TTACAGATAG 


TAATATGGCA 


ATTGCGATGG 


6840 


CTCGTCAAGG 


TGGACTAGGT 


GTCGTACATA AAAATATGAC 


TGTCGCCCAA 


CAAGCAGATG 


6900 


AAGTACGTAA AGTCAAACGT 


TCTGAAAGCG 


GCGTTATCAT 


TGATCCATTC 


TTTCTAACAC 


6960 


CAACAAACTT 


AGTAGCGGAT 


GCAGAAGAAT 


TAATGAGTCG 


TTACCGTATT 


AGCGGTGTGC 


7020 


CAATTGTAGA 


AACAATGGAA 


AACCGTAAAT 


TAGTCGGAAT 


TATTAC7\AAC 


CGTGATATGC 


7080 


GTTTTGTAAC 


GGACTACCAA 


ATCAAAATCG 


AAGAAGTCAT 


GACAAAAGAT 


CATTTAGTCA 


7140 


CTGCGCCAGT 


TGGCACATCT 


TTAAAAGATG 


CAGAAAAAAT 


CTTACAAAAA 


CATAAAATCG 


7200 


AAAAATTACC 


AATCGTTGAT 


GAAGCAGGTC 


GCTTAAGTGG 


CTTAATTACA 


ATTAAAGACA 


7260 


TTGAAAAAGT 


GATTGAATTC 


CCAAATGCTG 


CCAAAGATGA 


ACACGGTCGT 


TTACTTGTGG 


7320 


CGGCAGCAGT 


CGGTGTCACT 


AGTGATACGT 


TCGAACGTGC 


GGAAGCATTA 


TTAGAAGCAG 


7380 


GTGCGGATGC 


GATTGTCATC 


GATACTGCAC 


ATGGCCATAG 


TGCCGGCGTT 


ATTCGCAAAA 


7440 


TTAAAGAAAT 


TCGTGAAACG 


TTCCCTGAAG 


CAACATTAAT 


TGCTGGTAAT 


GTTGCCACTG 


7500 


CTGAAGCAAC 


CAAAGCATTG 


TATGATGTCG 


GCGTTGACGT 


TGTCAAAGTT 


GGTATT6GAC 


7560 


CTGGTTCAAT 


TTGTACAACA 


CGTGTCGTTG 


CTGGTGTTGG 


TGTTCCTCAA 


TTAACAGCGA 


7620 


TTTATGATGC 


TGCTTCTGTA 


GCTAGAGAAT ATGGTAAAGC 


AATTATCGCT 


GATGGCGGTA 


7680 


TTAAATATTC 


TGGTGATATC 


GTGAAAGCCT 


TAGCTGCTGG 


TGGACATGCT 


GTAATGCTA6 


7740 


GAAGTATGTT 


AGCAGGTACA 


GATGAATCTC 


CAGGCGAATT 


TGAAATTTAT 


CAAGGCCGTC 


7800 


GTTTCAAAAC 


TTACCGTGGC 


ATGGGCTCAT 


TAGGTGCAAT 


GGAAAAAGGA 


TCAAGTGATC 


7860 


GTTACTTCCA 


AGGAAGTGTC 


AATGAAGCAA 


ACAAATTAGT 


GCCAGAAGGA ATCGAAGGAC 


7920 


GTGTTGCTTA CAAAGGAAGC GTTTCTGATA TTGTCTTCCA ATTAATCGGT GGCTTGAAAT 


7980 
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CAGGGATGGG 


TTACGTCGGT 


GCAGCAGACC 


TGAAAGCGTT 


GCGTGAAGAA 


GCACAATTTG 


8040 


TTCAAATGAG 


TGGTAACGGA 


TTGAAAGAAT 


CACATCCACA 


TGACGTTCAA 


ATCACAAAAG 


8100 


AAGCACCAAA 


CTATTCTGTT 


CAATAAAATA 


AAAGCTGATT 


TTTTTGAAAT 


TTAAAAGAGG 


8160 


AAGACGTCAT 


TTCGACGTCT 


TCCTCTTTTT 


CTGTAGAATG 


AAGGATTTGC 


TTCTGTTTCC 


8220 


GCCGTTTATC 


AATTTTTAnG 


CCTGTAACAA 


AAaTCCAAAG 


TGATTTTTGT 


TACAGGCtCC 


8280 


TTTTCCTATA 


GAATAAAAGC 


CGTTATCTGA 


TATCAGACAA 


CGACCTTCAT 


CTTCCCTTTA 


8340 


TTTAATAACT 


GTTAGGTTAC 


CCATGTAAGG 


AACTAGGACT 


TCTGGTACAG 


TTACAGAGCC 


8400 


ATCTTCGTTT 


TGGTAGTTTT 


CTAAAATAGC 


AGCGACTGTC 


CGTCCTACGG 


CTAAACCAGA 


8460 


ACCATTGAGT 


GTATGCGCAT 


ATTGGACTTT 


ATCATTTTCA 


TCACGGTAAC 


GAATCATCGC 


8520 


ACGGCGCGCT 


TGGAAATCTT 


CACAGTTTGA 


GCATGAACTG 


ATTTCACGGT 


ACGTCTCTTG 


8580 


AGCAGGAATC 


CAAACTTCCA 


AGTCATAAGT 


TTTCGCAGCT 


GAGAAGCCCA 


TGTCACCTGT 


8640 


TGAAAGAGCC 


ATGACACGGT 


AAGGTAATCC 


CAGTTTTTGC 


AGAATTTCTT 


CGGCATTATT 


8700 


CGTCATTTTT 


TCTAATTCCT 


CATAAGAATG 


CTCTGCATCA 


CTAAATTTAA 


CCATTTCGAC 


8760 


TTTATTAAAT 


TGGTGTAAAC 


GAATTAATCC 


TCGTGTGTCA 


CGACCAGCGC 


TACCAGCTTC 


8820 


AGAACGGAAA 


GAAGGGCTCA 


aGGCCGTAAa 


GTAAATCGGT 


AAaTCCTTGC 


CATCTAAAaT 


8880 


TTCATTGTTG 


TAGTAATTGG 


TTAAAGGGAC 


TTCCGCAGTT 


GGAATTAACG 


TTAAATCCGT 


8940 


ATCTTGTAAT 


TGGAAGACAT 


CTTCTTTAAA 


TTTAGGAAAT 


TGGCCAGTCC 


CGAACATGGC 


9000 


GGTGTCATTA 


ACGATATAAG 


GCGTAATCAT 


TTCTGTATAG 


CCATGTTCAT 


AAACATGTAA 


9060 


ATCTAACATG 


AAGTTGTATA 


ACGCACGTTC 


TAAGCGTGCA 


CCTAAGCCTT 


TGTAATAAAC 


9120 


AAAACGGCTA 


CCAGCTACTT 


TGGCACCACG 


TTCAAAATCT 


AGAATGCCTA 


AGTTTTCAGC 


9180 


GACTTCCCAG 


TGTGGTTTTG 


GTTCAAAAGC 


AAAAGTTCTT 


GGTTCACTCC 


AACGACGTAC 


9240 


TTCTACATTT 


TCTTCTTCAC 


CAGCACCAAC 


AGGCACAGAA 


TCATCTGGCA 


AATTCGGTAA 


9300 


AGTAGTTGTG 


ATCCCACGTA 


GCTCTTCGTC 


AATAGCGTTA 


ATTTCTGCAT 


CTAATGCTTT 


9360 


AATATTGCCG 


CCGACTTCTT 


TCATTTCCGC 


GATTTTGGCC 


GTTGCATCTT 


CTTTGTTTCG 


9420 


TTTTAATTGA 


GCAATTTCAG 


CAGACACATC 


ATTCCGGTAT 


TTTTTCATTT 


CTTCGACTTT 


9480 


GACTAATAAA 


TCACGGCGAC 


TTTCATCTAA 


ACGCAGAAAT 


TCCACTAAGA 


TTTCTTCTTT 


9540 


CAC6CCACGG 


GTTTGTAGCT 


TTGCTTTTAC 


TTCGTCAAAA 


TTTTGACGCA 


TCATTTTTAC 


9600 


ATCTAACATA 


TTTTTTCCTC 


CTTAAAATTA 


GTACATAATC 


AATAATAACT 


GTTGTAGAAA 


9660 


ACAACAAAAA 


AGCCATCTCA 


CCCCAAAAAT 


AAATATTCTT 


GGGACGAAAT 


GACTGGAAAT 


9720 


CTCGCGGTAC 


CACCCAAGTT 


CAGTAATGTT 


ACTGCCCTTG 


ATTTGCAGAT 


AACGGCTGCA 


9780 


GACCGTTCCA 


ACTTATTTCG 


CTGGACACCT 


GGAAGAGACG 


GACTTCTTTA 


TAGATGAATG 


9840 


TTTGCTCGCA 


CCCCCGCAAA 


CTCTCTGGAA 


TTCAGTGCCA 


. TAAATACTTT 


TTCTCTTAGT 


9900 


TTTCTCTTTT 


CTATAGAATA 


CCGAAAAAGT 


TCAGGGATGT 


CAATTAACTC 


TCTGATTTCT 


9960 
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TGTTGGTACG 


CTGTAAGGGT 


AGATGAATAA' 


TAAAGGAAGT 


CCACTTTTGA 


TCAGACTTCG 


10020 


CATAAATATA 


TCCACCATGT 


AAAGCGACAA 


TACTTTGTGC 


AATAGCCAGA 


CCTAAACCAG 


10080 


TACCCCCCGT 


TTCTTGTGAT 


CGGGATTCTT 


CGACACGATA 


GAAGCGGTCA 


AACAACTGAT 


10140 


CTAAAGAATG 


TTTAGGGATA 


GCTGGTCCAT 


CATTGCGAAC 


AGCAATAATT 


GCTTCGGTGC 


10200 


CGACTTTGTC 


TACTTCCATT 


ACAATATGAT 


GGCCACCTTT 


GCCATATTTC 


AAAGCGTTTG 


10260 


AAAGAAGGTT 


GTTAAAGACT 


CGGACTAGTT 


TTTCCGTATC 


GCCTTCCATC 


ATTAAAGAAG 


10320 


CTGGATTGGC 


TTTTACTTGG 


ATTTGCATAT 


TAATCTTTTT 


CGCTTCCAAT 


TCAAAATCGG 


10380 


CAGCAAGCTG 


TTCAATTAAT 


TGTGCCATAT 


CAAAAGTCGT 


CGTATGAATC 


GGTACACTTG 


10440 


GTTGACGGAC 


TTTTGTATAT 


TCAAAAAGAT 


CATCAACTAA 


TAATTTCATT 


TGTTTCGCTT 


10500 


TGACGTAAGC 


GGTATGCGTA 


TATTTCAGTA 


AATCTTCTTG 


GCTATGAAAT 


TGACGATCTT 


10560 


CAATCAATCC 


CAGATAACCA 


ATAATAGAAG 


TTAACGGTGT 


CCGAATATCA 


TGACTGACAT 


10620 


TCGTAATTAA 


TTCATCTTTT 


GATTTTTCAA 


TTCGCCGTTC 


ATCTTCAATG 


GCTGCGACAG 


10680 


TACTATCCAC 


CAGACCGTTA 


ATACTCGTCA 


CGACTTTTGC 


TAAATCACCA 


CTTAATTCAA 


10740 


ACGGAATGCG 


GTGATTGTAA 


TTACCATCAG 


CGATGTAATG 


TAACTCACTG 


ATAATATGAC 


10800 


GCAATTGCAT 


TTGATGGTAG 


CGACGAATTA 


AGCGCCAATA 


GAGAACAGCT 


GCATCTACTA 


10860 


AAAAGAAAAT 


GGGTAAGACA 


AATTTATTCC 


AGCTCCAAAA 


CAAATCAGTA 


TTTAATTCTT 


10920 


TAGCGACCGC 


ATCCTTAGTG 


GCCCAGATGA 


CACCAGCTAA 


GGGCGAGTTT 


TCACTAAGTA 


10980 


GTTGTCCTAA 


AAGGAACATA 


ATTGAGACAT 


TTAATAGTAG 


TAATAA7\ACA 


ATTGTCACAA 


11040 


TTGCTTCTGC 


TAATAACTCG 


CTGATTTCTT 


TAGAAGTTAA 


GGTAATTCGT 


TTTTGTTTTT 


11100 


CAATCTTTTT 


AGGTTTAACG 


AGCATCGATC 


TTATATCCAA 


CTCCCCACAC 


TGTTTGAATG 


11160 


ACTTTTTCTC 


CACCAGTTGC 


TTCTTCAATC 


TTATCTCTTA 


AATGACTGAC 


GTGAACCATC 


11220 


ACGGTTTTAG 


CAGAAACAAT 


ACTTTCTTGT 


TGCCACACAC 


GCTCAAAAAT 


TTCATCCGCA 


11280 


CTAAACACAC 


GATTTGGATG 


ACTAGCTAAT 


AGATAGAGGA 


TACCAAACTC 


AAGAGCGGTT 


11340 


AATTGAATTT 


CTTTTCCTTC 


TATTGTTTTG 


ACTTCGTGAG 


AGTCCTTATT 


AATCATTAAT 


11400 


GGGCCGACTT 


CTAATTCATC 


GGGTTCTTCT 


TTGGTTAATT 


GCATTTGGCT 


TCTACGTAAA 


11460 


ATAGACTTTA 


CCCGAGCCAT 


TACTTCTAAA 


GGATTAAAAG 


GCTTGGTGAC 


ATAATCGTCT 


11520 


GCACCAGCAA 


CT7VAGCCTTT 


GATTTTATCC 


ATATCGGTTG 


TTTTTGCTGT 


TAACATAATA 


11580 


ATTGGAATTT 


GCGATTCTTT 


TCGAAGTTCT 


TTCACTACTT 


CCATTCCATC 


CATGATTGGC 


11640 


ATCATAATGT 


CCAAGATTAA 


TAAATCGATA 


TCTGGTGTTG 


TGTGCAATTT 


GGATAACGCT 


11700 


TCTTTTCCAT 


CATAAGCTTT 


CACGACTTCG 


TATCCTTCAT 


TATGAATATA 


AATACTCAAT 


11760 


AGTTCTACAA 


TTTCTTTATC 


ATCATCTGCA 


ACTAAAATTT 


TCATAAGCGT 


TCATTTCCTC 


11820 


CATATATAAA 


AAATCTCTTT 


TCCATATTTT 


AACAAATAAT 


TGCTAAAAAG 


GACAAAATCC 


11880 


TTGATATATT 


ACGATaAATT 


TAGAAGATTT 


TGCTAAAAGT 


TAATAAACTA 


TAGCCAAAAA 


11940 
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AGGATGAAAG GGGGTAAAAA ACGAACGAAA ATACCTATAC CAATCATAAT AATCGGTATT . 12000 
GAAAAAAAGT TTAAAAAAAG TG 12022 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 556 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



TAAAATAATC GCAATGACTG CATCTGTAAA TGCTTCTACA CGTGTTTTCG GCATCATTTC 60 

CCCCTACTTT CATAAAAATT AAATAACTTA CCCAAAGTAA GTATAACGCA TTTCTGTAAA 120 

AATTCAATTA TTTATGAAGT CTCTCGGTTG TTGAAGTTCC TAAAAACGAG TGCAAGAGCC 180 

CCACTTAAAA GTAAGCCTCT TGCACTCGTT ATAGAACAAA TCAGTTATTG GAAGACGCCT 240 

GCCTGATAAG GTGCTAACTG CATCCGTTTC TTCACCGTAA AATCTTGATT CCCTAATAAT 300 

TGTTGCCACT TTCTGCCTAG TAATTTATTC GGCAAGTCAA ATTGGCACTT TTCTCCACTT 360 

i\AATTGACAA TCACTAACGC TGTTGCCTCA GTTGTTTTTC TTAAAAAAGC AAAAACAGAC 420 

GGATGATTGG GcGCTAATAG TTCAAACTGT CCAtCTGTAA AGACCGGATT GTCTTtCCGC 480 

AACGCAATTA ACTTTTtATA ATAATcAATA CGGATgCGCA TcCGTTtCTT GGGCAGCCAC 540 

ATTTATnnCT TGAAAT 556 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5998 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



TGTTAAAAAA ACCATTTTTA CTATTTTTTT CACTTTTAGG AGCTATTTTT ATTTTAGCTA 60 

GTTGTGGCAT AGGAAAaGaT GCTGTCACAG ATACTAAGTA CAAAGTTAGT TTGCAGCAAG 120 

CTGCTGAAAT CTATGAAAAA GAAGCTGGCA ACAGCAAACC ATTAGTAAAT GTCCAATTTG 180 

ATACAGAACC AGCAAGTGAC TACAGCTATA TCTTTACTAA CGATACAGAA ACACTTTACG 240 

TGAATCCTGA AACAGGAAAA GTCACCAAAA ATACTGAAGC AAATCAACTT GGCGAAAACG 300 

AGACAGCCTT TTCAGCTGCT GAAGTCAAAG AATTAGGCGC TGTTAACGAC GTTTTAGCCA 360 

AAGCAAAAAA AGAAGTTGGA GGACTTTCTC CAC6TATTTT GACTTGGAAG TTAACCAAAA 420 

ATAACAATAA ACTTGTTTAT ACAGTAGATG TTAAAACGAC TACGGCAGAT GAAAAAGTTA 480 

CCTTAGATGC CAATAAATAA GCACAAGCGT ATAACTTATC ACGAAATAAC CAAATCTTAA 540 

AATACTCTTT GGGACAGAAA CTACGCATTA GGAAACCTTA ACATTTTCCG AAGCCGATCG 600 
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TTCTGTCCCA 


ATTTCATTCT 


TTTAAGTAAT 


AATAATTGTT 


CGTTACCCCT 


GCTACCGTTA . 


.660 


TACTTAGTTT 


AAAAGAAAAG 


ATGGGAGTAT 


CAATTCATGG 


TTTATACAGT 


TGATTTTAAA 


720 


GAAGTAGCCA 


CTATTGGTTT 


AGAAAGTTCA 


CCAGTGGCAC 


CTGCCCTTGC 


TGGACTAAGA 


780 


GCGAATGAAG 


CTCGCTATTT 


TTGGAATAAA 


TACAAACATC 


GCTTTACGAC 


CGTCCCTGCC 


840 


TCTGAAGCTC 


CAGAAACATT 


GGCCTGGATT 


GAAAAAATTT 


TATTAGAACG 


AGAATTACAT 


900 


TTCCCTTACA 


AGCCACTTGA 


AATTTCTTCT 


TTCACTTTAG 


ATGGCATAAA 


AATGGCCTAT 


960 


GTATTTTATG 


AAAACGGCTT 


AGCAGTTAAC 


GTGATGTACA 


CATTAGCTGA 


GGAAGGCAAA 


1020 


CGCGCCGTTG 


GGTTTAAACT 


TTCAGATGGT 


ATGGCAATCC 


CTGCAGAATT 


TGAAGGAAAA 


1080 


TTCAAATTTG 


CTCGTCAAAA 


ATCAAAATTA 


GCTGGAACCA 


TTCGCGGCTC 


TTATTTTGTC 


1140 


ATTAAAGGAA 


eCTATTAAAA 


AAATGCTTTT 


GCCCTTATTA 


TTAAAGGCAA 


AAGCATTTTT 


1200 


TTATTTTAAT 


TCTTCAATAT 


AGTTAAATAC 


TTGTTGTCCA 


GCAATACCGC 


CTTCACCGAC 


1260 


CGCTGTCGTG 


ATTTGACGCA 


ATGTTTTTTC 


ACGGACATCA 


CCAATGGCAT 


AAACACCAGG 


1320 


AATTTTTGTT 


CTCATTTCTT 


GGTCTGTTTC 


AATCCAACCA 


GCCTCATTCG 


TAATACCAGC 


1380 


TTTTTTAAAT 


GGTTCTGTTA 


ATGGATCTAG 


CCCCACGTAA 


ATGAACACGC 


CGTTTGCTTC 


1440 


AATTTCAGAA 


ACTTCATCTG 


TTTTGACATT 


CCGTGCTTTC 


ACACCTGTCA 


CGACCATTTC 


1500 


ATTGCCAACA 


ATTTCTTCGA 


CAACTGTATC 


CCAAACAAAG 


GAAATCTTTT 


CATTTGCAAA 


1560 


GGCACGATCT 


TGAATGATTT 


TTTGCGCACG 


CAATTCATCA 


CGACGGTGAA 


CAATCACTAC 


1620 


TTCAGAAGCA 


AATTGCGTTA 


AATAGATTGC 


TTCTTCAACT 


GCAGAATCGC 


CGCCGCCAAC 


1680 


GACTACTAAG 


CGTTTATTTT 


TAAAGAAAGC 


ACCGTCACAA 


ACAGCACAAT 


AAGAAACACC 


1740 


ACGGCCAGCA 


AATTCTTCTT 


CGCCTTTTAC 


ACCCAATTTT 


CGGTGTTCAC 


AGCCTGTTGC 


1800 


AATAATTACC 


GCTTTAGCTT 


CATAAGATTT 


GTCATCACAA 


ATCACTTCTT 


TATAGGATCC 


1860 


GTGGTCTTCA 


ATGCCCATCA 


CAATACCATA 


CGCATTTTCC 


GTACCAAATT 


TTTCTACATT 


1920 


TTCATACATT 


TTATAGGCCA 


ATTCTGGACC 


CATAATAGAA 


TCAAAGCCTG 


GGTAATTCTC 


1980 


TACCTCAGCA 


GTGTTGTTCA 


TTTGCCCACC 


AGGCGCACCG 


CGTTCAATCA 


TTAATACAGA 


2040 


AAGATTGGAC 


CGTGATGCAT 


AAAGTGCTGC 


CGTCATCCCA 


GCAGGACCGG 


CACCAATGAT 


2100 


AATTACATCA 


TACATTTTAT 


TTCCCCCAAA 


ATTTTAAGTA 


AGTTTACTTT 


ACTCCTTAAA 


2160 


CAATTTCTTG 


TCTACTTTTC 


TGCCCTTTTT 


TGACATTCGG 


AGTAGGTGTG 


TCTATTACGT 


2220 


AAGTTATTGT 


ACCACGGTTT 


T6TGTTTTCT 


AGGAGGGGGA 


AAAGAAAAGT 


TTAAAAATAA 


2280 


CGATCACAAC 


GAATCATTGA 


TTTTTAGAAG 


CCGTTGTACT 


GCCGCATCAG 


AGAATTCTTC 


2340 


TTTATGATTT 


TTTATGCATT 


CTTGTTGATT 


TTGCCCTTGT 


GGAACAAAAC 


GAAATACTGT 


2400 


AAATGATGCA 


GATAACGTAT 


CAGGATTACT 


AATCCGATGA 


AGACTTTTAT 


GCATCCGAAT 


2460 


AAGTTCTCCT 


TCTGCACAAA 


TTCTTGAAAT 


AATTTGCTGA 


TTTTCTAGCT 


CTTCTATTTT 


2520 


CACTTTTCCC 


GACTGCACAA 


GTAAAACTTC 


TTCGATTTCG 


TGATGCTTAT 


GCCAACCTTG 


2580 
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CACAGTTTTT 


CCTGGCAAGC 


TATTTATATG 


TATTTCAAAC 


TCTGGAAATA 


GAAAATAATT 


2640 


TACTTCTGTT 


CCATCTGGCT 


TTTTACGAAA 


GATGACCTCG 


TCTCTTGTTA 


AAATTTCTAT 


2700 


ATATTTCTGT 


AACATGGCAG 


GCTCCTTTTA 


ACTTTTTTTA 


TAGCCGAAAG 


CTATTCTTTC 


2760 


AAACTAAGTT 


TGAAGCTACT 


TTCTGAAAAT 


TCGAGTTCCT 


TTAAATATAC 


TTTTATTACC 


2820 


ATTTGAAAAT 


CACTTTTTTT 


AAAACAATTG 


ACTGTTCAGT 


ATCAATCTTc 


ACTAATCCAT 


2880 


CTGTTCACCA 


GAAAAAATAT 


AGTATTTCCG 


ATTATTCACG 


AACTTATTAT 


TCAACTTCAA 


2940 


AGGAAAATCA 


GTCAACATAC 


AGCCCACATA 


TCTAACATCA 


GAAACACTTC 


TACTTTTCTC 


3000 


AAGTACTGCT 


ATGGTATACA 


TTGACCTTAC 


TAGTAACATT 


ATAGCATCAA 


GTTTTAATCA 


3060 


TATGTAGATT 


TAAAAAAAAT 


GaTAGcACCC 


TAACAGaTGT 


TTTCGTTGCT 


CATTTAATTT 


3120 


ACATGATTAT 


TTTTTAGATG 


CTAATTAAAT 


GTTTGTCAGT 


AACCTGAATT 


TTTGATATTT 


3180 


TTATCCAGAT 


TACTTATGTC 


GGTTTATCAG 


TTGTGCTAAC 


AAAAAATGAT 


TGTTTTTAAA 


3240 


GATAATTTAA 


TTCATAATAT 


ATATAGAATG 


AAAAAAAGGA 


GGTTTTATTG 


TTGAAAGTTG 


3300 


CTATTTGTGA 


TGACAATCCA 


ACGTTaACCG 


AAAAAATAAA 


CACGATGCTA 


TTTAATTATA 


3360 


ACCCAAATAT 


ATTTGAAACC 


TATACCTACT 


ACAATCCTCT 


CAAGTTAGTC 


AATCAACTTG 


3420 


ACCAAGAAAA 


TTTTGACTTT 


TTTATTTTAG 


ATATAGAAAT 


GAGTGAAATG 


AGCGGAATAG 


3480 


ACTTGGCAAA 


AAAAATCCGA 


GAACGTGATA 


TTTTATCACC 


TATCGTTTTT 


TTGACGAGTT 


3540 


ACAAA6AATA 


CATGGAAGAA 


GTCTTCCAAG 


TACAAACTTT 


TGATTATTTA 


TTGAAACCGC 


3600 


tACGGAAGAT 


CGGTTGCACC 


AAGTTTTGGA 


AAAACTGAGA 


CAACAGCTAG 


AAAAAAAAAG 


3660 


AAATTACTTT 


GTATTTTCTA 


GTAACAAAGT 


TACTTATAAA 


ATACCAACCA 


AAGATATCAT 


3720 


TTATTTCGAA 


AAAGATAAAA 


GACAAGTATT 


GATTCATACA 


GTTGGAGAAa 


TCTACAAACC 


3780 


CtATATGTCA 


ACGAATCAGA 


TAAACGAACA 


ACTTGATACG 


AATTTTGTGC 


AAGTCCACTC 


3840 


TTCTTTCATA 


ATAAATTGCG 


CTTATATAAA 


AGAGTTAGGA 


AAAAATTTCC 


TATTAATGGA 


3900 


TTCTAAAGAA 


AAATGTATTG 


AAATACCAAT 


AAGTAGGCGC 


TTCAAAGCCG 


CCGCTCATAA 


3960 


AAGTATAGTT 


ATGTCCATGA 


GAGGAAAGAT 


ATAATGCTTG 


ATATAACATT 


ACGAAGCCTA 


4020 


AGTTTGTTAG 


TTTATTTATT 


TCTTATCTCT 


TATTTTTTTT 


CTAATTGGTC 


AACAGCTATT 


4080 


AAAAATCATA 


GAGAATTTTA 


TAATGTCATC 


TGGATTGTTA 


TAATCATATC 


AGCCATGGTT 


4140 


ACGATTATAT 


TATCCAATGC 


TTACATACCG 


AACATTTATT 


TACCGTATCT 


TAACTTATTA 


4200 


CTAAGTTTCA 


TTTTAACTTA 


CACTATTGCT 


ATGCTACACT 


CGGTCCCTTA 


TCTCGACTCG 


4260 


GCCGTATGGA 


CAATGACACT 


TATATCAATA 


AATTTAATCT 


GTGAAGTATT 


GTCGTTACAC 


4320 


TTTACGAAAA 


TAATTTTAAA 


TGCAGAATTA 


ACATCCTAT6 


GTAGCCCTAC 


TTTTTTCATA 


4380 


ACTTCAATAA 


CAATTACAAC 


TTTGATTGGG 


GTAGCCTGGA 


TAGTCATACT 


TAAATTTTCA 


4440 


ATAGTTAAAG 


AAGCAAAGTC 


TGAATTATCG 


ACAAACTTAT 


CATTTATTGT 


GATTTTATTA 


4500 


CCTATTCCCA 


TACTATCAAT 


AATCATTTTA 


TTTGGGCTTT 


TAATAGGAAA 


CAACAATGAC 


4560 
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AAAATTTCTG 


AGATTACTAT 


TACGATTAGT 


GTCATATTTT 


TAAACATCTG 


TGTAATTTTT • 


4620 


CTTTATAAAA 


TCACTATAGA 


ATATCAAAAA 


AATCAATATA 


ATCTTACTTT 


GAGAAAAAAA 


4680 


AATATCGAAG 


TGGAGTATAG 


AATTTTAAAT 


GAAATAAAGA 


GAAACAGAAC 


TAACGTTTTA 


4740 


AAATTAAAAC 


ATGATTTGAA 


GAATCAATAC 


TTAACAATCC 


TCGGACTTAT 


CGAAAATGAA 


4800 


GAAGTTAATG 


AAGCTATCGA 


TTATATTAAG 


AGTAGTTTTG 


ATATTTTAGA 


ACCACCAACA 


4860 


AAAACTTATG 


CTGCCGATGG 


AGTATTAAAC 


TACCTATTGA 


ATGAAAAGCT 


TGCTGAAGCA 


4920 


AGAAAGAATC 


AAATTAATGT 


TGATCATCAG 


ATATTTGTCT 


CTAAAAATAT 


AAAAATTAAT 


4980 


AATGACGTTC 


TAACTATTGT 


AATTGGAAAT 


ATAATTGATA 


ACGCCATACA 


AGCATCGAAG 


5040 


AGAATCAAGC 


CGATTGACAG 


ATATGTCAAT 


ATAATTATCA 


AGCAAGTCAA 


TAATGATTTA 


5100 


TTTATTGAAG 


TATCtAATAA 


CTATAATTCt 


GaAGAnATTT 


TTaCaAGAAA 


ACaTAGaAAA 


516^0 


AATaAAGGtT 


TAGGaATGaA 


AAACaTTGaT 


GATCTCTTAC 


AACAATTAGG 


TGGTATTCAA 


5220 


CGTCACTGGA 


CGAAAGAACG 


TAGATATTTT 


GTAACTATAG 


TTATATTTAA 


TGTATACAAA 


5280 


GGAAACAAAT 


GCGGTGATAA 


CTAATCAAGT 


AAATCACCGC 


ATTTTTTAGT 


ATACTTCCAT 


5340 


GTCGCATAAT 


TTATCAGAAA 


ATTTCAAAAA 


TAACCAAACA 


AGCATCAATC 


CCATAATCGT 


5400 


TGAAAATAGG 


AACGTTATAG 


TTGGAAAATT 


CATAAGACCC 


ATAGCAGAAA 


AATTCGAATA 


5460 


TATTAGCATT 


AGTACCACTG 


ACGTTATTAT 


TGAAGCAATT 


TCCGATTGTT 


TCCAAATTGA 


5520 


AATTATTTCT 


GACAATATCA 


AAATTACGAA 


TAGTAGACTA 


CAACTTGTAA 


CTATGGTAAT 


5580 


AAGACCAAGG 


TAAATTTCAA 


AAAATAAATT 


TGAACTATGA 


AACTTCAAAA 


TTAAGTTAAG 


5640 


CAAAACCTCA 


GTCACAAATG 


AAATAAGAAG 


CGCACTTCCA 


AAGCTAGATA 


AGATTATCAC 


5700 


GTAAAAAGAA 


CTTGTTTTTT 


TTACGAGAAG 


CTCTTTTCGA 


TCAATCGGAA 


AAAGAAACGT 


5760 


TCTATTCCGG 


TGATTGCCAA 


CATAAAATCG 


TACAAGCTCT 


TTCCTTAGTA 


ATACAGCTCC 


5820 


ATAAATTGAT 


ATGCCGCACA 


TAGAAACAGT 


GTTCGATAAT 


GCTATAATTC 


CTTGAAATGA 


5880 


CGAGATCGTA 


TCAGCACTTT 


CATCAAATAT 


TCCCACAAAT 


GAGAAAAmCA 


TAAAAAATAA 


5940 


CATTTGGATT 


ACCAGTGTAA 


TTAAGAAAAC 


TTTAAACCAA 


CATTnTAATG 


GnnCTAAT 


5998 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

AAATCCGTAG'CAACATTTAT GGAGACAATT ACAGAAAAAT GCGGTGAAGC ACATCAAGAT 60 

TGGGCAAAAA ACTTTCAGGG CGGnCTTTGC CAATACTTTA CTAAC/^CAG TGAAACGTGC 120 

AAGAAGATGG TACTACCTTT TTATTAACAG GCGATATTCC AGCAATGTGG TTGAGAGACT 180 
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CAA.CAGCACA 


AGTTCGTCCC 


TATTTAGTGA- 


TTGCnAAAGA 


AGACGAAGAC 


CTTGCTCAAA 


240 


TGATTGCTGG 


TTTGGTTAAA 


CGCCaTTTCG. -tTATATTTGC 


ATTGATCCTT 


ATGCCAATGC 


300 


GTTTAATGAA 


ACAGATAATC 


ATGCAGGCCA 


TCAAACTGAT 


AAGACAGAAA 


TGAATGGCTG 


360 


GATTTGGGAA 


CGGAAATATG 


AAATTGATTC ATTATGTTAC 


CCGGTCCAAT 


TAGCCTATTT 


420 


ACTTTATAAA 


AATACTGGGA 


TGACCGAACA ATTTAATTCA 


GATTTTGTGG 


AAGGCGTCAA 


480 


AAAAATTCTC 


AACGTTTTTA 


CAACAGAACA 


AGATCACGCA 


CAGTCACCTT 


ATTTATTTGA 


540 


ACGAGATACG 


TGGCGTCAAG 


AAGATACGTT 


GGTTGAAGCA 


GGGAAAGGAA 


CACCAGTTGG 


600 


CAAAACAGGG 


ATGACATGGT 


CAGGCTTCCG 


TCCTAGTGaT 


GATGCCTGCC 


AATATGGTTA 


660 


CTTAGTTCCT 


TCTAATATGT 


TTGCAGTCGT 


AATTTTAGGC 


TATATTCAAG 


AAATTTTCTC 


720 


TGATGTTTTA 


GAAGATGCTA 


CGATTGGGGC 


GATTGCTAAA 


AAATTACAAG 


AAGAGATTGA 


780 


AGAAGGCATC 


CAAACATTTG 


GACGAACAAA 


AAATCAAAAT 


AACGAAACGA 


TTTACGCGTA 


840 


TGAAGTGGaT 


GGCTTAGGCA 


ATGCAAGCGT 


GATGGaTGAC 


AGTAATGTTC 


CCAATTTAGT 


900 


AGCAGCCCCT 


TATCTGGGTT 


ATTGTTCAAC 


AGAGGATGAA 


CAATATTTAA 


CAACGCGTCA 


960 


AACCTTATTA 


AGTAAAGAAA 


ATCCTTACTT 


CTATGAAGGC 


AAATACGCGA 


AAGGGATTGG 


1020 


TAGCTCGCAT 


ACACCAGAAA 


ATTACGTGTG 


GCCGATTGCT 


TTAGCAATGG 


AAGGAATGAC 


1080 


AACCAAAGAT 


AAAGCAGaAA 


AAGAACGAAT 


TTTAGATTTA 


TTAGTTGCCA 


CAGATGCGGG 


1140 


GACTCATTTG 


ATGCATGAAG 


GTTTTGACGT 


AGATAATCCT 


GAAAATTATA 


CAAGAGAATG 


1200 


GTTTTCATGG 


GCTAATATGA 


TGTTTTGTGA 


ATTAGTTATG 


GATTATTTTG 


ACATCCGAGT 


1260 


GGAAAAATAG 


GAGGAA7VAGT 


AATGACAAAG 


AAAAAAGTCT 


ATATTGTTTC 


CCATAGTCAT 


1320 


TGGGATCGTG 


AATGGTATCT 


GCCTTATGAA GrACATCATA 


TGCGCTTAAT 


TGAATTAGTG 


1380 


GATAATGTAT 


TAGATTTAAT 


TGAAAATGAT 


CCAGAGTTTA 


ACAGTTTCCA 


TTTAGATGGA 


1440 


CAAACGATTA 


TCTTAGACGA 


CTATTTACAA 


GTTCGCCCAG 


AGAAAAAAGA 


AGCCGTGAAA 


1500 


AAAGCTGTCC 


AAGCAGGCnA 


ACTAAAAATC 


GGACCATTTA 


GTATTTTACA AGATGCTTTT 


1560 


nAATTAGCTC 


TGAATCCAAT 


GTCGAACATG 


CTGATTGGGC 


TTAGAAAGCn 


AAATnGGGGC 


1620 


ACAGTCAATT 


AGGTACTCCC 


G 








1641 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8033 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

AGAGAAGGGA TCATATGTCA GCAAAACATA ATCATGAACA ACACCAGGAA AAACAGCCAA 60 

CAATGTCCCA CGATAAAATG AATCACATGA ATCATGATAT GGCGCATGAA CAAATGGCTA 120 

TGTCACACCA ACATGAAGGA ATGAAGGGAA TGGATCATTC AATGCATATG GGGAATTTTA 180 
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AACAAAAATT 


CTGGCTGTCA 


TTGATTTTAG 


CTATTCCGAT 


TATTGTTCTT 


TCACCAATGA . 


.240 


TGGGTTTTCA 


ATTACCTTTT 


CAATTTACTT 


TTCCAGGTTC 


TGATTGGTTG 


GTCTTAATTT 


300 


TAGCAACTGT 


GCTATTTTTT 


TATGGTGGGC 


AACCATTTTT 


AAGTGGCGCT 


AGAATGGAAT 


360 


TACAGCAAAA 


GAGTCCAGCA 


ATGATGACGT 


TAATTGCGAT 


GGGGATTTCG 


GTATCATATT 


420 


TTTACAGCTT 


ATATGCGTTT 


TACATGAATC 


ATTTTACGAA 


TCAGGCCCAC 


GTGATGGACT 


480 


TTTTCTGGGA 


ATTAGCCACT 


CTGATTGTTA 


TTATGCTTTT 


AGGCCATTGG 


ATTGAAATGA 


540 


ATGCTATTTC 


TAATGCTGGT 


GATGCCTTGA 


AAAAAATGGC 


TGAACTGTTG 


CCTGACACAG 


600 


TCAAGCGAAT 


GACTGAACAT 


GGGGAAGAAG 


AAATTCCTTT 


GCAAGATGTC 


CAAGAAGGTG 


660 


ATCGCTTGAT 


TGTTCGTTCG 


GGAGATAAAA 


TTCCGACAGA 


TGGTAAAATT 


TTGAAAGGAA 


720 


GCACAACAGT 


CGACGAATCG 


ATGGTTACAG 


GGGAATCAAA 


AACAGTAGAA 


AAAAACATTG 


780 


GTGACTCCGT 


GATTGGTGGA 


GCTGTCAATG 


GCAACGGAAT 


GATTGaAATT 


TCGGtAACTG 


840 


GCACGGGTGA 


AAATAGTTAC 


CTGTCAAAAG 


TTATGGAGAT 


GGTTAAACxnA 


GcGCAATCGG 


900 


AAAAATCaAA 


ACTAGaATCC 


aTTTCAGACC 


gTGTAGCCaA 


GTGGTTATTC 


TATATTGCCt 


960 


TATTCGTAGG 


TGTGTTAGCA 


TTTATCGGTT 


GGTTAATGGC 


GACGAAAGAT 


TTGTCGCTAG 


1020 


CATTTGAACG 


AATGGTTACA 


GTTTTTATTA 


TTGCCTGTCC 


CCATGCGTTA 


GGCTTAGCAA 


1080 


TTCCTTTAGT 


GATTGCTAGA 


AGTACCTCCA 


TTGCTGCTAA 


AAATGGCTTA 


CTTTTAAAAA 


1140 


ATCGGAATGC 


ATTAGAACAA 


GCCAATAAGG 


TTGAATATGT 


GTTGCTGGAT 


AAAACTGGAA 


1200 


CTTTAACAGA 


AGGACAATTT 


ACAGTTACTG 


GCTTAGAATT 


AATGAGTAAG 


CAGTTTACTA 


1260 


GAGAAGAAGC 


ACTTAAATAT 


ATTGGGGCAT 


TGGAAAAAAA 


TGCAAATCAT 


CCGTTGGCAA 


1320 


TTGGGATCAT 


GAACTATTTA 


AtAAACAAGC 


CgTTCAACCT 


TATGAAGCCC 


ATAATCTGCA 


1380 


AGCTTTATCA 


GGCGTAGGAC 


TGGTTGCTAC 


TGTTCAAAAC 


CAAGAAGTCA 


AAATAGTCAA 


1440 


TGAAAAAGAA 


GTTGCACGGT 


TACAGTTAAC 


CTTCGATGAA 


ACAATTAAAA 


CAAACTATCA 


1500 


AGAGCAAGGA 


AATACCTTGA 


GCTATTTAAT 


CATCGCTGGT 


CAATTAGTCG 


CACTGCTTGC 


1560 


TTTAGGAGAT 


AAAGTCAAAC 


CAGAAGCTAA 


AACATTTATT 


GCAGAGTTAC 


AAGCGCAAGG 


1620 


AATTACGCCA 


GTCATGTTAA 


CAGGGGATAA 


TCAAACAGCC 


GCCAGTGCAG 


TTGCTAATTA 


1680 


TTTAGGAATG 


AAGGAATACT 


ACGCAGAGCT 


CTTACCAGAA 


GATAAAGAAA 


AAATTGTGCA 


1740 


ACAATATCTT 


ACTGAGGGAC 


ATCAGGTCAT 


GATGGTAGGA 


GACGGTATTA 


ATGATGCACC 


1800 


AAGTTTAGCC 


CGTGCTTCAA 


TCGGCATTGC 


GATAGGCGCT 


GGTACAGATG 


TAGCGATTGA 


1860 


TTCGGCAGAT 


GTTGTACTGA 


CGGATAGTGA 


TCCTAAAGAT 


ATTTTAAGGT 


TCTTAGATTT 


1920 


AGCCAAACAA 


ACACGACGTA 


AAATGATCCA 


AAATCTTTGG 


TGGGGAGCTG 


GGTACAATAT 


1980 


TGTCGCCATT 


CCTTTAGCGG 


CTGGGGTTCT 


TGCACCAATT 


GGGATTGTTT 


TAAATCCAGC 


2040 


TGTAGGTGCT 


GTCTTGATGT 


CATTAAGTAC 


AATTATTGTA 


GCAGCAAATG 


CTATGACACT 


2100 


GCaTATTTcA 


AAAAAATAAT 


TCGATaTAAT 


TGTaGAGAGC 


GGAAAGTAAT 


CAATACTTTT 


2160 
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TGTTCTCTAT 


TTTTTTCTTC 


TTAAAAAAAT 


TTTTGCCTTA 


TTTGACGTAT 


TCTTATwACa . 


2220 


AGGGAGTGaA 


AGGGGACGAT 


ATATGGaTTT 


CaAGGaATTA 


TTGgTCTTTk 


GAmCATAAAA 


2280 


ATATGTTGGA 


ATTGTTGGGA 


TTTACTGCCA 


ACAAGAGCAG 


ACATTTTCTG 


TTTTGGATGG 


2340 


CTGCCAGAAA 


ACTAAAGGTG 


AGCGACAAAA 


CCTCCATTGT 


TACATTCAAC 


GTTTTGAAAA 


2400 


AAAGCAGGAA 


ACGTTTAAGA 


TAGAAGGGCT 


TTGTAAGATT 


TATACAACGA 


CCAAAGGAAT 


2460 


TGTAAAATAT 


CGTGAATTAT 


GCGTTTGTGG 


CTTGGCTCGA 


TTTAGACAAA 


AATTTTGTTA 


2520 


TTTTATTCCC 


GAATTTTATA 


TTTTACGTTG 


TCTAATTGAA 


GAGCGTUVTCA 


ATTATGCACA 


2580 


ATTGACCCAG 


GTATTGGAAT 


TACAGGAATC 


AACTCTTCGG 


AAGAAATTTT 


CTAATATTCG 


2640 


ACGTTGGCTA 


GTTAATTTTG 


ATATTATTGT 


ACGTCAAAAA 


CATTACGATT 


TAACTGGTAA 


2700 


TGAATGGCAA 


ATTCGACAGT 


TAATCCTTTG 


TTTTTATTTG 


TTTTTCCAAG 


AAAGCTGTCT 


2760 


GGAAGAAAAC 


AGAGAAATGA 


CTCGGAAAAT 


CATTACCTTT 


TTTGAGTTGG 


ATTTAAATGT 


2820 


AGCACAACAG 


AATCATCTTA 


GTTGGCTAAT 


TTATATTTGG 


GAGAAACGAT 


ATAGAGGCGG 


2880 


GCATGAAATT 


TCAGTGCCAA 


ATGCTAATTT 


ATTTCAACAA 


ACAAGTGCTT 


TTTTTTATTT 


2940 


ATTTCGTGTG 


GAAGTGTTGA 


GCACTTCTTT 


TATGTCTCTT 


AAAGAACAAA 


AAGCGTTATT 


3000 


TGTGATTCTT 


GAAGCACATT 


TTGGTGGTTG 


TTTTGGCAAG 


AGAGCGCGAA 


AGTATTTTAT 


3060 


TCATGAGCAA 


ATGAAAATAG 


AAAGTTTATG 


CTTAAAAACA 


GCTATTTTTA 


TTATGAAAGA 


3120 


AATTAGGAGA 


AATTTTACAC 


AACATCATTT 


TAATTATCAA 


GAAATTCATT 


TATGTCGATT 


3180 


TTTAAGTACG 


CATATGAATA 


GTTTATTAGA 


CGGACAAGCA 


TGGTTGCCAG 


CTCATAAACA 


3240 


GGAACAAACA 


CTCGCTGCTC 


GTTATCAACA 


AACCTGGCAC 


AGATTACAAA 


AACTTATTAG 


3300 


GCTTTTAAAA 


CGCCTATATC 


CAGTCTTTAC 


TTCAGTAAAA 


GAACGGGAAT 


TGACGAGTTG 


3360 


CTATTTTTAT 


CACATCCTAG 


ATTTATTTAA 


CCCGATTTTA 


TATGAAAAAA 


AATATATTAT 


3420 


TTGCCTATTG 


ACGGACTTTC 


CTCCAGAAAA 


GGAACAAGCG 


TTGGGGCAAT 


CTATAAAAAG 


3480 


CTATTTCAGT 


GAAAAAAAGA 


ATATTACAAT 


TATTCATGGA 


AAACCAACGT 


ATCAACTTCA 


3540 


CCAAGTACAC 


CTCTTAATTG 


TCAATCATTT 


GTTTCAGATG 


AACGTAGCGC 


TTTCTTCTAA 


3600 


GACAGTGGTT 


TATTTGCCAG 


AGGAATTATC 


TCCTGCTTTT 


TTTGAAAAAG 


TCGAAGCGAA 


3660 


TCTATCGTAA 


GAAGTCCAAT 


TTTGTCTAAA 


TAGTGTTAAA 


ATGAAGAAGT 


AAGAGGAATG 


3720 


GATGAGGAGG 


ATATTAAAAT 


GAAACAACT6 


cAATTTGGTA 


CAAGTGACGA 


AACAGTTTCT 


3780 


TCGGTTATCT 


TAGGATGTAT 


GCGCTTAAAT 


GGTGCTGAAA 


ATCCTCAACA 


GGTAATTGAA 


3840 


ACTGCTTATG 


ATCATGGCAT 


TACATTTTTT 


GATCACGCCG 


ACATTTATGG 


TGGCGGCGAC 


3900 


TGCGAAACCA 


TTTTTGGTAA 


GGCCTTAAAA 


GAAAGTACCA 


TTCGTCGTGA 


AGATATTTTT 


3960 


ATACAAACGA 


AATGTGGGAT 


TCGTCAGGGC 


TTTTTTGACT 


TTTCAAAAGC 


ACATATTTTA 


4020 


GAAGCGGTCG 


AAGGGAGTTT 


ACAGCGTTTA 


GGTGTAGATT 


CAGTCGATGC 


ATTATTGTTG 


4080 


CATCGTCCAG 


ATACTTTAGT 


GGAACCAGAA 


GAAGTAGCTG 


AAGCTTTTCA 


TTTATTAGAA 


4140 
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AAACAAGGGA AAGTTCGCTA TTTTGGTGTA 


AGTAACCAAA 


CGCCAGGACA 


AATTGAACTT . 


4200 


TTAAAAACAG 


CTGTTAAACA 


ACCGTTATTA 


GCCAATCAAT 


TACAATTTGG 


CATCAAGCAT 


4260 


ACTGGTATGG 


TAGATCAAGG 


CTTACAAACA 


AATATGGAaA 


TATCAGGAAG 


CATTGATTAT 


4320 


GATCATGGTA 


TTTTAGATTA 


TTCACGGTTG 


AAACAAATGA 


CTATTCAAGC 


ATGGTCACCG 


4380 


TACCAATATG 


GTTATTTTGA 


AGGTGTCTTT 


ATTGGAAATG 


AAAAATTCCC 


TGAGTTAAAT 


4440 


CAAAAATTAA 


GCGAATTGGC 


TGAAAAATAT 


CAAACAACGC 


CAACTGGTTT 


AGCCAGCAGC 


4500 


TGGATTCTGC 


GCCACCCTGC 


TAATATGCAA 


GTCATCGCAG 


GAAGTATGAA 


TCTTGGTAGA 


4560 


ATTGAAGAAA 


TTGCTAAAGC 


AGCAGACATT 


GTGATTAGTC 


GCGAAGATTG 


GTACGATATT 


4620 


TATCGTGCAG 


CGGGAAATGT 


TTTGCCGTAA 


AAAGATTCTC 


ATACTTGTGA 


AGATCCAGCG 


4680 


TGTATTGCTG 


ATTGATTTTG 


GTATTTGATA 


TAGTTGTCAC 


TAAGGAACTA 


TGTCAAATAC 


4740 


CTTTTTCTTT 


CATTCTGCAA 


TCGCCGTCTA 


CTGCAAATTC 


TGTAGTTTTT 


TGTTAAAATA 


4800 


GAAAGGAAAC 


TAGTTTGCTA AAGATGAGGG 


GAAAAAAATG 


ACCAAAAAAA 


AATTATTACT 


4860 


CGTTGATGGA AACAGTGTAG 


CCTTTCGTGC 


ATTTTTTGCG 


CTACATAACT 


CATTAGAACG 


4920 


ATTCAAAAAT 


AAGAACGGCT 


TGCATACAAA 


TGCTATTTAT 


GCATTTAACA 


ATATGTTTGa 


4980 


AAATGTAATG 


cAAAAAGAAA 


TGCCTACTCA 


TGTTTTAGTC 


GCTTTTGATG 


nCAGGGAAAn 


5040 


CCACATTCAG AACaGrATTT 


TATTCaGAAT 


ATAAAGCCGG 


ACGTTCAAAA 


ACACCGGGGG 


5100 


AATTTAAAGA ACAAATGCCG 


TATATTCGTG 


AACTTTTAGA AGGTTTAGGC 


GTAAAATATT 


5160 


ATGAACTACC 


TAACTATC4AA 


6CAGATGATA 


TTATTGGTAC 


ATTAGnCCAA 


CAAAGTTGAT 


5220 


AAAGATGAAT 


TTGATGTTGT 


CGTGTTATCA 


GGGGACCGAG 


ATTTGACGCA 


ATTAGCGACG 


5280 


GACACTGTCA 


AAGTAGATAT 


CACTGTTAAA 


GGAGTTAGCG 


ATATTGAATC 


ATATACGCCT 


5340 


GAGCATATTG 


CTGAAAAATA 


TGATGGGTTA 


ACGCCTAAGC 


AAATTATTGA 


TATGAAAGGG 


5400 


TTGGCTGGCG 


ACACGTCGGA 


TAACATTCCA 


GGTGTAACGA 


AAATCGGTGA 


GAAAACCGCG 


5460 


ATTAAGCTGT 


TAAAACAATA 


TGGCTCAGTT 


GAAGGGGTTT 


ATGAGAATAT 


CGATGAAATG 


5520 


AAAAAAAGCA AAATGAAAGA AAACTTGATT 


AATGATAAAG 


AGCAAGCCTT 


TCTTTCTAAA 


5580 


CGTTTAGCGA 


CAATTGAAGT 


CAATGCACCA 


GTCGAAGTCA 


ATGTGGaAGA 


CTTGGcATAT 


5640 


GAAGGGAAAA ACTTAGaAAA ATTGGTTCCT 


TTTTATAAAG 


AAATGGACTT 


TAAACAATTT 


5700 


TTAGCTAAGT 


TGGACATTAC 


AGAAGAACCT 


GTTGAAATGG 


AAGATATTTT 


ATTTGAAGTA 


5760 


GTCGAAGATC 


AGTTAACCAA TGAAATGTTT 


ACAGACGATA 


TGGCTCTTTA 


TGTTGAGATG 


5820 


ATGGAAGATA ACTATCATAC GTCACCGATT 


GTCGGCCTTG 


CTTGGGGCAA 


CAACAAGAAG 


5880 


ATTTATACCA 


CGAATAATTT 


AGCTGTTTTT 


GAAAGTCAGC 


CATTTATTGA 


CTGGTTGATG 


5940 


GATGAGACAC 


GTAAAAAAAA 


TGTCTACGAT 


GCaAAACGGA 


CGTaCGTTGC 


ACTAAATCGC 


6000 


TATGTAGGAA 


AAATGACAGG 


GATTGCCtTT 


GATGTTTTAT 


TGGCTGCTTA 


TTTACTTGAT 


6060 


ACCAATGACA 


ATAACGCTGA 


TATTGAAGGT 


GTAGCACAAC 


ATTACGGATA 


TGATGCGATT 


6120 
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CAATCTGATG AAGCCATTTA TGGAAAAGGG GCCAAAAAAG GCTTGCCAGA GGACGAAGAA 6180 
GTTTTCTTTG GTCATTTAGC ACGTAAGATT AAAGCAATTC AATTTTTAAC AAGCAAATTA 6240 
GATAGCGAAT TAACAGAAAA AAATCAAGCT GATTTATTCT ATAAAATGGA ATTACCACTT 6300 
TCTCGTATTT TAGGCGATAT GGAAATAACT GGTATCCGTG TTGATGCCAC ACGCTTAAAA 6360 
gAAATGCAAG TAGAATTTTC AGAACGCTTG AAAGAAATCG AAGAAAAAAT CTATGCAGAA 6420 
GCTGGCGAAG AATTTAATTT GAATTCACCT AAACAATTAG GCGTTATTTT ATTTGAAAAA 6480 
ATGGGCTTAC CTGTGATCAA AAAGACGAAA ACTGGCTATT CAACAGCGGT AGATGTTTTA 6540 
GAACAACTAA AAGAACAAGC ACCAATTGTG GCCGATATTT TAACTTATCG TCAAATTGCC 6600 

AAAATTCAAT CGACCTATGT TGAAGGACTT TTAAAAGTAA TTCAGCCAGA TAATAAAATT 6660 

CATACTCGGT ATGTACAAAC ACTGACaCAA ACAGGACGAT TAAGCTCAGT AGATCCCAAT 6720 

TTACAAAACA TTCCTATCCG TTTAGAAGAA GGCCGAAAAA TTAGAGAAGC GTTCGTACCA 6780 

AGAGAGGATA ATTGGTTGAT TTTCTCTTCT GATTACTCTC AAATTGAGTT ACGTGTGCTC 6840 

GCACATATTT CTAATGATGA GCATTTAAAA GAAGCATTTG TGGAAGGACA AGATATCCAT 6900 

GCCAGCACTG CAATGCGAGT TTTCGGTGTT GAAAAAGCAG AAGATGTCAC ACCTAACATG 6960 

CGTCGTCAAG CTAAAGCGGT GAATTTTGGA ATTGTCTATG GTATCAGTGA TTATGGCTTG 7020 

TCTCAAAACT TAGGCATTAC TAGA7UVGCAA GCGCAACAAT ATATTGACAC CTATTTTGAA 7080 

AAATATCCAG GCGTGAAACA GTACATGGAA GAGATTGTCC GCGATGCCAA AGACAAAGGC 7140 

TATGTGGAAA CATTGTATCA TCGCCGTCGT TATTTAAGTG ATATTAATTC AAGAAACTTT 7200 

AATCTACGTT CGTTCGCTGA ACGTACAGCG ATTAATACAC CGATTCAGGG AAGTGCCGCA 7260 

GATATTTTAA AAATTGCGAT GATTGAAATG GCGCGTCGTT TAAAAGAAGA AAAATTACAA 7320 

GCGACAATGT TATTACAAGT GCACGATGAA TTAGTTTTTG AAGTACCAGA ATCAGAATTA 7380" 

GAACAATTAA ATCAACTCGT CAAAGAAGTG ATGGAGCACG CAGTTTCCTT ACATGTGCCA 7440 

CTTATCACCG ATAGCAGTTG GGGAAAAACA TGGTATGAAG CAAAATAATA GCTTACCCAT 7500 

TAAGAACGAT GAATAAAATA AGGATAAGTA GGATCTGGAC GTTGATATCT AAGCGTTCAG 7560 

ACCTCTTTAT ATAAAGGAGC AAAAATAATG CCAGAATTAC CAGAAGTAGA AACAGTGCGA 7620 

AAAGGACTTG AAAAACTTGT TGTAGGAAAA ACCATTCAAG AAGTCATTGT TTTTTGGCCA 7680 

AGAATTATTG AATCACCAGA AGTTGATGTA TTTCAAGGTC AATTAGCTGG TCAAACGATT 7740 

GAAGGGATTG AGCGAAGAGG AAAATTTCTG ATTTTTAAAT TGTCTGATAA TGATATGATT 7800 

TCTCACTTAC GGATG6AAG6 AAAATATGAG TTTCATCAGG CTGATGATGA AATTGCCAAA 7860 

CATACCCATG TAATGTTTAC TTTTACAGAT GGCACGCAAT TACGTTATTT AGATGTACGG 7920 

AAATTTGGGC GTATGACATT GGTTCCTAAG AATCAAGGTC ATCAATATAA AGGGATTTTA 7980 

GCTTTGGGCC CAGAGCCAAC ACCAGACGTT TTCCAATTAG CTACTTTTCA ACA 8033 
(2) INFORMATION FOR SEQ ID NO: 122: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:- .9013 has£-pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 



GTTTGGATAA 


TCGGCGTTAC 


TTTAGCAATC AGTGCTTTTC 


TTTTAGCTAG 


CTGTGGGCAA 


60 


ACCACACAAT 


CGCCAAAAGA 


GAAGAAAGAG CTCACGGTAA 


TGACTACTTT 


TTATCCGATG 


120 


TACGACTTCA 


CAAAACAAGT 


CGTTGGTGAT GAGGGAGAGG 


TTGAGCTGTT 


GATTCCAGCT 


180 


GGAACGGaAC 


CACACGATTA 


TGAACCAAGC GCAAAAGACT 


TAGCGAAAAT 


TACTGATGCG 


240 


GATGTCTTTG 


TTTATaATAG 


TAAAGAGCTA GAAACTTGGG 


TTCCGAATGT 


TATTGAAAAC 


300 


TTGGACACGA 


AAAAAGTTTC 


AATCGTGGAA GCCAGTCAAT 


CCATTCAATT 


GATGCAAGGA 


360 


ACAGAAGAGG 


AAGAATCTG6 


CGaAGAAGGT CATGAAGGCC 


AT7UVCCATTC 


CCACGAACTG 


420 


GATCCGCATG 


TCTGGTTAGA 


TCCCGTGTTA GCGCAAAAAG 


AAGTAACAGC 


GATTCGGGAT 


480 


GCGCTTATCA 


AAAAATATCC 


TGAGAAAAAA GCCGTATTTG 


AAAAAAATAC 


CGTTGCTTAT 


540 


TTGGAAAAAT 


TGACTGCGTT 


AGATAAGGAA TACCAAGCTG 


CGTTTGCAGG 


CGCTAAAAAT 


600 


CGGACCTTTG 


TGACGCAACA 


TGCTGCATTT GGCTATTTGG 


CGA7UVCAATA 


TGGCTTGACC 


660 


CAAGAACCGA 


TTGCTGGGAT 


TTCTCCAGAC CAAGAACCTT 


CGCCAAGTCG 


TTTAGCCGAA 


720 


TTAAAAAAAT 


ATATTAAAAC 


AAACAATGTT TCTGTGATCT 


ATTTTGAAGC 


ATCGGCTTCT 


780 


ACAAAAGTAG 


CCAAAACATT 


GGCAGACGAG ACAGGCGTCG 


7UVTTAGCCGT 


ATTGAATCCT 


840 


TTAGAGAGTT 


TAACTCAAAA 


AGAACAAGAA GCAGGCGAGA 


ACTACGTGTC 


TGTCAT6AAA 


900 


GAAAACCTCG 


CTGCTTTACA 


AAAAAGCATT CATTAATCTG 


GCAACAGAAA GGAGCCAACT 


960 


TATGCATTAT 


ATTGAAGTCG 


AAAATTTAAC CTTCTATTAT 


GATGATGAAC 


CTGTTTTAGA 


1020 


AGATGTTTCT 


TATTATGTTG 


ATCCAGGGGA ATTCGTTATT 


TTAACAGGAG 


AAAATGGCGC 


1080 






AAAGTAPTTT AGGGTTATTA 


AAACCAACAA 


GTGGCAAAAT 


1140 


TACGGTCGCT 


AAAAAAAATA 


GTGCTGGTGA AAAAATCA6T 


ATTGGCTACA 


TTCCACAACA 


1200 


AGTCGCTTCT 


TTCAATGCGG 


GCTTTCCTAG CACCGTCATT 


GAATTAGTAC 


GTTCAGGTCG 


1260 


CTTTCCCAGA 


AATCGTTGGT 


TCAAACCGTT GACGAAAAAA 


GATCATCTTC 


ATGT6GAAAA 


1320 


AGCTTTGAAA 


TCAGTGGACA 


TGTGGGAAAT GCGCCACAAA 


CGAATTGGCG 


AGCTTTCTGG 


1380 


AGGGCAAAAG 


CAACGAATTA 


GTTTAGCACG AGTGTTTGCG 


ACCGATCCAG 


ATTTATTTAT 


1440 


TTTAGATGAG 


CCAACAACAG 


GTATGGATGA ACAATCGCGA 


AACGAATTTT 


ATCAATTGTT 


1500 


GCAACACAGT 


GCGCATGAAC 


ATGGAAAAGC TATTTTGATG 


ATTACGCATG 


ATCATGAAGA 


1560 


TATCAAAACC 


TATGTGGATC 


GTCAAATTCG TCTTGTCCGC 


AAAGAAGATT 


CGAAATGGCG 


1620 


TTGTTTCCAT 


ATGAGTGAAG 


AATCGTATAC GTAAATTTTC 


CAATAAAACG 


ttatctggct' 


1680 
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CGATCATAAC 


GAAAGCCTAG 


GAACGGCTTA 


AAGAAATGTG 


GGAAAAATTT 


AAAAAATAAG 


1740 


CAGGAATTAA 


AAAAATAATT 


CGCTTAAACG 


TCGCTACAAT 


GGTAGTGTTC 


ATTTTAGCGG 


1800 


TAAGGAGTGA 


ATCGTATGGC 


AGAAATGCTT 


TCTTATGCAT 


TTATGCAAAA 


GGCCTTTTTA 


1860 


GCAGCACTGT 


TTATCTCAGT 


GATTGCCCCA 


ATGCTCGGCG 


TCTTTCTAGT 


TATTCGCCGA 


1920 


CAATCTTTAA 


TGGCAGATAC 


CCTTTCACAT 


GTGTCATTAG 


CCGGTGTGGC 


ACTAGGCTTC 


1980 


TTTTTTAATT 


GGAATCCTAA 


TTTAATGACC 


TTAATTGTCG 


TGATTGTGGC 


TGCAATCATT 


2040 


CTAGAATATT 


TACGAATOAT 


TTATAGCACC 


TATTCAGAAA 


TTTCGATTGC 


TATTTTAATG 


2100 


TCAGGCGGTT 


TGGCTTTGGC 


GTTAGTTTTG 


ATGAATTTAA 


CAGGAGGCAA 


TTCAGCTGCT 


2160 


AGTATTCAAT 


CGTATTTATT 


TGGTTCCATC 


GTCACGATTA 


CGTGGGATCA 


AGTGGTTATG 


2220 


TTGGCAATTT 


TATTCGTAGT 


TTTAGTTCTA 


TTGTTTATGT 


TATTTAAACG 


TCCAATGTAT 


2280 


GTTTTAACAT 


TTGATGAAGA 


TACTGCTCAT 


GTTGATGGGC 


TACCTATTCA 


TTGGATGTCG 


2340 


ATGCTTTTTA 


ATGTAATTAC 


TGGTGTGGCG 


ATTGCTGTGA 


TGATTCCGAT 


CGCGGGAGCC 


2400 


TTGTTAATTT 


CAGCAATTAT 


GGTCTTACCA 


GCTGCAATAG 


GTATGCGAAT 


TGGTAAAGGC 


2460 


TTTAACACGG 


TGATTATTAT 


CAGTGTGTTT 


ATGGGCTTGA 


TTGGCATGCT 


AACAGGGTTG 


2520 


ACTAGCTCGT 


ATTATTTGGA 


AACACCACCG 


AGTGCAAGTA 


TTACCCTAAT 


TTTTATTGGT 


2580 


TTATTCTTAT 


TAGTCAATAT 


TTATCGCCGA 


GTGGTTGTCA 


TGGTCCAACG 


AAAACAAAAA 


2640 


AXGCAAAGAA 


ACTAAGGAGA 


AAAACGAAAG 


TTTTTCTCCt 


TTATTTCTTT 


TTTTTAAAAG 


2700 


AAATGGCGAA 


CTATTTTCAG 


TTATTTGAAA 


AAAATCCTCG 


AATTTTTCGG 


TTAAACCCCT 


2760 


TATAATAAGG 


AACGATAGAA 


TGAAAAGGAG 


TGACGAATTT 


GAAAATTCGG 


AGAA6TGAAC 


2820 


GATTAATCGA 


TATGACCCAA 


TATCTTTTAG 


ATCATCCCCA 


TACATTGGTT 


TCTTTGACAT 


2880 


ATTTTGCTGA 


CCGATACCAA 


TCTGCCAAGT 


CTTCCATTAG 


TGAAGATTTA 


GCAATTGTTA 


2940 


AAAAAACATT 


TAAAGAACGT 


GGAACAGGTA 


TTTTAGAAAC 


GATTCCAGGC 


GCTGCAGgcG 


3000 


GTgTCCgCTT 


TATCCCTGAA 


ATTCCTTATG 


AGGAAGCGGA 


CAGTTGATTA 


TGGATTTATG 


3060 


TGATCGTTTG 


TCAGAACAAG 


ATCGCTTATT 


ACCTGGGGGC 


TATGTTTATC 


TTTCAGATTT 


3120 


ACTTGGGGAA 


CC6AATTTAT 


TAAGACAAGT 


GGGCCGGATT 


ATTGCTTCTA 


AATATTTAGG 


3180 


CAAACAAATT 


GATGCGGTAA 


TGACCGTTGC 


GACTAAAGGA 


GTGCCAATTG 


CGCAAgCTGT 


3240 


GTCTTATTAC 


TTAAATGTTC 


CGTTTGTGAT 


TGTTCGTCGC 


GATTCTAAGA 


TTACTGAAGG 


3300 


TTCAACGGTG 


AGCGTGAACT 


ATGTATCAGG 


GTCTTCAGAA 


CGTATCG7UVA 


AAATGGAACT 


3360 




AbCTTAAAAC 


GCGGCTCAAA 


AGTTTl Avj 1 C 


GTGGATGAC i 


TCATGAAAGG 




CGGCGGCACC 


GTCAATGGTA 


TGAAGAGTAT 


GATTGAAGAA 


TTTGAAGCAG 


AACTTGTGGG 


3480 


CATTACCGTT 


TTTGCTGAGT 


CAAAATTCAA 


TGGCCGTCGT 


GCAATCGATG 


ACTATACTTC 


3540 


GCTATTGTAC 


GTGGAAGATG 


TGGATACACA 


AACAAAAACG 


ATTAGCGTAG 


TCCCAGGAAA 


3600 


CTATTTCACA 


GAGAAATAAT 


TTGAAAAAGA 


ACCTGAAAAt 


GGTCCATTCC 


ATTTTCAGGT 


3660 
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TCTTTTTTTC 
AATGCCTTAT 
GACGATTTAG 
ATACGCAATT 
ATTGCATCCA 
AGAACCAACA 
AGGCGAACGT 
GCAAGCACAA 
GCCGTTATTA 
AAGTGCGACT 
TGATCATGTG 
ACGTGTTCAA 
GTTAgcGnAA 
AATTTTGAAA 
AATGGGTGTA 
AAATGAAATG 
TGAAGGCGTG 
AACAGTGATT 
CGGCAATCAA 
CGATGTGGGT 
TGGTAACTTC 
AACATACGTT 
TGTTTiATTAT 
CTCTGCAACG 
TTCAACCATC 
GAATAAAGAA 
GAACGCTCTT 
GCTGCTAGTA 
TTAGTTATTT 
TTTTTATGTA 
ATATTGTCCA 
GTTTTTTATT 
TCTTTTTGTT 
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TGTTTTATTC 
TTTGTTGAAA 
AAGAAAATCA 
ATTTTAGCAG 
GTTGCTGGTA 
GAAATCGTGA 
AGTCAATATG 
GAGTTATTAG 
ACTGCGGAAA 
ATTTTAACAG 
GGCATTGTTG 
GAAATTAATA 
ACAGATACAA 
AAAGAAGGCA 
AATGATCGTG 
CATATGCGAA 
GTAATTGGTT 
GGCGAAGATT 
GTGGTTGTTA 
CCGTATGCAC 
GTGGAAGTAA 
GGTGATGCAA 
GATGGCAAAA 
AACATTGTTG 
ACAGAAGATG 
GGCTATGCTA 
TTGCTTAAGC 
TTTGTTTAGA 
TTTTCA7U\AA 
AAGTTCGTTT 
AAAATAAAAA 
TAACGGATAT 
GTTTGTTACT 



CTTTAAAAAA 
AAGACAAAAG 
AACAATTTAG 
CTGGTAAAGG 
AACCAATGGT 
CAATCGTTGG 
CCTTACAAGC 
GTGGTAAACA 
CCTTGAAAAA 
CGCATGCGGa 
AACGAATCGT 
CAGGAACCTT 
ACAATACACA 
AAGCTGTCGC 
TTGCTTTATC 
ATGGTGTTAC 
CAGACACAGT 
GCTTGATTGG 
AACAGTCTGT 
ATTTACGTCC 
AAAATGCAAC 
CATTAGGCAA 
ATAAACACCA 
CGCCAGTCAC 
TCCCTTCAGA 
AAAAGTTACC 
AAAAGAGCGT 
GACGTTCAAG 
AATAGTTTTT 
TTTGGTCAAA 
CAATTTACAG 
ATTATGATAG 
TTGACAATTG 



AGATGAGAAA ATAAATTGGT CTACCTTTTA 3720 

AGTAAGATAA ACTAGAAAAG GTATAATGAG 3780 

TTAATTTGAG GTGGCAAACG TGGAAAATAG 3840 

AACACGCATG AAATCTAAAC TTTATAAAGT 3900 

CGAACATATT TTAGATCAAG TAGAACAAAC 3960 

ACATGGGGCG GAAATGATTA AAAGCCATTT 4020 

TGAACAATTG GGAACTGGGC ATGCAGTCAT 4080 

AGGAACAACA TTAGTTATTA CAGGGGATAC 4140 

TTTATTCGAT TACCATCAAG GTAAGAATGC 4200 

AGACCCAACA GGCTATGGTC GAATTATTCG 4260 

GGAACAAAAA GATGCCAGTG AAGAAGAAGC 4320 

CTGTTTTGAT 7VATGAATCAT TGTTTGAAGC 4380 

AGGGGAATAC TATTTAACAG ATATCATTGA 4440 

TGCTTACCAA ATGGCAGACT TTGACGAAGC 4500 

CACTGCTAAT AAAATTATGC ATCGTCGTTT 4560 

ATTTATTGAT CCAGACACAA CGTATATTGA 4620 

CATTGAAGCG GGAGTCACTA TCAAAGGAAA 4680 

CGCACATTCA GAAATCGTTG ATAGTCACAT 4740 

GATTGAAGAA AGTGTGGTTC ACGAGGGGGC 4800 

TAAAGCAGAT GTGGGGGC7UV ACGTACACAT 4860 

AATCGATGAA GGCACAAAAG TGGGCCATTT 4920 
AGATATTAAT GTCGGTTGCG GCGTTGTTTT 4980 
AACAATCGTG GGTGATCACG CTTTTATTGG 5040 
GATTGGTGAT CATGCGGTGA CTGCTGCTGG 5100 
AGATTTGGCG ATTGCCCGGG CACGTCAAGT 5160 
TTATATGAAG GATTAATCTC AAATAAAAAA 5220 
TCTTTTTCCT CCTTAAAGAC TAGACCAATT 5280 
GACTAGACCA ATGCTGTTTA TTCGGTATTT 5340 
GTTCCGCTTT TCCAATATAA AAAGACTTTC 5400 
ATTACTATTT TTTTCACAAA CAAATCCGCT 54 60 

GAAAAACATC TTATTTTAAC GGAATTTACT 5520 
GAAAAGAAGA GCAGCGTGAT GGGGGAAGGC 5580 
TTTGTTGCAA ATGGGTTGAA ACACAAGGAG 5640 
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GAAATTTGTG AAAAAGTTGA GCTTTAAAAA AGTGAAGTGG GQCATGCATT TTTTAATGGC 5700 

TGTTGCGTTG ATAGCGCCAA GTGTTACTAG TACGGCATAT GCAGTAGAAA CAACGAGTCA 5760 

ACAAAGTTCA GAAGCAGTAA CAAGTACCAC CGATTCAAGT AGAAAACAAG AACCAGTCAT 5820 

TACACAGGAA ACAACAGACA TCAAACAAGA AGCACCAAAT CAGGCTACGA GTGACAGTGT 5880 

CAAGCAGTCA CAAGAAACCA CAGCACCAAC AGAGACGACG AATTTAGAAA CGTCAATCGC 5940 

TGAAAAAGAA GAAACGAGCA CGCCGCAAAA AATAACAATT TTAGGTACGT CAGATGTTCA 6000 

TGGTCAATTA TGGAATTGGT CTTATGAAGA TGATAAAGAA CTACCAGTTG GTTTGTCCCA 6060 

AGTAAGTACA GTCGTTAACC AAGTCCGGGC ACAAAACCCA GCAGGCACCG TTTTAATTGA 6120 

TAATGGCGAC AATATTCAAG GCACTATTTT AACAGATGAC TTGTATAATA AAGCGCCTTT 6180 

AGTGAATGAA AAGACCCATC CAATGATCAC CGCCATGAAT GTGATGAAGT ATGATGCAAT 6240 

GGTTTTGGGA AATCATGAGT TTAATTTTGG TTTACCGTTA ATCAAAAAAA TTCAACAAGA 6300 

AGCCACTTTT CCAATCTTGT CTGCGAATAC CTACAATAAG GAAGATGGTC TTCGTTTTGT 6360 

TGAAGGGACT ACCACGAAGG AACTTGATTT TAATCAAGAT GGGCAGCCAG ATTTAAAAGT 6420 

TGGGATTATC GGCTTAACAA TTCCGCACAT TCCTTTGTGG GATGGCCCTC GTGTTACTTC 6480 

GCTTAATTTT TTACCTTTGA AAGAAGAAGC AGAAAAAGCA GTTACTGAGT TGAAAGCTAA 6540 

CGATCAGGCT GACATTATTG TTGCCTCGAT TCATGCGGGA CAACAAAATA GTGATCCGGC 6600 

TGCCAGTGCC GACCAAGTAA TTGAAAATGT CGCGGGGATT GATGCGTATA TTCTGGGTCA 6660 

TGACCACCTT TCTTTTACCA AGCAAGGAGC AGCGCCGAAT GGAAAAACTG TACCGGTAGG 6720 

GGGACCGAAA GATACGGGGA CAGAAGTTGT CAAAATTGAT CTTTCAGTTG CTAAAAATGC 6780 

CGATAAGTGG GAAGTGCAAG AAGGTACAGC AACGATTGTA CCAACAACGA ATGTTCCAGC 6840 

AGATGAAGCA GTTAAGGCAG CGACAAAAGA ATACCaTGAA AAAACGCGAG CGTTTATTCA 6900 

GGAGGAGATC GGCACAGCAA CAGCTGATTT TTTACCAAAA CAAGAAATTA AAGGAATTCC 6960 

CGAAGCACAA TTACAACCAA CAGCGATGAT TTCTTTAATT AATAACGTTC AAAAAGAAGT 7020 

AACGGGCGCA CAATTAAGTG CGGCAGCGCT GTTTAAATAC GACAGTAAAT TACCTGCGGG 7080 

GAAGATTTCC TATGCCACGA TTTTTGATAT CTACAAATAC CCGAATACCT TAGTGAGTGT 7140 

TCCCATTAAC GGTGAAAACT TACTGAAGTA TTTAGAAAAA CAAGGGGCGT ACTATAACCA 7200 

AACACAGCCA GATGATTTGA CCATTAGTTT TAATCCAAAC ATTCGTGTAT ATAACTATGA 7260 

CATGATTTCT GGAGTGGACT ACAAGATTGA CATTTCAAAA CCAGTGGGTG AACGAATTGT 7320 

AGATGCGAAA ATTGACGGCC AACCGCTGGA TCCTGCCAAA GAATATACGA TTGCTATGAA 7380 

TAATTATCGT TACGGCGGTT TAGCTAGCCA AGGGATTCAA GTAGGGGAAC CTATTAAAAA 7440 

TTCTGATCCA-GAAACCTTAC GAGGAATGAT TGTTGATTAT ATTAAGAAAA AAGGAACTCT 7500 

TGATCCAGAA CAAGAAATC6 AACGAAATTG GTCAATTATT GGGACAAATT TTGATGAAAA 7560 

ATGGCGTGCC AAAGCAATCG AATTAGTGAA TGACGGCACT CTTCAAATTC CGACTTCTCC 7620 
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TGATGGACGT ACACCAAACG CCGCCGCTAT TACGAAACAA GATGTCCGTA ATGCGGGCTT 7680 

ATGATMAGAT AATGCATATA CCATTATGCA CACAAATGAC GTTCATGGCC GACTAGAAGC 7740 

AGGGAAAGgC GAATTAGGTA TGGCGCGTCT AAAAACCTTT AAAGACCAAG AAAACCCAAC 7800 

CTTGATGGTG GATGCAGGGG ATGTTTTCCA AGGATTACCA ATCTCCAATT TCTCCAAAGG 7860 

CGCGGATATG GCCAAAGCAA TGAATGAAGT TGGTTATGAT GCCATGGCGG TGGGAAATCA 7920 

CGAGTTTGAT TTTGGTTTAG AGATTGCACT AGGTTATAAA GACCAACTGA ATTTTCCGAT 7980 

TTTATCTAGT AATACGTATT ACAAAGATGG CAGTGGACGG GTTTTTGATC CGTATACAAT 8040 

CGTAGAAAAA TCCGGGAAAA AGTTTGCCAT TGTAGGTGTG ACGACCCCAG AAACAGCAAC 8100 

GAAAACACAC CCGAAAAACG TAGAGAAGGT GACATTTAAA GACCCGATTC CAGAAGTAGA 8160 

AGCAGTGATT AAGGAAATTA AAGAGAAGTA CGCGGATATk CAAGCTTTCG TGGTTACTGG 8220 

GCATTTAGGC GTAGATGAAA CGACGCCGCA TATCTGGCGT GGTGATACGC TAGCAGAAAC 8280 

CCTTAGTCAA ACATATCCTG AGTTAGATAT CACTGTGATT GATGGACATT CGCATACAGC 8340 

CGTCGAAAGT GGCAAACGTT ATGGCAAAGT GATCTATGCT CAAACAGGTA ATTATTTAAA 8400 

TAATGTTGGG ATCGTCACAG CACCAGAGAG TGAACCAACT AAGAAAACAA CAAAATTGAt 8460 

TTCagcAGCA GAGCTGCTAG AATTGCCAGA AAACCCGGCA GTTAAAGCCA TCGTTGATGA 8520 

AGCACGTACG AATTTTAACG CTGAAAATGA AAAAGTAATT GTCGATTATA TTCCATTCAC 8580 

ATTGGATGGA CAACGAGAAA ATGTGCGCAC ACGAGAGACC AACTTAGGGA ATTTGATTGG 8640 

TGATGCGATT ATGTCATATG GCCAAGACGC GTTTAGCCAA CCTGCTGATT TTGCAGTAAC 8700 

TAATGGTGGC GGCATTCGCG CTGATATTAA ACAAGGGCCA ATTAAAGTTG GGGATGTCAT 8760 

TGCTGTGTTA CCTTTTGGCA ATAGCATTGC GCAAATTCAA GTAACCGGCG CCCAAGTTAA 8820 

AGAAATGTTT GAAATGTCTG TTCGTTCGAT TCCACAAAAA GATGAGAATG GCACAATTTT 8880 

ACTAGATGAT GCTGGCCAAC CAAAACTTGG CGCAAATGGT GGTTTCCTAC ATGTTTCAAG 8940 

CTCCATTCGT ATCCACTATG ATTCCACAAA ACCAGGTACT CGCTTGGCTA GTGACGAAGG 9000 

CAATGAAACA GGACAAACGA TTGTCGGTAG TCGCGTATTA GGAATAGAAA TTAAAAATCG 9060 

GCAAACACAA AAGTTTGAAC CATTGGATGA GAAGAAACAA TACCGGATGG CTACCAATGA 9120 

TTTCTTAGCT 6CTGGTGGTG ATGGTTACGA TATGCTAGGT GGTGAACGAG AAGAAGGGAT 9180 

TTCACTAGAT TCTGTCTTAA TTGAATACTT GAAAAGTGCA ACCAGCTTGC GGTTGTATCG 9240 

TGCAGCAACG ACGATTGATT TAGCACAATA TAAAGAACCA TTCCCAGGCG AACGAATTGT 9300 

TTCTATTTCG GAAGAAGCTT ACAAAGAGTT AATCGGTGGA GGAGAGACGC CAAAACCAGA 9360 

TCCAAAACCA GACCCGAAAC CAACACCAGA AACACCAGTA GCAACCAATA AACAAAACCA 9420 

AGCGGGAGCA- AGACAGAGCA ATCCATCCGT AACAGAGAAG AAAAAGTATG GCGGCTTTTT 9480 

ACCTAAAACG GGTACAGAAA CAGAAACGCT TGCATTATAT GGTTTACTGT TCGTTGGACT 9540 

TTCTTCTTCT GGCTGGTATA TTTATAAACG ACGTAACAAA GCTAGTTAGT CATTTTTTAA 9600 
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GAACCGTCcT TCGGGATTGG TTCTTTTTTT ATTTTAATGC TTGCCATTTG TTTTTAATTT 9660 

TAGTATTATA AGTlATATtCT AAAAATTAAA AAATGAAGAG TTTTTTCTGT GAAGGGAGAA 9720 

GGGCTGAAAG TAAACGTATA AAGGACTTAT CGGGTTAAGT TTACAACGCT GTTAGTTTAG 9780 

CTGCCTGTGG TGGTGATGGG GGACCTGGCG CCG 9813 
(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

CATTGGG)\AA AGTGGAAAGC CTTAAACCTC GCAATCGCCC AAGGTTTACC AGCTGAACTC 60 

GGCATCACTA AGCCgAAAAT TGAAAGCTGG AcCAACGGCT GGAATTTACG CAgTCATTTC 120 

TGGGCTGCGT ATCGCAGTGA GCAACGTCAA GCGGAAAATG CTTGTTTGGC GTTGTTATTA 180 

AATAAAAAGC AATTTCAAGT GTATTTGATG TTTCAACATT ACAAAAGTGA GGAGCGGACA 240 

GGCTCAGTCG TCGCTTACAA TCAATTGCTT AATCGTTTAG AAGCTTGGAG TCAAACCATC 300 

GACTGCGAAG GCTATTATAT TTGGCCGCAG gAGGAGCATG AACTAGTGGA CCATTTGCCT 360 

TTAAAAGATT ATTTAGCAGA TGAAAAACAA CGGAAAATTC TAGAAGAGCG GATAGGCAAT 420 

CGCTCATTTC AGATCGGTAA GTTGTGGTTT GCGACGGAAA AGCTTGAAAC GATTGAAGAA 480 

AAAAGTTGTC AGGCGCTTCA AGAATTAGCG CCGTTGTATA ACGCATTATC TGAAAAATGA 540 

ACGAAACAAC GTCTCTGACC AATTACCTGG TCAGAGACGT TGTTTGTGTT TTTATTGATT 600 

TAAATCTGTT AGTAGTAGCT CGTTAGAAAC AGCAAAAGTA GTAACTTCTG TTAAAGCTGG 660 

ATCCAAAGTG TATTTTGTAA ATGAGGTAGT ACTTTCTGGT TGGACATACA TGGTTAAATT 720 

AGTTACATCA TATCCTGACC AATAATGTGT GAAATCAGAC AGAGAATTAC TTTGACCTAA 780 

ATGGCCAATT CCCCGAGGAA TCATCACTGT TTGGAAGGCG CGATAGAGTT GCATAATCCC 840 

TTCAGAGCTA TCAAATTCTT GGAGATTGTC AGAATAGTAT AACGCACGAA TGAAGCGTGA 900 

CGGCGAAGTG AAGTCTCCTG GGATTCCTAA TAGTCCGTAA GAAGTCCCAC CTTCAATTGG 960 

GTCAAGTGTT GCGCCTGATT TTAAAGTGTA AGGTTCCTTA ATAGCATAGT TGCGCATCCC 1020 

gATATAGTTT CGTGCATTGG TTAAATGATA ATTGTATTCT GGACTGTTTG TGACTACGCC 1080 

AATCGAATCA TAAATGACGA ACGCTCCGTT ATCTGTTGGT TGCAAAACGA TACTATTATT 1140 

ATTTTTATCA TTAATGGTCC AGTGTAAGGT AAAATGAACT TCCCCCAATG AATCAAGTTG 1200 

GAATTTTTGA TCTAACAGGC CGATTTTTTC AAAAGCTACT TTTACTTCGT CAACGCTTTC 1260 

AAAATTACTC AAAATATAAG CAAGTACTTC TTCTGCAATA TGAGGTGTTA AACCAC6ATC 1320 

TGCTAAACTT TCAGCAGGTG CTGTTGAAGA TTCCATTAAA TATTGAATAT CACCAGATAA 1380 

GCCGGCTTCA TTGATACCAT CTGTGATAGC AAGGCTGACG GTTTTACCAT CGTTCGCTGG 1440 
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TTGATCTGTG 


CCTGACATTG CTAGTCCCAT 


AAAGGCATAT 


TTTGCCGTCC 


AAGGATTTAG 


1500 


TTGGCTGTTT 


AGAACTGTAC CTTTTGGAAA 


TTGAGCAATT 


AGAGTAGGGA 


TTTTTGGTGC 


1560 


AATTGGATCT TCGTTGCCAA AAAAGTCAAA CGTAAAATCC ATCGTACGTC CATAAAAAAT 


1620 


ATCATTTGTT 


TTGGAAATAA TCTTGATGCC 


TGTACACATA ATAGTTTCCC 


CAAATTAATG 


1680 


TAAGTTTGTG 


AAGCCTGTTT ACTAGTTAAG 


CATATCATAC 


TCTAGGAAAC 


AGCGGTGGCT 


1740 


AAATTCCTAA 


AATATTAGAC AATTTTGTCT 


AAATGAGGTG 


AAAAAAATTT 


TTTAGCGATT 


1800 


TCCTTCACAA 


AACACCTTAT TATCTTTCCG 


AAAATATGAT 


ATCTTTAGAA 


TGGAAATTTC 


1860 


ATGAAAAGAG 


GATGCG7UVTG TTTAAAAAGT 


ACACACAAAT 


GATAAGGGAT 


GAATTTAAAG 


1920 


GGTATAATCA AAAAAAAATG AAAAAAGATT TCTTgGCaGt 


TTGACTkTCG 


CAGCCGTCGC 


1980 


GTTGCCATTA 


GCGTTAGCGT TTGGGGTCTC 


TAGCGGTGCT 


ACTGCAGCAG 


CTGGCATGAT 


2040 


CACGGCGATT 


ATCGCTGGAA TTGTGATTGG 


GAGCCTTTCG 


GGTGGGTTTT 


ATCAAATTTC 


2100 


AGGACCAACA 


GGTGCGATGG CAGCTATTTT 


AATGTCAATT 


GCGGCAGTGC 


ATGGCATGCA 


2160 


AGGGATTTTG 


TTGGCCACCT TTTTGGCGGG 


GATTTTCTTG 


CTTTTGGCAG 


GGATTTTGCG 


2220 


CTTAGGGAGC 


TTGACTTCTT TTATTCCGTC 


ACCGGTCATT 


ACCGGGTTTA 


CTTCAGGAAT 


2280 


TGCTATTATT ATTGCGTTGG GGCAGCTTGA TAATTTCTTT GGGGTTCgTT CAGAAGGACA 


2340 


CAATTTAATT 


GAGAAAATCA GCAGTTATCG 


TTCATTGGGC 


TTTGATTTAA 


GTATACCAAC 


2400 


GACGGTTATG 


TCGATTTTAG TGGTGCTTGG 


ATTAGTGTTC 


TTTCCGAAAA AGTGGAATGC 


2460 


GATGATACCA GCTTCATTGG TGATGATTAT 


TTTAGCTACT 


TTAGCAACAG 


TGGTTTTTCA 


2520 


ACTGCCAGTG 


GCCACTGTAG GGAAAATTCC 


GACAAGTATT 


TTTAGTGATA 


CACGTTTGGA 


2580 


ACTTGGTTCA 


CTTCAATTAA GTGCGTTTCA 


GGATATTTTG 


GTGCCTTCGA 


TTAGCATTGC 


2640 


TATGTTAGGT 


ATGATTGAAA GTTTGCTTTG 


TGGCGCTTCG 


GCTGGTCGCA 


TGACAGGGAA 


2700 


ACCATTAGAC 


AGTAATCAGG AACTAGTGGC 


ACAAGGAATT 


GGCAACTTAA 


TTTTGCCATT 


2760 


TTTCGGTGGC 


GTGCCTGCGA CGGCAGCGAT 


TGCCCGGACG 


AGCGTAGTGA 


TTAAATCTGG 


2820 


TGCGCAAACA 


CGTTTAGCGG GTATTTTTCA 


TGCAGTCTTT 


TTATTGTTAT 


CGATGTTGCT 


2880 


TTTATCGCCA 


GTCATGTCTA AAATTCCCAT 


GCCTGCGTTA 


CTGGTGTTTT 


aGTGGTGACA 


2940 


GCGTGGCGAA 


TGAATGAATG GTCGGTGATT 


AAAGAATTCT 


TCCAGAAACG 


TTTCCACACA 


3000 


GCTTTATGTT 


TATtCTTCTT AACGATGGTA GCGACCGTTA 


TTTTTGACTT 


GAGTTTAGCG 


3060 


ATTGTtTTAG GGGTCTTAGC GGGTTGCCTG TTTTyCATTG CGAAAAGTGC CGTGATTACG 


3120 


ATTAATGTCG 


AGGAAGTTGA TTGGTCAAGA ATGGGGCTAG 


TACCGACCGA 


TTGTACGGAA 


3180 


AATTGGGTGG 


TAGTTTATTT GAGTGGCCCG 


TTATTTTTCA 


TGTCTTCTGA ACGATTGAAA 


3240 


CAAACGTTAC 


TGCAATTGGA AACAAAAAGT GGTGTAGTAT 


TTTCAATGCG 


AGGAGTTCCT 


3300 


AGCATTGATA 


GTACGGCGTT AGATATTTTA GAAGAATTTT 


GTCTCGCAGC 


AGAAGGTCGT 


3360 


GGACAACAAG 


TTCAATTTGC CGCTCTGCAA CCAGAAGTTG AAAAAATGAT 


TCGAACCATT 


3420 
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CAAGGAAATC AAGAGAAAGA ATACCATPTC TCAGTCGCAG AATTTTTACA AACGGTGCAT 
GTTGCAGAAC AATAATTCAT TTTCAAATGA AATGTTCTAT CTTCTATCGC GGGCTTAATG 
GGTGAAGAAA TAAGAAAAAT AAGTGGGGAC TAAAATGAGT TTCTAGAGGA GTTCCTAAAA 
AATAATAAAG AGAAAACTTC TCAGTATGAA GTTTTCTCTT TATTATTTTT TTGGTTATTT 
ATTGAATTAC TTCTAGCATA ACGTCTTTTT GATCATCCGT AACATTGCCA GCGTAGTTAG 
GAGTGATTAA AGATATCGAA CCATTTTTAT TGTTGAATAA GTATAAAGTG CCCGATCTAT 
TTTGTTCATC GTTTGAAGAA ATATTAGTTC CTAAAATTAA TTCTGTATTT ACTTTAACGG 
TGCGAATACT GTTATCGGCC GCTGAAAATA TTCTAATCTC TTTTGTAGGA ATTGTATTTA 
TAGATAGCTC ATATTTATTA ATGGCTTCTT TCACTTGACC ATTTTCAGCC CCAAACAATT 
TAGTTCGGAA GGTAGCAGTT GGTGTTGAAT TTAAGTTCTC TAACGTAATT GAAGTTGGCA 
CATTCATTCC TTTAAAATTA AATACGAGTG GATTGTTTAA GCTACTAAGA TCGACAGCAT 
AAGGTGTTGC TTCTAAAGAA TTTGCTTCTT CGACATTTGT AGATACTTCA TTTTTCGACT 
TTGACGAAGT TTGAGTGGTG CTGGCACTTT CTGAAGTTTC TTTTAGTTTA TTATTAGAGC 
TTGTTTCTAT GCTAGAACTG TAATGATTAG TACTGCTTTT TATTGATGTT GTATGATTGG 
TAGAATTTTC TTGTTTTTTA GGTTCATTCG CTTTTTGAGA ACTACAACCT GTTAAAAAAA 
TAACTAAAAT AACACAAACT AATTTTTTCA TACTATTCCT CCTATAGACG CTATAACTAA 
TTTTATCTTA CTTGTTAATA TAACAAAAGA AATAATTATT TTATTTTCAC GAATGCTCAA 
ATTGTTTAAA TCTGTAGTTT TTATACTTAT TTTTGTTGCA TTTGCAACAA AAATAAGTTA 
AAGTAATGAC AATCATAAAA TATTAGCGAA AGGAAGTGTC CTAGTGACAG ATATTTTAAG 
AGAAATTGGC ATGATTGCGC GGGCCTTAGA TTCAATCAGT AATATTGAAT TTAAGGAATT 
ATCTTTAACA CGAGGACAAT ATCTTTATTT GGTCAGGGTC TGTGAAAATC CTGGGATTAT 
CCAAGAAAAG ATCGCCGAAC TGATTAAAGT TGATCGAACC ACAGCGGCGC GGGCGATTAA 
GCGTTTGGAA GAACAAGGCT TTATTTATCG TCAAGAAGAT GCAAGTAACA AAAAAATTAA 
GCGGATTTAT GCCACAGAAA AAGGGAAAAA CGTTTATCCA ATCATTGTGC GCGAAAATCA 
ACACTCGAAT CAAGTTGCTT TGCAAGGCTT GTCAGAAGTG GAAATTTCAC AATTGGCAGA 
TTACTTAGTC CGGATGCGTA AAAATGTCAG CGAAGATTGG GAATTTGTCA AAAAAGGGAA 
CACTCGAAAT TATTAAGGAA AGGACGACTG ATTGATGGGA GAAATAAAAC AATTAACAAT 
AGAAGATGCA CCAGCGTTAC AAGAAATCAG CATTGAAACT TATACAGATA CTTTCGGACC 
ATATAATACC CCAGAAAATA TGCAAGCCTA TTTA6TAGAA GCCTACAATT TAGAAAAATT 
ACAAAAGGAA TTGGCTAATC CCCACTCTGA CTTTTATTTT ATCTTTGTCG ATAATCAATT 
AGCTGGCTAC ATGAAATTAA ATCGTGAGAC GGCACAAACG GAACCGATGG GGCCAGAAAA 
GCTCGAGGTT GAGCGATTGT ATATTTTGCC TGCGTTTAAA GGaAAGAAGC TGGGGACACA 
ATTATTAGAA CTGGCGGAAG AGAAAGCTCG TGAATTTGGT AAAAAAGCAC TGTGGCTAGG 



3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 
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TGTCTGGGAA CACAATGATG 


CGGCGCGGGC- ATTTTATAAA ACGATTGGCT ACCGTTATTA 


5460 


TAGTCAACAT 


TCGTTTTTTA 


TGGGAGACGA 


TGAACAGACG 


GATTTTATTT 


TGTTAAAGGA 


5520 


ATTGTAGCcA AGAAAATTTA 


GAAACAAAGA TTTCTGATAA 


AATCAGAAAT 


CTTTGTTTAT 


5580 


TTTTTTGACT 


AAAATTAAGC 


ATTTTTTTAT 


TTTCAAAATG 


TTATTTTTAA AAACGTAAAA 


5640 


ATTGAAAAAG 


AAGATTTTGA 


ATCTGTGAAA AGTTGTCGAT 


CTTGTGAGAA AACTCTTTTG 


5700 


TCAGTGTCTT 


TTTTCAGGAT 


GGGTCTCTTC 


CTGAAAaTCC 


ACTTCAGTTT 


ATCTGTTTTG 


5760 


TGTGTGTTTG 


ATTAAAAAAT 


GTTGTGTTGC 


ATTTCATTTA 


ACTGTATAAT 


TTaTTTATGT 


5820 


CATTTTTAAT 


AATAAGCGTA 


TTTTAGCATT 


TGamAAGAGA 


ATTTGTACAT 


AATATGTGAA 


5880 


TAAAAAGATT 


TGTTTCTTAC 


TTTCTAGAGT 


GAGAGGACGA 


TATAAAAAAG 


GTTGCTATTT 


5940 


TTTCGTTCAA 


AGTGAACAAA 


TAATCGTTTG 


TTATTTAGTT 


GGGGTAACTA 


ATGATTATTG 


6000 


TTATAAAAAG 


CGCTTACAAT 


TCGTGTGATG AGAGTAGTAC 


GCATGTTTGT 


GCAAAAGTGA 


6060 


CAAAAAAGAG 


AAGGCAGGGT 


GTTTTAATAG AAAAAATTTT 


TTAATAAAAA 


GGAGAGTTTG 


6120 


TCGTGGAGAA 


AAATGTGTCT 


CAAGTAACAG 


CTCAAGAAGA AACAGCGCTG 


AAAGCACGTG 


6180 


GCTACAATGA 


GGATTTGTTG 


CCAAGTAGCC 


CGAAACAACG 


GACGATGGGC 


GCGCGCAATT 


6240 


TTTTTACATT 


ATGGATGGGT 


TCGATTCATA ATATCCCGAA TTATGCAGCA GTGGGCGGCT 


6300 


TTATTTTCTT 


AGGTTTATCG 


CCGCTGCAAG 


TTATGCTGGC 


AGTTGTGTTA 


AGCTCATTCA 


6360 


TTGTAGCAAC 


TTTCATGAAT 


TTGAATGGCG 


TGGCTGGCTC 


TAAATATGGC 


ATTCCTTTTG 


6420 


CGATGCACTT 


ACAGTCTACT 


TACGGCAGTC 


TCGGGGCAAA ACTGCCAGGG 


TTCTTAAGAG 


6480 


GTTGTGTGGC 


CGCGATTGCT 


TGGTTTGGTC 


TGCAAACATT 


TACAGGATCG 


CTGGCTTTAT 


6540 


TAATTATTCT 


CGGGAAATTT 


TGGCCGAACT 


TTTTAG/IAAT 


TGGTGGATCA 


TTCCAGTTTT 


6600 


TTGGTCTCAG 


GCTACCTGAA 


TTAATGGCGT 


TTACTCTTTT 


CTGGCTCTTA AATGTGGCAA 


6660 


TAGGTTTCGG 


TGGTAGCAAA 


ATTTTGAACC 


GGTTTACCGC 


CATTTTAAGT 


CCGTTGATTT 


6720 


ATGTCGTTAT 


TATTGGATTA 


ACGATTTGGG 


CCATTCGAGC 


TGGGGGCGGT 


TTAACACCAA 


6780 


TTTTGTCCTA 


TCAAGTTTCT 


GGTGCGATTC 


GTTCAGTCAA 


TCCACTGGTG 


GCGTATCTGA 


6840 


TTATTTTTAA 


CTCGGTTGTG 


GCCGTTTGGT 


CAGCGCCTGG 


TGCGAGTGTG 


GCGGATTTTA 


6900 


CTAAGAATGC 


GCGCTCGACA 


CGTGCTCAGG 


TTGTGGGGCA AACAACAGGT 


TTAGTTGTTG 


6960 


GTTATGGGAT 


TTTCGCTTTT 


TCCAGTGTGG 


TTATTCTGTT 


AGGTGGTTCG 


CTGTATTTTG 


7020 


GGATTCAAGA 


ATG6AATATT 


TTAAACATTA 


TTGATCGTTT 


AGATAATGTG 


GCAGTGGTGG 


7080 


TCTTAGCCAT 


GTCGGTTTTC 


TTATTAACGA CTATTTCAAC 


GAATGCAACG 


GGGAACATCA 


7140 


TTCCGGCAGG 


TTATCAATTG 


GCGGCGCTAT 


TTCCTAAAAA AATGACGTAT 


AAAAAAGGCG 


7200 


TCATGATTGC 


CAGTGTTATC 


AGCTTTCTCA 


TTATGCCTTG 


GAAACTGATG 


GAAAACGCCG 


7260 


ATAGTATTTT 


TATTTTTCTG 


AACGCAATTG GGGCTGTTTT 


AGGACCAGTC 


GCTGGTGTGA 


7320 


TGATTGCGAA 


TTATTATTTT 


GTTCAAAAGC AACAAATTGA TTTAAATGCT 


TTATACGTAG 


7380 



wo 98/50555 PCT/US98/08985 

743 



ATAAACACAA AAAAGAGGAA GCCAATCCTT' TTTACGGATT AAACAAGCCG GCGTATGTTG 


7440 


CTACTATTCT GGCGTTGGTT CTTTCCTTAA GTGGTCAATT TATTCCACAG GTTAAAATCA 


7500 


TTGCTGATAT 


TTCGTGGTTT GTGGGCTTTG CCACAGGTTT TGTGCTGTAC TTAGTGTTAA 


7560 


AAAAATGGAC 


TTGGGATTCA AAAAAAGTGA AAGAAACAGC TTATCAGGAG GGAAAATAAA 


7620 


TGCAAGCAGA 


GTTAGTGATT 


AAAAACGGGT TAGTTATTTT AGAAACTGGA GAAGTCATTA 


7680 


CGGATGTCGC 


CGTTCAAGGA 


GGAAAAATCG TAGCGATTGG GCAAGACTTG TCAGGAGAAC 


7740 


GTGTGATTGA 


TGCGACGGGT 


TTAGTCGTTA GTCCTGGGAT GGTGGATGCT CATGTTCATA 


7800 


TTACCGATCC 


TGGTGGTGGT 


TACCGCGATG AATGGGAAGG GTATGTCACA GGAACGGCCG 


7860 


CTTGTGCCAA AGGGGGCGTG ACGACCTTTA TGGAAATGCC TTTAAATCAG ATTCCAGCAA 


7920 


CTGT6GATAA 


GACCTCTCTT 


GAGATCAAAT ACAAGGCAGG AGAAAATAAA CTAAAAGTAG 


7980 


ATGTAGGGTC 


ATTTGGGGGC 


GTGGTCCCGA CCAACTTGGC TGATGGGATT CAGGAACTTG 


8040 


ATGAAGGAGG 


CGTTTCTGGC 


TATAAATGCT TTTTGGGAAC GTGCGGCGAT CGTTCAATTG 


8100 


AAGGAGATTT 


TCAAAATGTC GACGATTATT CATTATATGA AGGAATGAAA CAAGTCGCCA 


8160 


AAACTGGCAA AGTGCTAGCA ATCCATGCAG AAAATGCGCC AATTACGGAT AAATTAGGCG 


8220 


CGGTAGCTTA 


TCAAAATGGC 


GAAACGACTT TAGCGGCATA TGTAGCTACA CGTCCTGTGT 


8280 


TTACCGAAGT 


TGAAGCGATT 


CAAAAAGCCA TTTTATTTGC CAAAGAAACA GGTTGTCGGA 


8340 


TTCATATTTG CCATGTGGCT TGTCAAGT^G GCGTGGAAGA AGTTTTGAAA GCCCAAGCAG 


8400 


AAGGGGTAGA 


TGTGACTTGT 


GAAACATGTA CGCATTATTT ATATTTTACC ACGGATGAAC 


8460 


TTGATGCCAT 


TGGTCCAGTT 


GTCAAATGTT CACCGCCGAT TCGTGACGCG GACCAACAAG 


8520 


CCGCCTTATG 


GAACCACGTG 


CAAACAGGCG GCATCGCGTT TGTAACCTCT GACCATTCTC 


8580 


CTTGTACGCC 


AGACTTAAAA 


GATACAACCA ATGCTTTTGA AGCTTGGGGC GGCATTTCTG 


8640 


GTGTTCAAAA 


TAACGTCGAT 


GTGTTATTCG ATGAAGCCGT CCAAAAACGT GGTTTATCCT 


8700 


TGAAACAATT 


CGCGGATATG 


ATTGCAGCAA ACCCAGCTGA CCGTTATCAC TTAGCACAAA 


8760 


A6GGCCGCAT 


CAGCATTGGT 


AAGGATGCCG ATTTTGTACT AATTAAACCT AATGCGCCGT 


8820 


ATATCTTAAA 


AGCAGAAGAT 


TTAGAATACC GAAATAAAAT TAGTCCTTAC ATTGGTCGTG 


8880 


AAATTGGGGC 


CCAAGTGATT 


CAAACAATTT TAAGAGGCGA AACAATTTAC GCACAAGAAA 


8940 


CTGGGGTAAC 


CGAGGCGTTT 


AACGGCGTAT TTATCAAAAA TTAAACACCT ATTTTAGAAA 


9000 


TGAAAAAGGA 


CAGAAGCGTG 


ATTGAATTAG CGCCTCTGTC CACGAGTTAG AGGATGTTTG 


9060 


TTGGTTGTTT 


TGCACAAAAA AAGGCTTAAT TTTGTTTGAT AAGTACAATG AAGATGGCTA 


9120 


GGGAAACGTT 


TACAATCAGA 


GTGTAATCAG TCAGGAGGTT TTGTGATGGA CTTGGAAAAA 


9180 


GTTTTAAAAC AAAGAATCGA AGAGTTGTCA GCAATTGGGA GTGATCCTGC AGGAGGGATG 


9240 


ACACGCTTAC 


TTTATACAGA 


TTCGTGGTTG GCAGCACAAA AGTATGTGCA AAGCCAAACT 


9300 


GAGGCGTTTG 


GACTGGAAAC 


GCAATTTGAC GAAGTCGGGa ATCTGTTTTG TCGAGTGGCA 


9360 
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GGAACGGAAT TTCCTCAGGA AACGATTTTA ACTGGGTCAC ATATCGACAC AGTAGTTAAT 9420 

GGTGGGACCT TAGATGGTCA ATATGGTGTC ATTGCTTCCA TGACAGCTGT TCAATATTTA 9480 

TTAGACAAAT ATGGGAAACC ACGGCGTTCT TTAGAAGTGC TTTCCATGGC TGAAGAAGAA 9540 

GGGAGTCGTT TTCCAACGGT TTTCTGGGGA AGCAAAAATG TCGTTGGTGA AGCCAAACGT 9600 

GAAGAAGTAG TCGATATCTC TGACAACGAA G6TAAAAAGT TTGTTGATGA GATGCATCGC 9660 

CAAGGCTTTG ATTTTCGGCA AGAGCAAACT AGACGTGAGG ACATTGAAGC ATTTGTCGAA 9720 

ATTCATATTG AACAAGGCAA TATTTTAGAG AACGAAAAGC TCCAAGTCGG TGTTGTTCAT 9780 

AGTATTGTAG GGCAACGTCG CTACACGGTT AACTTAAAAG GGCAAGCCAA TCATGCAGGT 9840 

ACTACGCCAA TGAGTTACCG TCATGATGCG GTGTACGGCT TTGCTAAAAT TTGTGCAGAA 9900 

GCCATTGATC GTGCGAACGA AATTGGGGAT CCGTTAGTTT TAACATTCGG CAAAGTCATT 9960 

CCGAAACCCA ATACAGTCAA CGTGGTGCCT GGAGAAGTGG AATTTACAAT TGACTGCCGT 10020 

CATACCGACG CTGCTTTTTT ACGACATTTT ACAGGTGAAT TAGAAGAACG GATGAAGACA 10080 

ATTGCTCAAG AATTAGGCTT GACGATTGCG ATTGATCGCT GGATGGATGA AGCCCCTGTA 10140 

CCGATGAATC AAGCAATTGT AGAAGTTATT GAACAAAAAG CCAAAGAAGG CAAGTATCAG 10200 

TATCGTATGA TGCATAGTGG TGCTGGGCAT GATTCGCAAA TTATTGCGCC TCATTACCCG 10260 

ACAGCGATGA TATTTGTGCC AAGTATTGGT GGGATTAGTC ATAATCCTGC AGAAGCAACG 10320 

GCATTTCCAG ATTTGGTTGA AGGTGTCAAG CTTTTAGCAG ATACCTTATA TGAATTAGCG 10380 

TATAAATAAG ATAGAAAAGG AAGAATTGTG ATGGGTTATA AAAATAATCG AGTTGGTTAC 10440 

CAAAAAGAAT TGTTAACATC AAGAGCTGTG ATAAAAAAAG ACAATTACGC CATTATTCCC 10500 

CATGATGGTT TGGTGCAAAA TGCAGTACCT GGTTTTGAAA ATGTCGATAT TTCCATTTTG 10560 

GGATCACCTA AATTAGGTGC GACATTTGTG GATTACATTG CTACGTTTCA TAAAAATGGT 10620 

CAACAAACAA CTGGTTTTGG TGGCGACGGG ATCCAAACGT TGGTTTATGT CATTGACGGT 10680 

CGTTTGCGCG TTTCAGATGG TCAAGAAACT CATGAACTTG AAGCCGGGGG ctACGCCTAC 10740 

TTTACGCCTG AAATGAAAAT GTATTTAGCG AACGCACAGG AAGCGGACAC AGAAGTTTTC 10800 

TTATATAAAA AACGCTATCA GCCGTTGGCA GGCCATCAAC CTTATAAAGT GGTTGGGTCA 10860 

ATTCATGACC AACAACCCGA AGAATATGAA GGAATGACAG ACGTGTTACT TTGGAGTTTA 10920 

TTGCCgAAAG AGTTTGACTT TGATATGAAT ATGCATATCT TATCTTTTGA ACCAGGTGCT 10980 

AGTCATGCCT ACATTGAAAC GCATGTCCAA GAGCATGGCG CGTATTTAAT TAGTGGCCAA 11040 

GGGATGTACA ATTTGGACAA CGAATGGTAT CCAGTGGAAA AAGGGGACTA TATTTTCATG 11100 

AGTGCCTATG TGCCACAAGC AGCCTATGCG GTTGGTCGTG AAGAACCGCT AATGTACGTT 11160 

TATTCCAAAG ATGCCAATCG TGAACCAGAA TTATAAAGAA AAGCCGAATG TAAAGTAATG 11220 

ACCTCGGAAC AATAGGATTT GAACACGCAG tGgTGAACGG TAGAGTCCTT CTTCTTGGTA 11280 

GACGTTACTT TTATTCGGCT ATTTTCCTGT TAATTATTTT TCTGAAAGTT CTAACGCTAC 11340 
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CCGTAAGAGT AAACTCGTCT CAGGCTCTTG- GATAGAGGTG CCTAATAATT TTTCACATTG 11400 

ATTCATGCGA TATTTAATGG TGTTTCTGTG AAGGTAAAGA GCGTTGGCCG TTTTAGTAAT 11460 

TTCGCAATTG AAATCTAAAA AAACTTTCAA GGTTTTTCGT AATTCTTGTA ACGTCGGTTC 11520 

CGTCGGATAG GCCAACTCTT TCAATTGTTG CTGGCAAAAG TATTCAACAT CTTCAGTGCC 11580 

AATCTTTTCA AAAAGTCCAG CTAGGCCTTT GGGATGGAAC AATTGCACAG TCGCTGGTTT 11640 

TTGAGCGTGT AAAGAAGCCT CTAATGTCGA GGAAGCTTCA ATATAAGAAT TAGGCAAATC 11700 

TTCTAGGTTT TCATAAGCAT TGCCTAAAGC AAAGCGAATG GTAATAGGCA AAGCCTGTTG 11760 

CAAACGTTCA GCGAGGTTTT GCAAAATCAT CA7UVTGGTCA TTTTTTTTAG ATTGAAAAAT 11820 

CAAGATACTT TTATTTTGAT TTTTTAATTT AAACAAGGCG ACATCTGGCA GGATTTCCGG 11880 

CAATTGTTCT TTGAGCCATT GGAAAATTAG TTGCCCTTCC GCTTGCTGAT AACGAATATG 11940 

GGTTTCATTT TCTGGTTTAG TCACGCCGTA GACAATCGCT AATTGATAAT AATCACTGTT 12000 

GATTAAACGA TAATTTTTCC AAAGTTCCAA CCAATTTTGA TGTTTACTCA GCGCTTCTTG 12060 

ATGTGTATCA AGCAAGCGGT CCAAAAAGTC TGTTTTGAGA TGCTCAAAAG ATTCAGCAAT 12120 

TTTCTGGTTT TTAAATAACA TAAAAGTCAA AACTAAAATG GCCTGATCAA TGGCAAATTC 12180 

AGAAATAGGA TAAGGAATCT GTTCAGGAGA TAAGACCAAT AGATAATAGG GAAAATAATC 12240 

GTTCACACGA ATTGGAAAGC CAGCTACTTG AATGTTTTCT TGTCGCTCAT CCTGTAAACG 12300 

AAAGATTTTC TTTTCTTGTG CCAATTGTTG AAAGTGGTCT TTTGATAATT GTTCGATGAA 12360 

AAACTCCGCC GATTTTTGAT TGCCATAAAA ATAATTCGAG TGGGCAATCA CCTGTTGCCA 12420 

AGGACTTAAT AAAATAATTG GCGCATTTAA AATTTTCGCA AATTCAGCGA TGAAGCGCGT 12480 

AGCATCAACA TCTTGCATTA ATAATGTTGA AAAACGCTTT TGTATGTCAA ATGCTACAGA 12540 

CATCTGTTCG GTTTTGCTAT CACGJ\AGATA GCCTACAATT TCATGCAACA ACCCGCCTAG 12600 

TGGCTGTGTG CTAGGAATTT TAATCAGGGG CAAATCAACG GCACTGGCAT AGGCAACAAT 12660 

TTCTGGTGAA ATTTCGTCAA GGAACCGCCC AACTTTGATT CCTAAAGCCG TACACTCTGC 12720 

TTGTTTTAGC GAGTCAATAA AAGGTTTCAA TTTTTCTTGA TCATCTTTAT AAATCATTGC 12780 

CGTTGTTAAA ATCATGACAT TTTTAGGGAT AAAATCAGCC ACATCAGGTG TTTCCGTGAT 12840 

TTCTACACTT TCCAAGGGCT GTGTTAAGTT TGAGTGAGAC GATAATAATT GTAAGTCAGA 12900 

AAATCTTGGT TU^GCCAAGA CCTCAGCTAA TATTTTCATC ACAACAACCT CCTTTTTTTA 12960 

TAGTATAGCG AAGAATAAAA AGGAGTACAA AAGATAGTTA TTGAATATGT CTAAAAAACA 13020 

AAGTCAAAAT AGTGCAAATT GAACAATGAA TTTCTGGGTT TTTCCAAAGA TAATAAAAGA 13080 

GAAGAATTAA AAATGAAGTG AGGGATTACT ATGTATCAAC AACCAAATTT TGTACGGCGA 13140 

ACAATCATGA CGCCGGGACC AGTCGAAGCC CATCCATCGG TACTTCGTCA AATGGGTCAA 13200 

CCAATTTTAG GGCAATTTGA TCCA6AATTT CTACAGATTA TGGACGAAGT TCGGGAAATG 13260 

ATTAAAGTAC CTTTTGCCAC AAAGAATCAA CAAGCCTTTG CCATTGATGG CACGTCACGT 13320 
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TCAGGCTTGG AAGCGGGGTT ATTTGCGTTG- ATTGAGCCAG GCGACAAAGT GCTTGTGCCT 13380 

ACGTATGGCC GGTTTGCTTA TTTATTAGGT GAAATTTGCG AACGCGCCCG GGCCGAAGTC 13440 

ATTTATTTAG AAAAGGACTG GCTCGCACCG TTTGAACAAA CAACGGTGAT TGAAGCGATT 13500 

AAAGAACATC AACCAAAAAT TGTTGCGATG GTCCATGGCG AAACGGCGAA TGGACAAATG 13560 

CAACCACTGG ATCAAATTGG CGCCTTTTGT CGGGAAAATG AAATTTTCTT TGTAGTAGAT 13620 

ATGGTGGCGA CTTACGGCGG TGTTGAAACA AAGGTCGATG ATTGGAAGAT TGATATTGCG 13680 

GTAGCAGGCA CTCAAAAATG TGTCAGCGTA CCTTCAGGCC TTTCTTTAAT TACCTACAAC 13740 

CAACGTGTAG CCGATTATCT GGCGGGCCGT TACCAAAAAG AGTTAGGGCT TGGCGCGGAT 13800 

GCGCGGAATG AACGGTTCAT TCAAAGTAAC TACTTGGATT TATCGCAATT AGAAAAATAT 13860 

TGGGGAGCAG AGCGCTTAAA TCACCATACA GAAGCGACTA CAATGATTTA TGGGCTACAT 13920 

GAAGGGCTGC GCTTGTTACT TCAAGAAGGC ATGGAAAATG TCTATGCACG CCATCGGAAA 13980 

AACAATCGTA TTTTAGTTGA AAGTCTGCAA AAAATGGGcT TAGAAATTTT CGGGAAGTTA 14040 

GACACAAAAA CACCAACAGT CATTCCTGTG GTCATTCCAG AAGGAATTGA TGGTGAAAAA 14100 

GTGCGGAGCT TATTACTGGA TCATTTTAAA GTTGAAATTG CCTCTTCcTT CGGTGATTTA 14160 

AAAGGAAAAA TTTGGCGGGT CGGTAATATG GGTTACAGCA GTCGCGAAGA CAATGTGTTG 14220 

CATTTCTTAA GTGCCTTTGA GACTGTATTG AAACACCAAG GCTATCAATT TGAAGGCGGT 14280 

AGCGGTAGTA CTCATGCATT GGCTGAGTAT TTAAACTAAA AAAGAAGGTA GGACAAAGGT 14340 

CACTACGTAA AAAGCGTAGT GCTTTTGTTC CTTGTGAAAC GTTAACCAAA AAAATGATTG 14400 

GTGGAGGAAA AGAAAATGAC AGACAAGACA AAAAGTTCGT TGGCTTATTG GAAAAAAATC 14460 

GTTATTTTAC TATGTATGGG CTGGGTAACA ATCTGGATTT ATCGTTCCGC GCTTTCTCCT 14520 

GTTTATCCAC AAATTCGTGC -GTCTCTAAAT GGTAACGCTT CCGATACGGC CCTAGGGTCC 14580 

ATTTCTAGCT TTTATTTTCT CGGTTACGTA GTCATGCAGA TTCCTGCAGG CTTCTTGGTA 14640 

GACAAAATTG GGAAAAAACG TGTATTAATT CCAGGCTTTA TCTTGTTTGC GCTGGCAGCA 14700 

TTACTGATTG CCCAAGCCCA AACGATTGGA ACAATTTACT TAGGTAGCTT ATTGGCAGGC 14760 

TTAGGATGTG GTTCTTTCTA TGGCTCGGCT TACTCGTTGA CGTCGCAAAA CATTCCTCAA 14820 

GCCAAAAAAA GTTTTGCAAC AGCTATTGTC AACAGTGGTT CAGCCGTTGG TTCAGGTTTA 14880 

GGGATGATTT TATCTAGTTA TTTAGTTGCC CAAAAAGGAT TACCTTGGCA AGTGATGATG 14940 

TACATTTCCG CCTTTATGAT TATTTGTATG TTAGTGGCTT TTCAAGTGAT TATTCGTAAT 15000 

CATAAAGAAG ACATGGCCTT 6ATTCAGCCA ACCGTTGTTG ATGAAAAAGA TAACGGAGCG 15060 

GTAAAAGAAA AAGTGCCCTT TAAAAAACTG TTTGCGCCAC ATATGCTATT TGCGTATATT 15120 

TTATATTTTG GCACTTGTTA TGCATATTAT ATGATTGTGA CTTGGTTACC TAATTTTTTA 15180 

AGTACCGAAC GTGGTTTCCA AGGAGCAGCA ATAGGGTTAT CCTCTTCACT GGTCGCATTT 15240 

GCCTCTATTC CAGGAGCGTT ATTTTTTAGT CGGTTGGCGG ATAAGTACAT GCATAAAAAA 15300 
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GTTCAATTCA TCGTTGTTTT AGAATTTTTA GCAACAGCCA TGCrCTTGTT AACTGTACAA 
GTAAGTAATG CGACGTT6TT ATTAATTGCT TTAATTATGT ATGG3TTCTT ^^GAAAATTA 
GCTGTAGAAC CGATTATTAT TTCTTGGCTA GGTGAAAATG CACC^AAAAT TGGAATTGGC 
ACTACTTTGG GCGTGTTTAA CTTTTTCGGC ATGATGTCAT CTGT3ATTGC CCGAGCTTTA 
ACTGGGAAAA TTTCTGACAC AACGGGTTCG AAAGTCATGG GC^-TATAT CGCCGTCGTG 
ATTTTACTAA TTGGCACGAT TCTTTTTATG CTTGTCAATT TAA?jyU^CG CAGGGCAGAT 
TCGTCTAATA AATAAAAAAA CAACGCAAAC TTATCTTTCG CTGAATAAGT TTGCGTtGTT 
TTAgcTTTTT GACAATCATA TTAATCGCTT GTGCTAATCG CTCTrCTTGT 7CTTTAGGAT 
CTTTTTCAGG ATTTTCAAAG CAATGCTTTA AATTCTCCGC AAC-TCATC CCCATTACGC 
GATCAATACT AGAACGGACC GCACTCAACT GAGTGATAAC ATCGATACAT TCTTTTTCTT 
CATCAATCAT TTTTTGAATC CCGCGAATTT GaCCTTCTGT 3CGGCGTAAA CGATTCAGCA 
CTTTCTTTTT TTCTTCAGTC ATTTTGCCCC TCCTTAAGAG TAGTGATTGG TTTAAAATAT 
ACCTACACGA GGTATTTTTG TCAACTATCT TTTCTTTGAA TC?A?JU^TG GCAGAAATCT 
CTTCAGTTGA CAAGtCTAaC TTTTtCATGT tAAATATmCC CCGTPAGgTA 7ATAAGAGAA 
GGAGAAAAaG rTGTTATTTC AATTGTTTAA AAAAACGCCA AGTA7TTCrA CAAAGGAATT 
GCAGTCACGT TTATCTAAAG AGATAACGTT ATTAGATGrC CGGP.CACCTA GCGAATACCG 
CGCAGGGCAT ATTCCACAAG CGATTAATGT CCCACTTAAC AAAP.TTCCCG CCTACAACAA 
ATCCGCTAAC GAAGTCTATG TGATTTGCCA GTCCGGCATG CGCAgCAAAA ATGCGGCCAA 
AATTTTAGCG AGAAAAAACT ATCACGTGAT TAATGTTCGT GGCSGGATGT CCCAATGGTC 
TGGTCAAATT AAAGGAGGAA AATAAATGAA AATTGTAATT ATCGGAGGCG 7GGCTGGTGG 
TATGTCAGCA GCGACACGTT TACGTCGTTT AATGGAAaAT GCTG^JUWTG TTGTCATGGA 
AAAAGGCCCT TTTGTTTCAT TTGCAAACTG TGGTTTGCCT TACTACGTTT CTGGCGAAAT 
CGCAGAAAGA GAGCAACTGC TTGTTCAAAC ACCC6AAGCG TTAAAGGCAC GGTTTAATTT 
AGATGTTCGT CCTCACCATG AAGTCGTGGC TATTGATCCA ATi^-GAAAAAG TGATCACAGT 
GAAACATGAA ACAGAAATTT TAACAGAACA CTATGACAAA TTAArTTTAT CACCAGGGGC 
GAAACCTTTT GTGCCACCAA TTACAG6ATT GGCAGAAGCC AAAAATGTTT TTTCATTAAG 
AAATGTTCCT GATTTAGATC AAATTATGAC AGCCTTGACA CCAGAAACAA AACGAGCCGT 
CGTGATTGGC GCAGGCTTCA TTGGCTTGGA AATGGCAGAA AACTTGCAAA AACGCGGATT 
AGAAGTCACT CTCGTGGAAA AAGCGCCTCA TGTTTTACCG CCATTAGACG AAGAAATGGC 
CGCTTTTGTC AAAGCTGAAT TAAGCAAAAA CAATGTTCAA GTAATTACGG GACAATCTGC 
GGTTGCTTTT GAAGAAGAAG GGCAAGTGAT TCGCTTAGAA GACGGTCAAA CATTAGCTTC 
TGATTTAACC ATTTTGTCGG TGGGGGTCCA ACCAGAAAAT ACCTTAGCAG 7TGAAGCAGG 
TGTAGCAACT GGTTTACGTG GCGGTATTGT TGTTGATGAA CACTATCAAA CGAATCAACC 



15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 
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CGATATTTAT GCGGTTGGGG ATGCTATTGT AGTGAAACAA CAAATCACTC AAGAAGATGC 17340 

GCTGATTTCT TTAGCTTCTC CCGCCAATCG CCAAGGACGT CAAGTAGCGG ATGTGATTGC 17400 

TGGGTTAGAG AGAAAAAATC AAGGAAGCAT TGGGACCGCC ATTGTGCGAG TCTTTGATTT 17460 

AACCGCTGCT TCAACTGGTT TAAGCGAACG GGCTGCTAAA GCTGCTGGAC TAACAACAGC 17520. 

TGTTGTGCAT ATCAGTGGAA AAGACCATGC GGGGTATTAT CCTGGCGCAA CAGATCTTCA 17580 

GTTAAAATTA GTTTTCCATC CTACGACAGG GGAAATTTAT GGCGCACAAG GAATTGGGGC 17640 

AAAGGGCGTA GATAAGCGGA TTGATATTCT TGCGACCGCT ATTAAAGGAC AGTTAACTAT 17700 

TTTTGATTTG CCTGAATTAG AGTTTACCTA TGCGCCGCCG TTTGGTTCAG CGAAAGATCC 17760 

TGTGAACATG TTAGGCTATG CAGCGATGAA CCTTGTAGAA GGATTGAGTG AAAACGTTCA 17820 

ATGGTATGAG TTATCCAACG AATTAGCTAA TGGAGCTGTT TTATTAGATG TCCGTAATCC 17880 

CGCCGAACGA GCCAATGGTC AATTTAAAAA TGCTGTGTCT ATTCCTTTAA ATGAGTTAAG 17940 

AGAACGTTTG GAGGAATTAG ACAAGTCAAC GGAGTACATT GTTAGTTGTC ACAGTGGTTT 18000 

GCGTAGTTAT ATTGCAGAAC GGATGCTAAA ACAAGCAGGC ATCTCAGCCA AAAATTTAGA 18060 

TGGTGCTTTT GCGCTATATC GAATGGTAAA ACCG6AGGAA CTAGAAAATG TATAACACGA 18120 

TTGGCATGCC TGAATTTTAT CaGGAAGCAA AACGAAAAAA ATTACCGATT ATTGATGTAC 18180 

GAGAAGAAAT GGAATACCAA CTGGGGCATG TCCCTCAGGC CATCAACTTA CCTTTAAGTA 18240 

CTTTAACCGA AAATTATCAG CAATTAGATA AAAACCAACC CTATTATCTG ATtTGTCAAA 18300 

TGGGGTCGCG CTCCGCACAA GCGTGTGCGT TTCTCAGTCA ACAAGGGTAT CAAGTAACCA 18360 

ATGTTCTAGG TGGTACATCT GGCTGGTTGG GTCAATTAGA ACAATAAAAT AACAAAAAAC 18420 

GTTTATTCTT TTACACAGGA TAAACGTTTT TTTGTTAGAT AAAGATGTGT CAAAACTGTA 18480 

AGAGGAATGA TAGGAAGATG AAGGGACCAG TTTTTTTATT CATTATATAC TAAATAAAAT 18540 

GACAAACTCG TCAACAGAAA TCAAGGAGGA ACAAATAAAT GAAAAAGTCA GGCCAGAAGA 18600 

AGCGAAATAA AAAAGTGTGG TTCGGTATTG GTGCCGCAGT GGTCGTCGTC GGATTTATCG 18660 
GAGCAAAAAC AGTTTTTTCT TCAAAAGAAG TTGAACCTGA ATACACAACG TATACCATTA 18720 
CAGAAATGGC CTCATTAAAA TTAGATGGTC AAGTGAGTTT TCTAGATACT CGGGATATTT 18780 
TCTTTGATCC TTCTTTAGGA AAAATTGCCG AAATAAATGT TGAAAATGGT AAAGAAGTCA 18840 
AGAAAGATAG TCCATTGTTA ACGTACAATA ACTCAGACAT CCAAGCCACA GAAACAGAAC 18900 
AAGCGAATGC TGTTAATCGG AACAATCTTC AAGTTCAACA AGCACAAGAG AACGTTAACT 18960 
TAGCCACTCA AAAATACAAT GAAGCGTTAA ACAAAGTTGC TGCAGCC7UVG CAAAAATTAA 19020 
ACACCGCAAA AGAGGCTGAA GAGAAAGAAA CACTAAATGC GGAAATTCAG CAATTGAATG 19080 
AAGCCGTTAG TGCTGCGAAT AGTGAAGTCG CCCAAGCTAA CCAAGCCCTT CAATTAGCCA 19140 
ACAGTGATGC CGTCGGAGCA GCAAACACGC TTGAACAAAC ACGTGGCAAA GTCAACACAG 19200 
TCGTTACAGC ACCAATTGAT GGCCAAGTCA CTGTTGATGC TAGTGCCATG AGTAGTACCG 19260 



wo 98/50555 PCTAiS98/08985 

749 

ACAAACCAGT CATTAAAATT GCAACACAAA AGAAAAATAT TCAAGGAAAA GTAACCGAGT 19320 

ATGATTACGA TAAATTACAG ACAGGGGAAG AAGTAACAGT GACGACCGTT GGTTCAGGTA 19380 

AATCAGCACC AGGCAAAATT GTTTCCATTG CGCAAACCCC AATAGCTAAA AATGAAGGCA 19440 

ATCCAGTAGT GAGTTATCAA TTTACAGTAG AAGGTGATTT TCCTTGGGCA GAAGGCTTGT 19500 

CTACTAGTAT CGCTGTGCCA CAAAAGCAAA TGATTATACC GACAGCAGCA ATCCAAAAAG 19560 

AAGGCCAAAA AGAATTTv-VC TATGTTTACA AAGCGGGCAA AGCGAAAAAA ACACCGATTG 19620 

AAACAGAAAC AAATTTAGGT CGTAAAGTTG TCAAAAGTGG CTTGAATTGG AAAGACCAAG 19680 

TGATTGCCAA TCCCAATAAA GAATTAAAAG ATAATCAAGA TGTTCAGGTA GCTGCCAATG 19740 

ATTAAGCTAG CCCATATTCA TAAATATTAT TATTCCGAAG AAGAAACACT GCATGTGTTA 19800 

GATGATATTA ACCTCCAAGT AGACGCTGGA GAATTTTTAG CGATTATGGG TCCTTCTGGT 19860 

TCTGGGAAAT CGACTTTGAT TAATTTGCTA GGCTTTATTG ATAAAAAGTT TGAAGGAACG 19920 

TATTTATTTG AAGACCGTGA AATTGGCGAC TTTTCTGATA AGGAACTATC CCGAATTAGA 19980 

AATGAAGCAG TCGGCTTTGT CTTTCAGAAT TTTAGTTTAA TTGAAACACT AACAGTTGAA 20040 

GAAAACATTG aATTACCTCT TTTATATAGT GGGTTAACCC CAAAAGAAGC CAAAGATCGT 20100 

GTCCACGAAG TTCTGACAAA AGTCGGCTTG CCAGATAAAG GCAAGAAGCA TCCAAAACAA 20160 

CTTTCGGGCG GACAACAACA ACGTGTTGCC ATTGCGCGTG CGATTGTCAA TCGGCCAAGT 20220 

TTTATTATTG CAGATGAACC AACTGGGGCT TTAGATAGTA AAACATCGGA AGAAATTTTA 20280 

ACGCTCTTTC AGCAATTAAA CAATGAAGGT GTCACCATTA TTTTAGTGAC P.CATGATGAA 20340 

GAAACAATTG AATACTGCAA TCGTTTGATT AAAGTTCGTG ATGGAAAGAT TTTAGAGGAG 20400 

GTGCTGACAT GAAACGTTCA ATTATTTGGA AAACCGCCTT CCGTTCCATT TTGAAAAATA 20460 

AGCGACGCAG CTTGTTAACG ATGCTAGGGA TTGTCATCGG GATCGCTTCG GTGATTACCA 20520 

TCGTGGCTAT CGGTAACGGC TTTAAAGAAG ACATGGTCGA TAAATTATCT GCTGAAAAAC 20580 

AAAAAGAAAA TGTGAAAAAG ATTTCTTTTT CGGCCTACAA CACTTCGGAT ATGTTTAGTG 20640 

ACCAAGCGAT GTTTACCGAC AATGATTTAG GGGTCGTGCG CATGGTTCCT GGCGTAGAAA 20700 

AAATTGATTT TGATCAAAGA GAAGTTGATG GCGCCCAAAA AGGTACACTG AATTTTCAAG 20760 

CAGGACCGAA AAATCTTTCT GTAACCTATG AACTTGCCAA CAAAACCACC AAAGAATTGT 20820 

TGGCGGGACG CCAACTGAAT GATGGAGACA ATGCAAAACT GGATAAAACA GTGGTGGTGG 20880 

ATGAACTGGT TGCCAAAGAA CTCTACCAAG ACTCGACTGC AGCAGTTAAT CGAACTTTTC 20940 

CATATAAAGA GCAACTTTTT ACCATTGTTG GCGTGACAAC CAATACCAGC GGTGCCATTG 21000 

GTCCAGGTAA TGATGACTCA TTGCTTTATT TTCCCAAAAA GACCTATGAA CATTATTTCG 21060 

GCAAGCTAAA AGATACATCT ACGTTGAAAC TAACAGTAGC ACCTGGCTAT CAACCAGATC 21120 

AAGTATTGAA AGAAACAATA AAAACTCTCT CTCAACAAGG AACCATGAAA AACAGTGGGA 21180 

CGTATCAAGA ATATAATGTT AAAGATACCA TCAAAGAAAT GGGCTCTTTA TTAAATAATT 21240 
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TAACCTTGTT 


TATTTCCGCA 


ATTGCGGCTA. 




AA 1 TG uTGGG 


GTCGGCGTTA 




TGAATATGAT 


GTACATTTCG 


GTCTCAGAAC 


GGATTAAGGA 


7k 7k fTlfn/^/^fn 7k f^/^ 

AATTGGTATC 


7k 7k n* 

CGACGTGCAT 


213dD 


TGGGCGGGAC 


AGCCAGTGAT 


Ik mm Ik Ik nk m Ik 

ATTAAAAA6C 


AATTCTTAAC 


7k ^7k 7k ^^/*t7k mm 

AGAAGGGATT 


GCCTTGACCT 


21420 


TAATCGGCGG 


AATTACTGGT 


TATCTTTTAG 


GAATGATTAT 


TGCCTACTTA 


GCGTCAATGG 


21480 


CACTACCGTT 


TTCTGTTCGA 


CCAGATTTAA 


TGACCGTTTC 


ATTAGCCATT 


GGGATTTCTG 


21540 


TCTTTATCGG 


GGTTGTATTT 


AGCTATTTCC 


CTGCCTCGGC 


AGCTTCAAAA 


AAAGACTTGA 


21600 


TCGATATTAT 


GAAATAAATA 


TAAACAGCAA 


AGAAACAGCC 


ATTTTTGGCT 


GTTTTTTTTA 


21660 


GTGAAAGTAG 


AAAAAAATCC 


AAAGATTTCA 


AGAGAAAGTA 


GCTAACAAAC 


GTCAAAAGGT 


21720 


TGATATTAGC 


TACCAACCTT 


TTGACGTTTG 


TCAGTTAAAT 


AAAAAAGAAA 


AAAATTAAAA 


21780 


AAATAGCACG 


TTGAGGATTA 


AATAACAAAC 


ATTTTTTAAA 


TAAGACTATT 


GGAAATCCTT> 


21840 


TATAAATCGG 


GGATTACAGG 


AATGACTGTT 


AATGACTGAT 


TTGCTTGTAA 


ATTTCTAAAT 


21900 


TTTAATTCGA 


AAAATTTCCA 


GTATACTAAA 


TATATCAAAA 


CAAAACACGA 


AAATTGAGAA 


21960 


AATTTTGAAC 


CAAAAAAGAA 


TTAAGGAGGA 


ATCAAGAATG 


AAAATAATGG 


TCATCACAAA 


22020 


AAATATTTTA 


TCAGAAACTG 


CTTTTCAAAA 


CCGCTTACAG 


CATTTAGATC 


ACCAAGTTTT 


22080 


TTGTACGACG 


TGTCTGATTG 


6TGAAATTAA 


GGAAAGTCAT 


TTGTTTTATC 


GAAAATTTGA 


22140 


TGCCATCATT 


CTCAGCGAAA 


CGATTCCTCG 


CAAAGAACAA 


TATTTATATC 


TAAAGGAATT 


22200 


AGCCAGAGTT 


AAAACGCCGT 


TGTTTCTTCG 


CTCGGAAGAC 


ATTCCTACCT 


CGGAAGAATT 


22260 


GAGCATTAGC 


TATCAATATG 


GCATTCTCAA 


TTGGTTACGA 


TTGGATGCGA 


CTTTGGATAC 


22320 


GTTAAGAGAA 


AATCTCCTTT 


TGGCAGAATT 


AAATAAACCC 


ATCCAACGAG 


AAGATGAAAA 


22380 


GTTAGATGTA 


GAGGGAATAA 


AGCTAACAGC 


CGCAGAACGT 


GCACTACTGG 


CTGTCTTGTA 


22440 


TAGTACGCCA 


AAAGAACACT 


TACCAATGGA 


GCGCCTGTGT 


AAATTAATAT 


ATGACACAGA 


22500 


AGAAACAGCC 


AATAATCGAA 


AAAAAATGAC 


CAAACAGCTG 


ACAAAATTAA 


ATAAAAAAAT 


22560 


TCAAGTAGGG 


GGCTTTAAAA 


AACAAGGTAT 


CCAAAATACA 


CGGTATAAAG 


GCTACAAACT 


22620 


GCATCCGCAg 


TTGCTTCCTT 


ACATTGGCGA 


ATTATTTACT 


TAAATTAGAG 


AAGCACTTGA 


22680 


CGAATGAAAA 


TCACCATCAC 


AGGATGACCT 


CTAAAAACCA 


TGCTATACTA 


ATACGGAATA 


22740 


G T AT AGGAGG 


TGGGTCTGTA 


TGAAAGACGG 


ACTATATTGG 


GTACAACAAT 


tn Ik 9k vk^ 9k y^9k 9k ^^9k 

TAAAAGAACA 


O 1 O A 

22800 


GAAAGTAAGT 


CGCATAGAAC 


TCTTAAAGGA 


nr^ 7k ^ik ts ik 

TATTGAGAAA 


AAAGTCCAAA 


9k #^ fn^n 9k 9k 9k fw%^^^^ 

AGTTAAATCC 


22860 


AAAGATCAAT 


r*f^i MiifiMiM *fiiik Ik 

GGCTTTGTAA 


CCTTTGATAG 


f>r> 7k 7k ^ 7k 7k/^^7k 

CCAAGAAGCA 


^ 7k 7k f^l^ 7k 7k ^mw» 

GAAGCAACTT 


7> 7k 7k 7k 7k 7k 7k 7k 

ATCAAAAAAA 


o o f\ o r\ 
22920 


TAAACAGCAG 


GACACACTAT 


TTGCTGGTTT 


GCCATTTCCA 


TTGAAAATGT 


TAGGCCAAGA 


22980 


AAAAGCGGGG 


TGGTCAGCAA 


CGGCCGGCTC 


GCAACTTTTT 


AAAAATAGTA 


CTGCAACAAC 


23040 


GACCTCTAAC 


TTTGTCAAAC 


AAGCAGAAAC 


GATTGGCTTG 


ATGCCTTTAG 


GACAAACCAA 


23100 


CGCTCCAGAG 


TTTGGCTTTA 


AAAATATTAC 


GGACCCTGCG 


CTTTACGGGC 


CAGCCCGCAA 


23160 


TCCTTGGAAC 


CTGGAACATT 


CGCCAGGCGG 


TTCCAGTGGC 


GGCGCTGCCG 


CAGTTGTCGC 


23220 
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TAGCGGGATT 


GTTCCTATAG 


CCGGCGCTAG . 


CGATGGCGGT GGblCCATCC GAATTCCCGC 


23280 


CTCTTTTAGT 


GGCTTAATTG 


GCTTAAAACC 


TAGCCGTGGC 


TCCATGCCTG 


TCGGCCCAGA 


23340 


AGGCTGGCGA GGCTGGCAAG 


GTGCCTCGAT 


TGACTTTGCG 


TTGACTATTT 


CTATGAGAGA 


23400 


CACCAAAGCG 


TTATTTTATG 


GCCTACGTGG 


TAGCCACAGT 


GGCGCCCCCT 


ACCAAGCGCC 


23460 


CTTGGCAGAA TGGCAAACAC ACCCTAAAAA ACAACGGCTA AAAATTGCTC 


TGTGTAcCGc 


23520 


TTCgcCTATC GGCTCACTGA 


TTTCGCCAGA 


AGCAACACAA 


GCGGCGAAAC 


AAGCGGCTGA 


23580 


TTTTCTGGCA 


GCCGCTGGCC 


ATGAAATCAT 


TGAGATTCCT 


TATCCAGTCG 


ATGGTGCAGC 


23640 


GCTGATTCGC 


TCCTATTATC 


AAAT6AACGG 


CGCAGAAACT 


ACAGCGATGA 


TGAACAGCAT 


23700 


TCAACAGGGG 


CTCGGGCGCC 


CAATTCGTAA AGAAGAAATC 


fiAATfTTTTT 


CATGGACCAT 


23760 


GCATCAATTT 


GGCCAAAAGA 


TTCCAGCAGC 


CACCTATGTT 


PATTPATTAP 


AGTTGTGGGA 


23820 


TCAAGCAGCA 


GTAACGATGG 


AAGAACTTTT 


TCAAAAATTT 


GATCTGTTTT 


TATCACCAAC 


23880 


AACGGCTTTT 


TCAGCACCAA AAATTAACGA 


AGATTTGCAA 


AGTGAGCACA 


TTCGTCAACG 


23940 


AATGGCGCAA GCAGCAGAAT 


TAACAGAAGC 


AGAATTAGCT 


GAATTAATCT 


ATGATTATTT 


24000 


TGATAAAAGT 


TTACAGCTAA CACCGTATAC 


TCAACTAGCT 


AACTTGACAG 


GACAACCAGC 


24060 


CATCAGCTTG 


CCCACACATG 


TTACGGCAAC 


GGGGTTACCC 


TTAGGCATTC 


AATTGCTGGC 


24120 


TGCTCGTGGA 


CGTGAAGATC 


TCTTATTCCA AGTAGGCGAA 


CAGTTTGAGC 


AAGAAGGCAA 


24180 


ATTTAAATTA 


CCAGAAAGTT 


ACCGCTAAAT 


AGGGTGTTAT 


TGTGACCAAA 


TTTCAGATAA 


24240 


AATAGACACA 


AGACAAAAAA 


CTGAGGTGGA 


AAAATGCTGT 


ACGTAGTGAG 


ACATGGTGAA 


24300 


ACAGATTATA ATGTGGCCCG 


TAGAATTTGT 


GGTCATGCGG 


AAGCGGAATT 


GACCGAAAAA 


24360 


GGCTATCAAC AAGCGGAATT GGTAGCTGAA AAAATTGCCA AACAAGGAAT TCAAATCGAC 


24420 


CGTTTACTGG 


CGTCGCCTTT 


AAAAAGGGCG 


CAAGAAACAG 


CCCGTAAAAT 


TGCAGAAC6A 


24480 


AATCAATTGA 


CGATTGAAAC 


AGAACCTCGT 


TTAATTGAGA 


TGAATTTTGG 


CATTTATGAT 


24540 


GGTGAACCGA 


TTGAAACAAC 


GGCTTTCCAA 


GAAAATCGTT 


CTGAGATTTC 


CTTGCCGTTT 


24600 


CCAGAAGGGG 


AATCAGTCCT 


AGATGTCGCG 


GGTCGAATTT 


ATCCCCTGAT 


TGAGGAAGCC 


24660 


TTGGCTTCAG 


AGGAAACTTA 


CCTATTTGTT 


TGCCACAATG 


CGGTCATGCG 


AGTCATTGAT 


24720 


AACTACTTCA 


ATGGTAAAAA 


AATCCAAGAT 


TTCTTGGACT 


TTCATTGTGA 


AAATACACAA 


24780 


CTTGTACAAT 


ACGGAGAATA 


AAAATCAAGT 


CTTTAATCTT 


TTGGTTAAAG 


ACTTGATTTT 


24840 


TTTTATGGTA 


AATAAGAAAC 


CGCTTCCGTT 


TTAATGTATA ATAAATTAAC 


TTTTGGATCA 


24900 


AAAAAGAAAT 


GGGGAAAAGA CTATGAGAAT 


T6ATAATATT 


CATCAAGCGT 


TAAAACAGTT 


24960 


TCACGTACAA 


AATCCAAGTA 


GCAAAGAAAC 


CTTGTTGCCC 


GCAACGGcTT 


CGCAATTAAG 


25020 


TCAGGCCTCT 


GGCTATACCC 


GAACCGCTGT 


TAGTGAAGAG 


CTTTCAGAAT 


TAGTGAGAAA 


25080 


AGACCAAGTC 


GTTAAAGTTA AATCAAGACC 


GGTGTTATTT 


TTTGATAAAG 


ACTACTTAGA 


25140 


GGAACAGTTT 


CCTTTTTTTA AAAGTAGCAC 


GTGCTATGAT 


TCCCTCGcTT 


CGCTGCAAGA 


25200 
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GGACATTTAT CAACAGACAG AACAGATACA TAAGAAAAAG CAGCTACTGA AAATGAATCC 


25260 


ATTTGCGAAC 


GTAATTGGTT TTGATGGGAG CTTAGCAGAC GCCATCAAGA AGTTGAAAGC 




CGCTATTTTA TATCCGCCTA ATGGCTTGAA TATTTTACTT ACTGGAGAAT CTGGGGTGGG 




GAAAACCTTA ATTGCAGAAC AATTACACCA ATTTTATCAA GTTAAAATGA ACGAAGAAGT 




TCCGTTTATT 


TATTTTAATT GTGCAGAATA TTTCAACAAT CCAGAATTAT TAACTTCGCA 




TCTGTTTGGC 


TATAAAAAAG GCAGTTTCAC TGGTGCCGTC AATGACCAAA AAGGTTTAGT 


£DDO\J 


GGAGTTGGCA 


GATGGCGGTT TTTTATTTTT AGATGAAGTC CATCGATTAA CGAGCGAAGG 


o c con 


CCAAGAAA7UV 


TTGTTTACAG TTTTGGATAA AGGGTATTTT ACACGGATGG GCGArCgAAA 


ZDboU 


AACAAAGAAA 


TGTATCGATA AAATTTATTT TTGCT^CAAC CGAAGCAGTC AATCAACATT 




TTTTGAAAAC 


GTTTTTAAGA AGGATCCCTG TGACATTAGC CATTCCAGCT TTAGCCGAGC 




GCTCTTTGAA AGAAAAAATT CAATTGATTT TGTCCTTTTT CTTAGAAGAA AGTCAACGGA 




TTCAAAAG7UV 


TATCCATATT CGTTATGAGG CGTTGAAACA ATTAACATTT ACTCATTTTG 




AAGCCAACGT 


AGGCGAATTG AAAGCGGCGA TTCAATTTAT GAGCGCACAA GCCTATTTAG 


25980 


AGGTGTTCGA 


TCAATTAACG GAAACTATTT GCATTGAAGC AAACGAGGGG ACAGTCAAAG 


ZoxjhU 


AGCAACCTGA 


TGTAGAAAGT CTGCTGGCTT TTAATGAGTG GATACCTAAC GATGGCCTCA 


o >r ^ A A 

26100 


CGATTCAATA 


CAACCATTTT TATTTAAGCA GTCAAGAGTT ACTTAAATCA GAAAGTCTGA 


CA 


AAAATGATCG 


TTTTTATAAT TTTCTATTGA AAGAATATGC GGATTTGAAA AAATTAACTC 


2o2Z(J 


TTTCGACAGA 


GGATGCGCTA GCAATCATGC AAAACAAAGT CGAGCAACTT TATGATATGG 




ATATTTTTGC 


TCAAGAACCG CTAGCCATAG AGGCCATTGA CGAAACATTG CAGCCAAAAA 




TTCAAGAAGC 


CATTGTCTTT TTTGAACAAT TAATCCAAGA AAAATTGACA GAGGATTTCC 


ZD4UU 


AGAATATTTT 


AATTGTCCAC CTTTATTCTG CTGTTTTTTA TTCTGCACAA ATGAATGAAA 


O CA 


CATTTTATTC 


AGATACCAAA AATATCTATT CTGGCAAAGT CACCCAGTAT CAAGCTGCTA 




AGACCATGGT 


GGACTTTCTA CAAACATTGT TTGGGATTTC TTTTGCTGAT ACGGAAATCC 




TCTTTTTCGA 


TCTATTTATT CGTAAAGTCA CAGCTAAAAA AAGAGCGCAC TCAGATGAAG 




AAAACTGTGG 


CGTTATCGTC GTTGCACATG GTGCCGCGAC GGCCAGTAGC ATGGTTGAGT 


96700 


ACACCAACCT 


TCTGTTTTCA ACAACGGTCA TGCGTGCAGT TGATATGCCG ATTAATCAAT 


26760 


CAGTGGAAGA AACATTAGAA AAAGTCCGTA CAATTGTAAA GCAAAATCAG TATAAAAAAA 


26820 


TTATTTTAAT 


GGTGGACATC GGTTCCCTGG TTTACTTTGG TAATGCTATC AGTCAAAAGT 




TTCAAAGGGA AGTGCTGCTA ATTAGCAATA TCAATCTATT AACATTGCTA GAAGTGGCCC 


26940 


GAGAAGTCAT TTATGAATCG ACGGCGTTTG AGTATCTATT ACCAGTCTTA AAAGAAAAAA 


27000 


ATCATCGCGC 


CTTGATTTGC AGTAAAGGTC GCTTCAACGA GGCAAAAGTT CTCATTATTT 


27060 


CCTGTCTAAC TGGCATGGGG ACTGCTATTA AAATTGAACA GTTGTTACTA AAAACTTTTC 


27120 


CCAATGAATT 


ACTAACGAAT GTACGCATCA TTACTTTAGA GAAAAGAGAA GTGGAAGACA 


27180 
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mm^^ mr^/^/^mm /^/*»ri»m^*»f»»i»n»»¥» r*'i*t*r*f*f*'^'A.1k.fl TkHfl'^flT^l^li.TiT f^fZ^fflfZTHTC* lVTTf3f2f*Af OR 
T TGATCGCTT GCATTCTTT 1 vj 1 1 UooiviAla wi/4«M i wva i a sasa /\ i x \9V9 wiwv^u 


27240 


TAAAGACAGA AGTGCCAGAT ATTCCGTTTA TilUiliAv^ Aiafvi i iav^ia i v^ioAAia^iTiaVa 


27300 


GCATTCAGCT AGTTTTCGAA TTATTAGGCC ATCATTTAGA AGAAAAACAA AACCAAGAGT 




TACTAAAAAA AGTTTCAGCT AAATATATTC AAGGATTATC AATCGAAGCT ATTACAGACT 


A on 


TACTAACGGT TTTGAATCCG ACAACCATTT CAATCGAATT AGCTAAGATT TATTCAGACA 


2 /4oU 


TTTGCTTGCA AACTAAAATA AAAAGTGACG AAAAGATTTT ACTTCGTTTT ATCATTCATT 




GTGCTTGTAT 


GTTAGAACGG CAATTGTTGA ATCCAGAATA TGACCCGACA GCTTATGaAG 


£. / OUU 


TGTCaTATCG 


AGAGTTGCCA GAAGCTGTAT CTGTCATAAA AATGGCGTTT CGACCGCTTG 


z / oou 


AAGTGTCCTA 


TAATCTGTTG ATTCCACCTT TGGAAGTGCG CTATATTTAT GAATTGCTCT 


Z. f /zu 


TTGAAAAGGC 


TTAATTCAAA CGGAAATGAA AGTTGGCACA CTTATTGCTT TTAATAAAAT 


Z / /oU 


GATTAATTAG 


CAAAGGAGTT TGTTTATGGA AAATGAACCA ATTTTACTTT TAACACACAA 


27840 


TGGTTGGGGC 


TTGGAATTAG TCAAAAGCGT CCAGATGATT GTTGGAGAAG TCACAAATGT 


o T n r\ 
2790U 


ACATGAAGTT 


GCATTACAAG CAGAGGATTC ACTGGGGGAT TATTTAGAAC GAGTCACAGA 


27960 


AAAAATTGAC 


CAATTAACGT GGCACAACCA GTTGCTAATT TTAACAGATG TTAAAGGCGG 


O O A O A 

28020 


CACACCTAGT 


AACGTTGCAT TACGTTTATC AAAAGATTAT GACCTCCTGA TTGTGTCAGG 


28080 


ATTATGCACA 


TCGTTATTAT TAGAATCCGT CATGAAACAA AGTGGTCCTG GATTTCGTTT 


28140 


AGAAGATGCA GAGATGATTC AACAAGCAGC AGTAGACAGT TGTCAAATTT TAGAAATTCC 


OO O AA 


AAAATAAGAA 


AGTACAGGTG AACAGAATGT CAAAAATCGT ATTAACAAGA GTCGATAGCC 


ZDZOU 


GTTTAATCCA TGGACAAGTA GTGACTAAAT GGTTACAACA ATCAGGTGCA AATGAAATTT 




TGGTAGTGAG 


TGATGAACTA GAGCAAGATG AATTTTTACA AAGTATTTAT TTAATGGCGG 


ZD JDU 


CGCCTCCAGG 


GGTTTCTGTA GTAATTAAAG GGATTGAAAG TGCCAAAGAA TATTGGG/^T 




CGCAAGAAAA 


GGAAGAGAAA AAGGTGTTAT TTTTAGTGCC AGATTTAACG ACCTTAAAAG 




CGATGGTGGA 


CCAAGAAATT ATTAAAGAGG AGATTCAAGT TGGTGGCTTA GGTGGCGGAC 




CAAATCGGAA 


AAATGTTTTA AAAAATATCA ATTTATCTGA AGAAGATGTT GCAATTTTGA 


ZO DZu 


CTGAACTTTT 


AACGAAGGGC ATCCATGTTT TCTTCCAAGC GATTCCCGAA GAAAATCCCC 


ZO DO V 


TACCAATTCA 


AAAATTAATT GAAAAATATC AATCACTATA ATCAGAGAAA AAGGAGAGAA 




TCATATGGGC 


GTTTTAGGTG TTTCAATTAT TATGGGACTT TATTATTGGT TTGCACGTTT 


ZO 0 uu 


GCGTTTTGGG TACACATTTT CAGGCATGCT GTCTCAGCCA TTAGGCGCAg cGTTGGTGGC 


4.00 SJV 


,TGGTTTAGCA ACAGGAAAAC TTAGTGAAGC AATGGTTGTG GGGGCGGGGA TTCAACTTGT 


28920 


TTATCTCGGA GTCACCTCCA CACCGGGCGG CAATGTGCCT AGTGATCCGG CTTTAGCTTC 


28980 


AGCAATCGCG 


ATTCCGATTG CTTTAGGTGT GGGAATGAAT GCCGAGGCGG CTGTTGCACT 


29040 


AGCTGTTCCT 


TTTGGTGTGT TAGGCGTCTT TATGGATCAA TTACGGCGCA CCATCAATGC 


29100 


GACGTGGGTG CATATGGCTG ATAAATATGC GGAAGAGTTA AATTATGGGG GGATTTACCG 


29160 
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r\\3\^ 1 1X1 


\3X 1 xxix\«v«n\9 


PTTTAGCGGG 

^ X X X nUV«V9wV9 


ATTTTTAATT 


CGCTTTCCGT 


TGGTTTTTGC 


29220 


o\7V«V7nnV.r 1 X i 


X X X\9VjxvMrv\x 


v* X \7 X UrVa X onn 


TAAAGTCTTA 


GAAGTTATCC 


CAGCATGGTT 


29280 


GATGCACGGC 


TTTGAAGTAA 


X Vy\70 X V7V7 1 n 1 


TTTACCAGCG 


T TAGGCT TCG 


CCTTAACAAT 


29340 


TATGGTAATT 


GGCAAGAAAA 




T *P 2V T *P T T 1* *P A 
X XnX X 1 X X xr\ 


T%X X\9v9X 1 X XV9 


TTGCAGTAGT 
X X 19 v^nia X nvi x 


29400 


ATATTTTGGC 


GCAGAAGTTA 




lAl X X XnooA 


ATCTGTATTG 


CCTTTTTAGT 




TAGAAACAAG 


GCATTGAGTG 




ATAATRfiAAf 
Al AA 1 lJ\3AA^ 


AAATGACACA 


AGCAAATGAA 




CAACAATTGG AACAGAAAAA 


AAx xAUlAAA 


a AA<2AATTAT 
AAAuAAX lAX 


CCAAAGCCTT 


TTGGATTTAT 




CAACTAGGCT 


GTGAATTATC 


AAA T TCTTA 1 


IstAAlrfis X ^ 1 


AAAGTTTAGT 


TTTTTGTGCA 




TCGATGATTC 


CTGCAATTAA 


AAAACTTTAL. 


Uv.Al3AU\3A X Vs 


AAGAACAACA. AAGAGAAGCT 




CTGAAACGTC 


ATTTGAATTT 


TT n AAl ACl 


VsAAviijAAWlo 


TAGGGGCTTC 


CATTCAAGGG 


^ 7 ' ov# 


ATTGCAATTG 


CTATGGAGGA 


AGAAAAATCC 


AAT GGC vjC*Ag 


cGATCTCTGA 


TACGGCGATC 


0 QQ ori 


ACATCGATTA 


AAACTGGGCT 


GATGGGTCCA 


CTOGC 1 WjtiA 


TTGGCGATTC 


GATTATTTGG 


0 QQ on 
Z 700U 


GCAGCATTAA 


TGCCGTTAAT 


CATTTCCATT 


TTTATTCCAA 


TGGCTAAAGG 


AGGTAACGTG 


0 QQ>i n 


ATTGGAAGTA TCGGTCCGTT 


GGTTCTGTAC 


ACAGCCaTTA 


CATTATACAT 


CAGTTGGACG 




TTAGTGAATA 


AGTCCTATAC 


ATTAGGGCGT 


AATTCAATCT 


TAAGTTTGTT 


AAAAGACGGC 




AAAATTAAGC 


AAGTCATTTA 


TTCAGCAAAT 


GTTT TAGGAA 


TGATGATGAT 


GGGCGCTTTA 


JUJLZU 


AGTGCGAGTT 


ATGTTAAAAT 


TGCTAGCCCG 


ATCaACl? X i i A 


AAGTTACTGG 


CGGTGCCACG 




ATTGTTTTAC 


AAGATATCCT 


AGtATuAAATT 


Ax oAAAbuo 1 


TACTTCCTTT 


AGCAGCAGTC 




ATGGCTATTT 


ATTTCTTCAT 


UGTAAAAAAA 




ACGGTATCAT 


TATCGGCACG 




ATTGTCTTGG 


TTAGTTTGGT 


UA^ X xUAi i u 


1 1 IbisX 1 xlai 


T A r AAAoVs A\3 


aaaaTaaT/za 
AAAAlAAlvaA 




ATAGTTACGA AAAATTCCAC 


TTAAAAGAAG 


Xw\TTAAl ov. 


TTCTGGCAAA 


ATGACCATTC 




TAGGTGTCTC 


CAAAGTTTCG 


«k fr*r*yk f^t*r*f* 
(3AUoCiAl3TL.U 


1 AuwUaC 1 UA 


ACGATTCGGC 


GGCGAACATT 


an^ on 
ji/s 01/ 


TCTTTGAAAT 


GAGTGAACTT 


AGTGTlaUAAA 


UAui7V3UUL.i 1 


TTTAGCCAAC 


TTGTTGAAGG 


.'ansdn 


TAGAAGATGC 


CCAAATCGTT 


TCCTCGGCTT 


L>uurd3VV3bA 1 


TGCCCAATCG 


GTGGCTGCCT 


JUDlil/ 


TGATTGGAAA 


AGGCAGCTTG 




aTr* a T r*r*r*T a 


TACAGAAAAA 


ATCG7UVCAGC 


30660 


GGGAGATTGT 


TTTACCTAAA 






CACGCCTGTA 


GAAGTCATGG 


30720 


TAGCGCAAGG 


TGGCGGCCAA 


riTi^/^Trir: A a n 

O 1 VJVJ 1 V3V3/v\0 


1. i3M k^ 1 n^u\>^ 


CAACATGTGC 


AGTCCAGAAC 


30780 


ACGTGGAAAT 


GATGATTTCT 


nAAAAAAPAR 




ATACATCAAA 


AGCCACCATA 


30840 


CCGTGCAAAA AAGTATGCTG 


ACGGTGGCCG 


AAGCGGCCAA 


AGTGGCGCAA 


CGCCACAAGG 


30900 


TGCCGCTAAT 


TGTTGACGCG 


GCAGCAGAAG 


AAGATCTTTT 


CAAATACACT 


GAAGCAGGCG 


30960 


CTGATTTGGT 


GATTTACAGC 


GGCGCCAAAG 


CAATTGAAGG 


GCCAAGTGCC 


GGTTTAGTGG 


31020 


TCGGGAAAAA AGAATACATT 


GACTGGGTGC 


GCCTACAAGG 


CAAAGGCATT 


6GTCGAGCAA 


31080 


TGAAAATTGG 


CAAAGACAAT 


ATCTTGGGCT 


TTACGCAAGC 


AGTGGAAGAA 


TACTTGGCAC 


31140 
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ATGGCAGTGA ATCAGGGGCT TCGATGCAAG -AGCGCTTAAA ACCGTTTGTT GAAGCCATCA 


31200 


ACAACTTATC AGATTTAACC GCCAAAATCG 


TCCAAGATGG 


CGCTGGCCGA GACATTTATC 


31260 


GCGCCAGTGT CAAAGTGGAC GGCAGAAAAA CCGCTAAGGA AGTCATCCAA GCATTAAAAG 


31320 


CAGAAAGTCC AGCCATTTAT ACGCGAGAAT 


ATCAAGCCAA 


CAATGGCATC 


ATTGAATTTG 


31380 


ACATTCGTTC TGTCAATCAA GAAGAAATGA 


ACAAAATCGT 


GCAACGATTA 


CAAGAAATTA 


31440 


TGGACAAAAA GGAGAAATAA GTCATGTCAT 


TAAAACCTAA TTATTTAGAA 


GAACGCATTT 


31500 


GTTTAAACGT CTTAGCCAAT TCAGTAGAAA ATGCCCAAGC CTGTTATGAA GCAGCAGAAG 


31560 


GGCATGTTGT GTTAGGCGTC CTTTCAAAAA 


ACTATGAAAC 


CGATGAAGCA 


GCTATTGACG 


31620 


ACATGAAAAA ATACCAAGCA GCAACCAACA 


ACGCTTTATC 


TGTCGGCTTA 


GGGGCTGGCG 


31680 


ATCCTAATCA AAGTCAAATG GTGGCTCGCT 


TATCAGAAGT 


TCTACAACCG 


CAACACGTGA 


- 31740 


ACCAAGTCTT CACAGGTGTC GGcgCtTCGC 


GGGCCTTACT 


TCGTCAAGAC 


GAAACAGTGA 


«J X O V w 


TTAATGGCTT GGTTTCACCC ACAGGCAAAG 


TTGGCTATGT 


CAACATTGCG 


ACAGGTCCGT 


O X O V \J 


TAAGCTCTGG TGCGCCAGCG GCGGAAGTGC 


CTATCGAAAC 


CGCCATCAAA 


TTGTTGAAAG 




ACATGGGGGG CAGTTCCATT AAATATTTCC 


CGATGAAAGG 


CTTGGCTCAC 


AAAGAAGAAT 


J JL 30U 


ACCAAGCAGT AGCAGCAGCT TGTGCCAAGT 


ATGACTTTTA 


TTTAGAGCCA ACAGGAGGCA 




TTGATTTAGA AAACTTTGAA GAAATTGTTC AGATTGCGGT GGATGCCGGC 


GTTAAAAAAA 




TCATTCCTCA TGTCTACAGC TCAATTATCG 


ATCAAGAAAC AGGGGACACT 


AGAACAGAAG 




ACGTGAAAAC GTTACTAACC ATGATGAAAA 


ACACATTGAA 


TAAGTAGGTG 


GCAAAGATGA 




AAATTGCTGC GTTTGGTGAA GTCATGCTtC 


GTTTCACTCC 


GCCAGAGTAT 


CTAATGTTAG 


32280 


AACAAACAGA GCAGTTAAGA ATGAACTTTG 


TAGGGACTGG 


CGTAAATCTC 


TTAGCAAATT 


32340 


TGGCTCATTT TCAACTAGAA ACAGCATTGA 


TTACGAAATT 


ACCTGCCAAT 


CGTCTTGGTG 




AAGCTGGCAA AGCAGCGCTA AGGAAGTTAG 


GAATTTCAGA 


TCAATGGGTA 


GGAGAAAAGG 


32460 


GCGATCATAT TGGTAGTTTT TTTGCAGAAA 


TGGGTTATGG 


7VATCCGCCCC 


ACACAAGTAA 




CTTATCAAAA TCGCCATCAG AGTGCTTTTG 


GCATCAGCGA 


GGCGAAGGAC 


TACGATTTTG 


32580 


AGGCCTTTTT AGCGGAAGTA GATATGGTGC 


ATATTTGTGG 


TATTTCGTTA AGTTTAACTG 


32640 


AAAAAACTCG TGATGCTGCT TTGATACTTG 


CTCAAAAGGC 


GCATGCCTAT 


CAAAAGAAAG 


32700 


TTTGTTTTGA CTTTAATTAT CGACCTAGTT 


TGAATACAGC 


AAATAGCGCA 


CTCTTTATGC 


32760 


GTCAACAG 








32768 


(2) INFORMATION FOR SEQ ID NO: 124: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

TTTATTTACA CCTGCATTGG AAGTCGAAGA TGCCGAAAAG AGTAGTTGGA CTTATCCCGT 60 

TTATGGTTAT TTAGATAGCG AAAATCCTTG GGAAAAAATT GCGGCTTTAT TAAATAAACG 120 

AACACAAGGC ACACCACGCT TTGCTTTAGA AAAACAAGCT CTTTCTTTAG CTCGTTTCGA 180 

TCAATTGAAA ACCTATTTCC CAGCTAGCGA TTTTTCGCAT GATGTTACGC CTTTAATTGA 240 

AAAACTTCAA TTAATTAAAA CAGAACCTGA AATTCAACGT TTACTAGAAG CTGGTAGTTG 300 

GGCTGATGTC GCTTTTGAGA TTGGCTTCAA AGCAATTCAA GCAGGCGTTG CGGAACAAGA 360 

AATTGTTGCA GAAATCGAGT ATCAATTAAA ACGTCAAGGT ATTCGCTCAA TGAGTTTTGA 420 

TACGTTAGTC TTAACAGGAA AAAACGGAGC TAGTCCTCAC GGGGTACCTG GCGAAACAAA 480 

AATTGAGCCC CATGACTTAG TGTTATTTGA TTTAGGTGTC GTTCATAATG GCTATTGTAG 540 

TGATGCCACA CGAACAGTCA GCTACTTGGA ACCTTCTGAT TTCCAAAAAG AAATTTATGG 600 

AATTGTCTTA GAAgCACAAT TAGCAGCAAC TGAAGCGGTG AAACCTGGTG TTACAGCAGG 660 

cGAATTAGAT GATATTGCCC GAGGTGTAAT TACmAAAGCT GGTTATGGTG AATACTTTAA 720 

CCATCGCTTA GGCCATGGCA TTGGAACCAC TGTCCATGAA TACCCATCGC TAGTTCATGG 780 

CAATGATTTA GTCATTGAAG AAGGTATGTG TTTCTCTATT GAGCCCGGTA TTTACATTCC 840 

TGGAAAAGTA GGCGTTCGTA TTGAAGATTG TTTACACGTA ACGAAAACAG GCTCCGAACC 900 

ATTCACTAAG ACAACCAAAG AATTGCAAAT TATTCAATAG CATAAAAATA AAATGAGGCC 960 

ATCAGTCAAC CGGACTGATG GCCTCATTTC TAAATTTTAA GGTAAGCCGC TGAACAAAAG 1020 

CAACACGATC CGTTGCTAAC TTTCATAAGC TAAACGCTTG TTACCCTTTA AATTCTTTTT 1080 

TTACGTCTTT CGCTGCATCT TTAACATCTT CTTTTGCTTC TGCAGCTTTT TCAGGAACAT 1140 

CTTTGGTGGT TTCTkTTGCT TCATCGACAC CAGCAGAAAC AGTTTCTTTG GCCGCTGCCG 1200 

CAGAATCTTT AACGTCAATA TAGATATCTT CTGCTGCATC TTCGGCAATT TCACCTAAAT 1260 

CTTCTGCGTG ATCTTGAGCA TCGTCAGCTG CTTTTTTAAA TTGATCAGAT AAATCACCTG 1320 

TTTGTTTTTT AAAGTTATCA AGCATGTCGC CAGAAACACC TTGTGCTTTA TCCAATGAAT 1380 

CTTTTGTTTT TTCTTTGACA GAACCTGCCA AATCAGAGGC TTGATCTGAT AAAACGCCGG 1440 

CTTTTTGTTT TGCGATTTCT GTTAATTCTG TTCCTTTTTG AACAGCATAA TCTGTGTAAT 1500 

CTTGCGCTTT GTTTTTTAAA TCATCTGTTT GATTTGATAA ATCATCACGT AATTCTTTAC 1560 

CTGATTTTGG TGCAAGTAAT AATGCGGCAA CGGCTGCTGC TGTTCCACCA ATTACTGCyC 1620 

CTAAGAAAAA TCCGCCTTTT TTAGCCATAA AAATCTCCyC CAATTAGTTA TTkGTTTTTT 1680 

TATTTTTTGA ACGCATTGCT CTAAAAGCAG AACCGCCTAC TTTACTCACT nAAACCAGcT 1740 

TTCGCTGtGC CTTTyCCTAC GGTTCCTACC TTTGTAATCA AATGTTTACT TGAGGTGTTC 1800 

AATTCAGAGA CACTTTCGCT TAAATCAGCA ACGGCCGTAA ACAACGGATC AATCGTTGCC 1860 

ACTTTTTGAT TCACATCTGT CAATAGTTCA TTACTTTTAA CAAGCAAGCC TTCCACTTGG 1920 
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CGACTCAATA 


CATTCACATC 


GCTAGTCACG ACTTTGATTG 


TJGTATTTGT 


TTCATCAATT 


1980 


GTATCTTCAA 


TTTTTGAAAC 


AACTTGAGAG ACTTTATATA ACACAAGTAC TA&AAAGATG 


2040 


ACTAAAACAA 


CAAATGcTAC 


CGCTGCAATT 


AAGGCAGCAA 


TTTCCCCACC 


TGACATATCA 


2100 


TTACTCCCCT 


CTGAAATTTT 


ATTTATAATC 


tTAGGATAAC 


ACCAACTTAT 


ATTTTTCACA 


2160 


ACTCATAAAG 


CTAAGAATGA 


CAATTTATTT 


TATAA/U\TTC 


TCATGCCCTT 


TTCTTTAAGA 


2220 


AAAAAAGC6T 


AAAATTTCCT 


TCTAACGAAT 


TGAAATTTTA 


CGCtTATTTT 


TATTTATTTT 


2280 


TCTTTTGGTT 


GCTCTTTTTT 


CGCTAATTGA GCACCTAAAT 


TAATAATGTA 


TTCTCGTAAA 


2340 


TTATCTTTTA 


CTTCTGGATG 


AGTCAAACCA 


TATTCAATGC 


TTGTTTTCAT 


AAACCCAAAT 


2400 


TTATCGCCAA 


CGTCGTAACG 


TTCTCCTTTA 


AACTCACGAG 


CAAAAACGCG 


TTGTGTTTTA 


2460 


TTTAAAGTGT 


CAATCGCATC 


TGTTAACTGA ATTTCGTTAC 


CAGCGCCTGG 


TTTTTGATTT 


2520 


TCAAGTACAT 


CAAAAATTTC 


AGGGGTTAAT 


AAATAACGGC 


CGATAATCGC 


TAAATCACTT 


2580 


GGTGCTTCCT 


CGGGTTTTGG 


TTTTTCAACA AAATTCTCGA 


CATTGTATAG 


TCCTTTTTCG 


2640 


ATTTCTTTTT 


CTGGGTTGAT 


AATACCATAT 


TTTGAAGTAT 


CTTCATGGGG 


TACTTTCATT 


2700 


ACCGCAATTG 


TCGAAGCATG 


TGTTTGTTCG 


TAATCATCCA 


TTAATTGTTT 


CGTTAATGGC 


2760 


ACTTTATCTT 


CCATTAAATC 


GTCGCCTAAC 


ATCACGACAA 


AAGGTTCATT 


GCCAACAAAT 


2820 


GCTTTCGCTT 


GTAAAACAGC 


GTGTCCTAAG 


CCTTTCGGAT 


GGGATTGACG 


AATAAAATGT 


2880 


AAATTAACAT 


CCGTTGTTTC 


TTCGACTAAT 


TTCAATAAAT 


CGGTTTTATT 


TTTTTCTTTT 


2940 


AAGTTATTTT 


CTAATTCAAC 


ATTTGAATCA 


AAATGATCTT 


CAATCGGACG 


TTTGGCTTTG 


3000 


CCTGTCACAA 


TTAAAATATC 


TTCGATTCCT 


GAAGCTAGTG 


CTTCTTCTAC 


AATAAATTGA 


3060 


ATCGTTGGTT 


TATCAACAAT 


CGGTAACATT 


TCTTTTGCCA 


TCGCTTTGGT 


AGCCGGCAAG 


3120 


AAACGTGTTC 


CTAACCCAGC 


TGCTGGAATA 


ACTGCCTTTT 


TAACTTTCAT 


CTTTTTTCCT 


3180 


CCTAAAAATC 


AAGTGTAAAT 


TCGTTTTCGG 


TTTTGCCATC 


GCGCAACATG 


ATTTCTTTTG 


3240 


CTGCTTGTTG 


TACATCTTCG 


TCATTGTACA AGACATTGTA 


GATTGTTTCT 


GTGATTGGCA 


3300 


TTTCGACTTC 


TAGTTGTTGT 


GCCAATTCAT 


AGGCTGCTTT 


AGTTGTTGAA ACACCTTCCA 


3360 


CAATCATGCC 


CATGTTTTCT 


AAGACTTCAT 


CTAACTTATG 


GCCTTTTCCT 


AATAAATTAC 


3420 


CCGCACGCCA 


ATTTCGAGAA 


TGGACACTTG 


TACAAGTAAC AATTAAATCC 


CCAACACCAC 


3480 


TTAAACCAAT 


AAATGTTAAT 


GGGTTTGCTC 


CCATCGCAAC 


CCCTAAACGA 


CTAATTTCTG 


3540 


CTAAACCACG 


AGTCATAATT 


GCCGCTTTGG 


CATTATCACC 


AAAGCCTAAG 


CCATGAATCG 


3600 


CTCCGGCGCC 


CAAAGCAATA 


ATATTTTTTA AGGCCGCCCC 


TGTTTCTACA 


CCAATCACAT 


3660 


CATCATTCGT 


ATAAATTCTA 


AAGTAATCAT 


TCATAAACAA CTCTTGCACA 


TACACAGCGT 


3720 


CTGCTAAATT 


TTCGCTTGCT 


GCTGTAATGG 


TTGTAATGTC 


ATGAACAGCG 


ACTTCTTCTG 


3780 


CATGGCTTGG 


ACCAGATAAA 


ACCACAATCC 


CTTGCCGTTT 


TTCTGCTGGA 


ATCTCTTCCG 


3840 


CTATCACTTC 


TGAAATCCGC 


TTATGTGTAC 


CTTGTTCGAG 


CCCTTTACTT 


GCATGAATAA 


3900 
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TAATTGGTTT TGTTTTTAAA TGTTGGGCCA CCTCTTGTGC TACTGAACGA ATGGCTTTAG 3960 

TTGGTACCAC AAACAGTACC GCATCAACAT CTACTAAAGC AGTGGCTAAA TCAGTGGTTG 4020 

CCTGAATTGA AGTAGGTAAG ATTAATTCTG GTAAATAATG TTTGTTGGTG TGTTTGTATT 4080 

TATTTCATCG ATTTGCTCTG GTTTATTCCC CAAATACAGA CCTCGTGTCC ATTTcGGCTA 4140 

ATACTgAGCT AAAGCTGTcC CCCAGGAACC TGGACCTAAA cAGCGACTTT TgTTTCATCG 4200 

GTGAAACTCT CCTTTCGCTA AACAACGTTA CTTCTTTAGT TTACCATATT CTTCAATCAA 4260 

GTTGGATTAA GAAGTTGTTA TTTTTTTCTT TTGTTTCTCC CGATTGTAAA ATTTCAAATT 4320 

GGGATTTCTT CTGCGAA:GA TCACAATAAC AATTGCTGCC ACGAACATCA CTAAAGACAA 4380 

CAATTGTGAG ACACGAATGT TACTAAACGC ATACAAACTG TCCATCCGTA AGCCTTCGAT 4440 

GAAGAAACGG CCGAAA6AAT ACCATATTAT ATAACCTAAA AAGACTTCGC CTTCTTTTAA 4500 

AAAGTGTGGC TTTTTGCGTA ATAACACTAA AACGATAAAG CCTAAAACAT TCCAGACAGA 4560 

TTCATATAAA AATGTTGGTT GATGATATGT TCCGTTGATA TTCATGTTAT CAATGATAAA 4620 

CGTTGGTAAA TGGAGATTTT CTAAAAATTG TCTGGTGGTC GCTGGCCCAT AGGCTTCATG 4680 

ATTCATAAAG TTGCCCCAGC GACCAATTGC TTGCGCTAAA ATCACACTTG GTGCGGCAAT 4740 

ATCTAAAAAA GTCCATGTAG AAATGAAACG ATGACGCGTA AAGAAAAACA GCGCCaGTCC 4800 

CCCACCAATA AGACCACCAT AAATCGCTAA GCCACCATTT CGAGTAAAGA AAATTTCAAT 4860 

CGGATTGTCT ACATAATCTT GCCATTGAAA AGCGACGTAA TACAAGCGCG CGCCAACAAT 4920 

TGCTGCAGGT AATCCCCAGA GCATAAAGTC AAAGACATCA TCTTCTTTTA AACCAACACG 4980 

AACGGCTTCA CGACTACTTA GCCAAACAGC TAAAGCAATC CCTGAAACAA TAATAATTGC 5040 

ATACCAATAG ACGGGAATAC CGAAAAGGCG AAATGCAATT GAATTTACTT GGGCTAACAT 5100 

CATCACATCT TCTCCTTCTA TTCTCCTGAA TTTTCTTCAA TCAAACGGGT TAAGCGCTCT 5160 

TCGAACGTTT TTGTTGCATC ATAGCCCATT GTTTTCGCGC GGAAGTTCAT TGCTGCTACT 5220 

TCAATGATAA TGGCCACATT TCGGCCGGTC TTAACAGGAA TACGAATCTG AGGGACATCA 5280 

ACGTTAGCTA TTTCAACCAT TGCATCATCA CTACCAAGAC GATCATATTT CTTGTCTTTT 5340 

TCCCATGCCT CTAAATAAAC GACCAATTGA ACTTGCATAA AGCCACGGAC CGCACTAGCT 5400 

CCAAACAGGT TCATGACGTC AATAATTCCG ATACXACGAA TTTCAATTAA ATGCTGCAAA 5460 

ATTTTTGGTG GTTCTCCAAC CACGGTTAAT TCATCTTGCT GATAAACATC GACGCGATCG 5520 

TCTGCGATTA GCCGATGTCC ACGTTTAATA AGCTCTAAAG CTGTTTCACT TTTACCAATA 5580 

CCGCTATCTC CTTGAATCAA AACACCAAGT CCATAAACAT CAACTAAAAC TCCGTGGACA 5640 

CTTGTCCGAA CAGCTAAACG GCCATCTAAA TAACTGGATA GTTCCCCTAG TAAACGGGAA 5700 

GTTGAAATCG GTGAACGTAA TACAGAAACG CCATTTTCTT TTGCTGCTGT AATTAATTCT 5760 

TCGGGGATTT CTAGTCCTCT GGATACTATA AAGGCTGGAG TATCTTTCGC ACATAAACGG 5820 

CGCATCACTA ATAAACGCTC TTCAGGCATC ATTCGTTCCG CAAAGGTAAT TTCTTtGCTC 5880 
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CCAAATAATT GTAAACGGTC ATGGGAATAA TAATtArAAT AQCCaGTTAA TTCTAAACCT 5940 
GGCCGGGAAA TTTCACCAGT TTTAATTGTT CTATTTAAGC TTtCTtCATC TCCGTATACT 6000 
ACTTCTAAAG AnAGATTTTC TACCAGT 6027 
(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



CAACAGTCCC 


AGATGATAAG 


GTTCTGCTAC 


CATTACGAAA 


CTCTACATCA 


GTATATAGTT 


60 


TATCTGACAC 


TCGCATAATA 


ATTTGGTGCG 


GATTTTCAGT 


AAAGTAATTA 


TCTGAATTTG 


120 


ATACCAATTT 


TGTAACATAT 


GATTTGTTAG 


CAGTTATATA 


ACCTTTGGCA GkCyTTAATC 


180 


yAGGTAATCC GTTTTCGGTA 


TAATCAACAC TaAAACTTgA 


TTAATGTATC 


TTTAGAAACA 


240 


AACTCACtTT 


mmfTl TV /''lll/^ TV 1\ TV 

T TTACTGAAA 


TTTACATCTT 


GATAATAATA 


ATCATTTGTT 


TTCATAACTA 


300 


CAT ACn U I K3\a 




AGATAATCTT 


TGATGTTTGG 


CACAACTTTT 


TGTGCATAAC 


360 


mfnmfp TV *P*P TV r*C 




CCATTTTGGG 


TAATCAAACG 


AGGAACTTGA 


TTTTCACGGT 


420 


Av^ 1 L>oA 1 1 L*v« 




ACTAAACTTC 


CCTTGGAAAC 


AGCTGCTCCT 


TTATATAAGA 


480 


AGTCTGAATC 


CCCATAAAAA 


TTATCACTAT 


TTTTCATAAT 


AATTTGCCCT 


GGATTAACCG 


540 


TATGATAGTT 


AGCAATCCAA 


GATCCAGTAG 


CTACTACATA 


GTTCTTGTTA GCCGTGAAAT 


600 


AATACCCATT 


TGCCATCTTC 


AATCTAGGAA 


TTCCTGATTT 


TGTTTTCTCA ATAGCAGTTA 


660 


CTTTTAAAAC 


ACTGCCTGCA 


GATACTTTAT 


TTGCTTTTGT 


CTTGAATTCT 


ACATCTTTGT 


720 


AATAGGTATC 


TTCATTTCTA 


GTAATTACTT 


GTCCAGGATT 


AGATGTGTAG 


TAAGTATCTT 


780 


CTAATCCTTT 


ATATTGTTCA 


ACTATATTTT 


TATTTGCAGT 


CAAATATCCT 


TTCGCTGTTT 


840 


TCAATCGAAA 


TATACCGCTT 


TGAGTATAAG 


CTAAATCTTC 


AACTTCTACT 


ATTGTTCCCT 


900 


TCGAAACCTT 


AGACAACCTT 


TGCTTAAATT 


CTGGATCAGC 


ATAGAAATAA 


TCATCGCTTT 


960 


TCAAACGAAC 


TTTTTTAGGA 


TTTGTCGTAA 


AATACAAATC 


AATATTGCTT 


TGTGCCGCTA 


1020 


ATACATAATC 


TTTATTTGCT 


GTTAAATAAC 


CTTGGTCTGT 


GACCAATCGA 


GGAATTCCTG 


1080 


CTGAACTATA 


TTCAATCCCC 


TTAATGGTAA CTAGTGTATT 


TTTTTTAACT 


CGCCAACCAG 


1140 


GAGTTCTAAA 


CGCTACATCT 


TGGTAAAAAG 


TATCGTCATT 


TTTCATGATC ACTCGACCTG 


1200 


GaTTAGTTGT 


GTAATACTTA 


CTTAAGTcAG 


GCCCACTACT 


ATTACCCATA TTTAaaAAAT 


1260 


TACTATAGGT 


CwGGAAtATC 


AAACGGACGC 


CCTTcATAAT 


tAGCTAAACC AGGgAAATAC 


1320 


ATTTGCGAGA 


CCATTGCCAA 


GCAccTGgTC ATTATyCCmC 


TGCATGGAAC 


TATCTGGaGT 


1380 


ATATGGATAT 


tGCGCTACCC 


aTACTCTGTC TCTTCCgAAG 


GCGGAGGTGT 


CGATATAGCC 


1440 


ATTkGTTAAc 


CACCATTTGC 


CAACATACAA 


CGCATCATTT 


TTATACCCTA ATGCCTTTAA 


1500 
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TTGCTGGTTA AACGCAACAG 


AATTAGcATG 


CGTTTCGCCC 


AT-TGTTTGTA 


AGCGTTGGAT 


1 Rcn 

J.3DU 


CTTCCGCATC 


ATTAAACATT 


ATGGTATTTT 


TATCTAAGCC 


AGAGTTAGCC 


GCCGCCTGAG 


1 on 


CAAAATATCT 


TGCTTCATCT 


TGTGtGTCGC 


AGCAGAGACA 


TACATACTAT 


AGTGATATGC 


1 con 
XboU 


1 o/innV.^ 1111 


AAGCCCGCAG 


CTTTGGCATT 


TCTAATTTGG 


CCCGCAGCAT 


ATGGATTAAC 


1740 


A i /lLrl.«Anu 1 1 


CCTTCTGTCA ATTTAACCGA AACACCTGTT ACACCATAGC 


TTTTAATTTT 


1800 


1 iuAl/\l lOA 


GCTACTGAAA 


TGTCTCCAi 1 


C UAAw i AUAA 


a r'/^TT" a a T a a 
AUo 1 UAAl AA 


AATTCTTGCC 


1860 


AooLr/iHMl lA 


GGATCATTCG 


CATAGACGTT 


f^f*f* a a /^»pr"»p»p 
LC9UAAL1 b X 1 


\j\0 1 CijCbL. 1 A 


CCCTAGAAGA 


1920 


AAooVrfCvj 111 


TTCATAGCAT 


ATGTTCCTTT 


TTTATGGTTC 


GTTCCTTCAC 


GTGCTGCTTC 


1980 


i 1 t»A X AA 1 


GCTTCATCTT 


GGTCAATGGT 


CTTTTTTTCT 


ACT TC Ab ACj 


TTTGTGATGC 


2040 


1 Idol 1 1 1 A 


CTATCTGTTG 


TGTTTGTTTC 


TGAAACAGAA 


GAATGTTCTT 


CCGTTGTACT 


-2100 


ATTTGTTGAA 


GAATCAAGAG 


ACTCTTCTTT 


TG TGCT TGAA 


GTAGAACTAT 


CTCTTTCAGA 


2160 


GGTATTTTCT 


TCTGTAACAG 


AAGATGTATC 


CTCCGTTGAA 


TTAGTCGTTG 


TTTCTGACGT 


2220 


AGCTTGAGTG 


ACCAAGCCAG 


TcTCCCCCGT 


TGTTTCTGTT 


TTTACAAGCT 


CTTCTCCTGA 


2280 


TGCAATCACA 


CTAAAACTAT 


TTAATAATAC 


AGTACAAATT 


a a a *p a /^r*»p^ 


AAATGATTTT 


2340 


TTCATGAAAC 


TCTCCTTATA 


TATTTAATAA 


TATGTATCTT 


r 1 AAAUAC 


TTGCCATTCG 


2400 


CTCTCGAAAA ATGTGTATCC 


TATAACCTTC 


AACTAATTGT 


1 1 X uAU A XXX 


TAAAACTAAC 


2460 


ACCAATTATT 


ATCTCATACC 


TTTATTTCAA 


ATTCAATTAC 


laV^X X X X X XAA 


CATTCCTATA 


2520 


TCAATTTTTT 


ACTTTTAAAT 


GGTAAAAAGA 


TGAAAACAGA 


XVirfX X XAAlv^V? 


ATTGTTTTCA 


2580 


TCTTTTATTT 


ATTTTTTAAG 


ATATTCTGTG 


TACGATTTTT 


1 Ao 1 A 1 ^^pMA^ 


AATTAGCGCT 


2640 


TCATGATGGG 


GTTGGCGATC 


TTCCTCTGGT 


GGCAATTGCA 


TATAATfPPf 


AAAAACCTCT 


2700 


GTTAAGTAGC 


GCTTGTAGCC 


CACTGGTATT 


GGCATTTTTG 


TATCTTCAAA 


TGGAACAAAA 


2760 


ATAGCAGATT 


TAAAATCTTC 


CAAATGATAA ATATTACCCA 


TGTAGCGTGG 


TCCGACACAT 


2820 


AATTCTGTAA 


CATAAGGAGT 


GGAGCCGAAG 


GAATACTTAG 


TCATTCTTTT 


TTCAGCAAAG 


2880 


CGCCAAATAT 


GATAGCGAAC 


CTTTTTCGAT 


GGAAAGATAT 


TCAATAATAC 


CCGACTCCCC 


2940 


GTTGCTAAAA 


TGCCACCATG 


TTTTTCTGGT 


ACCACTTGTG 


AACAGAATAA 


AGCATAAATT 


JUUU 


AATGCCCAAA 


TTTTTTGTTT 


CTTCCTTTGA 


AAATTTCCTT 


TTGGCGCACC 


ATCTAAAGGA 




AAAATATCGA 


TAGGAATACC 


ATGAGGAATC 


GCTAATCCAT 


CTTGATAGGT 


TTTAATAAAC 




GTTGTCTGTG 


CATCCCTAAT 


TGTCATAAAT 


GAATTATGAT 


CATTATAAGT 


TTTTGAAGCA 


JXDU 


ACTAACAATG 


GATAACATTC 


TGTATCTGCA TATTTAGGCC 


AAAGCACTTT 


CaATTTTTCa 


3240 


TAATCTGGTC 


TTGGCATAAA 


GAAATCTAAG 


TCATCATCCC 


aTGGAATAAA ACCTTGACTG 


3300 


CGGATTGCTC 


CGATACAACC 


GCCTCCACAA AAATAACAGG 


TTAGATTATG 


TTCTTCACAA 


3360 


AAAGCCACAA 


AATATTTTGA 


CATGCGTAAA ACGACTTGTT 


GAATTTCTAC 


TAATTGCGTC 


3420 


TGTGTTGTTT 


GTTCCACTTT 


TCCTTTTCCT 


CCCTATTCAT 


TCTTTGCATA 


TTTTKAATGTT 


3480 
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GTATAGAATA 


AAAAACGCAT 


TAAGTAGAAT GAACTTTTAA TAACTGAAAA CTGTTCATTT 


3540 


TGACGCAAAA 


TTTTCCAAAC 


CCAACTGATT GTATTCCATT 


TGTTAGAAGA TGTAGAGGTG 


3600 


GTTAGCTTAC 


GATAAATGGC 


TAATTTTTCG TTAATAAAAT 


AGACCGTATT 


TATTTCTTTC 

X AAA AAA %^ 


3660 


AAAATGTTTA 


ACCAAGTAGC 


aTAHTPTTPA TGTTTTAAAT 


CCGGCATTAA 


ATGGCGTTTA 


3720 


AGCAGATTAG 


ATTCGACTAC 




xnx X XV9X XXX 


TAAAARATPT 

X /Uv\nV3 A X \^ X 


3780 


TGATAGGTTT 


GTGGTCCCTC 


AvtlAMMnxXA V.»Uv» XVvnX Uno 


1 /Iri X O X X V9^V» 


AX 1 X i^Al lA 


3840 


CATACAAAAT 


AAGAACTACA 






^11 IxAL* 1 1 u A 


3900 


ACTTCTAATT 


TATTAGGAAG 






Ai Ai LKal V'V^V^ 


3960 


TTTGAAGCTT 


CAATTCCTTT 


Ailit^lljV^lU 1 1 1 VaUAMioVrf 




L i wii ill Iv^ 


4020 


AGTAAAAAGA 


TTCTTGAATC 


TTTTTCCATA AAObCAAQ^yvj 


^AATAT^AAT 

UAAIAIUAAI 


A/^ A A'PI^AT^T 


4080 


TTACTTCCAT 


CATCAATGAG 


TAAT AAT TCC L.AA ri L. i laA 1 


AGGTTTGTTC 


CAGAACGCTT 


4140 


TCAATAGATT 


TTGATAAAAA 


TTTGCCAGCA TTGTACATGG 


GCATAATAAT 


TGAAACAAGT 


4200 


GCTGTTTCCA 


TkTGTTCACT 


CCTCGTTGCA TTTTATGTAA 


GAATGTAACA 


GTTTATATTA 


4260 


TCGCATAGCT 


CTTTTATTAT 


TTCAACGTTC CTTGTTTCCA 


AAAAAAGTTT 


GAGACATAAG 


4320 


CTTAAATAAC 


TTATGTCTCA 


AACTTTTAAC GGTAAATTCT 


T^TAGCATTC 


TCTTTACAAT 


4380 


AATCCATCGT 


CAATTCTTCA 


CGTTCTTCAG ACTCATCCAA 


TCCTCTtGAT 


GATACTTTAT 


4440 


CAAAGAGAAT 


TTTGATGGTT 


tgAAGCAGAA TTTTCATATC CaATAGCAwG aATAATTCTT 


4500 


GATGTACAGT 


AAATCAAAGT 


TCAATTTACT ATTGTAGTCA 


GAAGCATACT 


TCCCATAAAC 


4560 


TTGGGCATAG 


CCAGTAATCC 


CCGCACGTAC ATTGTGACGT 


AAATAGTAAT 


TCGGGTTTTC 


4620 


CTGATTAAAT 


TGATCAACAA 


AGAATGGTCG TTCAGGACGT 


GGTCCGACAA 


TGGACATATC 


4680 


CCCGCGAATA 


ACATTGATTA 


GTTGTGGCAA CTCATCAATA 


CGTAAAGCGC 


GCAAGTATTT 


4740 


ACCAACTGTA 


GTGACGCGAC 


TGTCGTTACT TGTAGATAAG 


ACTGGGCCTG 


ATTTTGCTTC 


4800 


TGCTGTTGCC 


GACATCGTTC 


TAAATTTTAA AATCGAGAAC 


TCACGTTGAT 


CTAATGTAAT 


4860 


CCGAGTTTGT 


TTGTAAATCA 


CTGGTCCCGG TGAGGTTGCT 


TTGACTAAAA 


TGCGC 


4915 


(2) INFORMATION FOR SEQ ID NO: 126: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3149 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

ATTAATGCAA TAGCTGCCCC TACAAGCGCT ACTCCTAATA AAGGTATTTT CATATAAACA 60 

GCCATTACAA AACCAATAAC AAGATAACTA AAATATTCTT TTGCAGGTAA ATATCGCA6T 120 

AGTAACCCTA TACCGACAGC TGGTAGTAAA CCTCCAGCAA CAGTTAATCC ATTGGTTAGC 180 
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CATTCTGGAG TATACGATAG AATTGTATTA ATTAATGATG GACCGAAAAT TACTGCCAAT 240 

GCTACTGGTA TGCCTGTTGT TGCTGCT6TT AAAAACACGC CAAACAtXTQ .CAHTXJGGTA 300 

ATCATACTAT AATCACCTTT TTTCACACCT GCTTCAGCTT TGTGAACCAT CCAGATACCT 360 

ATGGTATTTC TGATAACATC TAGTTGGACC ATCAATAAAG CTACTGGTAT ACCAATAGAG 420 

ATTCCTACAG ATGGTTTTTG CCCAGTAGTA ATTGTAATAA AAGTTGCTAC GATACTTGCT 480 

GTTTGATAGT CAGGTATTGA AGCTCCTCCc AAAATTACTT ATTCCCaAAG ATAGTAACTG 540 

TAAAGTACCa CCGATAAATA AACCTGtTGA TATATCmCCt AGGATTAATC CTGTAACAAA 600 

ACCCGCTGnT TACCGGTTGA TAAGTTCCAA ACaTTGTTGA GTTTTTTTCA AAGTTAATAA 660 

AAAATCCATA CAGCACAATT AATACTAATT GAAAAATACT TGCTTCCATT AATTTTCCTC 720 

CTTAATTACA TCAAATAATT TTGAAAATTT ACCTTTTTCA CTTGTTTTGA TAAAAATCGG 780 

GATTTGTTGA ATGTCTGCTT CGACTACCAC TGTTTTTCCA CCATCAAAGT TTTCTCTTGT 840 

CCATAGGTTT GTCCAACTTT CACCTTTTGG TAAATAAACT TCTCGACTAG TCTCTTTATA 900 

ATTTTCAATA GGCGCAACAA GAATCGCATC TCCAAATAAA TAGGTATCTT CAATATTCCA 960 

AGCATTCTGA TCATGTGGAT ATTCATAAAA TAATGTTCTC ATAAGTGGAT CACCAAACTC 1020 

ATGGGCCTGT CTCATCAACT TACTAATATA AGGCTTCAAT TCTTCTCGTA tTTTTATATA 1080 

TTTCGtTAGT ATTTGTTCGA CTTCTTTACC ATAAGACCAA ATTTCATTTG GAGCTCCGGT 1140 

AACCATTGAT CCTCCTCCAT GATCAGATAA TGGTTTGCTG TGTGGTAAGC GATCACCATG 1200 

CATTCGTAAA ATAGGCGAAA ATGTTGCATA TTGGAACCAG CGGACTAGTA ATTCTCTAAA 1260 

CTCAGGATCT TTAGGATCTC CTCCATGAAA TCCTCCAATA TCTGTTGTCC ACCAAGGTAA 1320 

CCCAGCCAAC CCAACGTTTA ATCCAGTATT TACTTGATTA CGAAAAGCTT CAAAGCTAGA 1380 

ATCAATATCT CCTGACCATA ACAAAGCTCC ATAGCGTTGT GCCCCTGCCC ATCCACCTCT 14 40 

TACAAGTGTA ACAACTGATT GATTATTTCC TAAACCTTCA GAAATCATTG ATAAATAATC 1500 

GATTGGATAC AAATTCCCAA TTTGTAAATC gGTTCCTTTT TTATATCGAT AATTATCGAA 1560 

ATCATATACA GAATACCCAG GCTCAGCTAC ATCTAGCCAA AAATATTTAA CTCCTTTGTT 1620 

AACATAATTC TTTTTAATGA GGTTCCAAAC ATATTCTCGT GCTTTCTCAT TAGTCATATCT 1680 

AACAAAAATT GTATTTCCTT GAATTTGCAT TGTTAATCTA ACGCCACGAT TTACTTTCAC 1740 

TAAGTAACCA TTTTCTAGAT ATTCATTAAA ATTCATTGCA TCAGTTTGTA CAGTAGGCCA 1800 

TATGGAAATA ATTGGCTCTA CATTATATTC ATCTCTTAAT GTcTTAAAAA TTGTTCAGGT 1860 

TGAGGCCAAT ATnCTcATCA AATCGATATT CACCTTGTTT AGGCCAATGG AAATAGTCAA 1920 

TAGCTATAGT TGAAAGTTTT ATTCCCAAAG CATAGTATTT TTTTACAACA TCCATTACTT 1980 

CATTTGGAGA TCGATAGCGT AGTTTACTTT GCCATAATCC CAATAAGTTT TCAGGCATTT 2040 

TAGGAACTTT ACCAGTTAAA TCACTATAAT TTCGaAGTAT CTCTTTTGGG GTATCACCTG 2100 

CAATTACTAC ATAGTCAATA TAATTAGTAG AAGCCATTTT CCATCGAGTC ATGTTTTTAG 2160 
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CAAATGATAC 


CTCTCCTATA 


CCTGGATTAT 


TCCATAAAAA TCCATATCCT AAACTCGATA 


2220 


CATAGAATGG 


TATAGATACT 


TGTGAGTTTC 


TTTGAGCTAA 


TTCTAGAACT 


GTATTCTTCA 


2280 


AATCAAGATA 


AGATTGTTGA 


TATTGTCCCA 


TTCCAAAAAT 


CTTCTCACTT 


GATTCAGATT 


2340 


CAAATCTAGT 


TGTAACCTGA 


AAACCACCTT 


GTAAATTAGA 


GTGGTATTCT 


CTAGACTTCA 


2400 


ACTTCAGCGT 


TGAATTAAAA 


TCTTTAGTTA 


TTTCAATTGT 


ACCTGCATCT 


TCACTTCCGT 


2460 


CATCATGTTT 


GACAGCTCTC 


AATCTGATAA 


ACTCTTTTAA 


TAGCACTTCA 


TTTTTTTCAT 


2320 


TATAAAAAGT 


CAATCGGTCC 


CGATGGTCTA 


ACACAACTCT 


AATTTTCCCG 


TTTATTAATT 


2580 


CTGTCTTATT 


TTCGTCTTGA 


TTTATTATGA 


CTGTACCTAA AGCATCAGGC 


TCACCAATTA 


2640 


AAGAATTAAG 


TCTATTTTCA 


AAGTTCATAT 


CTACAAACGA 


TCTGACTCTA 


ACGGCATTTT 


2700 


CACCCCAAGC 


TTCTACAATC 


AATAATTCAT 


CATCAAAAGT 


TCTCcGAAAA AGATTTTGGT 


2760 


GTAACTTCAA 


TCACATCTTC 


CAACCCACTT 


TCAATTTCTT 


CTTAATTAGT 


TTGGTATATG 


2820 


ATAACAATAC 


TAAGTAATCG 


CAACAATACA 


AACGCTTTCC 


ATACGCACGC 


CTTTTGCTCT 


2880 


CTCTGTGTcm 


ACAAAAGGTT 


ATAGAtTAAT 


TTGTTTTCmC 


GAtAAACTAA 


TTAACTAAAA 


2940 


AAATAAACAT 


AAGATAATAA 


GCATCAATGA 


CTTGAAATTT 


ATTCGTTAGC 


ACGTTAAACT 


3000 


AATTAATAAA 


TAGACTATAA 


AACAGGCTCA 


GTAAAATGTC 


AACTGTAAAC 


GGCTATATTT 


3060 


TTTTTGGATA 


AATTAATTGG 


CTTTAGGCAA 


TAGGGAATCT 


TAACCTTAAT 


TnCACTAATT 


3120 


TATTCnAATT 


TAGGGGGGAA 


ATGGTTTCC 








3149 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7019 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 



ATTcCCCkGC tTCyTTTyCC ATAAAAATAC yCTTTtCTAG 


ATATTTTAAa 


GCATTTTaTr 


60 


GAAACGTTtC 


ACCAAAATAT 


GAACATCTTT 


TTCTTAAAAA 


TAATTCTGTT 


AAATAACAGA 


120 


AAAAAGTCTG 


GTCAATCCGA 


CCAGACTTTT 


TAAATATAGA 


TAGTAAAATG 


TGTAATTTCA 


180 


GGGATCACCC 


CAAAACGGGC 


ATGAATGGCG 


GTCGTTCCTA 


AGCCTGTATT 


AACATATAGT 


240 


TGCGTTTCTk 


GGTCCAACTG 


ATACAATCCT 


CTGGTATACT 


TTTCTGCTAA 


AACATTTGTG 


300 


ATTGGATAAA AAGGAATCCA AATTTGTCCG 


CCATGGCTAT 


GTCCTGCTAA AACTAACTGT 


360 


GTACCCGTAC 


CAACAAATGC 


ATCCGCGACA 


TCTGGTTCAT 


GGGTCATTAA 


TAACGCATAG 


420 


TCATACGTTT 


CTCGATAGGA 


TAATGTTTCT 


GAAACGGAAG 


GATTTCCCAA 


TAACGAGTCA 


480 


TGTAAACCAC 


CAATAAAAAG 


ACGGCGTCCA 


TCTGCTAAGG 


TTACTGTTTC 


TCCTTGGTTT 


540 


TTCATAACGG 


TAAAGCCACT 


GGCCTCCATG 


aCTTCrnCAT 


ACACCGCAGA AGCACCGCCA 


600 


CCATAATCAT 


GATTTCCCCA 


AACTGCATAT 


TTTCCAATTG 


TCGCTTGTAA 


CTGTTTTAAC 


660 
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TTTTCAATCA TCGGTTCTTT TTGTTGCGGA 'TTTTTGCCAT AATTGTCAAA TAAATCACCT 


720 


^in« ji» ni*r»*^a 'Piiik'Pik*pr"P/^^ TTTTTr^Rr^r^li TT<2Zir'TTTTT TTATCACTTT 
GTAAAAACGiA TAAlAlQ-lOb AliilUA.CoA iilsAUliiii i 1 1 


ATCTAACCGC 


780 


TTCGTCTCAT AAAATTCTGA AACTTGTAIA lUt*lj/4/i/\vji i ti^Mw«\w4i i 




840 


CGTTGATTTT TCTTTCCCAA AGTCACCTGg ATTAACbioa AUiAAAiii^Li 


\a\0 X V^iini ^ 1 i 


900 


CCACGcATAG ATTGGGaGTG CGaTAATTAA tiawUiAAftAAi i/\ 




960 


AATTTTTTTA ACCACATCAC TCGCTCCTTl ClillUiliLi Aiiiu^-^AAia 


a r"rT a/^ T a T a 


1020 


GAGAAAAATC TTCTTAACCA CAATCTAAAA CAAAbGAAAl lAAbLrdiib 


TTaaar*aaop 


1080 


TAAAG/^TTT CCACAATCTC TTATCCTTTT CTTAGCTTTT TTATAAAAAA 


TV rT'T R ^ » T ft ft 


1140 


TTTCATCTGT CGATGAAATT ATCTAGCTAG AACATTCTTC TCTTAATGAT 


bAl bi X X Xbv> 


1200 


AAACGTTTGA TAACCAATTA AGCAAAATAA AATCCCCAOT AAT^TCATTC 


UAb X 1 Ab 1 b A 


1260 


AAATAAGCCT GCTAACATCT TTCGCATCCT CTCTGTTGCT AATCTTCCTT 




1320 


ACTATTTCTT GTTGTAAAAT TGGTTGAAAT AGACGACGTT CTGCTAATTT 


TCTTGcTAAA 


1380 


ATTGCTTCAC TTTTTTTCGT AAAGATACCT AAGTTATGGC GCTTGCCTTT 


CAAAGAGATA 


1440 


TAAGCCAGCC ATTTCCCAGA AGCGGTCTGG CTCACCCCTT TGTGTCCACT 


GCTATTATTT 


1500 


TTGGGGCGTT TACTTTGTAA CTTCCCTGGA ATTACGCCAG CAACTCTTTG 


TCGGTAATCC 


1560 


TTCTTTTTAC GATGACCGCA ACTTTTGGTA TGGCCATTCA ACAAATTTTC 


ACAGCGAACC 


1620 


CAGCAAGATT GTCCGCACGA ACACAGACAA TACGCTTTCA TTGTCTGTTG 


AAXACl I lUA 


1680 


ATCCGTACCA CGACTAATTC ATGATAGTTT TTTTTCAACA TTGTTTGATA 


bbbb X bb X oA 


1740 


GCACATTTCG TTCGATACCC ACGAATTAGT TGTCTTCTTG AAGCTTCTGC 


A 1 Abb 1 bbbA 


1800 


CAAGAACAGC GACAGGCCCA AACAATCAGC CCTTCTTTTC TTTTTGCCGT 


i bb Xbb X A^ 1 


1860 


ACTGTCAACT CACCAAACTT TTGGCCAGTT AAATCTTlAr Abl lAbbibb 


1 XbX^X k«AAl 


1920 


TGACTATCCT TCGTATTCTT GTTCTCGCCA CATCTCGATT GGGTCTCTCT 


CATAGTGCTC 


1980 


ATTTTTCTGA GTCTCTCCCT TTCTTAGTTC TCTGACCACC TCAGCAAATA 


CTTGCCAATC 


2040 


TTTTGTTTCC GGCAAATGAC GCCATGTCAA TTGACGACAC CGTACTTTCG 


AACTATAAAA 


2100 


ATCGGACAAA TACAAATCGG TTTCTAAGGT TAATTTGTGA TCAATCATGA 


CATCCATATA 


2160 


ATTGAAAGAA GCAATAATTT TTGCAATAAA GCGGTTATAA TCCTCCCCAC 


GAGAAAAATC 


2220 


AACAAATACA TGAATCGTTT TTTCAATTGC GCAAi^AAGAC CGATCATTTA ACAAACAAAA 


2280 


CATAAAATCA TAGTATAGAT GCACGCGTTC ATACATTAAA AATCGTTTTT 


GATACAGCGA 


2340 


CTCTGTCTTT TGTATAAAAC CATCTAAGGC GCGATGAATT TGAGGAAACC 


GCTCTTCAAA 


2400 


GAAACTGCTC TOCTTGGCGG GAATAAATGC AGCCACCGAA AAGTATAGGC 


GGTCCCAGCG 


2460 


ATAACAAATC AACTTTAATT TTTCTTTTGT TTCCACCGTA ATTCGCTTTT 


CAAAGAAAGG 


2520 


AATAACACTA TTCATATGAT CAATAAATTC GTCTAACATG GTTAACTTAT 


CTGGAAAAAC 


2580 


AATCGGACTA TCAATGTATT CATTTAAACT TAAAAAAAGA AAAATATATG 


AAGTCTCTTT 


2640 
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GCCAAAATCC 


TCTTTAAAAA GAGGGGTTTC GCTTGTATAA ATTTTTTCAA ACGCATGCTG 


2700 


TAAACAGCTA 


GGCACATGAA 


CTTCCGTAAA AGAGTCCACT 


ACATGATCAG 


AAAATACACG 


2760 


AAAACGTTGG 


ATCGTTAAAA 


GAATAAATAG 


TTGTGCGCTT 


TGATTAAGAC 


TAAACGTTCT 


O & V 


TTGATACATC 


AGTTGTAAAT 


CGTTTATAAA 


TCGTTTAATG 


CCAGGACTAT 


TTTCCAGCAA 




AATGTTAGGC 


AGCCTTTCTT 


GATTACTTCC 


ATAAAAATAA 


TACAAGCATT 


GAAAAC7\AAA 


OO A A 


ACTTCGGATA 


T TTTTCTCTT 


CTCCT^CCAA 


GCCTGTTGAC 


TTTAATTTAA 


TATGCCAATT 


JUUU 


AGCTAAAATC 


AGATTTAGTT 


TATACGAGAA 


GGCATAAATT 


TTGCTTTTGC 


TCATAAAATG 


JUoU 


AATTTGAGAA 


AATTCTTGAA 


TCGTGTGCTT 


TTGATAGAGG 


CCCACATCAA 


GAAGCAACTT 




GGCCATTAGT 


GATTGATCAA 


AAT AAT AT TT 


TCTTAAGGCT 


TGATAATCAA 


TTTGCGATAA 




TTTTTGCGCA 


TTAATCAGTC 


CTTTAGCTGG 


AAAAAGAATT 


TGACTCCTTG 


AAGAAATCTG 


"^O A(\ 


TGTACATTCA 


TAATTTATTT 


GACGGCACAA 


CGATTCTAAC 


TTTTGAGTGG 


ATAAATCAAA 


"3 "a AA 


ATATCCATAA 


AATTCTTCTT 


TTAATAATGG 


ACPTTCACTA 


TTCATAAAAG 


TTTGATAAAT 




AAGTAATTTA 


TCTCGGAATT 


CTTGGTCTAA 


TAATTCATCT 


AACAAAGTAG 


TCACATTTTC 




AACTCCAAAC 


TATCACGCCT 


TAAGTATCTA 


TTTTTTTAAC 


AAGACGTGAG 


TGTCAATCTT 


^ il O A 

3480 


GCTTATTTTT 


TCCACAAATT 


TAAAAATGAT 


TCTTTCAATA 


TTTAGGAATC 


TTCATAGAAA 


O C il A 


AGTATTATTT 


TAACCAAAAA AAGCCCCCTT 


GCGTCACCAA 

X ^^4^X^%^*X*X 


GGAGGCTTTT 


TTTGTTATTT 


«a£ AA 


TTTTACTTTA AATTTAAAGA AAAGCTGATT 


TGTTTGACTA 


ATTTTCTGTT 


TCTTTTCTAG 


.JOOU 


CCTATTCGTC 


AAAAATGATT 


GAAACTTTTG 


GTTCGTCTTC 


TATACTATCT 


TCCTATAAAG 


OTA A 


TTCTTACTCT 


TTTAAGTATC 


AGCTTCTGTC 


AATAAAAGTC 


CTCATTTGGC 


ATTGCTAATC 


0-7Q A 


TGAATACAGT 


ATGACTACTT 


GTTGCCGCTA 


TTTTTTGCTG 


TTATTTTACT 


CTTTAATTGA 


3o4U 


AGCGCTAGAA 


ACGCTGTCAA CTGTTGGCTT 


TTCGGTCGCT 


GACTCTTCTT 


CATGGACTGA 


JtUU 


TGCACTTGAG 


ACACTATCTG 


CCGTTTCTGG 


CGCTTCTTTT 


TTGGAGGCAC 


CACTGACTGC 


J7DU 


ATCGACGTTT 


TTTCCACCGA 


CTAATTCTTG 


CCCAGTTTGT 


TTTAAATACC 


AATCAACAAT 




TGGTTTAAAG 


TCTAAAATAT 


ACTCATTAAT 


TCCCGCGTAA 


TTTCCATTAT 


CATCATACAT 


4 V 0 w 


CGCTTGATAG 


TTATGCACTA 


CATAGGTATC 


TGGTCCGTGC 


ATTGGAACAT 


GAACGCGAAT 


4140 


AGCATCTGTT 


TTTCCTGACC 


GTAATTGTTG 


AATCACCCAT 


TCTACATTTT 


TCAATGCTTT 


4200 


TGGCGGATGG 


CAATTCGCTA ATGGATTACC 


AACTTGTCCT 


GGTTGCCGTT 


TAGCAAACAT 


4260 


TTCTTCAGCA 


GCCATTTTTT 


TATTGTAATA 


TAAAAATTGA 


TTGTTACTGT 


CTGCATAAGT 


4320 


CAATTCCATT 


GGCATTGATT 


TTAAGAAATA 


ATTTAGCTGA 


TCTACGGTTA AAATGCCTCG 


4380 


ATCTAATTTT ACATATgTGt 


CGCCTTCCAC 


AGCCTGAAcC 


GCtTCTGCTG 


CTTGtTCTAC 


4440 


CGAATCATCT 


GCTGCCATAT 


CAACCCCTTC 


GATCGTTGTT 


TCAATGGTTC 


CTTTTGCATG 


4500 


TAAATCCTCA 


GGTAAGTCTT 


TTGGCTTAGC 


TCCTTCTAAT 


TTTGAAATCA 


CATCAATGAA 


4560 


TTCCaCGAAT 


TTTTGTAATG 


TGCTTTCTAA 


AAATTGACGT 


GTTCCCtCCG 


CAATTAAGTT 


4620 
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GCCGTTTTCA 


TCAAATGCTT 


CTTTAGCTTT CCCTAATAAA AACTCGTTCC 


CaGGCATCaC 


H OO VI 


GATTGCATTG 


ACACCAGGTG 


CATCAAGAAT TTGACGGAGA 


TGTAATTGCG 


CACGCGAGGT 




TCCTTGATCA TAATAAGAGC AACCAACAAT CATTACTGGT 


TTATTTTCCA 


GCGGATGAAT 


*i QUI/ 


TTTGAATGAT AACCATTCTA ACACACTTTT CAAACCGGCA GGAATGGTAT GGTTATGTTC 


HOvKf 


GGGCGTCGCA ATAATTACGC CGTCTGCTTG TAAAATTTTG 


CGATTCATAT 


TCTGAATCAA 




GACACTGTTC 


GTTTGATCCT 


TACTTTGATT AAACATTGGA 


ATATCTTTAA 


TTTCTAAAAT 


4 you 


TTCCAAGTCG 


AACATTITAT 


AAAACTCTTT GCCAATATAT 


TGCAGTAGTA ACCGATTATA 


5040 


AGAAGAATCA 


GCGTTTGATC 


CGACGATCCC AATTAATTTC 


ATTTTATTCA 




5100 


TTAAATCGTT 


TCCCATGAGA 


AATTTTCTGC TTCTTTTTTA 


TTTTGCGCAT • 




5160 


CAGCTGTTCT 


GTAATTTCAA 


TAAATACACA AA7VATCTGCA AACAGACCAG 




C O O 

5220 


TATTTGTTCT 


GAATCTTTTA 


ACGCATTGTT TTCATCAAAA 


GCTTGTAAGG 




5280 


TAGAAATTCA 


GAGCTTGGCA 


TAATTCGTGC TTTCAATTCA 


GGAGCATCTA 




5340 


CAGATGCGCT 


TGTGCTCTAG 


AGGAACCTAA CGTGCCATAG 


GATGCACCAG 




5400 


TGGTTTATCA ACAAACGGAA AAATCCCGTA AGATAACCAA CTTAGGGCAT 




54 60 


AGCCGGCACC 


GCATGGTCAT 


ATTCAGGCGT ACTGATAATG 


ACGCCATCTG 




5520 


TTTTTCAGCA 


ACCGTTTTAA 


CTATTTCTGG CAAAACTTTG 


TCTTCTGGCT 


TCSTTAAAPAT 


55o0 


AGGAAAGTCA 


ACAATTTCAA 


CTAGTTCAAT ATCAGCCTCC 


TGCGCAAAAT 


ACTGTTGCAT 


5o4U 


AAATTGCAAA 


AGTTGACGAT 


TGGTTGATTG CTCTGAGTTC 


GTTCCTACTA 


AGCCGATAAT 


5 /OO 


TTTTTTCATA 


ATGAGCTGTC 


CTTTCTTTTT CGGTTCTATG 


AATGGATAAT 


GTGAAGAGTA 


D / DO 


TCTGCTAACC 


CGCTGCTATA 


TAAAATTTGC TGGTCTTGTG 


TAATGATGAC 


TCCTTCAATC 




CCATCTAGTT 


GATTGAGGAT 


TTCAAGCGCT TCTGGAATTG 


GATAACCAAA 


TAAACGCGTT 


CQ QO 


GTCCAAATCT 


CACCATCCAC 


TGAAGCATCC GAGATAATTG 


TGAGGCTGGC 


CATTTCAGTT 


CC%A/\ 

55*40 


TCCAATGGGT 


AACCGGTTGT 


CGGATCCAAC AGATGGTGAT 


AGACTCGCCC 


AGCTTCGGTT 


bOOU 


AAACTTCTTT 


CATAAATGCC 


CGACGTCACA ACCGATTGAT 


TAGCCACTTC 


GACAAGTAAG 


bUbU 


CTGATTGTTT 


CTCTGGATTC 


TTGCGGATTG CGAATACCAA 


TTCGCCATTT 


TTTGTTTTGA 




TGCGTAGAGG 


GcCCCAACGT 


CACAATATTG CCGCCCAGAT 


TAATCAAAGC 


AGACGTCACA 


Ox o w 


TGGACTTCTT 


TCAAATAGGc 


AATTAGTAAA TCAGCGATAT 


AGCCTTTTGc 


TAAAGCCCCC 




AGATCTAATT 


TCATTCCtTC 


TTCTTCCAAA AAAACCGTTT 


GTTTTAAGGG 


ATTtAAGTGA 


6100 


ATCTTTTCtG 


GATTAATTTT 


TTTCAAACAG GCTTTAATTT 


CCTCTTCTGA 


CGGGACACGG 


6360 


GCATCTTTAA 


AACCGATTCG 


CCATGTTTGA ACTAACGGGC 


CAATGGTAAC 


ATTCAAATGA 


6420 


CTAGCTGGAT 


CACAACTATG 


TTTTTTCCCT AATGCAATTA ATTGATACAA 


CTCTGGATGA 


6480 


ACGCTGACTG 


GTTTTTGCCC 


AGCTTGTTGA TTGACAGCCA 


TCAGTTCCGA 


AGTTGAATCA 


6540 


TTTGCACTAA 


AGCGTTGTTC 


ATAAGTGATC AATCGTTGAT 


GCACTTCTTC 


TAGAATTTTT 


6600 
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TCAGATTCTT CATGGTCTAC AAATACATCT* ATTACTGTGC CCATCA/^ATA AATAGTTTGA 6660 

CTTTGTTGCA CCTCAAGACT TCCTCTCCAC GAAAGATTGT GTTTTATTTC ACATTTATTC 6720 

ACAAATGAAT CATACGCCTT TATTTGATGC CAGTCAAGTC GTTTATATAT TTTTTTATAT 6780 

AAAATAACGA CCTTAAAATT TTCATTTAAG GTCGTAACTT GTCTTGAACT CTTTTAAAAA 6840 

ACGCTTTTTT TATTATATTT TGAAAATAAA TATATAATTT TCTATATAAA CTAATTTCCC 6900 

TTAGATAAAA TTGTTTCTTT CAATGTTTGA ATTGCTTCCA ACAAGATGCG ATTTTTATTG 6960 

^.j..j.q.q.YGCAT ACGCGCGGGC ATCTTGTAGT AGTTCTAAAA TTTGCTCTTG CGGCTTTTC 7019 
(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 



TATTTATTCT 


TTTAACGAAA 


CACACAATGA 


AGGTAAAGAG 


aCTCTtGGTG 


GTAAAGGGGC 


60 


AAACTTGGCA 


GAAATGACAC 


GCTTAGGTTT 


GCCGATTCCA 


CAAGGATTCA 


CGATTACAAC 


120 


CCGTTGCTGT 


ATGGACTATT 


TGGCAGATGC 


TACTTTCTTT 


GAAGAACACT 


TACAATCAGA 


180 


AATTTTGAAA 


GCGGTTAAAA 


ATCTAGAAAC 


TGAAACAGGC 


AAATCTTTTA 


CCGCGGACAA 


240 


TGAGATTCTC 


TTAGTTTCCG 


TTCGCAGCGG 


TGCTGccTtC 


TCGATGCCTG 


GTATGATGGA 


300 


TACTATTTTA 


AATTTAGGTT 


TAAATGATCA 


ACGTGTGAAA 


AAATTTGCGA 


CCTTAACCTC 


360 


TCCTGGTTTT 


GCTTTTGACT 


GTTATCGCCG 


ACTTATTCAA 


ATGTTTGGAG 


ACGTCGTCTA 


420 


TCACATTCCC 


AAAGAATTAT 


TTGACCAACA 


AAAAGAGCGC 


TTGG7ACAAG 


AGCTCAACAA 


480 


AAAAATTACC 


GCTTTTCAAG 


AAGAAGATCA 


TTTTGCCTTG 


ATTGTCCGCT 


ACCAAGAAGT 


540 


TTTTGAACAA 


CATCAGGTGG 


TCTTCCCGCA 


AGACCCTGTC 


GCACAACTTT 


TTGAAGCAAT 


600 


CAAAGCTGTT 


TTTGATTCAT 


GGAATAATCA 


ACGTGCGGTG 


GTTTATCGAA ACTTACATCA 


660 


TATTGCACAT 


GATTTAGGTA 


CAGCTGTCAA 


CATTCAAGAA 


ATGGTTTTCG 


GCAATCGTGG 


720 


ACTAGACAGT 


GGCACTGGCG 


TTGTTTTCAC 


TAGAAATCCG 


GTCACAGGGG 


AAAATCAACT 


780 


GTTTGGAGAG 


TTTCTTTTGA 


ACGCGCAAGG 


AGAAGATGTA 


GTGGCTGGTA 


TTCGGACGCC 


840 


AGAACCCATT 


CGTCGTTTAC 


GTTTGACTAT 


GCCAAAAGTT 


TATCAAGATT 


TTCGCCATTA 


900 


CGCAGAACTT 


TTAGAATATC 


ATTATCGTGA 


CATGCAAGAT 


ATTGAATTTA 


CCATTGAAAA 


960 


TGAAAAATTA 


TATATTTTAC 


AAACAAGAAA 


CGGGAAACGA 


ACCGCCGCAG 


CTACGGTTAA 


1020 


AATTGCTTTA 


• GACTTAGCCA 


AAGAAAATCG 


CATTACTAAA 


CAAGAAGCAC 


TGTTACGTGT 


1080 


GACACCTGAT 


ACGATTGATC 


AATTAATTCA 


TCCCGTGTTT 


GATCAAGAGA 


AACGACAACA 


1140 


CATGGAACGT 


TTAGCTATGG 


GACTTCCTGC 


TAGTCCTGGT 


GCAGCAAGTG 


GCCAAATTGT 


1200 
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TTTTACAGCA GAAAAAGCCA AAGAACTGAC 


CAATCTAGGA AAGAAAGTCA TTCTTGTTCG 


1260 


TCAAGAAACT 


TCACCAGAAG 


ATATTGAAGG 


CATGGTTGTA 


AGCGAACAAt 


TGTGACCAGT 


1320 


CGTGGCGGAA 


TGACTTCTCA 


TGCCGCTGTG 


GTCGCACGAG 


GAATGGGAAC 


TTGCTGTGTG 


1380 


ACTGGTTGTG 


AAAGTTTAAC 


CGTCAATGAA 


GAAACCAAAC 


AACTACACTG 


CGGGCCACAG 


1440 


GTCATCTTAG 


AAGGAACAAT 


CATTTCCGTC 


GATGGCTCAA 


CGGGGGAAAT 


TTATCTTGGC 


1500 


GAAATTCCTA 


CAATTTCTGC 


TGATAATAAC 


GATGmCCtTC 


ArGAGCTCTT 


GTCTTGGGCA 


1560 


GATGCGTaCG 


CTGATtTAcT 


GTTCGTGCCA 


ATGCAGAAAC 


TACaCAAGAT 


TTAGAGACTG 


1620 


CTATCCGTTT 


TGGCGCAGCG 


GGGATTGGTT 


TAGCGCGAAC 


AGAGCATATG 


TTTTTCGGTG 


1680 


AAGAGCGTGT 


CTTGGAAATG 


CGTCGCCTGA 


TTTTAGCAGA 


GTCTGAAAAG 


GAAGCAACCT 


1740 


ATGCGTTGGA ACAACTTCTT 


CATTTTCAAC 


AGGAAGACTT 


TTATCAAATG 


CTGAAAGTAG 


1800 


TTCAAGACAA 


GCCCATGGTG 


GTCCGCTTGT 


TAGATCCGCC 


GATGCATGAG 


TTTTTACCTC 


1860 


ATGAAAAGAA TGATATTCAA TTATTGGCGA AgCAACTACA ACgGTTCCCG GTGACCATCG 


1920 


CCAAACAAAT 


TGAACGGTTA 


CAAGAAACCA 


ATCCAATGTT 


GGGTCATCGT 


GGTTGTCGCT 


1980 


TAGGTGTGAC 


CCAACCACAA 


ATATACAAAA 


TGCAAGTTAC 


AGCCTTATTT 


ACTAGTGCCA 


2040 


TTCGATTAGT 


CAAAGAAGGA 


ATTACTGTTT 


ATCCAGAAGT 


GATGATTCCT 


CTAATCGCCG 


2100 


AAAAAGAAGA 


ACTTCTCTAT 


CTCAAACGCA 


TTTTAAAAGA 


AACGATTGAC 


GGTCTCTTTG 


2160 


AAGAACATAA 


AATGACTCCT 


TTCCCTTACG 


AAATTGGGAC 


TUVTGATTGAA 


CTACCTAGAG 


2220 


CTTGTCTGAT 


TGCGGATCAG 


TTAGCAGAAG 


AAGCAGACTT 


TTTCAGCTTT 


GGGACAAATG 


2280 


ATTTAACACA 


AATGACCTAT 


GGTTTTTCTC 


GTGATGATAT 


CGGTAAATTC 


ATTAATACGT 


2340 


ATCGGGAAAA 


GAAAATCATT 


ACCCAAGATC 


CGTTTCAATC 


ATTAGATCAA 


ACAGGTGTTG 


2400 


GTCAATTGAT 


TCAATTAGCT 


GTAGAGAAAG 


CGCGTCGCAC 


TAAACCAAAC 


ATGAGTATTG 


24 60 


GCGTATGTGG 


AGTUVGTCGGT 


GGTGATCCGC 


AATCCATCAC 


ATTTTTCCAA 


ACACTTGGCC 


2520 


TAAACTACAT 


CTCTTGCTCA 


CCTTATCGGG 


TACCTATTGC 


GAAACTCGCA 


GCTGCCCAAG 


2580 


CCAAAATTTT 


AACCGAACCG 


GCTGTGACTG 


AAGAACAAAT 


GGTCCTTCTT 


TAATTTAACG 


2640 


TGTGTAACAA 


ATGATTCATC 


CTTACAAGAT 


AAAAAGTGCT 


AGGrACATCA 


GTCGAAACGC 


2700 


TTGATGTTCC 


TAGCaCTAAA TTTTAAGATC 


AATGATTGTA 


CTCTGGTTAT 


TTTCTTATAA 


2760 


ACGCGGCATC 


TTTTGTCAAG 


GGGTCT6TTA 


CATCTGATAC 


ATATCTTTCT 


ACTTTCATAT 


2820 


GCAAATCATA 


TTTTTCTCTA 


AGTTCCGTAA 


TACGGGTCAT 


CATTGGAGAA 


GCGTGGTGTT 


2880 


TGTCTAATGC 


TTCCTGATCT 


GTCCAACTAT 


CAATCAGCAG 


AACTGTTTCT 


TCATCGTCCA 


2940 


TTGGAAAAAA 


GTATTCATAG 


CGACTATTCC 


CTTTTTCTGC 


ACGAATCGCC 


TCTACAATTC 


3000 


CGCTGGCAAT 


• CATTTCTTTG 


GCAAATTCTC 


TTGCACCTCC 


ATTTGTCCCA 


CTATAATAAA 


3060 


TATTTATAGT 


AATCGCCATC 


TGTCTTCCCC 


CCTAAATTTA 


CTTGCCTTTT 


TACGTTCCCG 


3120 


ACTGGAATTG 


AACCAGCGAC 


CTACCGCTTA 


GGAGGCGGTk 


GCTCTATCCT 


ACTGaGCTAC 


3180 
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GAGAACAATC 


TGTTTTAATT 


TTAGAAGTCT 


TCACTACATC 


TQTCAATAGA CAAAAAGAAA 


3240 


GTATCCCGCA 


TAAACAATTA 


TGCGAGATAC 


TTTCTTTCAG 


TGAAaCACGT 


CGTTCGTTTA 


3300 


TCTATTATTT 


TTTTTGCCGC 


CGATCAAGAA 


TCATTTTTCC 


TAAATAATAA 


AGTCCTGTCG 


3360 


CAATAGCTAT 


ATATTGCCCA 


ATTTTATTAT 


TGCGTGCAAC 


TTTTTCTTGT 


TTCGTTTTCG 


3420 


CCATTGTCCC 


CATCACCTTT 


CTAAAATAAT 


TCTTTTTACA 


TTTTTATTAT 


AGTAGGTTCT 


3480 


GTCATTTTTT 


GCGAATAAAC 


ACACTTTTAA AAATAATCTT 


CTAAAATTGG 


CATGCCTTTG 


3540 


ACTAAACGTT 


GGCCtAAACG 


TTGAGCTATC 


GTTGCGTCTC 


CTTGCAAGCG 


TTCATAAAAA 


3600 


GATAACGTCT 


CTGCTGAACG 


ATAGCCCAAG 


AATAGTTGCG 


TCCACGTTTG 


GATATCTGCT 


3660 


TTCAMGCAG 


CAGTCCCTTC 


TTTTTCCGCA 


GCACCTTTAG 


TCACTGTTGC 


TTTTCCTTGT 


3720 


TCGTCAATTG TGATTGTCCA AATGCCTTCA TTCCATGGCC CATAGGAATC TTCAATCTCT 


3780 


AAAGAATACG 


TTTCTTTTTC 


TCCTGATTGA AATGGATACT 


TTTCCAGAAA 


TGTTTGTAAC 


3840 


TCAACGATTC 


GTGCCATCAT 


GT7WVGGT7VAA 


ATTTTTACAG 


AAGCTGCTGG 


CGTTGGCATT 


3900 


AAATCATTTA AATCTTTTCC 


AGCAAAACCA 


TTAATCCAAT 


GGAACGATTG 


AACGGAACCA 


3960 


CTATGTGAGC 


CAATAAACCC 


AGCTAAAGCT 


TTAAAAGCTG 


TATTAGTTAA 


ATAGTTCCAT 


4020 


TCTACGATTT 


CAAAGGTGCC 


AGCAGCTATC 


CGATAAATCA 


CATAGCCTTC 


TGCTTTCCCT 


4080 


TCTGACGAAT 


AATAAATCGC 


TTGATTATTT 


GGTTTACTTG 


CACGATTCAG 


CGTATAATCC 


4140 


AGCCACCAAG 


TTTCACGGAT 


CACACCACCA 


GAGTGCGCTC 


TTTGATTTTC 


aAGGTATACG 


4200 


TCTTTAATGA 


CCTCTTTACC 


GTCAGCCCAA 


GaAACTCGtT 


TAATGGTACC TGGAACTCgT 


4260 


TTAAcACGTG GCCaATCTTC CgTTTTAAtT 


GTATACTCTG 


CTTGTTCaAA 


TGTTTGTTCA 


4320 


TAGCCATATT 


GACGATAAAA 


TGGATATGAA AACGGTGCTA AGTAAGAAAG 


GGCAACCTTT 


4380 


TGTTTCGCTA 


AATCAGCCAA 


CATTTCTTTC 


ATAATAGCAG 


AGATGCCGCC 


TTCACCACGA 


4440 


TATTCTGGAT 


AAGAAGCCAC 


ATAGCCAATT 


CCAGCCATGG 


GATAGCGGAC 


ACCATGGAAA 


4500 


TTCACTTGAA AAGGTGTCGC 


CATCACTTGA CTTGTTAATT 


GCTCATCAAT 


TAGGAAGCCA 


4560 


TAGCTTTGCG 


TATGTGACAA 


CAATTTTTCA AAACGTTCTT 


GcGTTCCGCC 


GTGGGTTCCT 


4620 


GATTAAAAGC ATAGATAACA AGATCAAACA TCTCTTTCAT 


TTCTTCTTTG 


CCCATTTTTT 


4680 


TGACTCTTTT 


AGTTGTCATT 


TTCTTTCCTC 


sTCGCACATT 


GTATCAATTC ATTTTATCAC 


4740 


AGAAACGATT 


CTCTTTCTAA AAGCCGTTAT 


TTGCTGTCAT 


TTCTTTGAAC 


CAACGACCGC 


4800 


TTTTCTTAAT 


GGTCCGTTTC 


GCATCATCAT 


CTAAATCCAC 


AGCGATGAAG 


CCATAACGAT 


4860 


TTTTATAGGC 


ATTGAGCCAA 


GACCAGTTAT 


CCATACACGT 


CCACATATGA 


TATCCTTGAA 


4920 


CATTACTACC 


TTCTTGAATC 


GCTTGGTGCA 


CCCATTTCAA 


GTGATCTTGC 


ACGAATTCAA 


4980 


TCCGATAATC 


ATCTTCAATG 


ACTCCTTGTT 


CATTGACAAA 


GCGTTCTTCT 


CCtTCTACGC 


5040 


CCATGCCATT 


TTCTGAAATG 


TAGCAGCGAA 


TGTTGCCATA 


ATTTTCTTTT 


AAATTCATTA 


5100 


ATGTATCATA 


AATACCTTTT 


TCGTAGATTT 


CCCAGCCACG 


GTAAGGATTC 


ATTTTTTTGC 


5160 
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CAGGCATATC 


GTAGACATCA 


TAAAAATCTT 


CTGGAAGTAA 


GCCTTCTGGT 


CGTGTTTCAA 


5220 


GAGGTGTTTC 


TTTTGCTTTG 


GCACGACGTG 


GTTGATAGTA 


GTTTACTCCT 


AATAAGTCCA 


5280 


CAGTATTTTC 


AGCAATTATT 


GCTAAATCTT 


CTGGCGTTGT 


TTCTGGCAAT 


AAGTCATTTG 


5340 


cTTTCGCCCA 


AGCAACCATA 


TCTTCTGGAA 


AATGACCTTT 


AACAGCCGGG 


TCTAAAAACG 


5400 


AACGGTTAAA GAAACCATCG ACAAATTTAG CCGCATTTAC ATCGGCTTCG TTGTGCTCAT 


5460 


CACGAGGATA AGTCGGCGTT AAGTTTAAAA TAATGCCAAT TTCACCTGAT AAGTTCATAG 


5520 


AGTGATAGAC 


GGCAATCGCT 


TTGGCACTAG 


CTAAAGCTTC 


ATGATAGAGT 


ACTTGAATCC 


5580 


CTTCTTTTAG 


ATTCACTTTA 


TCAGGATAAT 


GCCAGCCATA 


GAGATAACCA 


CCTTCAACTG 


5640 


GTACAATTGG 


TTCATTGTGC 


GTAAACCATT 


TTTTCACGCG 


ATCGCCAAAC 


AATTCAAAAC 


5700 


AGGTTTTGGC AAAGTCAACA TACAAATCAA CTACTTCACG GCTTTCCCAA CCGCCTTTTT 


5760 


CTTGAAGTAC 


CATCGGCATA 


TCAAAATGAT 


ACAAGTTCAT 


AAAGGGCTCA 


ATCCCGTGTT 


5820 


CTAATAAATC 


ATCAATGACT 


TGATTATAAA AATCAACTGC 


TGTTTGATTG 


ACTTTCCCTG 


5880 


TTGTAGGATC 


TGGAATTAAA 


CGGCTCCATT 


GAATGGATGT 


TCGAAAAGAG 


TTATGTCCCG 


5940 


TTTCTTTCAT 


TAACTGAATA 


TCTTCTTGGT 


ATTTTTTATA 


AAATTGTGAG 


GTCTTATCTG 


6000 


GACCGACTTG 


TTGAAAAAAT 


TTTTCTGGCG 


CTTCTTGATA 


CCAAAAATCC 


CAAATATTTT 


6060 


GACCTTTACC 


ATCACCTTCA 


AAAACGCCTT 


CTGTTTGAGG 


ACCGCTGGCT 


GCAGAACCCC 


6120 


ACCAA/^TT 


TTCTGGAAAT 


TGATATTTCA 


TGTTATTTGC 


TCCTTTACGA 


TTTAATCATT 


6180 


AAAAAAAGGA 


TTGAACCCTC 


CGTAAGAGTC 


CGGACGATTC 


AATCCTTCGA 


TTTTTTACTC 


624 0 


GGCTGCTTCT 


GCGGCTGCTT 


CTTGTTTCAA 


GTATTGATCT 


GTTGCTACTT 


TTACGAAAGG 


6300 


CATGTAGATC 


AAAATAGAAA 


TCAATAAGTT 


AACAGCTGCT 


AAAACTCCGC 


CGGCGATACT 


6360 


CTTCGTTGCT 


AAAAATCCGC 


CAATAATTGG 


TG6TGTTACC 


CAAGGTGGCA 


TAAAGGTTGC 


6420 


AGCTGGTACT 


AAACCAGTTG 


CTGTCGCCAC 


ATACGCAACA GTTACAAGAA 


CCATTGGGAC 


6480 


TAAAATGAAT 


GGGATAAACA 


TAATTGGGTT 


TAAAACAATT 


GGTAAACCGA ACATGACTGG 


6540 


TTCATTAATA 


TTAAATAGTC 


CTGGTGCTAA ACTTAAGTTT 


GTAATGACTT 


TGTATGGTTT 


6600 


GTTACGACGA CCTACGATAA AGATAGCAAT 


TAACAGACCT 


AATGTTGCCC 


CTGTTCCACC 


6660 


TAAGTTTACA AATGAATCAA AGAAAGGTTT 


GTTTACAATA TAAGGAATTG 


CTTTACCTGC 


6720 


TTCTAAAGCT 


TTAATGTTCG 


CATCAATCGC 


TGGTACGTTG ATTGTTTGCA 


TTAATGGGTC 


6780 


AACCATGTTG 


GCACCATGTA AACCAAAGAA CCATAAGAAT 


GGTGTAATGA 


AAGCTAACAA 


6840 


TAATGCTGAT 


GGGTAGCTGT 


TTGCAAGTCC 


CmTAAATGGT 


TGTTGAACTG 


CTTCATAGAA 


6900 


TGAAATAACG 


ATATTGTCAA 


TTCCTAAAGC 


AAATAGAATT 


GATGTAATTA 


AACTAAATAA 


6960 


ACTGATJcGTA 


ATCATTGCTG 


GTAATAATGC 


CgCAAATGAT TTTGCTACTG 


CTGGTGGTAC 


7020 


ACCATCAGGC 


ATTTTGATAA 


CTAATTTAGG 


ATTGCCAGAT 


AAACGTTGGA 


ATAGTTCTCC 


7080 


TGAAATAAGT 


GCAATAATCA 


AAGCAATAAA 


TAATCCAGTT 


GCGCCCATGC 


CTTCTAAGCC 


7140 
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GCCAACTGCG AAGAATGAAG CTACAGAAAC TGTACCGGCT GCAATGCCGT CTTTTcCATA 


7200 


AGATTT7\ACT 


AAATTGTGTG 


CTACTAAAAA JGGCTACTAAA ACGGCAAAGA 


TTGCG7UVTGT 


7260 


ACCGTTCCAA 


ATGTTACCAC 


CAAAGCTTTT 


CCAAATTGGT GCATCGTTAT 


ACATCATTGG 


7320 


GAAAATTTTG 


TTTAAAAAGT 


CCTCATATCC 


TGGAATTGGA ATATTATTAA 


TTAATACCGC 


7380 


CAAAGCCCCT 


AAAATCATTA 


ATGGCATCGT 


TACGACAAAC GCGTCACGAA 


TTGCTACCAA 


7440 


GTGCCGTTGC 


GCTGCGATAA 


CTGACGCTTT 


CGGCATAAAG TGTTTCTCCA 


TCCATTGAAC 


7500 


AAATCCGTCC 


ATTGTTATTC 


CTCCTAATTA ACGCTTTCTC AAGCGCGTTT 


TTCTTATAAG 


7560 


ACGCATTGAC AACGAGATTA CTCGTTATCA ATACGGCGAT AAAGATCGAT AATTTCTTTT 


7620 


GCTAAATCTG 


TAAATGCAAT 


CGCAGTCATT 


AAATGATCTT GACTGTGCAC 


TGTTAACAAT 


7680 


GTTACTTGCA 


TGTTATTGCC 


TTTTGCTTCT 


TCTGTTAACA TTCCTGTTTG 


CGAATGATGT 


7740 


GCATGTACAA GTGATTCTTC TGCATCTGCA ATTTTTTTGT CTGCTAATTC AAAATCCCCC 


7800 


GATTTTGCTG 


CGGCAATCGC 


TTCCATTGCA 


TCGCTTTTAG CATTTCCGCC 


ATACATGATT 


7860 


AAACCCATGA 


TTGCTTCTAA 


GTTCTGTTGT 


CCGTCCATTA CCTCTACCCC 


ATTTTCTGCT 


7920 


CTAGTTTGAT 


GATCCCATTA 


ATGAAATTGC 


TTGATCTAAA ACTTTTTCGC 


CATTCATCAT 


7980 


GCCATAGTCC 


GCCATGTTAA 


TTACATCTAA 


AGGAATCCCT TTTGGTTGTA 


ATTTTTGTTC 


8040 


AAATTGCCCT 


TTCATGAAAC 


GAACTTGTGG 


GCCTAATAAT AAAACATTCA 


CCTCTTTATT 


8100 


TTCCAAGTTT 


GTATCTGCTT 


CAGAAGCCGA 


TACTGCAAAG ATGTCTGCTT 


CCATACCACG 


8160 


ATCTTCTGCT 


GCTTTTTGCA 


TTTTTGTTAC 


TAATAAACTT GTGCTCATTC 


tGCGGAACAA 


8220 


ACTAACATAA 


TTGTTTTTTT 


TGCCATTTTC 


CTTCATCTCC TAAATTCTCT 


TTGTTTTGCT 


8280 


TTACACCTTT 


ATATAAGCAA 


GAAGCGTGCC 


AAGTTTTAAC ACATTAAAAA 


CCTTGATATG 


8340 


ACAACGTTTC 


TTGATATAGA 


AAGCGTGTAC 


ACACTAAAAA TAAAACGATT 


ACACACAGCT 


8400 


TTTTTGTGTG 


CGTTTCCAAA 


ATAATAGGGC 


AACAAAAGAA AGTGAGCGAT 


AACCCG7U\AC 


8460 


GACTCATCGC TCACTTTCTA AAGCTGAAGA GAATCAACTG GACAAAAGCC GCTTTTTCCC 


8520 


TTTAAGCAGT 


TTCCATCATT 


TTAAGCATTT 


TATAAAACTG TTTCTTTTTA AAATAACTTA 


8580 


AAAAGATACC 


TAATACTGCA 


TCATTCAGTT 


TTTGTAACGC CCCATAAGAA ACCATTGTTT 


8640 


CTGTATAACG 


CAGTTCACAG 


CTTTTTTCAT 


CAATTGGCTG TAGCTCATAC 


CGCACCGAGA 


8700 


AATCATTTTT 


AGTTGTGGAT 


GTTCGAAATT 


GATAGGTCTT ATTTTCTACA GCTTCTTCAA 


8760 


TAACTATTTT 


CGCGCGACTA 


TTTTTACTAT 


ATTCTTTGAT ATATTCAAAC 


TTACGTAACT 


8820 


GCTTTCTTGT 


AAGAGTTTTG 


CCTGTTTGCT 


TACGGATATC AAAAATAACA 


GAATTCATTA 


8880 


ATTGCGTGTA 


AAATTCATCT 


GCTGAAATAG 


TTAATGTTTT CGTAATTTCC 


ATTACCGTTT 


8940 


CACCTCATCC 


' TTTGTCGCCT 


CTTCTTCAGC 


AACTTTTTTT CCTTTAAAAA 


ACAGGTAACC 


9000 


ACCAAGTCCA 


ATGAGAAAAA 


TCCCCGTTGT 


CAGACTTAAT AAATCATAGT 


TTAAGGAGCT 


9060 


GGGATTCAAC 


GAATACAAAA 


CAATCAAGAA 


GCCAATACTA CATAAAAATA 


ATCCATTTCT 


9120 
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AATCATTTTC ATCCATCCTA TCTTTTTAGA CTACCGTTGC TTQGAAGTAT AAAACGCCAC 9180 

CCCAaGGTTG TTCCTTAATC AGCTGtTTTG CAACTTGTGC AGCAGTTTCA . GGGJCTCCT3L . 9240 

CAACTTCAAA AAATAACTTA ATACCATTTT TTTCTTTAAA AGTAAATGTT TCATTTTCCC 9300 

CTTGATATGC CTCTAATAAA GTAATAATTT CCTCTTGCTT CATTGCACTT GCAATCGAAA 9360 

TCATGCCGCT CGCTCCTTTC TATACAAGTA ATTGCTGTTT TTATTGCTTA TTGCTCTCTT 9420 

TCGCCTAATG CAGGATCCAT TTTGTTAGCA GCGATAACAA ACGGTGTCCA AATCAAGAAA 9480 

GTGACTACCA TACATACCAA CGCTACGATT GGTGCGCGCC AGTCGGCACC TGTCGCTAAG 9540 

AAAGATAATA GCAACGGCGG TGTCACCCAT GCGACTTGTT GAGAAACTGG GGCGACTAGC 9600 

CCAAGATACG TTGCAACCCA GCCAATTGTA GTAGCAACTA ATGGCGCAAT TACAAATGGA 9660 

ATAAACATAA TTGGATtCAG AACAACTGGT AAACCAAACA TAATCGGTTC ATTGATATTA 9720 

AAGATCCCAG GCCCTAATGA TAATTTACCA ACTGTTAAAT AATCTGCTCG TTTGGAGAAT 9780 

AACAAGATTG CAATAATTAG GACGATTGTA CCGCCTGAAC CACCAAACCA AGCAAAAGCA 9840 

TCAAACGAAC CACGGACCCA CATGTAGGCT TTACCGTCGT CAATAGCGGC TAAAACCGCA 9900 

GCTGTTCCTG TTTTTCCTTC ATAACCTTTT TGGAAAATAT CAATATTGAT TAATTGTGCT 9960 

TGTCCCCAAA TTCCTTCAAG AACTGGTGCC AAAACGTTTG GCCCATGAAT ACCAAAGAAC 10020 

CAGAAAATTT GAACAAAAAC CGTTACGATT AACACAGCGC CTAGTCCTTG TGATAACCCA 10080 

AGGAACGGTT CCGCGATATA TTTTTGAATT AAGTCAATTA ATAATTGACC GCCTGAAAGT 10140 

TTTGACACCG TAAAGTTAAT AATTGCTACG ACATACAATG CGATTGTTGC TGGTAAAATT 10200 

GCCGCAAACG CTTTTGAAAC TGCTGGTGGT ACAGAATCAG GCATTTTTAT TGTTAAATCC 10260 

GCTTGCATTA ATTTACAAAA AATAATGGTT GATAAAGCGC CCATAATCAT CACCGTAAAG 10320 

AAAGCGTTAC CATTTAGATG GTTGAGATTT AACCAACCCC ATGGATTGAC AGTCAATTGA 10380 

TTATTAGCAA TTGACGCACC AATTTCTGCC CCATTAATTG CGTTGGCAAT CTTTTCTGGC 10440 

ACTTTTACAG CTAATTCAGC TGTGTTATTT GGTGCAAAAG CAACACCTGA AATTAAGGTA 10500 

GCTAAACTGA CAATCCCACC AGCTAGCTCA TTCACATTGT ATGCTTTCGC TAAGTTATAG 10560 

CCCCAAGAAA AAGCGAAAAT CAAACCAGCG ATAsCcAATG TACCATTCCA AACAAAACCA 10620 

TTAATCCCAA TGATGACATT CAATACTGCA AAAATACCTT TGTCTGCTTC CAAACTTGCC 10680 

GCATAAGTTG AGACAAACTC TTGGGGAAGA TCTCGAATAA TTGCGTTAAT CATTACGGCG 10740 

ACTGATCCAG CCATTGTCGC AGGTAATGTA CCAATAAAAG CATCCCTTAG TGCCACCAGA 10800 

TGTTTCTGTG cACCAATTTT TGCGGCAGTT GGTAAGATAT ATTTTTCCAT CCATGCCATT 10860 

AATCCGTTCA TTTTTATTCC TCCTGTATCC ATTTATATTT ATTAAAAAGG CAGTAGCCTC 10920 

ATTAAACGAA TGAATCGAGC GTACAAAACA AAACCTTTGG TAACATGCAC TATCCTTATT 10980 

TGCTATTCAC TTTTCCTAAA CGTTTTAATT CCTTATCCCA TTAATGAGAT TGCTTGATCT 1104 0 

AAAACTTTTT CGCCATTCAT CATGCCATAA TCTGCCATGT TAATTACATC TAAAGGAATC 11100 
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CCTTTTGGTT GTAATTTTTG TTCAAATTGC CCTTTCATGA AAQGAACTTG TGGACCTAAA 


11160 


AGTAAAACAT 


TCACCTCTTT ATTTTCCAAG 


TTTGTATCTG 


CTTCAGAAGC 


CGATACTGCA 


11220 


AAGATGTCTG 


CTTCCATGCC ACGATCTTCT 


GCTGCTTTTT 


GCATTTTTGT 


TACTAATAAA 


11280 


CTCGTGCTCA 


TTCCTGCGGA ACAAACTAAC 


ATAATTGTTT 


TTTTTGCCAT 


TTTTTCTTCA 


11340 


TCCCCTTAAA AGTTATGACG TTATACACTT 


TATTTATAGC 


AATAATTGTG 


CCAAGTTTTT 


11400 


TCCTCACTAA AATACTGATA TAACAGCTTT TGTCATCGAA AAATGCGTGC ACACACTAAA 


11460 


AGAAAACGCT 


TACACACATG AAATTGTGTG 


TAACTTACAA 


AAAGAAAACA 


TCAAGAAAAA 


11520 


AGAGAGCGTT 


TTCTCATTTC TTGATGTTTT 


TTTTATTTAT 


TATTGTTTAC 


TTTTAATTTC 


11580 


ATTTTCTAAA 


AACATTTGGG TAACATGCGC 


AACTTCCGCT 


TCTGGTyVTCG 


TCACACGATA 


11640 


CGCCCGTTCA ATTAACATAA AATTCGTGCG TACGACATCG ATTTCTAAAC GATGTGTTTT 


11700 


AACATATTCC 


GCTAAGTTTG GAAAATTTCT 


TGTTTCTTCT 


CCTTTT^TTA 


AAGCATCTAC 


11760 


CAGAAAAGCC 


AAATGAATCA TGATTCCGGC 


ATCCACCCCT 


GGTTCAACAA 


TAATATGCAT 


11820 


GTCTGTTTGT 


ATTTGATGAA CTGTTTTTTG 


T7\ACATGAGA 


ATTAATTTTT 


GTAAGGAGTC 


11880 


CACAGAAGTA 


ATGACGCCAC TTAGTGAATG 


AACAATTGTA 


TCAATCGCCA 


CTTCATCACT 


11940 


AGCAATCCGT 


TTCAGTACAT TT7UVTTTTTC 


ATCATCA7VAA 


ATATCATAGA 


CGGAAAAGAA 


12000 


GGGAATATTT 


TGATATTCAA CATCCaCAGT 


TCCCGCAATG 


GCCTTAATTT 


CATATTCTTC 


12060 


CATCAAGTTG 


TCGATATGTT TTTTAAATGT 


TTCTCGTTCA 


ATAAATTGCA 


TTTGAATCAG 


12120 


TTCCACTTTT 


GTTTCATCAA TAACTGGTAA 


GATTCGTTGG 


TATAGTTTTG 


CGGCAACCCC 


12180 


TTCTCCAGTA 


AAGCACGTCA CGATTACTGC 


TTTTTTTACG 


TTTTGACGTT 


TTTGCAGACT 


12240 


AGAACGAAAT 


TGTTCTCTCA CAACGCTTTC 


AAAAGAAAGT 


TGAATATTTT 


GGTAAATGTC 


12300 


CtCTAAACTt 


CGTCCCACGC TTGCCATTCG 


AATAGCTTCT 


AAGACT^TCA 


TTGTGCTCGT 


12360 


CATTGTAATC 


GCCTTTGTTC GGATACCTGT 


TTCTTCAAAC 


AACATATTTC 


CAAACGAATT 


12420 


TAATGAACCC 


ATATCCGTCA GTAATArAAT 


CCCATTATTT 


AACGATTCTT 


TTTGCGTAAT 


12480 


TACTTGGTTT 


CGTAATTGTT cATACATTGT 


TTGAACTTCC 


ATCGTTAACG 


GCATATTCAT 


12540 


CGCTGTTCCG 


ATTGACGTTC CtAAAAGTTC 


TTGGGCGGTT 


TTCAACATAC 


TTGAGGCAGT 


12600 


AGAATCCCCa 


TGCATTAAGA CGAACACACC 


AACATTATTT 


TCTGGAGTTC 


XT TCTTCT TT 


12660 


ACCGACATTA ATAGATAAAA ACATACTAAT 


AAAGCCAATT 


TCATCGAAAG 


GAATCTCTAT 


12720 


GTTGTTCTCT 


TCTTCAATAA TAGAAGATAA 


ATCTAAGGCT 


ACTTGAAATT 


CTTTTTTCAT 


12780 


ATTTTTACGG 


ACATTATTTA AATCAGGATG 


GACAATCACG 


TGACCTTCTT 


TGACACGTTC 


12840 


AATCGTACTT 


TGTAAATGAA GCGCAAAGGC 


aAAGCGCgCa 


GTTTCGTTGT 


ACGTGTGTTC 


12900 


CaAACGCTCT 


TCTGCGaTAT CATaAAGACG 


ATTCGTTAAG 


TCCCAAATAG 


CGTCTGGTAA 


12960 


TAACTCCTTA 


TGTAAAGAAT GTTGGGTTAA 


TTCnTCGACA 


TAATTTTTAA 


AATACGCATT 


13020 


CATATCTTTG 


GAAATCAATT CTTCTAAATC 


AACATTTTCT 


AAGTCAGCTT 


CGGACAAGTC 


13080 
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AGCGACTTTT 


TCTyCGATTT 


CaTTGTAAAC TTGCATGTCT 


CTAGCTGGAT 


CTTGTGACCA 


13140 


AACAATTTTG 


TTGGTGCCTG 


GCTGGAAAGA TAAATAATTA 


CTTTTACTAT 


CGACAAATTG 


13200 


ATCCAAGCGC 


TCAGGTACTT 


CTTTAATTTT TAATAATCCT 


TTTTGGACAG 


GTAACGACAA 


13260 


ATCCCGTTTA 


CGGATAACCA AATGTTCTAA ATTGTGCGTC 


CGATAATAAA 


GGAATGACTT 


13320 


AGCACAAACT 


AATTTTAAGT 


CCCGCTTCAC CTGACCAATA 


TTTCCTTCTG 


CATCGTATAA 


13380 


CATAAAAGCC 


AAAACTGCTT 


CGCGTTCGAT ATCAATTTGT 


TGGCCCAAAC 


GTGTTGCCTC 


13440 


TTGCTGAATA AATAACGAAA TGATGGCATA ACGCTCATCT 


AGCGTCCGTG 


CATCTAAAGA 


13500 


TGGTAATGTA 


ATCGACATCG 


GAATCCGCCG ATTAAAAGTG 


GTTAAAAAGT 


TTTCTGAGGA 


13560 


TTCCGTTGTC 


GCACCAATAA 


TTTGAACAGA GGCTTCATAG 


GTTTGACCAC 


TTTCTCCTAA 


13620 


TGGACGGTAC 


ACGCCTTTAT 


CAATAAACGT AAAGAGCATT 


TCTTGACCTT 


CTGGCGGTAA 


13680 


GCGATGAATT 


TCATCTAAAA 


AGAGAATCCC GCCATCTGCT 


TTTGCCATTA 


AGCCTGGACT 


13740 


ATCTTCATTC 


GCACCAGTAT 


AGGCACCTTT TTTTACGCCA 


AAGATATGAC 


CAAATAATAA 


13800 


TTGTGGGTTT 


TGTGCATAAT 


CAGCACAATT AAAAGAGACA AAAGGGGCAT 


CTTCATCTAA 


13860 


GGTTTTTGAA 


TCAATGGCAA 


AACGGTACAT ACATTCAGCA AAAAGAGATT 


TCCCTGTTCC 


13920 


TGTTTTCCCA 


AAAATAATCG 


TATGCAAACC TCTTGGTGGA 


TATAAAATAG 


CCGCCTTAGC 


13980 


CTGCTGAATA 


CTCACTTTTA 


AAGAATCATT GGCACCTACT 


AATGTGTCAA 


AACTAACAGA 


14040 


GGTGGACTCA 


TCTACATGGA 


CTTGTTCTCC TTCATTGATT 


GTTGGCACTG 


AATAGATAAC 


14100 


AGGACGTCCA 


GTACTTCTAG 


TAATTTTCTT TTCTTTGAAT 


AATTCATTTA 


AATAACGGCT 


14160 


GGCATTACTG 


CGATCAATCA 


TTAACTGTTC TGCTACCGCT 


CCTGCCGTTA 


AGCCAGTTTC 


14220 


' ATTTTTTTGA 


AGTAACGCTA AGATTTCTTC CTTCCGAGAT 


TTCATGGAAT 


CCTCCTCGAT 


14280 


ACAACGTTCT 


GCTTGArGCT 


CGTAAAGCGT TATTTCTTCG 


CTTTACGAGC 


yCCTTTGTTT 


14340 


TTCGTGTTTT TCTTTTTGTT TTTCTTCTTT yTGCGTGTTT CTTGTGGTTC 


TTTGACAACA 


14400 


GAATTTACTG 


TTTCTGGTTT 


TCTTTCTTTC GTACTCTTTT 


TTTCTGAATA 


CGTATCTTTG 


14460 


ACTACGTTCT 


TTTTTTCTAG 


TTCATCTTTG GCTTTTGGCT 


GTTCATGAAG 


TGCACCATCA 


14520 


TATAAAAAGA 


TTTCCTGATA 


AGGAATGGCC AATTGTTTCA 


TCAAACGTTG 


ATAATCCCTT 


14580 


TTGGTGCCAT 


CGTTGATAAA 


CGTAATCACT GCCCCTTGTG 


CACCCATCCG 


GCCAACGCGA 


14640 


CCAGAACGAT 


GAATATAGCT 


CTCTTCGGTT AAAGGAACAT 


CTACGTTTAC 


GAC6TAAGGG 


14700 


ACCCCAGTGA 


AATCTAAGCC 


CCGTGCCGCA ATATCCGTGG 


TTAATAACAA 


CTTCACTCGT 


14760 


TCTGCTTTAA ATTGATCGAT 


CGCCAATTTA CGCAACTGTT 


TACTTTGATC 


TGAAGCCAGA 


14820 


CCAATAGCCG 


GAACATTCTC 


ATAAACTAAT TTTTCTTCTG 


CTGCACCTAA 


ATCAGCAACT 


14880 


TGATTGAAAA 


• AGACCATGCT 


TCTGAATGCT TCTGTATGGG 


CCAAACGACG 


TAAGTAATCA 


14940 


GCTTTTTTTC 


TTGGTGCTAA 


CCGAATAAAG TAGTGGGCGA 


CTTGGCCTGC 


TGAAGTATCT 


15000 


TCTTCCGTCA 


CATCAATCAC 


CTGTAATTTT TGGCTTAAAG 


ATTGCGCTTG 


ATTCACGACA 


15060 
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CGATCAGCAG TCGCTGAATA AAAAACAAGT TGGTATTCGG TTGGTGTGTG TGTCAAAATT 15120 

TGCT.XGGTTA AACTTAGCTC TTCCTCTTGG AAAAGTTGAT CAACTTCATC CATTACGATT ' 15180 

GTTTTTAACA ACTGAGCTTT GACTTTTTTA TTCTTCATTA ACTCTAAAAT TCGACCTGGT 15240 

GTTCCGATTA ACACTTCGGG ACGTTTTTTT AACTTGTCAA TCTGGCGTGA AACATTTGCC 15300 

CCGCCGATTA ACGTTTGAAC CTTTAATTGA AGTGGCTTAG CCCAAGTCCG AGCAACTTCA 15360 

GCAATTTGCA TAGCTAATTC TTGAGACGGA GCAATAATCA ACAATTGATT TCCCTGTCCT 15420 

TTTTCAACAG TTAAGAGTAA AGGCmGCmTG TAAGCAAGTG TTTTTcCmGT TCcTGTCGGC 15480 

GAAATTCCTA AAACATTTTC TCCTtCGCGA AGGGGCGTAA AACTTTTTTC TTGAATTGTT 15540 

GATGGTTCGT TAAAGCCAGA AGCTTGCCAC TGTTCTTGCC AAGCCTCTGG AAGTTGTTCT 15600 

AAAAATCTCA TATTGTTCCT TTCCTATCGC TTAATTCTTA TTTATCAGCA TCAAAAATAA 15660 

TGCCGGCACT ATGACGTAAA TCAGTGACTA CTTTATTGAC ATTTCGAGCA AGCTCAACCC 15720 

ATTCTTCATA CTTGGTGCCC CAATACTTAT CTGTTGGGTT ATTTAAAACT GCTGCAAAAT 15780 

CACGGGCTTC TTCAATCATT GGATTATCGG CTACTTGAAT ATCTAAGACA TCACGTTCTT 15840 

CATGTGCGCG ATCATGGTAC TCAGCCGAAG AAATAGCATT GACACTATCT AAAATCAAGG 15900 

TCCCGTTTTC AAAGTAAATT TCGGATGGCG CAAAAGAATC ACTAATTTTC CCAGTGTTTA 15960 

ACGTCACGTC AAACTGATCA TATCGTAAAA TAATTGTTCC TAAGCCATCG ACACCTGTCG 16020 

CAATTTTACT AGCAAAATAA TGGCTTTCGT TTGGTACGCC AAACCAGCCT AAGGCTGCAT 16080 

AAACTAAATA aACGCCTAAA TCCGCTAAGG CTCCCCCAGA AAAATGAGgA rAAAAtATTT 16140 

GGCTCTTTGC CTTCTAATAC TTGATCATAG CGAGAAGAAT ATTTCATATA CGTAAAATTC 16200 

GCACCTAAAA TTTGGTTTTT TAATGGCAAT AATTCAGCAA TTTTTTGGAA ACTTTGCTCA 16260 

TGAATATTAC GGGCCGCTTC AAAGAAATAC ACACGATTTT TATTCGCTAA TTCAATAATT 16320 

TCTGCCATCT CATCAGGCGT TGAAAAAGCT GGTTTTTCAA CAATTACATT TTTTTTCGCT 16380 

AAAATTGCTT GTTTCGCTTG TTCAAAATGT AATGAATTTG GTGAGGCAAT ATAAACCGTA 16440 

TCCATATGTG CAATACCAAA AAACGTCTCT AAATCAATTG CATATTCCAC ATCCCCATAG 16500 

CGAGAACCAA ATTCTTGTGC CGTTGCTAGT TTCCGAGAAT AAACAGCCGT TAAATCATAC 16560 

GCACCCGTTT CTAAAGCTGC TGCAACAAAT TGATGACTAA TCCAATTGGT ACCAATTACG 16620 

CCTAAATGTA TCATAATCAA AAACCTCCAT TTTTTGTTTA TTTACTCTCT TTATAATAGT 16680 

AATCTCTCGT ACAAAAGTCA CGTAATTACC CTGCCTTAAA GCGCTAGGAG TTCTTCAAGT 16740 

TATTTATTAG GATCATCAAA TGCCTGTAAT AAATCAAAGG TTACCTGTGA CTTAAGTTCA 16800 

ATCGTTGCTG GTGTTACGTG GCAATTGTAA GCAAGGACTT GCACACCTTG ATCAATTGCA 16860 

GCACCCACAT TTTCAGCAAA GGCGGGTTGC ATCTCTTGGT GAAGCGTTGC TTGCTTAATT 16920 

TCTTCAAATT GGACCACAAA CAAAATGTAA CAGCGGTAAC CCGCTTTAGT TGCAGCCATT 16980 

AGTTCAGTGA CATGCTTCAA GCCACGAAGT GTTGGCGCGT CAGGAAAAGC ACCGATGCCT 17040 
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TTATTTTCTA 


AAGTCATGCC 


TTTCACTTCG 


ACAAAAGCTT 


GTXCGTCCGC 


ATCCGTTTCC 


17100 


ACTAGAAAAT 


CAAATTTTGA 


ATGTGCAAAT 


CGTTGCTCAC 


GTTTAACAGT 


TTGGATCGTA 


17160 


CCAACTAAAC 


CAGGAAGCAC 


AATTTGACCA 


TTTTTTAACG 


CTTCATTGAC 


TAATGTATTC 


17220 


GGTACTTGGC 


TATCAATATT 


AATCCAAAAA 


GAACCTTTTT 


TTACTGCAAT 


CAAATCGTAA 


17280 


TCTGTTTTTC 


TTTTAGGCGA 


TGGTTGATAA 


CTAAGCGCCA 


CAACAGCACC 


TGGTAAAAAA 


17340 


ACTTCTTTCC 


CTCTGCCAGT 


ATTTTTGACA 


TGTGTGATTA 


CTTCTTCATT 


CGTTTCCATT 


17400 


AAACGACAGT 


GTGCAATAAA CCGATTGGGT 


CGTTCGATAA 


AATAAGCAAG 


TTGGACATTC 


17460 


GGGTACTTCA 


TACACAGACT 


TCCTTTCTTA CACACTATAC 


CATAAAAAAA 


CAAGGAAAAG 


17520 


TATAGACCAA AACCAACAAT 


TTTACTATCT 


TGTGTCAATC 


ATTAAATATT 


TGTCAATTCT 


17580 


CAGCGATTTA 


GTCGAATATT 


TTAGACAAGT 


TCTATATAAT 


AAAGGTAACG 


AAGACGAATA 


17640 


AATGAAAATG 


AAGGTTAGGA GGAATTTCTT 


TGAAAAAACA 


TTGGACTTTC 


TTGGTTGGTG 


17700 


CATCTTTACT 


TACTGGATTA 


GCAGGCTCTA 


TCCTCTGGAA 


AAAAAATAAA 


CAGCTCCCTT 


17760 


CCCCACAGGA 


GCTTCAACAC 


AACTATACTG 


GGCATTGGTG 


GTTTGTGGAC 


GTGAAGAAAG 


17820 


CCACGCAACA 


CACCTTAGAA 


ATCACCTCAT 


CTTTCAATGT 


TTTCATTGAT 


GGCAAACA7VA 


17880 


CGCATGCCTT 


GCTTGTTGAA 


TTAACCCCTA 


AACGCTTAGT 


TATCTTAGAC 


AAGTTAGGTT 


17940 


ACCATTTAAT 


CATTCAATGT 


GAAAATGGTG 


TCCCTGCGAC 


GCTTTACGAT 


GAAGCCGACG 


18000 


ACGAAACCTA 


CACACTAGAA 


CGGGCCATTT 


ACAATTAATA 


AATCACCCAC 


GACACATAAA 


18060 


ATGAAGTATG 


TGCGGTGGGT 


GATTTATTTT 


TTTACATCCC 


TTGCAAAAGC 


ATTCAGAAAG 


18120 


TTTTCTTACA 


ACTTGAATAT 


TATTAAATAG 


TCAAATGCGC 


CTTTTTCTGA 


TAAACTTAGG 


18180 


ATGATGTTTT 


TTTGAAAGGA 


TGAGCCCCTA 


TGACACCAAA 


TCGCGAAGAC 


TATTTAAAAT 


18240 


TAATTTTTGA 


ATTAGGTGGC 


GACGAAGTTA 


AAGTGAATAA 


TAAACAAATT 


GTTTGTGGAC 


18300 


TCGATGTTTC 


GGCAGCTTCG 


GTTAGTGAGA 


TGATTTCAAA 


GTTAGTAAAA 


GAAGATTTGG 


18360 


TTGAGCATTC 


TCCTTATCaA GGGGTACAAT 


TAACTGAAAA AGGCTTAAAA AAAGCGAGTA 


18420 


CGTTAATTCG CAAACACCGA ATCTGGGAAG TCTTTTTAGT AGAGCACTTA AATTACACTT 


18480 


GGAATGATGT 


GCACGAAGAG 


GCAGAAGTTT 


TAGAACATGT 


TACTTCACAG 


ACGCTTGTGA 


18540 


ACCGTTTAGC GGATTATTTA AATCATCCAG AATTTTGTCC ACACGGTGGT GTTATTCCCG 


18600 


AAGATAATCA ACCCATTCAT 


GAGGAGAAAC 


GCCAAACGTT 


AACAGACTAC 


CCTGTTGGCA 


18660 


CAAAAATTCG 


GATTGCACGT 


GTCTTAGACG 


AAAAAGAATT 


ACTGGATTAT 


TTAGTTTCCA 


18720 


TTGATTTAAA 


TATTCAAGAA 


GAATATACGA 


TTAAAGAAAT 


TGCTGCATAT 


GAAGGACCGA 


18780 


TCACCATTTA 


TAATGAAAAC 


AAAGAATTAT 


CCGTCAGCTT 


TAAAGCAGCA AACACAATTT 


18840 


TTGTTGAGCC 


GTTGATTAGA 


GAAAGTGAGG 


AAAACTAATG 


AATGAAAAAG 


TAGCATTATT 


18900 


TACTATTTTC 


GGGGGGACAG 


GTGATCTTGC 


CAAACGTAAA 


TTGTATCCTT 


CCCTGTTCCG 


18960 


CTTATATAAA 


AAAGGATTAT 


TAGCTGAGCG 


TTTTGCGGTG 


ATTGGTACCG 


CACGTCGGGA 


19020 
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ATGGACAGAC 


GAATACTATC GTGAAATCGT CCGTGAAACC ATJCAAGACT TAAATCCCAC 


19080 


GGCCGAAGAG GCAACTGAAT TTTCTAGTCA TTTCTATTAC CAATCACATA AXGTGAATG^ 


JLS140 


TACGiSAACAT 


TATAATACCT 


TAAAAGAATT 


ATCTGATCGC 


CTAAATGAAC 


AATACCACTT 


19200 


AGAAGGCAAT 


CATGTTTACT 


ATTTAGCTAT 


GGCCCCTCAA 


TTCTTTGGAA 


CCATTGTTAA 


19260 


TCACTTGAAA 


TCACAACATA 


TTATTTCGGA 


AGAAGGCTTT 


GATCGTCTAA 


TTATCGAAAA 


19320 


GCCATTTGGT 


rCTGATTATG 


AATCTGCCTA 


CGAATTAAAT 


GAAGAAATTC 


GTGCTGCATT 


19380 


TCCTGAAC7VA 


GATATTTTCC 


GAATTGACCA 


TTACTTAGGC 


AAAGAAATGA 


TCCAAAATAT 


19440 


TTCAGCCATT 


CGTTTTGCCA 


ATAATATTTT 


TGAATCACAA 


TGGAACAATC 


GTTACATTGA 


19500 


TAATGTCCAA 


ATCaCCTTCG 


CTGAAAGCTT 


AGGTGTCGAG 


GATCGTGGCG 


GCTATTATGA 


19560 


CCACAGTGGC 


GCACTAAAAG ATATGGTTCA AAATCATATT 


TTACAAGTCG 


TTGCATTGCT 


19620 


TTCTATGGAG 


CCTCCTGTTG CCTTTTCTGA AAAAGAAATT 


CGGACTGAAA 


AAGTAAAAGC 


19680 


TTTACGTGCC 


ATCCGCCTTT 


ATTCTGAAGA AGAAGCCCTT 


CAAAATTTCG 


TGCGTGGACA 


19740 


ATACGGCGAA 


GGTCAGCTAG 


CTGATCAAAC 


ATTTGCAGCC 


TATCGTGATG 


AACCAAATGT 


19800 


GGCAGAAACT 


TCATCCACTG 


AAACTTTTGT 


TGCTGGTAAA 


TTTTTAATTG 


ATAATTTCCG 


19860 


TTGGTCTGGC 


GTCCCTTTCT 


ATGTTCGCAC 


AGGTAAACGA 


TTAACTGAAA 


AAGGCACACG 


19920 


CATTAACATC 


GTCTTTAAAC 


AAGTACCTAT 


TAATGTCTTC 


AAAGATGATA 


CGTGTGAAGA 


19980 


ATGCGATAAA 


ACCGATTTAC 


CACCGAATGT 


TTTAACTATC 


TATATCCAAC 


CAACTGAAGG 


20040 


ATTTTCATTA 


ACCTTAAATG 


GAAAAGAAAT 


CGGTCAAGGG 


TTCAATACAA 


CTCCAGTGAA 


20100 


ACTTGATTTC 


CGTCAAAGTG 


CTGAAATGAC 


TGAAAATAGT 


CCAGAAGCAT 


ACGAAAAACT 


20160 


TCTATTAGAC 


TGCCTAAATG 


GCGATAGTAC 


AAACTTCTCT 


CATTGGGACG 


AAGTTGCCCA 


20220 


ATCTTGGCGC 


ATTGTTGACA 


TTATTCGACA 


TGCTTGGGAT 


AAAACAGACG 


TTTCTTTCCC 


20280 


TAATTATGCC 


GCTGGCACAA 


TGGGCCCGCA AGCTGCCTTC 


GACTTATTAG 


AAAAAGATGG 


20340 


CTTTACTTGG 


GAATGGCAAC 


CAGATAACTG 


GTACCGTGAC 


CGCGGTCAAC 


TTGATCAATA 


20400 


AATAAGCCAA AAAAGATCCG CTAGTTAACG 


ACTAGCGGAT 


CTTTTTTCTT 


GTTTATTCGA 


20460 


ATCCTTCAGC 


AACCGCTTCT 


GGATAATTTT 


CTTCAACAAT 


GTGTGCACAA 


CGGCCaCAAA 


20520 


GTGTTGGTAA 


TTTTTCATCT 


GAGCCGACAT 


CTTGACGAAC 


GGAACGACAA 


CGGTCACACG 


20580 


TTTCGCCTTC 


TGCTTTTTCA 


ACTAAAATAG 


CCATGTCTTC 


AAATTGAACA 


GCTTCACTAG 


20640 


GTGCTACTTC 


TTTTGATACT 


TCAAAGTCGG 


AAACAATTAG 


TAATTGAGCA 


ATATCTGCAT 


20700 


CAACAGCTGT 


CATTAATTGA 


CGAATTTGTT 


CATTTGGATA AACGGTCACT 


TTGGCTTCTA 


20760 


ACGATTTACC 


GATTAATTTA 


GAATGACGAG 


CTTCTTCCAA 


CGCTTTTAAG 


ACATTGTCAC 


20820 


GGAAATCCAT 


• AAAGGCTGCC 


CATGTATCCA 


TCAATTCTTC 


TTCATTAGTA 


AACGTTTCGT 


20880 


AACCTGGGAA 


TTCAGCTAAT 


TGCACATACT 


CTTCTTCTTC 


TTGTAAGAAA 


CTCCAAATTT 


20940 


CTTCCGCTGT 


ATGTGGAATA 


ATTGGTGTCA ATAATTTTGT 


TAACGAAACC 


AATGTTTGAT 


21000 
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AGAAAACAGT 


CTGCATACAA CGACGTTGAT AATCGTTTTC AGfTTCAATA TAAACGACAT 


21060 


CTTTCGCAAA GTCTAAATAG AAAGAAGATA AATCTACAGT TAAGAAGTTC ATAACTGTCC 


21120 


GATAAATATG 


CATGAAATTA 


TATTTTTCAT AACCATTTTC ACGGATTTCT TGAATGACTT 


21180 


GATTTAAACG 


AACCGTCATG 


TATTTATCTA CAGAACGTAA ATCGGCATAA GCTACAGTAT 


21240 


GCTCAGCTGG 


ATTAAAATCA 


CTCGTATTCG CTAACAAGAA ACGCATAGTA TTACGGATTT 


21300 


TCCGGTACAC 


TTCAGATACT 


TGGTTTAAGA TATCCATTGA GACACGCACA TCGGCTTCCG 


21360 


CGTCAACACT 


ACTTACCCAC 


AAGCGTAAGA TATCTGCCCC CATTTGATTG ATAACTTTTT 


21420 


CAGGTAA6AT 


AGTATTGCCT 


AACGATTTAC TCATCTTACG GCCTTCGCCA TCGAGAACCA 


21480 


TCCCTTGAGA AATAATTGAT 


TTATAAGGCG CTACTCCATT GATAGCGACA CTTGTTGTAA 


21540 


TACTTGAGTT 


AAACCAACCA 


CGATATTGGT CAGAACCTTC TAAATACATA TCTGCTGGGA 


21600 


AAGTTAACTC 


TTCTCTTTCT 


CGTAAAACTC CTTCATGTGA AGAACCTGAG TCAAACCAAA 


21660 


CGTCCATGAT 


ATCTGTTTCT 


TTGGTAAACT CGCCATTTGG TGAACCTGGA TGTGT7VAAGC 


21720 


CTGCTGGTAG 


TAACTCTTTT 


GcTTCACGCA TAAACCAGAT ATTTGATCCA TGTTCTGCAA 


21780 


ATAAATTCGC 


TACATGTTCA ATTGTTTCTG GCGTAATGAT GGCTTCCCCA TTTTCAGCAT 


21840 


AGAAGATCGG 


TAATGGTACA 


CCCCACGCTC TTTGGCGAGA GATTACCCAA TCACCACGGT 


21900 


CACGAATCAT 


ATTATATAAA 


CGTGTTTTTC CCCAAGGAAT TAGCCAGTCA ACTTTCTCAA 


21960 


CTTCATCTAA 


AATATCTTGA 


CGGAATTTAG AGATTGAAGC AAACCATTGT GGCGTTGCAC 


22020 


GGTAGATAAC 


TGGTTTTTTG 


GTACGCCAAT CATGTGGATA ACTATGCGTA AAGAAATCTA 


22080 


ATTTTAATAA 


CGCGCCTTTT 


TCTTCTAATA ACTCAGTAAT CATTGGATTC GCTTTGTCAT 


22140 


AGAAAATTCC 


TTCAAAACCA 


GGTGCTTCAT CTGTAAAGAC CCCACGGCTG TCAACAGGAG 


22200 


AAATAACAGG 


TAAATCATAT 


TTACGGCTAA CGATATAGTC ATCTTCACCA TGTCCTGGTG 


22260 


CGGTATGAAC TAAGCCAGTC CCGGCATCTA GCGTCACGTG GTCGCCTAgC ATAACAAGTG 


22320 


aTGTTCGATC 


ATAGAATGGA 


TGTTGAGCTG TCATACGATC TAATTTTTCA CCAGAAAATT 


22380 


CTCGCAAGAC 


TTCTACTTCT 


TsCCAACCAA TCGCTTCTTT AACTGTTGTT AATAAATCTT 


22440 


TAGCAATTAC 


AAATTaCGAC 


CGTCTGCCTT CACTTCAACA TACGTAAAAT CAGGATTTUVC 


22500 


TGAAATACCT AAGTTGGCTG GTAATGTCCA AGGGGTTGTC GTCCAGATGA CGAAGGCTGT 


22560 


CTCGTTATCT 


AAAAGACCTT 


TACCATCGGC CACATTAAAA GCTACGTAAA TAGAAGGTGA 


22620 


TTTTACATCT 


TTGTATTCAA 


TTTCTGCTTC TGCTAAAGAA GATTCACTTG AAGGAGACCA 


22680 


ATAAATTGGT 


TTTAAACCTT 


TATAAATATA GCCTTTTTCT GCCATCTTAC CAAAAACACG 


22740 


AATTTCTGCT 


GCTTCATACT 


CTGGATCTAA CGTAATGTAT GGATGTTCCC AATCGCCTGA 


22800 


CACACCTAAA 


•CGTTTAAAAT 


CGTTACGTTG TTTATCGACT TGTGATAAGG CATACTCTTT 


22860 


ACATTTTTCG 


CGATACTCAG 


CGACAGTCAT TTCTTTACGT TTAACCCCTT TATTGGtTAA 


22920 


CACTTGCTCa 


ATTGGTAAAC 


CATGAGtATC CCAACCAGGc ACATAAGGAG AACGGAAACC 


22980 
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AGACATTGAT 

ATGAATATTT 

GTTTAATTTT 

TTCACGGTTT 

TTTCATTTTC 

ACTAAAGATG 

CGAAGTAcTC 

TGATAACGGT 

GGATCTGTTT 

AAACATTAAA 

TCAATTACAG 

TCATTTTCGT 

CCGACATAAC 

AATTGAGTTT 

TCTTCTGTTT 

GATTTCGCTT 

TCTTTGGCTT 

TTCACTTTAT 

TCATtGT^GT 

TTTTGTAATG 

TGGTTATAGC 

GCCATATTCC 

ACTAATCCTG 

ACAGCCAAAA 

GACACGAAAT 

TCCCCGTATC 

CCAATTGACT 

AAATACGGCG 

CCAACCATCC 

TTTACCAATT 

CTGCCACTTA 

CATCAaGGTG 

kTCAAATAAT 



TTTGaACGaA 
CCGTTTGCAT 
TGACGTTGTT 
GGCAAGTTCC 
ATATGCATTT 
AACGATAGCT 
GTGTTACCAC 
AATTATCCGG 
AATGCCAGTG 
CGTTCTGTTC 
GCTTTTTTTC 
TGTCAAGGTC 
TAGAAAAAGG 
CAAGCATTAA 
CCCCAGCTAA 
CAGCATCTGC 
GGTTATCTGC 
CAGCGGTATC 
ATtGTAATTT 
CAtCTTCATA 
CTCTCATTTT 
TTCACCTCTT 
AAATAGCAAA 
TGGTTTTCTT 
TTTTCTTTCT 
GAAACAATAT 
TTTACTTTAC 
GAAATCACAC 
TTCGGTATCA 
TTGTCGACTT 
TCACCATCGG 
CCTAAAATTT 
ACAACACCAA 



TAATAATATC 
ACGGAGGGCC 
CATATAAGCC 
CACGCATTGG 
GCCTCGCTTT 
GATAAAAGTA 
CCAAATTCcA 
CAGTTCCTAC 
TCTGTCAAAC 
TATCTATTTT 
TACTACAGCA 
TTGTTCATCA 
CTTTAAAATC 
GCTTAAGCGT 
TTGACGGGCA 
AATCATTGCA 
CGAAGTGATA 
TtGTGCAACA 
TTCTtCTGCG 
ATCACGTGTG 
TGTTGAGAAG 
TGGATTATTT 
TATCTTACTT 
AAACAAAAGT 
TGGTTTTTCC 
CTAATAAGTC 
CAGATTCGAT 
TATCCAGCCG 
AAATTTGTGT 
GGTTAACAAC 
AAATAATATC 
TCCCaTGaGA 
AATCTGTCTC 



779 
TTTACTGATT 
ATCATGTAAA 
TTTTTCTTCC 
AAAAGCTGTT 
CATTTTATCG 
GATAAAAAAA 
AAGTAGAGTT 
TCAAGTGTTC 
TTTCACTATC 
GATTCTTCTT 
TCTGTCTTCT 
AGAATTTCCT 
TCTTCCCACT 
TGaTGGAAGA 
CGTTCAATTG 
TTTGATTTAC 
ATCATTTCTG 
ATAATTGATT 
TGTTTTAAtG 
ACTTGATCTA 
tCTTTGTTTT 
TTGTAAAACC 
TTTTTATTAT 
TCGAGATTAT 
TTCTAATTCT 
TAAAGCAAAA 
CAATTGTTTt 
CAAAGAGCTC 
ATAATCCCGT 
GAAGCCAGCT 
TCCAAAGGCT 
TAaTGTCGCA 
TTCTGGTATG 



TTATTCAAAG 
ACAAAGGTTG 
CAGTCTTTTT 
TTTCCTAATT 
ATTTAATTGT 
GAGTACTCCa 
GCCCCTACTT 
AGAACTGCAT 
ATTGACTCGC 
CAATTGTTTC 
CTTCAGAGTT 
TAACTGCTGT 
CTTCACTCTT 
CACGTGTTTT 
CTTCTGCTAA 
GTTCAGCTTC 
ATTCTTTATT 
GATTCAATGC 
AwTTCTCAAG 
AAAAGTCATC 
GAATAwCTAA 
CCTTATGAAC 
ACACAACTTC 
TTTCTCAATA 
TGAATTTGAA 
TCTGGTCGTG 
GATCGCTGCC 
ATAGTCGTTC 
TCTTCTAAGC 
ACTTCTTGTG 
TCCcTACGAA 
AaTTTAGATG 
TAATACGCTG 



AATGTCCTAA 
GTTTTCCTTC 
GCCaTTCTGC 
GCAATGTTTC 
GTTAGTTCAT 
TCCCaAGGGA 
TCTCTGAAAT 
CTAGGATAGA 
TGCTAAGAAT 
TTCAGTGACT 
TACAACAGTT 
ATGTTTGTCT 
TACTTGcTCT 
CTTCTTCAAG 
GATTTGTGTT 
TACCAAGGTT 
GGCACTTGAT 
ATCTTTCAAT 
TTCACGATTT 
TACATCGTCT 
tGGAGTTAAT 
ACATTCTCAT 
GTCTGAATGA 
CCCCTAAGAG 
GGCGCCCAAA 
TCGTTTCTGT 
TTGAAATATT 
GCTCTTCCTG 
GAACAGTAAT 
CTATAAAAAC 
CCCCAACATT 
GATAAATAaT 
GATAAATCAA 



23040 

23100 

23160 

23220 

23280 

23340 

23400 

234 60 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 

24900 

24960 
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GCAGCGTTTT CGTTCTGCTT GCTCATAACC ACCATAAAAA GXAAACGATA AATCACTGTT 


25020 


TTGTCGAATC 


AACGTTTCCA 


AAATATACGC 


CTGTCTAGGA 


TCAAGAAAGT 


CTGTCAAATA 


2508a 


AGGCGCGTAT 


TGCATCTCCA 


CTTGCTCTAG 


CCAATCCCCG 


ACAGTGTCAA 


TAAAAGGATG 


25140 


TTCATCTTTG 


CGGAAATGTT 


GATACACATT 


TGCGTTCATA 


GATAAATCAT 


CCTTTCTAAA 


25200 


AGTTATTAAT 


AAACAATAGC 


TCGTAAAATG 


ACTGCTAAGC 


CACCCGCTGC 


TAGGTTAAGG 


25260 


ACAATAATCC 


CTACCATTAC 


GTTAAATCCA 


ACCATTCCAA 


TCCTTAAATT 


CAAACGGTCA 


25320 


AACAAACTTA 


AGTATGGTTC 


ACAAATTCTA 


GAGAGCAAAC 


GCCCGAAAGC 


AGAATCATAT 


25380 


GCTCCTGGAA 


ACCATGAAAG 


TAGCGCGTAA ATCACTAGTA ATCCTGAATA 


GATTTGAACG 


25440 


CCTTTGTATA ATAGAGTAAT 


CAGTGTTGCG 


ATAATAACAC 


CTCCCTAAGG 


CTTAGAAATC 


25500 


AAAGAATTGT 


TTGTTGGCTA 


AAGATTGTGC 


TGTCGCACTA 


TCAATTTCCA 


TATTCGCTGG 


25560 


TGTGCATAAG 


AAGATTTCAT 


TGCCAACGCG 


CTGAATATCT 


CCGTCTAAAG 


CATACACAGT 


25620 


ACCAGTTAAG 


AAATCAACGA 


TGCGTCGTGC 


TTGGTCTTCT 


TCTACTAATG 


TAAAATTAAC 


25680 


TAGCACAGCT 


TCTTCAGCAA 


ATAAATGTTT 


CGCAATGCTC 


ATTGCTTCTG 


AATACACACG 


25740 


TGGTTCAATA 


ATAGTAATTT 


TATGTGTTTT 


TGCATTGGCG 


GTGCTTTCTT 


GTGCTCGTCT 


25800 


TGTATTCGTA GTCGCTGATT 


GATTAGAACT 


ACGCATAGAA 


ACCACTTTTT 


TCTCTGTGTA 


25860 


AGCAGCTTGT 


TGTGTGTCTC 


TTTCTTGATG 


CGCTTCAGCA 


GAACGATAGC 


GTGCAGAAGG 


25920 


TTTTTCAACT 


ACAGGCTCAC 


TTTGTTGTGT 


ATAAGCTGGC 


TGACTGTAAC 


TTTCCTGTTG 


25980 


TTGAACAGGG 


CGTTGTGGTT 


TAGGCCGAGC 


TGCACGACGC 


GGTGGTTCAT 


TAACTGCTTT 


26040 


GCGCTCTTCA 


TATTCTTCGT 


AGTTATCATA 


ATAATCATCT 


TCTTCGCCAG 


ATAAGCCAAA 


26100 


AAAGCTCGAT 


AACGCATCCT 


TGTTAAAAAT 


TGACATAAAT 


TCCTCTCCTT 


TACGCATCTC 


26160 


TGAATAACGC 


TGTTCCGATT 


CGTATAAAAG 


TTGCACCTTC 


TTCAACTGCG 


ATTGGGAAAT 


26220 


CGTtACTCAT 


TCCCATGCTA AGTTCTGTAC 


AAGGAGCAAA 


AGATAAATGT 


TTCTCATTTA 


26280 


CGGTCATTTG 


CAGCTCTTTT 


AATtCTCTAA 


ATGTTGCATG 


TAATACTTCT 


TCACTTGCCC 


26340 


CAAAAGGTGC 


CATAGTCATT 


AGACCGACGA 


TTTGAATTTT 


GTCCAAATCA 


GCAAGTTCTT 


26400 


GAATGAAAGG 


TAAGACGTCT 


TCTTTACGAA AGCCGTGCTT 


GCTTTCTTCC 


CCAGTTATAT 


26460 


TCACTTCAAC 


AAAACAACGA ATAACTTTCG 


TTGCTCTTTT 


TTGAATTTCT 


TTGGCTAGAC 


26520 


TCAGACTATC 


TAATGCATGA AAATAATCTA 


TCTCATTAAT 


AATTAATTTT 


ACCTTCCTAC 


26580 


GTTGTAAATT 


ACCAATTAAA 


TGCCACTGAA 


TATCATTAAA 


TTTTTCCAAT 


CCAGCTTTCT 


26640 


TTTCTAGTAA 


CTTATCTACT 


CGGTTTTCAG 


CTAAATTCTG 


GACACCAAGC 


TGAACAAGCT 


26700 


CTGCTGCTTC 


CTCAATTCCT 


ACTGATTTAG 


TGACGCCAAT 


CATTGTCACA 


TCAGCAACAG 


26760 


AGCGCTGTGA 


GCTTTTGCAT 


GCTAGCTGAA 


GTTGATTATT 


AATATTTGTT 


AAGTTTTCAG 


26820 


CTATCATTTC 


TAATCGATGC 


TCCTTATCGT 


TTTCTGCGGA 


AAAATGGTGG 


CGTACTTAAT 


26880 


TCGTCATCAT 


TGTGATTTGG 


TTCTTCACGG 


TGGAATGTGT 


CAAATTCTTT 


TTTGTCTACT 


26940 
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TGTTCTAATG AAGATTCATC AACTTTTGGA CGTGTATTTT GTTCTCTGCG AATATCCCAA 27000 

TCGCCAAAAG CACTTGCTTC TTCTXGTGAT ITT.GGXTGAT CCATTTCCAC AGATTGAGTT * 27060 

GTTTGTTGCA TTGGTTGAAC CGCTTGTCTT GTTTGACGGT GCGGTTTTCT ATCCTTTTTT 27120 

GATTCATCAA TACCTGTAGC GATTACTGTT ACACGAATTT CGTCGCCTAA ATCTTCGTTA 27180 

ATTGATGTAC CTAGGATGAT GTTCACATCA CCTGAAGCAG CATTTGTAAC GATATCTGAA 27240 

GCATCTTGCG CTTCAAACAA GGTCATATCT AAGCCACCAG TAATGTTTAA TAAGACTTGT 27300 

TCCGCACCAT CGATTGATGT TTCTAACAAT GGTGAAGAAA TAGCTTTTTT CGTTGCTTCA 27360 

ATTACACGTT CTTCGCCGCT CGCTACACCA ATTCCCATTA AAGCTGTTCC TTGGTTTTCC 27420 

ATCACTGTTT TCACGTCAGC AAAGTCCAAG TTTACGTAAC CTGGTGCAGT GaTTAAATCT 27480 

GAAATCCCTT GAACACCTTG aCGTAATACA TTATCAGCTT CTCTAAATGC TTCAAGCATT 27540 

GGCGTTTTCT TGTCAACGAC TTCTAATAAG CGGTTATTTG AGATAATTAA TAGTGTATCA 27600 

ACGTTTTCTT TTAATAAGGC AATTCCTTCA GCanGCAAAc GACCACGTTT TGGACCTTCA 27660 

AAACTAAATG GACGAGTTAC TACACCAACT GTTAAAGCGC CTAATTCTTT AGCGATTTTT 27720 

GCAACTACTG GCGCAGCACC TGTACCAGTT CCGCCACCCA TACCAGCAGT AATGAAAATC 27780 

ATATCCGCGC CTTGTAATGA TTCTGAAATC ACTTGTTCAC TTTCTTCTGc AGCTTTTTGG 27840 

CCAACTTCAG GTTGtGAACC GGCACCTAAA CCACGAGTGT ATTTAGGGCC TAATTGAATC 27900 

ACTGTTTCTG CTTTTGAATG TTTTAATGCT TGAACGTCTG TATTGGCTGT GATAAATTCC 27960 

ACGCCTTTAA CGTTTTCTTC GATCATACGG TTTACAGCGT TTCCGCCGCC GCCACCTACA 28020 

CCGATAACTT TAATGACTGC ACCGTTGTTA ATGTTATTGT CTAATGAAAA TTCCATGATA 28080 

TCGTATCCTC CTGTTTGTGT GATTAGTCGA AAATGTTCGA GAAGAAGTCT TTGATTTTGC 28140 

CTGTCACTTT TTCACCAGAC TCACGTTCGT TGAATTCTTC GTATTCTTCT TGAGGCTGTT 28200 

CAGCATATGT GTCATAGCGA ACTTCTTGTT GGACAGCGAC TGATTGTGCT GGTTTCGATT 28260 

TTTCACCAGG GATTGCATAT TTGGCAATGT GATAAATATC GTTTAGTTGC GCTGAATATT 28320 

CCACAATGCT AATGACATTA GCAAAGACTG GGTTACGTAA GCCCATGTGA TTTGGTACGT 28380 

ATAGTTTTAC GTTGGCTTCA AAAATCTCTT GCGCCAAATC GACAATACCT GGCATACTAG 28440 

CGGCGCCACC AGtTAACACA ACGCCTCCAG GTAATTCmAA CGCATCAATT TCATCCAGAA 28500 

CTTCTTTTGA TTTTCTCAAA ATTTGTTCCA CACGCGCTTC 7VATAATTTCT GAAAGGTAAC 28560 

GTTCGTCCAC TCTTACAGGC TCTGACTTAC CGATAACATC AACAGGAAAC TCTTCGTTTG 28620 

CTGAAGTTCT TTCTGGATAA GCATCCCCAT AATTAATTTT TAAGGCTTCG GCGTTGTTAA 28680 

AAGAAGTATT CAATACGATT GAGATATCTT TCGTGATGAA CTCGCCGCCT TCTTGGTTAA 28740 

CATGAGTAAA TTTCAATTGT TTGTCGTGGg ATAACAGATG TCGTTGtTTG aCCGCCACCC 28800 

aTATCaATGa CAATkGtTCC aAAGTCTTTT TCGcCATCTG tTaAAATTGT TTCCGTTAAA 28860 

GCAAGCGGTG tAATCACTAA TTCGtTAATT CCTAAgCCAG CTTTTTCAAC ACATTTACGG 28920 
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ATGTTATGGA TGaTTGTTTT TGGtCCAGTA TAGACAACGC CA^UVCATCTC CATGCGAACG 
CCAAGCATTC CACGTGGGTC TTTGATGCCT TCAAAGCCGT CGACGGTAAA .GTCTTGCGGT 
aAGATTGCAA CAATTTGACG TTCAGGAGGA GTTGAACGAA CCAAAGCTGC AGATGCGACG 
TTGCGCACAT CCTCATCTGT AATTTCTTTT GATTCACTGC TAACAGCTAT CATTCCTTGA 
CAGCTTTCTA CTTCTAATAA ATTCGCTGGT AATCCAACAT TTACGCTTTT AATTTGAATG 
CCGGCTTTTT CTTCGGCTTG TCGAACGGCC CGTTGGATTG CTTGAACTGT TTGATCAATA 
TCGACGACAA TCCCTCGATT CAAACCGTCT GATTTTGCAT TTCCAACGCC GATGATGTTC 
ATTTGACCTT CAATATATTC AGCGACAACT ACTTTGACAG AAGTTGTACC AATATCGAGG 
CCTACATACA TTCCTGTTTT TGCCATGAAC GGAATTCCTC CTCCTATTAA TTCCAATATC 
TAAGCGGACA AAGGCGTCCG TTTAACTATC TCTTTTTTAG TATTATTCAC TTCTTCCAAT 
TTTACCATAA AAATGGGGGT TTTAGAAAGC AGATTATACG AATTTGTTCG TTAATTTTTG 
AAACATTTTT GTAAAAAACT TATTCTGCTT GTTGCACTTC TTGATTTTCT CCATTTGTCG 
GCGTACCGTT ATTCGCTTCA TCCGTTGCTA CGCCTGCATT TGGGTCCACA ACTTCTTGAT 
TTTCAACCGC TGAATCTTCG GAAACTTCAC TACCCGTTTC TTCACTTTCT TTTTCATACG 
GATAAGAAAA AATGCCGACT TCCATATCAA CGATTCCAGG TTTGTCCATT TGCTCGGCTA 
CTTGTGAATA ATAAGCCATT TTTTCACTTA AATCAGAAAT ATTAACAATC ACTCGATTTC 
CATCATTCAT ATAGACATTA ATCAAGTCTT TATTGGTTTT GGATGGTGCA TATTTAATTT 
CTGAAATTCC TTGTTTAATT TCTTGAGGTA ATTTGTTGTA AGAGGCCATT AATTCAGGGA 
TTAATTTATC CTCTTTG.AAA TTCTCAAAAA TTGGTTTGCC ACTTTCAGCA GCTTTGGTGG 
TTTCCGCCAA AGTTTTACCA TTCTCTAAAA TCGGATGATA GCCGCCTTTA GTCGCAGCAA 
GAGCGACAAT TTGATACTCT TGAATATCAA TTTTAAATGA ATTAATACCA TTTAACTTAA 
TGTTCGCCTT TTTTATAATT GGAAACTTTT TCTGAATATT TGCAGAAAAA TT^TTTCTAT 
TTGAATACTG CTCCCATAAG CCACTGCCCG TTTCAAGTTT TGACTGCTGA ATAATCGCTT 
GGCTCTCCAC ACTTTTGTTA CCTGAAACTG TTACTTCACT CAATCGACTA AGAGGCGAAA 
CAAAATACAA TGCCACTAAA ATGGCTGTCC CTAAACAAGT TAAAATAAAG GCTAAACGCC 
TATATAAGCG CTTGTTCCGT GTTTTCTTAA CGTTCGGTAA ACGATCTGCA AAATGCTCCT 
CTTCTTCCAC TGTTTCTTGT GGAAGATGTT GTTGAGGCTC TTCTTGACTT TGTTCTTGTA 
AAGTGATGGT GACTTCTGCT TGTTCGTTTT CACCTTTTGA AGCAGCTTCT TCCGCTTGCT 
TCTTCAAATA TTCTTGGTTT TGTTTTTGCC AAGGTGTCAG TTCAGGTTGA GACTCTTCCG 
^TYTTTTTGG AGGTAATGGT TTTCTTTTTT TGAATATATC GTTCTCGTTA CTAATCTTCC 
ACACCTCCTA TCTCAGACTA ACGTTTTCAC AACTTGATAC AAACGATCAC TTGCATCTGG 
GATGCCTTCC CCTTTGGACG CCGTTGCCAT CTGTTGTCTT TTTTCATTGT TTAATAAAAT 
ATCATCTATT GCCGCAACTA AACGAGCGCC GGTCAATTCT GCATCTGGAA TCATTTCAAC 



28980 

29040 

29100 

29160 

29220 

29280 

29340 

29400 

29460 

29520 

29580 

29640 

29700 

29760 

29820 

29880 

29940 

30000 

30060 

30120 

30180 

30240 

30300 

30360 

30420 

30480 

30540 

30600 

30660 

30720 

30780 

30840 

30900 
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AGCACCAACT 


TTTACTAAGC 


TTTGCGCGTT 


TTTCGTTTGA 


TGATCATTCG 


TGACGTAAGG 


30960 


ACTCGGAATT 


AAAATTGCTG 


GTAATCCTAA 


TGCGGTGAAC 


TCCGCAATTG 


AGGTTGCTCC 


31020 


TGCACGTCCC 


ACCATTAAAT 


CGGTATTAGC 


CATCACTTCA 


ACCATTTTAT 


CAATATACGG 


31080 


TTGCACACTT ATGTTTGTCA ATTTCTTTTC AGAAAACTTT 


AGACTTTCTT 


GAAGTTCCTG 


31140 


ATAATATCTT 


TCTCCTGAAG 


CGTATAAAAC 


TTGATACTCT 


CTCTCTTCAA ACAAAGGAAA 


31200 


GGCCTGTTCA AACGCTTGAT 


TGATTTTCAA GGCGCCACGA 


CTTCCGCCAA 


ATAAAACCAC 


31260 


CGTTTTCTTT GCTGGATCCA AGCCA7VACTC TGATAAAATA GCTGACTTTT CCACCGTTAC 


31320 


GACTTCTTGT 


CCCCGCGGGT 


TCCCTGTT7UV 


AATTGTTTTT 


TCTTTCGGGA 


AAAAACTAGC 


31380 


GACATCCGGA AAACAAATCG 


CAATTTTATC 


CACATAACGA 


CTTAAAAATT 


TATTTGTCAT 


31440 


TCCGGGGATA 


CTATTTTGTT 


CATGAATAAT 


TGTCGGTATT 


TTTAATTGGT 


GGGCCGCATA 


31500 


AACCACTGCA 


CCAGAAACAT 


AACCGCCTGT 


ACCAATGACC 


ACATCTGGTT 


GAAATTCACG 


31560 


AATAATTTTT 


TTTGCTTTAT 


TAATGCTTGT 


TAAAAATAAA 


TAAATTGTTT 


TAAAATTTTG 


31620 


AGGACTTAGC 


GAACGCTTGA 


ATCCTTGAAT 


TTTAATCGTT 


TTAAATGGAA 


TTTTCGCCTT 


31680 


TGGCACAATT 


TGACTTTCTA 


GACCATTtTC 


TGTTCCCACA 


TACAAGAATT 


CTGTTGCAGG 


31740 


TGCTTCTTTT 


TTGACGTGTT 


CGACGAAGGA 


TAACGCAGGA 


TAGATATGCC 


CACCTGTGCC 


31800 


ACCACCAGTT 


ACTAGTATCT 


TCATTTTCTC 


ACATTTCACT 


TTCTTTTAAT 


TGTTGAACAG 


31860 


CTTGCATAAA GGCTTCCCCG 


CGTACTTCAA 


AATTCGGGTA 


TTGGTCCCAA 


CTTGCGCAAG 


31920 


CAGGTGATAG 


TAAAATAGTA 


TCATCTTTTT 


CCGAATAATC 


AAAGGCAATG 


GTAACCGCCG 


31980 


TTTGAACATT 


TTCAGCAAAT 


AAAATTGTTT 


CAATGTTCGC 


TTTTTTAGCA 


GCTTCCGCCA 


32040 


ATTTTTCTTT GGTTTCTCCA AACAAAACAA TTGCTTTGAG 


TCCCAACAGG 


GCAGGAACCA 


32100 


ATTCATC7\AA 


TGAGTTACCG 


CGATCCAAGC 


CACCCGCAAG 


TAAAAGTAGC 


TTTTGGTTGT 


32160 


CAAACCCACT 


TAACGCCATC 


TCTGTAGCTA AAATATTGGT 


TGCTTTTGAA 


TCGTTATAAA 


32220 


AACGTCTTTG 


CTGAACTTCG 


CCAACAAATT 


GCGTTCGATG 


AGGAACGCCT 


GAAAAGTTTG 


32280 


TCAAAGTTTG 


TCTAATCTGA 


GCATTCGATA CATTTTTTAA 


TTTAGCTACA 


CAAATCGCTG 


32340 


CGAGTGCATT 


TTCAATATTG 


TGACTGCCAG 


GAATCCCTAG 


CTCATCGGCG 


GGCATAATAT 


32400 


ATTCTTCATT 


GAAATATAAT 


TTCCCATCTA AAAGATAAGC 


CCCTTCTACT 


GCTTCTTTCG 


32460 


TTGAAAAAGG 


CAATACGTTG 


GCAGCTGTGG TTTTGGCTAA 


CGTTTGAAGC 


TCTACTTGAT 


32520 


TCCAATTTAA 


AATCAAGGTG 


TCCTCTGCGG 


TCATGTTTTT 


TTGAATGGCC 


CATTTTGCAG 


32580 


CAACATATTC 


TTTCCGCGAA 


CCATGATAAT 


CCAAGTGTGC 


CTCAAAAATA 


TTTGTAATTA 


32640 


CTGCAATTTG 


TGGGTGAAAC 


GTCTCAATTC 


CCATTAACTG 


AAAACTAGAA 


AGTTCCATCA 


32700 


CAAGATTATC 


CTTGGCCGTT 


GCTTCTTGAG 


CCACCGTACT 


CGCTGGAAAA 


CCAATATTTC 


32760 


CCGCCAAA 












32768 


(2) INFORMATION FOR SEQ ID NO: 129: 
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(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 6259 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

GGCTGGATTT TCCTAGTAAG AGCTTGTTGG AmCAGCTTAA TAAAGACACA AGCCTTTTTT 60 

GACGTCGTCG CTCAGGATTT TGCTAAAATG GCCCAGTTTA CGTATCATTT ACAAGAAGCA 120 

AAAAAAGTCA CAGAGTATAC AAATTCTTTA AGATTAAAGG ATTTACAAGG AAAATAAGCT 180 

GAAAACAACA TAAAAATAGT GTATAATTGA AAAAATTAAT CTGCGAGAGA AAGGGAACCT 240 

TTTTTGATTG TAGATCACCG CAGTTGTTTG TTGCCTAGTC TCTAAGAAAT AAACCTAAAA 300 

TTATTGATAT AAAGGATGAA CAAATGAAAA AAGAAGAAAT GCAAAT6CGT AATACACGTC 360 

GTCAAAAATC AGGAAAAAAT AATAAAAAGA AAGTAATTAT TACTTCTTTG GTTGGACTAG 420 

CTCTGGTTGC TGGGGGCAGT TATGTTTATT TTCAAAGTCA CTTTTTmCCA ACCACAAAAG 480 

TAAATGGAGT TTCTGTAGGC TGGTTAAATG TAAATGCTGC AGAAGAAAAA TTAGCGCAAG 540 

TTAATCAAAC CGAAGAAGTT GTGGTTCAAA CGGGGACAAA AGAAGAAAAA ATTCAACTTC 600 

CTTiAAAAATA CCAATTGGAT CAAAAATTTT TAAAAGACCA TTTACACAGT AGCAAGGTGA 660 

AGCTACCGTT AAACGAGGCA TTCAAAAAAG AACTAGAAGC C7VAATTAGCA ACTTTGAGTT 720 

TTCCAGAGGG GAAACCAAGC AAAAATGCGA GTATCCGTCG AGGCAATGGC ACTTTTGAAA 780 

TTGTTCCCGA AGAACAAGGC ACAGTAGTGG ACACACAGCG CTT/^CCAG CAGATTATTG 840 

CGGATGTTGA AGCGGGAAAA GGCAACTATC AATATAATGC CAAAGATTTT TATAAAGCCC 900 

CTGAAATTAC AAAAGAGGAT CAAACGTTAA AGGCAACATT GACAACGCTC AATAACAAGT 960 

TAAATAAAAC AATTACAGTT GATATTAATG GTGAAAAAGT AGCCTTTGAT AAAACACAAA 1020 

TTCAAAACGT GCTGAATGAT GATGGCACAA TCAACAAAGA AAAACTAACT ACTTGGGTGA 1080 

CACAATTAGA AACAACATAT GGTTCTGCTA ATCAACCAGT TTTATTTACA GATGTTCACG 1140 

GCACGACACG TCGTTTTAAA AACAACGGAA GTTATGGCTG GTCGATTGAT GGGGCCAAAA 1200 

CGCAAGAACT ACTAGTAAAC GCGCTGAATA GCCAAGAACA AACGAATGCA ATCACTGCTC 1260 

CGTTGGTTGG TGATACCAAA GAAAATAGTA AAATTGCCAA TAATTACATT GAAATTGATT 1320 

TAAAAGATCA AAAAATGTAT TGTTTCATTG ATGGCAAAAA AATAGTCACC ACAGATGTCA 1380 

TTACTGGCAG ATATAACAAA GGAACCGCAA CAGTACCAGG ATTCCATACA ATTTTATATC 1440 

GGACAACCGA TGTGAATTTA GAAGGTCAAA TGCTTGATGG TTCTCGATAC AGTGTGCCAG 1500 

TAAAATATTG GATGCCGTTA TTAAGTCAAG GGGGCGTTGT CACACAAATC GGGATTCATG 1560 

ACTCCGACCA TAAATTGGAT AAGTATGGCG ATAAAGAAGC CTTTAAAACC GATGCTGGTA 1620 

GTAATGGCTG TATCAATACG CCAGGAACAG AAGTTTCAAA AATCTTTGAT GTATCCTATG 1680 

ACGGAATGCC GGTAATTATT TATGGACATA TCTATGATGA TGCACCAGGT GAATTTGATA 1740 
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AACCTGTAGA TTACGGC&A AAGAGATATT AGGcAAGACC GTACTTTGGT 

V^GCTGTct'aTCTGM?^^^ 7AA.-jyVTTSG AGACGTCAGC GTAACGACTG ACGTCCCCAA 
TTrTTTATTG ACAAGCG^CA 7AACCCGCTC TATGATACAA ACAGAACAAG CAATTTAAAT 
TGGGGGAACG AAGATGCC-.T rTGTACATGT AGAATTAATT GAAGGCCGCA CAGAAGAACA 
GTTAACTAAT ATGGTCAAPjG ATATTACAGA AGCTGTGTCA AAAAACGCTG GTGCACCAAA 
AGAGAATATC CaTGTGAT7G TCT^-TGaATT GAAAAAgACC GCTATGCACA AGGCGGCGAA 
TGGAAAAAAT AAGTTTTC:;.JG AGAATTTCTG TTGAATCACT TGCTTTTTCT GAAAATAAGT 
GGTTAAATAA GTGCATAAAT GATTGATGAG AAAGACAACT GAACTTGCCC GCGATCTATT 
AGAGAGGAAA ATCTTGGC7G AAAATTTTCC AGATTGGCAC AAGGGATGAC CACTTTCGAA 
ACCTTTGTTG AAACAAAGGC GGTCGCACCG TTAACACGCT AGAGGAACAC AGATTATTTG 
TGTTCAAAAT TAGGTGGAAC CACGATATTT TCGTCCTAGT ATTCGTTAGA ATACTAGGAC 
TTTTTGTTTT TTAACAAGGG GAACAGGTGG CTGTTTCGTT TCATTGTGAA AAACAGGAAC 
ACAATGAAAC GCTTCAJVA7C AAAAAAATTA AAAGGAGAGA ATTTTATTAT GTCTATTCAC 
ATTACTTTTC CAGATGGCGC TGTTAAACCG TTTGATTCTG GAATTACAAC ATTTGATGTT 
GCTAAAAGTA TTAGCAACAG 7TTAGCCAAA AAAGCTTTAG CTGGTAAATT CAATGGTGTT 
TTAATCGATT TAGATCGTCC TATCGTA6AA 6ATGGTTCGC TTGAAATCGT GACACCTGAT 
CATGAAGATG CTTTAGGAAT TTTACGTCAT TCATCAGCTC ATTTAATGGC TAATGCCTTA 
CGCCGTCTTT TCCCTAACAT TAAATTTGGC GTAGGTCCTG CGATTGATTC TGGGTTCTAT 
TA?GATACAG ATAATGGAGA ATCCCCTGTG ACAGCGGAAG ATTTACCTGC AATTGAAGCC 
GAAATGATGA AAATTGTG^A GGAAAATAAC CCAATCGTTC GTAAAGAAAT CTCACGTGCA 
GAAGCGTTAG AATTATTTGC TGATGATCCT TACAAAGTTG AATTAATTAC AGATTTGCCA 
ga:vgatgaaa TCATCACTGT CTATGATCAA GGCGATTTTG TTGATTTATG TCGTGGTGTT 
CACGTCCCTT CAACAGGACG 6ATTCAAG7C TTTAAATTAC TTTCAGTAGC TGGTGCTTAT 
TGGCGCGGAA ACTCTGACAA TCATATGA7G CAACGGATTT ATGGCACTGC CTTTTTTGAT 
AAAAAAGATT TAAAAGAGTT TATCAAAAT6 CGCGAGGAAG CCAAAGAACG TGACCACCGT 
AAATTAGGaA AgAATTAGAT TTATTTATGG TTTCACAAGA AGTTGGTTCA GGGTTACCTT 
TCTGGTTACC AAAAGGCGCA ACCATTCGTC GTACAATTGA ACGTTATATT GTGGACAAAG 
AAATTAGCTT AGGTTACCAA CATGTGTATA CACCAATTAT GGCAGATGTG GAATTATACA 
AAACATCTGG TCACTGGGAT CATTACCATG AAGATATGTT CCCACCAATG GATATGGGTG 
ATGGCGAAAT GCTGGTATTA CGTCCAATGA ACTGTCCACA CCATATGATG GTTTATAAAA 
ATGACATTCA TAGTTACCGC GAATTGCCAA TTCGAATCGC TGAATTAGGG ATGATGCACC 
GCTATGAAAA ATCTGGCGCA TTATCAGGGT TACAACGTGT TCGTGAAATG ACTTTAAACG 
ATGGCCATAC TTTTGTTCGT CCTGACCAAA TTAAAGACGA ATTTAAACGT ACTTTGGAGT 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 
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TAATGGTGGC AGTCTATGCT GACTTTAAGA TTACGGATTA TCGTTTCCGC 


TTAAGCTATC 


3780 


GTGATCCAAA 


TAATACAGAC 


AAATATTTTG 


ATGATGATGC 


GATGTGGGAA 


AAAGCGCAAG 


3840 


CGATGTTAAA AGCTGCCATG GATGAATTAG AATTAGATTA CTTTGAAGCA GAAGGCGAAG 


3900 


CTGCCTTTTA 


CGGTCCGAAG 


TTAGATGTTC 


AAGTAAAAAC 


AGCTTTAGGA 


ATGGAAGAAA 


3960 


CATTATCAAC 


CATCCAATTA GACTTCTTAT 


TACCAGAACG 


TTTTGACTTA 


ACTTATGTTG 


4020 


GCGAAGATGG 


TGAAAATACA 


CATCGCCCAG 


TTGTTATCCA 


CCGTGGTATT 


GTCTCAACAA 


4080 


TGGAACGATT 


TGTGGCTTAC 


TTAACAGAAG 


TTTACAAAGG 


CGCTTTCCCT 


ACTTGGTTAG 


4140 


CACCAATTCA 


AGCAACTATT 


ATCCCAGTTT 


CTGTAGAAGC 


GCATTCTGAG 


TATGCTTATG 


4200 


AAATCAAAGA 


ACGTTTACAA 


GCACAAGGCT 


TACGTGTTGA 


AGTCGATGAT 


CGTAACGAAA 


4260 


AAATGGGCTA 


CAAAATTCGG 


GCCTCTCAAA 


CACAAAAAGT 


ACCTTATCAA 


TTAGTGGTCG 


4320 


GGGACAAAGA 


AATGGAAGAC 


GCAACGGTGA 


ACGTCCGTCG 


TTATGGAAGC 


AAAGAAACGT 


4380 


CTGTCGAAGA 


TTTATCAATT 


TTCATTGACA 


GCATGGCTGC 


TG7WVGTTCAC 


AATTACAGCC 


4440 


GTTAAAAAAA 


GAGAnAAAGT 


CAAAAGGACT 


TCTGAATTCA 


GAAGTCCTTT 


TGACTTTTTT 


4500 


CAACAACAAA 


ATAATTAAAT 


ATTTCCTTAG 


AGCTGGGAAT 


AATCCTGATT 


TTTTGACTAT 


4560 


TTTTTGTTAC 


AATGAGAATG 


CGAAACTTTT 


TAGTGAAGGG 


ACTAGAGGTA 


TGAAGAAAAA 


4620 


CTCATTTATA 


GnTrAaATAA AAAAAaCAGA 


TGCCGCAGCA AAGAGTCAAC 


GAGGTAGAAA 


4680 


GCCACATATT 


CCTTTTCGAT 


T6AACTTATT 


ATTTTTTGTG 


ATATTCACTT 


TATTTGTTTC 


4740 


ATTAATTGTC 


CGTTTAGGGT 


ATTTACAGAT 


TGTAGAAGGA 


GAAGAGTTTA 


ACAAAAAAAT 


4800 


CACTGCCAAT 


TCTTCACTAC 


AAATCACGAC 


ACCATCCCCT 


CGAGGACAAA 


TTTATGATTC 


4860 


ACAAGGAAAA 


GTCTTGGTTT 


CTAACAAAGC 


CAATTTAGCA 


ATCACATACA 


CACGAGGAAA 


4920 


AAATATTGAA 


GGAAAAGATA 


TTTTACCGAT 


TGCGAATAAA 


GTTAACGAAC 


TTATTAACGT 


4980 


TCCAGTTGAT 


CCAAACTTAA 


CGGATCGTGA 


CAAAAAAGAT 


TATTGGTTAG 


CGAATCCTGA 


5040 


AAATTTAAAA 


GCAGCCCAAT 


CCCGTTTAAC 


CGATCAAGAT 


AAAGAAGATG 


AAAAAGGCAA 


5100 


CAAAATTACG 


GACGAAGGCA 


CATTATATGC 


CAAAGCAGTT 


GAAAAAGTCA 


CCCCTGAAGA 


5160 


AATTGCGTTT 


GATGACCGAA 


CCTTACAAGC 


AGTAACGATT 


TTTAAACGAA 


TGAATGCAGC 


5220 


ATCACAAATG 


AACACGGTCT 


TCATCAAGAA 


TGAAGGGGTA ACAGAAGGTG 


AAATTGCGAC 


5280 


AATTGGTGAA 


CACACAGCGG 


AAATATCTGG 


TGTTTCAACA 


GGGACTGATT 


GGACGCGTGA 


5340 


TTATTCTCAA AGTGGCGCAT 


TACGTAGCCT 


GTTGGGCACc 


GTTTCAACTG 


AAAAACAAGG 


5400 


ATTgCCAGCC GAAGAaGTCG ATGanTATTt GAAAAAaGGC tATGCaCGAA aTGACCGTGT 


5460 


CGGGACTAGC 


TATTTAGAAA AACAATATGA AGACGTTCTA CAAGGTAAAA AAGCAAAATC 


5520 


AGAAGTCGTT 


TTAGACAATA ACGGAAAGAT 


TGTTTCACAA 


ACCCCAATTT 


CCAAaGGGGA 


5580 


AAAAGGCTCC 


AACTTAAAAT 


TAACGATTGA 


TTCAAATTTC 


CAAAACAAAG 


TTGATGAAAT 


5640 


TTTACAACGA AACTATTCAC AAATAGTCAA AACCATTGGA CCATACTCAG AAAATGCATA 


5700 
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TGTCGTTGCT ATGAACCCTC AAACAGGAGC- TATTTTAGCG AT.GTCCGGAT TTCATCATGA 5760 

TTTAGCAACT GGAGAAGTCA CACCTAATCC TTTAGCACCT ATTTTGAATT CTGAAGTTCC 5820 

TGGATCAGTT GTGAAAGCAG GTACTTTAAC AGCTGGTTAT GAAACAGGCG TCATTAAAGG 5880 

CAATGACGTT TTAACAGATG AAGCTATTTT ACTAGCAGGT AGTAATCCAA AAGCTTCTTG 5940 

GTGGAATGCT TCTGGCGGAA CAACCATGCA ATTAACTGCG GAgCAAGCGT TAGAATATTC 6000 

TTCGAATGCT TATATGATGA AATTAGTCTT TAAAATGATG GGTGTGAATT ATTATCCCAA 6060 

TATGATTTTC CCTTATGAAG TAGGAGATGA CACTGTCTTT AAAGAATTAA GaAAAGCTTT 6120 

TGCAGAATAT GGTATGGGAA CGAAAACTGG AATTGATATA CCAGGAGAAA CAACAGGAAT 6180 

TCAArACAAG GACTTTAAGG ATTCGTCATC tGCGCCaCAA GGCGGTAATC TTCTTGACCT 624 0 

ATCTTTTGGA CAATATGAT 6259 
(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9767 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 



CGCTGACTCA 


ATTAGCGTAA 


TGGAGTTTGT 


TTTAGAACTT 


GAAGATGAAT 


TTGGAACAGA 


60 


GATTTCAGAT 


GAAGATGCAG 


AAAAAATTGA 


AACAGTTGGT 


GCCGCTGTAG 


ATTATATTGT 


120 


GAGTAATTCA 


TAAAAAATAA 


TAAGACAAAG 


TCGATTAACA 


TTCCTGTTAA 


TCGACTTTTT 


180 


TTGCGTACAT 


AATGGTTATT 


TTATTAAGCA 


ATAAGGCTAT 


TTTTACACAT 


ATTTTTGTAA 


240 


AAAAAATCAT 


ACATGTAAAA 


AAATAACTGC 


GAAAAAAGTT 


ATTTTATCTT 


GCTTTTTATG 


300 


GAATAATTCT 


ATAAAATTAT 


GAAAAAACAA 


TGAGCTGTAA 


ATTAACATTG 


ATAAAAATTA 


360 


GAAAGTGCTA 


TCATTT^AATA 


TTAGTTAATT 


TATATAATGT 


TTTTGAAAGG 


ACGTAAGGGG 


420 


TGCGTATGGA 


TGCAAGGAAA 


TGAACAATTA 


CTAGAGGTTT 


CAGGATTAGA 


AACAGCATTT 


480 


AGAATCAAAG 


ATGATTATTA 


TAATGCCGTC 


GATGATGTTT 


CATTTGAATT 


GAAGAAAAAT 


540 - 


GAGATTCTGG 


CAATCGTCGG 


AGAATCTGGT 


TGTGGAAAAA 


GTACCTTAGC 


AACAACGATT 


600 


ATGGGCTTAC 


ATGATCCCAA 


TCATACCAAA 


ATTACTGGGG 


AAATTTTATA 


CAACAATTTA 


660 


AACTTAACTA 


CGTTTAATGA 


AACATTGTAC 


AACAAAGTCA 


GAGGAAACGA 


TATTGGGATG 


720 


ATTTTCCAAG 


ATCCGTTATC 


TGCACTGAAT 


CCGCTCATGC 


GAATTGAAGA 


CCAAATTAAA 


780 


GAAAGTCTCA 


CGTATCATAC 


CAAAATGACC 


GCACAACAAA 


AACAAGCACG 


GGCCATTGAA 


840 


CTGTTAGATC 


AAGTGGGGAT 


TCCCAACCCA 


GAGCGTGTCG 


GGAAACAGTA 


TCCTCaTGaA 


900 


TTGTCTGGcG 


GcATGCGCcA 


ACGGGTGaTT 


aTTGcCATTG 


CAATCGCGTG 


tAAACCACCG 


960 


aTTTTaATTG 


cmGrTGaACC 


AACAACGGCC 


TTAGACGTTA 


CCATTCAGGC 


ACAAATTCTC 


1020 



wo 98/50555 PCTAJS98/08985 

788 



GATTTGTTAA 


AAGATTTACm 


AGaAGAAAcA 


GGGAGcGGGA 


TTATTTTAAt 


TACCCATGAC 


1080 


TTAGGTGTAG 


TTGCGGAAAT 


GGCTGATCGT 


GTAGCGGTTA 


TGTATGCGGG 


TCAGTTTGTA 


1140 


GAAGTAGCCA 


CTGCTGAAGA 


ATTGTTTAGT 


CATCCGAAAC 


ATCCATATAC 


ACGTTCCTTA 


1200 


CTTCAGTCCA 


TTCCCCAAGA 


GAATTCAGAT 


GAAGAAGAAC 


TTCATGTGAT 


TGAAGGGATT 


1260 


GTTCCTTCGT 


TAACCAAATT 


ACCTAGAAAA 


GGTTGTCGGT 


TTGCCCCACG 


AATTCCTTGG 


1320 


ATTGATGCAT 


CGGAACATGA 


AGAAAATCCA 


ACATTACATG 


AAGTTGCACC 


GAATCATTAT 


1380 


GTTCGTTGTA 


CATGTTATAA 


ACATTTTCAT 


TTTAGAGACG 


GGGAAGGGGA 


AGCGTAATGA 


1440 


CAGAAATTAT 


TCAAATTAAA 


GATTTAAAAG 


TTCATTATCC 


AATTCGCAGC 


GGCTTCTTCA 


1500 


ATCGAATCAC 


TGATCATGTT 


TTGGCGGTTG 


ATGGTGTGGA 


TTTCATGATT 


GAACAAGGTA 


1560 


AAACGTATGG 


GTTAGTTGGT 


GAATCGGGTT 


CTGGAAAATC 


AACGACAGGA 


AAAGCAATTA 


1620 


TTGGTTTAGA 


TIAAAATTACA 


AATGGTGAAA 


TTATTTATCA 


AGGACAAGAT 


GTAACCAAGC 


1680 


CGCGAAGTCG 


AAAAGCGATT 


GGATACAACA 


AAGATGTCCA 


AATGATTTTC 


CAAGATTCGA 


1740 


TGTCAAGTTT 


AAACCCTAAA 


AAACGAGTAC 


TAGATATTAT 


CGCCGAACCA 


ATTCGAAATT 


1800 


TTGAACGGTT 


AAGTGACCAA 


GAAGAAAAGA 


AAAAAGTCAA 


AAGTTTGTTG 


GATATTGTGG 


1860 


GGATGCCAGA 


AGATGCGTTG 


TACAAATATC 


CTCATGAATT 


TTCTGGTGGA 


CAACGCCAAC 


1920 


GTTTAGGCGT 


CGCCCGAGCG 


GTGGCTACCA 


GCCCCAAATT 


AATTATTGCA 


GATGAACCTG 


1980 


TTTCAGCACT 


GGACTTATCT 


GTGCAAGCAC 


AGGTATTGAA 


TTTTATGAAA 


AACATTCAAC 


2040 


AGGAATATGG 


GTTAAGTTAC 


TTATTTATTT 


CCCATGATTT 


GGGCGTAGTT 


AAACATATGT 


2100 


GTGACAACAT 


TGCCATTATG 


TACAAAGGCC 


GTTTTGTTGA 


AATCGGTACA 


CGCCAAGATA 


2160 


TTTACACTAA 


TCCACAACAC 


ATTTATACCA 


AACGCTTACT 


TTCAGCAATT 


CCTAAAATTG 


2220 


ACGTAGCGAA 


CCGAGAAGCT 


CACAAAGAAG 


AACGAAGAAG 


AGTTGAGCAA 


GAGTATCGGG 


2280 


AAAATCATAA 


AGACTATTAT 


GATGAAAATG 


GGCGGGTTTA 


TAACTTACAT 


GCGATTAGTC ' 


2340 


CAACCCACCA 


AGTAGCGTTA 


AAAAATGGAG 


GTGCTGAATA 


ATGTGGAAAA 


CAATTTTACG 


2400 


CCGTTTGTTA 


TTGATGATTC 


CGCAAGTGAT 


TATTTTAAGT 


GTCTTGATTT 


TCCTGTTGGC 


2460 


TAAAATGATG 


CCTGGTGATC 


CGTTTACAGG 


ATTGATTAAT 


CCGAACCAAG 


ACCCGAAAGT 


2520 


GATTGAAGCC 


ATGCGCGAAg 


cTGCCGGGTT 


AAATGATCCT 


TGGTACGAAC 


AATATTTCCG 


2580 


TTGGATTGGC 


AATGCCTTAC 


ATGGAGATTT 


TGGGCATAGT 


TTTATCTTTA 


AATTACCTGT 


2640 


GTCCACGCTC 


ATTGCTGGTC 


GGGTGGGGAA 


TACGATTGCA 


TTGGCCGCTG 


TGTCAGTTAT 


2700 








XXX rXt\ X X \D\^ X 


X X 


AAAATTPATfi 

ryr\r\r\ X X v«n X O 


2760 


GTTTGACAAA 


ATGGTGGTTA 


TCTACAACTT 


CTTCAGTTTT 


GCCGTGCCAT 


TATTCATCTT 


2820 


TGCCTTGATT 


ATGCTCTTTA 


TTTTTGGGTA 


TCGCTTAGAC 


TGGTTCCCAA 


CGAGTGGAAC 


2880 


CGTGACTGTT 


GGTTTGGCTG 


AGGGAACGTG 


GCCGTATTAT 


TTAGATAAAC 


TGAAGCATTT 


2940 


AATTTTACCA 


GGGGTCACTC 


AAGCGTTGCT 


AGGGACAGCG 


GTCACTATTC 


AATATTTGCG 


3000 
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CAGTGMGTG 


ATTGATGTTA 


GCCAACGAAT 


AAAATTTTTA 


TCAATTAGGT 


TATGAAATTA 


TGCTTTCCCA 


GGAATTGGAA 


TATTACCGCT 


CTCGTGTTGA 


TATTGTAATG 


AGTATTGTTG 


GGTGATAAAA 


ATGGAATTGA 


ACCAATGGGC 


TTTCGGATGA 


TTTAATCCTT 


TTGGTCATTA 


AAGTGCCGTT 


ATGCATGTAA 


GAATGGCACC 


AAGTTCATTT 


GATTATCGGG 


GCTAGGAACT 


TATCGGGGTC 


GGTTTAGGGA 


GATGCGTATC 


GTGGATTTTA 


TTCCGTGATT 


CCTAAATATA 


GGTTGGAAAA 


GCGCGGTTGT 


TAGTGCCTCT 


AAGACAATGG 


GAATTTAAGT 


TCATTAATTA 


CGAAACAACG 


TTAACTTTCT 


CTTAAtTGgc 


TACGCCAGCA 


TGCATCAATT 


CTAATTTTAG 


GCGCTCAGCA 


GATGCACGAC 


AGATGAACAA 


GAAACGGATT 


TAGCTGCATG 


TGGTGGCGGC 


TTTCAAAAAT 


GCCAATCGCT 


ATGTCGCTGT 


AGTTATGGAT 


ACAACTATGA 


TGCACAATAC 


ACTTTAAGAT 


TGTCGATGGG 


CAACCATTAA 


ATTACGTCAC 


ACGTGATTTT 


CTCTTATGAA 


ATAACTTTAC 


GAATATTGTT 


CTGGCATAGA 


AAAAGTCAAT 


GAATGCAACA 


ATTAGGTGGC 
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AAAACATGGA CTTTGTTCGT ACTGCTCGTT CAAAAGGAGT 3060 

ATCGACACAT TTTTAGAAAT GCAGCATTAC CAATTGCCTC 3120 

CTGCTTTGAT TGCCGGATCA GTCGTTATTG AAAAGATTTT 3180 

AATTATTTAT TGATAGTATC ATTCAACGTG ACTATTCCGT 3240 

TTTTGGGATT GGCAACCCTT ATCGGAACCT TAATTTCTGA 3300 

ATCCAAGAAT TCGAATTCAG TAGAAAAAAC ACGAAGAAAG 3360 

CAGAAGAAAA AAGAGAAGAA GTCTTACAAG AAAGTATTCC 3420 

TTGCGCGAGA ATTCGTCAAA GAAAAAATGG CGATGTTTTC 3480 

TCTTATTAGC CGTTTTTATC GGTTCGTTAG TGTTAGACCA 3540 

GTATTTTAGA TAAATATGCG GAACCAGGTA CCGTTACAAT 3600 

TAGGTGCCGA TGAAGGGGGT CGTGATGTTT TAGGACAGTT 3660 

CGATTTTAAT CGGTTTTGCC ATTACGATTA TTACGTCTAT 3720 

TTATTTCCGG TTATTATGGA GGCATGATTG ATAACATTTT 3780 

TTATGATTTT GCCAATTATG TTAATTATTA TTGTGTTCGT 3840 

GTATCTGGTC CTTTATTTGG ATTATGTGTG CGTTTTATTG 3900 

TCCGCAGTAA AACGTTATCT GAAGGGCGCC GAGATTACGT 3960 

GAACAAGCGA TTTTAAAATT ATGTTTCGAG AAATTATGCC 4020 

TTACGAACTT AACGATTAAT TTCGCGGCCA ATATCGGGAT 4080 

TAGGCTTCGG GTTACCACAA AGTGTCCCAA GTTTAGGAAC 4140 

GTGGGGATGT ATTAGTCAAT AAACAATGGG TTTGGTTACC 4200 

TCTTGATGTT AAGTATAAAC TATGTTGGAC AAGCATTTAA 4260 

AACGATTAGG CTAATTAGCC AAAAAATGAG GGAGGAAAAG 4320 

TTAGGTGCAA TCACGTTAGC TTCTGTGTTA GTATTCGGGT 4380 

AATAAAGGCG GGGGCAATAA AGCAACGGAA ACAGAAGACA 4440 

GTTAAAAATG ATAAAAAAGC AATTGATGGC GGTACATTAG 4500 

ACACAATTCC AAGGACTTTT CCAGCAAGAA TTTTATCAAG 4560 

ATGCTTCCAA CGGTACAGCC ATTATTTAAC AATGATGCAG 4620 

GGTCcTGCGG ATCTGAAATT AGATGAAGAT GCCAATACAG 4680 

AATTTGAAAT GGTCTGACGG TAAAGATGTG ACAGCCGATG 474 0 

GTCATTGGTC ATAAAGACTA TACAGGGATT CGTTATGATG 4800 

GGCATGGAAG ACTACCATGA TGGTAAATCG CCAACCATTT 4860 

GATAAAGAAG TTAAAATCAC TTATAAAGAA GTTCACCCAG 4920 

GGTGTTTGGG GCTCAGTTTT ACCAAAACAT GCCTTTGAAG 4980 
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GAATTGCTGT TAAAGACATG GAATCAAGCG 
GACCATACTA CAXGAGTAAT ATTGTGACAG 
ATTACTACGG TGGTAAACCT AAATTAGATA 
GCATTGTAGA AGCGATGAAA GCGAAACAAT 
CGTATCCAAC ATACAAAGAT ACTGAAGGGT 
ACACGTATAT TGGCTTTAAA ATGGGTACGT 
ATCCAAAAGC TAAAATGGCA GATAAAAGCT 
ATGATGCAGT CGGCCAAAAA TTCTACAACG 
CACCAGTCTT CAAGAGCTTG CATGATAGCG 
AAGCGAAAAA ATTATTAGAC GATGCTGGTT 
AAGACAAAGA AGGCAAACCA CTAGAAATCA 
CACAACCACT TGCTGATTAC TATGTCCAAC 
ATACAACAGG ACGCTTAATT GATTTCCAAG 
CAGAAGTAGA TATCTATCAA GGCGCGTGGG 
TATATGGTCC AAACTCAGCC TTTAACTATA 
TACTTGATGC GATTGATTCA AAAGCATCAT 
ACGATTGGCA AGAGTATGCC ATTGATGAAG 
AAGTCTTGCC TGTCAACcIAC CGTGTAGTTG 
ATCCATGGGC AACGGTGGGT GTCACAGCAG 
AGACATAAAG CTCTTCTAGA ATCTGTTTTT 
TTTTATGTGA AAATTTTTGT TACTTCAAGC 
TTGTTATAGA TGAATTGTTT TTTATGCAGT 
ATTTTATTTA GTGTGTATAT CTACATGTTT 
TATAATACTC CCTTTACGTA TAAAGAACGA 
TTTTTGAAGG TGTGTTTGGT ATTGTTTATA 
ATGGATATGC CACTTTATAT GGAGAATTAT 
GAAATAACTG TAAAATGGGT TTGGGCAGTT 
GGTTATGCAG TTTACTGTAG TTGGGTAGGT 
GTTCCCGACT TGGTACACGT TACCTTCAAT 
CCTCTATTTT ATGGAGGCTT CTTGTTAAGA 
AAAAATTTGA TAGGAAAGTT TTAGATAATA 
GTGTGAnTGT AGGTATAAAT GGAAGTGGGA 
TGAAAGATGC AACGTCTGGA AAGGTATTTA 
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ATGCAGTTCG TAAAAACCCT GTGACTATTG 5040 

GTGAATCTGT TGAATACCTA CCAAATGAGC 5100 

AATTAGTGTT CAAATCTGTT CCTTCTGCGA 5160 

ACGATATTGC ATTATCAATG CCAACAGATA 5220 

ATCAAATCTT AGGACGTCCC GAACAAGCCT 5280 

TTGACAAAGA AACAAATACA GTGAAATACA 5340 

TACGTCAAGC CATGGGCTAT GCAATTGACA 5400 

GCTTACGAAC AGGGGCAACA ACGTTAATCC 5460 

AAgcGaAAGG CTATACGCTT GATTTAGACA 5520 

ATAAAGACGT AGACGGCGAT GGCATTCGCG 5580 

AGTTTGCTTC AATGTCAGGC GGCGAAACTG 5640 

AATGGAAAGA AATTGGCTTA AACGTAACGT 5700 

CATTCTATGA TAAATTGAAA AATGATGACC 5760 

GCACAGGTTC AGATCCTTCA CCAACCGGCT 5820 

CACGTTTTGA GTCAGAAGAA AATACTAAAT 5880 

TTGATGAAGA AAAACGTAAA AAAGCCTTCT 5940 

CGTTTGTAAT CCCAACGCTT TACAGAAATG 6000 

ACTTTACTTG GGCAGTTGAT ACGAAAGATA 6060 

ACTCACGGAA ATAAAAAAAC AGATTCTAGA 6120 

TTGGATAGGA TAGTTTCTTA GGCCATCTTA 6180 

AATTACAATT TATATGTATG CGTTATTTTT 6240 

ATGGTTTTTG ATGGGAGGTT CTGTTTTTTT 6300 

GTTATGAAAT GTTATTTTTT TTATTATATA 6360 

TAAGTTACAA TTAGATTAAG TTTTAGTTAG 6420 

ACTAAAACTT ATATTTAAAG AAAGGAGTGA 6480 

TTGCTTGCAT TGCAATCAAA ATATTCACAA 6540 

GTAATATTAC TTTCAGTGAT TGGCTGTATT 6600 

GGAACATTTG CTGGGAGTAT TAAAATTGGT 6660 

TGTAAACGTT GATATTTAAG GAGGAAGGAG 6720 

AACTTATGAT AAGAATTGAA AATTTAACAA 6780 

TAACTATCTC ATTGCCTAAA AACCGAGTTA 6840 

AAACTACTTT ATTAGATTGT AATGTAGGAT 6900 

TTGAGAGCTA TTCTAATGAT TCTGAAAAGT 6960 



wo 98/50555 PCTAJS98/08985 

791 



TTAAAGAATG 


TATATTCTAT 


ATACCATCGG 


AATTTTATCT 


ACCTAATTAT 


ATGACGGGTA 


7020 


AAGAGTACTT 


AAACTTTGTT 


TTATCAAGAT 


AXCGCTGTAG JTGATATAGAA 


CGGATAGATG 


7080 


ATTTTTTAGA 


ACTATTTGAT 


TTAAAATTTG 


CTGGAACTAA 


TTTAATAGAA 


TCGTACTCAT 


7140 


TTGGTATGAA 


GAAAAAAATA 


CAAATAGTtG 


CAGCAGCACT 


AGCTAACACT 


GATTATATAT 


7200 


TAGGTGATGA 


AATATTTAAT 


GGACTTGACT 


TTGAGACAAC 


ATTGCTTACG 


TTAGAGCTGT 


7260 


TT6AAAATTT 


ATCTAGAGAA 


GTAGGGATAG 


TGATTATTTC 


TCATAATAAA 


TTAATTATTG 


7320 


AAAGGTTTTC 


AGAGAATATT 


TTATTGATGT 


CCAATGGCAA 


TTTAACTCCA 


TTTTTAGGGG 


7380 


CGTCAGAG7\A 


TTTAGAAAAA 


GAAGTGATAA 


GTACGGAGAA 


AATTCATGAA 


AAAATTAAAT 


7440 


ACATCAAAGG 


ATATCATCCT 


ATTAATCGAG 


TTATTTATTG 


ATAATGCTTT 


ACGAGGTTTG 


7500 


CTCAAGCACT 


CTATTTTGTC 


AAAAAAGAGT 


ACTCGCATCG 


TAATAGGTGT 


GTGTGTATTT 


7560 


TTTATCTATT 


TTGCATACTT 


TTTTTTTAAT 


ATGAGTGAAC 


TAGCTAGAAT 


TGTACCAGAT 


7620 


TCAGAAAAAA 


TTAGTCATAT 


TTTGATAGAA CAAGGTCGAA 


AAATAACTTT 


TTATAGTTAT 


7680 


TTTAGTAATA 


CATTTGTTTT 


AGGTATAATA GCATATATTT 


TAGTTGaTAA 


TACTGTTGCa 


7740 


TTAGATAAAA 


ACTCCTTATT 


TTTTGTGAAA 


ACATTGCCtT 


TTAGAAAGAA 


AGACATCAGC 


7800 


TTATCTTTTA 


TGCTATTCAA 


ATTTGTAATT 


ATGATATTGC 


TTTATGAATT 


AGTTATGATC 


7860 


ATTTCTACGC 


CAGCTATTAA 


ACTTGTGACG 


ACTGTTCCTA 


TTGAATATGT 


CATTTTTTTT 


7920 


ATTGTTCAGC 


ATCTTTTTTA 


TCTAGTGGTA 


ATTGGGGTGA 


TTGAGTTTAT 


ACATTGTTTG 


7980 


TTTACATTCT 


TTTTGAAACG 


AAGAGTGAAA CAAATTAATA 


GTCTCAAAGT 


GATAGGTGAT 


8040 


AGTTTATTAA 


TGATTTTTGC 


TACTTTTTAT TTTTTtGATT 


TTCGTTATGG 


TTTAGAATTA 


8100 


TTTCTAGCAA 


ACCAATTGTG 


GAGTATCTCC 


TATATGATTC 


CTATCACTTT 


TTTTCTAATA 


8160 


GTGGTATGTT 


TAATGCTTAT 


TAGTATAGGC 


CTATTAAGAC 


TAATTAGTAT 


TCTTGAAAAT 


8220 


CAAATGTCTC 


AACATTCAAA 


ATATATATAC 


ATTCCATTCT 


TAAATAAAGT 


tATATTTCGT 


8280 


TATAAAGCGA 


ATTGGTATTT 


TATTAGTTTT 


ACTACAGTTG 


TTATGTTGGT 


TATTTTTTTC 


8340 


CAAAGTGGTT 


TAAGAACAAT 


GTTATTCATT 


TTGACAACTG 


TTATGGCCTT 


TTCAGGCGTT 


8400 


TTATTATTAA 


GCTATGGAGA 


TATTACTGCT 


GATTTCAGAA 


AACAATATGA 


TTTATTGAGA 


8460 


ATAAAAATTA 


GAAATGAATG 


GTTGAGTCmA CTATTGCkAG 


TGATAATGTT 


GGCAATGCCA 


8520 


TtGtTACTAC 


TTgtTTTTGG 


GGATTtGGGG 


AGTTAGCTCA 


ACTAATTACA 


GCGTTGTCGT 


8580 


TATCATTAGT 


AGCTATAATT 


TTGGGATATG 


TATTTCCAAA ATCACAAGGA AGTTTAAATG 


8640 




ATTATTHTTA 


TTATTTATTG 


TATTTGTTCT 


AGTTAGTTTA 


TTGACAAATC 


8700 


GCTCTTTTGG 


TTGGTTGATT 


CTAATTGTTT 


TAGTAnTCTT 


GCATTTGTTA 


GTAATTAAGA 


8760 


AGGTGAGAAA 


TGAAAAAGAA 


TAGTAAATAT 


TTAGGAAGTA 


TAACAGTTAT 


CTATTTGTTA 


8820 


GTTCTTGTGA 


TCGAAACATT 


ATGGGAGCTA TTCCATACCA 


GTCGCACAGA 


ATCAACGAAA 


8880 


GTCACTACCT 


TATTAGGAAT 


TACGATTGAT 


AATCGTATAA 


GTAAACATGA 


AATTTCAACT 


8940 
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ACTTTTGGTT 


TGACAATTAA 


GGTATTAGTA 


•CTCTATTTGT 


TGTTATTGTT 


AGTTGTCTAT 


9000 


ATTTTGACTG 


GGTTATTTGG 


AAAAAAGAAA 


ATATGATTAT 


TAAGTTATCA 


ATATTGTATG 


5060 


TGTTGTCAAT 


ATATGTCGTG 


TTACTTGTAC 


ATGAGTGGTT 


ACATTTTATT 


TTAGCCmAAT 


9120 


TCTTCAAATA 


CAATGcTTAT 


ATAAAGAGAG 


TTGGTCTATA 


CCCATTTAAA 


GTGGTGTATA 


9180 


CGAACAGAAA 


TAATCCtCTA 


GATAATTTAT 


TGATTTCAGC 


GATATCTCCG 


CTTTTCTTGG 


9240 


TTATAGTAGG 


AATAATTTTG 


CCATTAAATT 


ATTATACTGT 


GATTTTAAAA 


GTATCTTGTA 


9300 


TTAGTAATAT 


TTTTAATCTC 


TTACCGTTTA 


CTGCAGATGG 


AGAAATAATT 


TTGCTTTCAA 


9360 


TTTTTCAAAT 


TTTTAGGAGG 


AGAAATA7UVT 


GAAAAAAAGA 


AAAGTGTTCA 


TTATTTTTCT 


9420 


TGTTAGTTTG 


ATTATATTTG 


TTTTTGGTTA 


TTACTTAGCG 


TTTAATCAAA 


ATAAGTATGT 


9480 


ATCATATAAT 


GACGAGACAA 


AACAGTTTGT 


ACATACTGAT 


ACTTCAAAAA 


TAATGGAAAA 


9540 


TCTGGTGAAC 


AGACAACAAG 


GAATTTATTA 


TTTTGGCTTT 


CCGACTTGTC 


CTTGGTGTCT 


9600 


AGAATTATTA 


CCAATACTAG 


ATAAAGAATT 


AGAGAAGGAA 


AGTATGAATG 


CTTATGCTGT 


9660 


AAATACTCGT 


GGTGATGACT 


ATACaGAAAA 


TGATGAkGAA 


TTATTACaAA 


AATTTkATCa 


9720 


AAAATATACT 


GGGGATGAGT 


CGCTTTCTGT 


ACCTTTTATT 


GTTGCTA 




9767 



(2) INFORMATION FOR SEQ ID NO; 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

AAAATCGTTT TGGATTTTTT CTCGGnTnCT TTCTATTTAA GTTATTTTAG TTGGATATCG 60 

- TTATTTAATA AGGCTATTAT ATGGTATAAT GATAATTAAC TGAAGTGTGG AGGAAATAAA 120 

ATGTTATCAA TTGTTGTCCC TTGTTATAAT GAGGAAGCAG CGATTCCGCg TTTTTTgAAG 180 

AAGTAGAAAA AATTAGCCAG AAACTGTCTC ATTCAGTAGA ATATATATTT GTGAATGATG 240 

GCTCAAAAGA CAATACGTTA GCTGTTTTAA GACAGCTTTA TCGCGAACAT CCAGAGAAGG 300 

TCCGTTATCT TTCTTTTTCA CGAAATTTTG GGAAAGAAGC GGGCTTATAC GCTGGTTTAA 360 

AAGAAGCAAC AGGTGACTTA GTGACAGTCA TGGACGTTGA TTTACAAGAC CCGCCTGAGT 420 

TATTACCTCA AATGATTGAG ATGATTGAAA CCAGTACAGA TTTAGATTGT GTTGGCACAC 480 

GACGCATAAC AAGAGACGGC GAACCGCCCA TTCGTAGCTT TTTTGCACGG ATGTTCTATA 540 

AACTCATTAA TCGAATTGCT GAGACGGAAA TGGTGGATGG TGCCCGTGAC TTTCGTGTCA 600 

TGACCCGGCA AATGGTCGAT GCGATTTTAG AGTTATCTGA GTATAATCGT TTCTCAAAAG 660 

GGATTTTTAG TTGGGTCGGC TTTAAAACAG AATATATTGA ATTTAAAAAT CGAGAACGAA 720 

TTGCTGGAGA AACCTCTTGG TCTTTCTGGA GTCTATTAAG TTATTCAATT GACGGGATTG 780 

TCAATTTTTC AGAAACACCA TTGAATATTG CTTCATACGT TGGCGCCTTT TCGTGTATTG 840 
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AtjCAATGTTA 


GTGATTATGT 


TTCGAACATT 


AGTTAATGGT 


GATCCAAC6A 


900 




TTCAATGGTT 


TGTATTGTAC 


TTTTTGTAGG 


TGGTTTGCAA 


CTATTATGTT 


960 


TAGGAATTAT 


TGGCAAATAT 


ATCGGTAAAA 


TTTTCTTAGA AACGAAAAAA 


CGACCAATTT 


1020 


ATATTGTAAA 


GGAATCAGAA 


AAAGATTCTA 


AAAAATAACC AAAAAAGCAG 


AGAAAAATTC 


1080 


TCTGTTTTTT 


TGTTGTGCAT 


TAAAAAAAGT 




X 1 1 i U AVaAAA 


AACAACATTT 


1140 


TTTACATAAA 


AAATACAfTA ACTTTACCTA 


1 i i X i Aid i 


AA risAAACAG 


CTTTGCTCTC 


1200 


GCGGTTGCAA 


CaAAATAAATG 


T T AAAATCAA 


AAovj 1 A 1 OL> 1 


ATT AAATG T T 


TTTTTGGAGG 


1260 


TTGGGTcATG 


ATAGAAGTTA 


TCGACT TGAA 


AAAAU 1 A 1 1 


GATAATGGGT 


TTGAAGCGTT 


1320 


AAAATCGGTC 


AAT T TT ACGA 


TTGAGCAAGG 


GGACTTGGTT 


TGTTTGTTAG 


GACCAAGTGG 


1380 


GTGTGGGAAA 


TCCACAATTT 


TAAACTTAAT 


TGCAGGTTTA 


TTGCATCCAA 


GTGATGGAGA 


1440 


TATTCAGTTT 


CGACAGCAAT 


CGGTTGTCAA 


AACAGCACCA 


AAAGATCGGA 


ACATCGGGTT 


1500 


CGTTTTTCAA 


AACTATGCCT 


TATATCCACA 


TATGACTGTT 


CTAGAAAACG 


TAATGTTTCC 


1560 


TTTGACGGTA 


GGAAGTAAGA 


AAGTTCCCAA 


GGCAGAGGCA 


CAAGCTATTG 


CCGAAGAATA 


1620 


TATGAAACTA 


ACGAATATTG 


AAGAGTTAAG 


CCATAAAAAG 


CCAGGCACAC 


TTTCTGGTGG 


1680 


TCAACAACAG 


CGTGTTGCGA 


TTACCCGGGC 


GTTAGTTCAA 


AAACCAGATG 


TTTTATTATT 


1740 


AGATGAGCCT 


TTAAGTAACT 


TGGATGCACG 


TTTGCGTTTG 


AAAATTCGTG 


AAGAAATCCG 


1800 


CCGCTTGGTG 


AAAGAAGTAG 


GGATTACAAC 


AATCTTTGTA ACGCACGACC 


AAGAGGAAGC 


1860 


GCTATCAATC 


AGTGATAAAA 


TTATTTTATT 


AAACGAAGGG 


GTTATTCAAC 


AAAATGATGA 


1920 


ACCTCAAAAC 


CTTTACTTAG 


AACCAAATAA 


CTTATTTGTT 


GCTCAGTTCA 


TAGGTAACCC 


1980 


AATTATTAAT 


TTATTGTCTG 


TTGAAGTGAA 


AGACGGCAAA ATGTACCACA 


AAAGTTTTGA 


2040 


AATTCCGCTT 


GAGCGTTTTG 


AGCAAGCACG. 


CTTTAAAATG 


CCAATGACTG 


ATGGGAAATA 


2100 


TACGTTTGCT 


TCTCGTCCgG AAGATGTGGT 


ACCAGCTGAA ACAGGTCTCT 


TTACCACAAC 


2160 


GACAGATTTA 


GTGGAATTGA 


TTGGTCGAGA 


ACGTATTTTA 


CGATTTACAT 


TAGGAAATGA 


2220 


ACAAGTGAAA 


TCAATTGTAA 


GTGTAGAAGA 


AGCGATTGAA 


GAAGGAGACA 


CTTTATCTTT 


2280 


TGATTT TTCA 


TATAAAAAAG 


TATTTATCTT 


TAACGAAGCG 


GGAGACCGGG 


TTTACTAATG 


2340 


AAAAAATACA 


ATCCAGAAAA 


TCAACCAAAA 


GCATGGCTCT 


TCCTTCTGCC 


ATCATTAGGA 


2400 


AiUAI I ITAC 


TGTTTAGCGT 


TTATCCACTG 


TTTCGTTCTT 


TGTGGATGAG 


TTTCCAAAAA 


2460 




TTAACCAACG 


cTACGCTGGT 


TTAGAAAATT 


ATCAACGTGT 


ATTGAATGAT 


2520 


CCAATCTTCT 


ATAAAGCGTT 


AAAAAATACA 


GCGCTCTATG 


CATTTGCTGT 


GGTACCGATT 


2580 


GCCTTGATAA 


TTTCCCTAGC 


AATTGCTTGG 


ATTATCTTTG AAAAAGTTAA 


ACATAAGAGT 


2640 


TTCTTTGAAA 


CGATTTTCTT 


TATGCCTTAT 


GTAACGAGTA 


CGATTGCCAT 


TGGGATTGTC 


2700 


TTCCGTTACT 


TCTTTAATGG 


CGATTATGGA 


ATTGTCAATT 


ACGTCTTAGG 


CTTTTTCGGC 


2760 


ATTCCTTCTG 


TCAACTGGCT 


AGATAATGTT 


CAAATGAGTA 


TGCCAACATT 


GATTATTTTT 


2820 
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GGGGTTTGGA 


CGAGTTTAGC 


ATTTAATATT 


ATTATTTTGT 


TGGCTGGGTT 


GAGAAATATT 


2880 


GATGAAGAAC 


ATTTTAAAAT 


TGCAAAAATG 


TTTGGCGCCT 


CAGACGGCGA 


AATTTTCCGA 


2940 


CGTATTACGT 


TTCCGCAGCT 


AGTCCCGACC 


ATTGCCTTTT 


TATTAACGGT 


CAACTTAATT 


3000 


GGCGCCTTTA 


AAGTTTATAC 


ACAAGTTTAT 


GCTTTATTCG 


GTGGCCGTGC 


GGGGATTGCC 


3060 


AATAGTGCCA 


CAACCGCAGT 


GTACTATATT 


TATGACAAAT 


TCCACATTGC 


AGGACGTCCC 


3120 


GGAATTGCGA 


TGGCCGCCAC 


AGTGATTTTA 


TTTGTGATTA 


TTTTAGTGGT 


CACTTTCTTA 


3180 


CAAAATAAAC 


TCTTAAAGAA 


AGTGGGGCAA 


TAAGCCAATG 


AAAAAAGTTT 


TAACGATTAT 


3240 


CGCATTTGTT 


TTCTTAGGCA 


TTTTAGCCGT 


TATTACGTTA 


TTTCCATTTG 


TCTATATGAT 


3300 


TTTAGCTGGT 


TTAATGAGTT 


ACTCTGAAGC 


GACCAGCATG 


CCGCCAACCA 


TGTTTCCGAA 


3360 


ACAACCACAA 


TGGCTU^AACT 


ACACAGAGGT 


CTTTCAAAAA 


GCACCCTTTC 


TTCGGTATTT 


3420 


CTTGAACACA 


GTCTTTGTTT 


CAGGCGTTAC 


AACGATTGCT 


ACGGTAGTCA 


CCGCCGTGTT 


3480 


AGCTTCATTC 


GCATTAACGA 


GTTTGAAGTT 


CCGTTTTAAA 


7VATGTGGTCA 


TTGCTTTGAT 


3540 


GATTTCGTTG 


TTAATGGTTC 


CGTATGAATC 


TATTATTTTT 


ACCAATTATC 


AAACGATTGC 


3600 


GCAATTAGGT 


TTGTTAAATA 


CTTACAGCGC 


CTTAATTATT 


CCATTTTTAA 


CTAGTATTTT 


3660 


TTATATTTAC 


TATTTAAACG 


GTTATTTAAA 


AGGTATTCCT 


GATACTTTTT 


ACAAGGCTGC 


3720 


CAAAATTGAT 


GGCGCTAGTG 


ATTTAGAATA 


TATTTGGCGT 


ATTTTAGTAC 


CAATGTCAAA 


3780 


ACCAGCGTTG 


GTAACAGTAG 


GGATTTTAAC 


ATTTATTTCT 


AGTTGGAATT 


CTTTCTTGTG 


3840 


GCCATTGTTG 


GTGACGAACG 


AGAAAAAATA 


CCGTCTATTG 


AATAATGGAC 


TATCGGCCTT 


3900 


TGCGACAGAG 


AGCGGTAGTG 


ACGTACATTT 


ACAAATGGCT 


GCTGCAACAT 


TAACCGTTAT 


3960 


TCCAATTTTA 


ATTATTTACT 


TGATTTTCAG 


AAAAGAAATT 


ATCAGAGGAG 


TTGCAAAAAA 


4020 


TGGAATCAAA 


GGCTAAAACA 


ACTGTGACTT 


TTCACAGTGG 


CATTTTAACA 


ATTGGAGGAA 


4080 


CCGTGATTGA 


AGTGGCGTAT 


AAAGATGCCC 


ACATCTTCTT 


TGATTTTGGT 


ACAGAATTTC 


4140 


GACCAGAATT 


AGATTTGCCT 


GATGATCACA 


TTGAAACCTT 


AATTAATAAT 


CGTTTAGTGC 


4200 


CAGAATTAAA 


AGACTTATAT 


GATCCACGTT 


TAGGTTATGA 


ATATCATGGT 


GCAGAAGACA 


4260 


AAGACTATCA 


ACACACCGCT 


GTTTTCTTAT 


CTCATGCCCA 


TTTAGATCAT 


TCACGCATGA 


4320 


TTAATTATTT 


AGACCCAGCT 


GTTCCGTTGT 


ACACCCTAAA 


AGAAACGAAA 


ATGATTCTAA 


4380 


ATAGTTTAAA 


TCG/y^AAGGT 


GATTTTTTGA 


TTCCATCACC 


GTTTGAAGAA 


AAGAATTTTA 


4440 


CCCGAGAAAT 


GATTGGTCTT 


AATAAAAACG 


ATGTGATTAA 


AGTTGGCGAA 


ATTTCAGTGG 


4500 


AAATCGTGCC 


TGTTGATCAT 


GATGCGTACG 


GTGCTTCGGC 


ATTACTGATT 


CGCACGCCTG 


4560 


ACCATTTCAT 


CACATACACA 


GGTGATTTGC 


GCTTACATGG 


CCATAATCGT 


GAAGAGACTT 


4620 


TAGCTTTTTG 


TGAAAAAGCC 


AAACATACTG 


AATTATTAAT 


GATGGAAGGC 


GTAAGCATTA 


4680 


GCTTCCCAGA 


ACGTGAACCA 


GATCCAGCCC 


AAATAGCGGT 


TGTCAGTGAA 


GAAGATCTTG 


4740 


TTCAGCACTT 


GGTTCGTTTA 


GAACTAGAAA 


ACCCGAATCG 


ACAAATTACC 


TTTAATGGTT 


4800 
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ATCCAGCAAA 


CGTGGaACGT 


TTTGCTAAGA 


•TTATTGAAAA 


GTCACCACGT 


ACAGTCGTTT 


. 4860 


TAGAAGCAAA 


TATGGcTGCG 


TTGTTACTTG 


AAGTATTTGG 


AATAGAAGTT 


CGTTATTATT 


4920 


ATGCTGAATC 


TGGTAAAATA 


CCAGAATTGA 


ATCCAGCGTT 


AGAAATCCCg 


TATGACACGT 


4980 


TACTAAAGGA 


CAAAACAGAC 


TATTTGTGGc 


AAGTTGTGAA 


CCAGTTTGAC 


AACCTCCAAG 


5040 


AAGGTAGTTT 


ATACATTCAC 


AGTGATGCAC 


AACCGTTAGG 


GGACTTTGAT 


CCACAGTATC 


5100 


GCGTGTTTTT 


AGATTTGTTG 


GCTAAAAAAG 


ACATTACTTT 


TGTCCGCTTA 


GCTTGTTCAG 


5160 


GACATGCAAT 


TCCAGAAGAT 


CTGGATAAAA 


TTATTGCATT 


GATTGAACCT 


CAAGTATTGG 


5220 


TTCCAATCCA 


TACGTTAAAA 


CCAGAAAAAC 


TGGAAAACCC 


GTATGGTGAA 


AGAATATTGC 


5280 


CAGAACGTGG 


CGAGCAAATT 


GTTTTATAAA 


TTTTTTTAAG 


GGAGAGAAAA 


AAATGAAGTT 


5340 


CAAAACTCTA 


GCAACAACAG 


TGTTAGCAAC 


CGCAGCTATT 


TTCGCATTGG 


GGGCTTGTGG 


5400 


TAACGGTAAT 


GGGGCCAAAG 


AATCAAACGA 


TATTGTGAAA 


GAAGTGAAGG 


AAGATACGAC 


5460 


AATCACTTTC 


TGGCATGCAA 


TGAATGGGGT 


TCAAGAAGAA 


GCGTTAACAA 


AATTAACGAA 


5520 


AGACTTCATG 


AAAGAAAATC 


CAAAAATTAA 


AGTGGAATTA 


CAAAATCAAT 


CTGCTTACCC 


5580 


TGATTTACAA 


GCCAAAATCA 


ATTCGACTTT 


AACTTCACCA 


AAAGATTTAC 


CAACAATTAC 


5640 


GCAAGCGTAC 


CCAGGCTGGT 


TATGGAATGC 


TGCACAAGAT 


GAAATGTTAG 


TGGACTTAAA 


5700 


ACCATATATG 


GATGATGACA 


CAATCGGCTG 


GAAAGATGCA 


GAGCCAATTC 


GTGAAGTATT 


5760 


GTTAGACGGC 


GCCAAAATCG 


ACGGCAAACA 


ATACGGCATT 


CCATTTAATA 


AATCGACAGA 


5820 


AATGTTATTC 


TATAATGCTG 


ATTTGTTGAA 


AGAATATGGT 


GTTGAAGTAC 


CGAAAACATT 


5880 


AGAGGAATTA 


AAAGAAGCTT 


CTAAAACAAT 


TTACGAAAAA 


TCCAACAAAG 


AAGTCGTTGG 


5940 


TGCTGGTTTT 


GACTCGTTAA 


ATAACTATTA 


CGCAATTGGA 


ATGAAAAACA 


AAGGCGTTGA 


6000 


TTTTAATAAA 


GACTTAGATT 


TAACAAGCAA 


AGATTCACAA 


GAAGTCGTGG 


ACTATTACCG 


6060 


TGATGGTATC 


GAAGCAGGTT 


ACTTCCGCAC 


AGCTGGTTCA 


GATAAATATT 


TATCTGGCCC 


6120 


ATTTGCAAAC 


AAAAAGGTAG 


CAATGTTTGT 


CGGTAGTATT 


GCTGGTGCTG 


GTTTTGTTCA 


6180 


AAAAGATGCT 


GAAGCTGGTG 


GCTATGAATA 


CGGTGTTGCA 


CCACGTCCTG 


AAAAAATCAA 


6240 


CTTACAACAA 


GGAACAGATA 


TTTATATGTT 


CGATAGTGcT 


ACGCCAGAAC 


AACGGACAGC 


6300 


GGCATTTGAA 


TTCATGAAAT 


TCTTAGCTAC 


TCCTGATTCA 


CAATTGTACT 


GGGCACAACA 


6360 


AACAGGTTAT 


ATGcCAATTT 


TAGAATCTGT 


TTTACACAGT 


GATGAGTACA 


AAAATTCTAA 


6420 


GACAACCAAA 


GTACCTGCAC 


AACTTGAAAA 


CGCAGTAAAA 


GATTTATTCG 


CTATCCCAGT 


6480 


AGAAGAAAAT 


GCTGATTCAG 


CCTATAATGA 


AATGCGGACA 


ATTATGGAAA 


GTATTTTTGC 


6540 


TTCATCAAAT 


AAAGACACGA 


GAAAATTATT 


GAAAGATGCA 


ACATCACAAT 


TTGAACAAGC 


6600 


ATGGAACCAA 


TAATCAATAA 


ATAAACAGAC 


GTGGGAGGTT 


GGGATCTGAG 


TCGTTATGGC 


6660 


TCGTGTTCTA 


ACCTTCCTGT 


TCTTTTGGGC 


GGAAGAAGGA 


GAGAAACAAA 


ATGCAAATAA 


6720 


AAATTTTAGC 


AACAAGTGAT 


ATGCATGGGT 


ACATTATGCC 


AACAAGTTAT 


AGTGAAAAAA 


6780 
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AGATGGATTT 


ACCTTTTGGA 


ACCGCAAAAG 


CAGCAACCAT 


GCTGAAAAAG 


TTACGGGCGT 


6840 


CTGCCAAAGG 


ACCAGTTTTT 


CAAATTGAAA 


ATGGGGATTT 


TATTCAAGGT 


TCGCCGCTTA 


6900 


GTTACTACGT 


AAGAAAAGCA 


GAAACACATT 


CAGTGGCTGC 


CATCACTAAA ATCATTAATC 


6960 


AAATGAATTA 


TGATGTTAGC 


ATTTTAGGTA 


ACCATGAATT 


TAATTACGGT 


TTAGATTATT 


7020 


TAAAGGAAAC 


GATTGCCAGT 


TATCAACAGC 


CAGTTTTAGC 


TGCCAATATT 


CTTGGTAAAG 


7080 


ATGGACAACC 


TTATTTCGGT 


CAGCCGTATG 


TCATCATAGA 


AAAACAAGGG 


GTTAAAGTAG 


7140 


CCATCCTTGG 


CGTGACGACT 


CAATATATTC 


CTCATTGGGA 


ACAGCCGGCA 


ACAGTCAAAG 


7200 


ATTTAACGTT 


TAAAAGTGTC 


GTTGAAACAG 


CGGCGGAATA 


CGTGCCAAAA 


CTGCGGGAAG 


7260 


AAGCGGATCT 


AGTTGTTGTC 


GCTTATCATG 


GCGGTTTTGA 


AAAAGATTTA 


GAAACAGGGG 


7320 


AACCCACGGA 


ATTACTAACA 


GGAGAAAATG 


AAGGCTACGA 


TTTATTGGAA 


AAAGTTCCTG 


7380 


GGATTGATGC 


CTTAGTCACT 


GGTCACCAAC 


ATCGAGAAAT 


TGCTACAAAA 


CTGAATGGAA 


7440 


TACCTGTGAT 


TCAACCAGGT 


TTTCGAGGAG 


CTTTCGTTGG 


AGAAATTACT 


TTAGAAATTG 


7500 


AACCTATGGC 


TAAAGGCTAT 


CACGTTATTG 


GAAGTGACGC 


AGCTATTCAT 


CCTGTTGGAA 


7560 


ACGAACAACC 


AGATACAGAA 


GTTTTAGCTT 


TGACAACAGC 


GTTACATGAC 


GAGGTAGAAG 


7620 


AGTGGcTTGA 


TCAACCAGTG 


GGAAATGTGG 


AAGGGGATAT 


GACGATTCAA 


AATCC6AATG 


7680 


CCGTACGCCT 


CAAAGAACAT 


CCATATATAG 


aATTTATTAA 


TAACGTCCAA 


ATGGCTTCAA 


7740 


GC6GAACGGA 


TATTTCAGGG 


ACTGCACTTT 


TTAACAATGA 


AGGCAAAGGG 


TTTAATAACC 


7800 


AAATTACGAT 


GCGCGATATC 


ATTACCAACT 


ACATTTATCC 


AAATACCTTG 


GCTGTATTGC 


7860 


GTGTCACTGG 


ACAAGACCTC 


CGTGAAGCGT 


TAGAACAAAG 






7900 



(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 



TCCTTACATT TCATGTTATA AAATGTAACA TAGAAATTAA GAACAAAAGA TTAAGATCAG 60 

TTGAAGTCCG AAACAGCAAA AGTGGCGAAA ATATGTTCGG TATGTTAAAC TAATAAGTGG 120 

ACATTCTGCC AGAATGGGCC GATAATTTAA GAGAAATTAG ACACTTTGAG GGGGAGTTAT 180 

AAATGATCGA AAGAGAAACT TCCAATACCT TTCATTTGGC TTCTAAATAT GAGCCTGCCG 240 

GAGATCAGCC AGCCGCTATT GCTGAACTAG TTGACGGTGT CAAAGGTGGA GAAAAAGCTC 300 

AAATTTTACT TGGAGCGACC GGAACAGGTA AAACATTTAC AATCTCTAAT GTGATTCAAG 360 

AAGTGAACAA ACCAACATTA GTGATTGCAC ATAATAAAAC ACTAGCTGGA CAATTATACG 420 

GGGAATTTAA AGAATTTTTC CCAGATAATG CGGTCGAGTA TTTTGTCAGT TACTATGATT 480 
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ATTATCAACC AGAAGCATAT GTACCCTCAA 
TTAATGATGA AATTGATAAA CTACGCCATT 
ATGTCATCGT GGTAGCATCA GTTTCTTGTA 
GTCAACAAGT TGTTTCATTA CGTGTGGGAA 
GCTTAGTCGA TATTCAATTC GAACGTVAATG 
TCCGCGGTGA TGTGGTTGAG ATATTCCCAG 
AATTTTTCGG AGACGAAATT GATCGAATTC 
TCGGCGAAAC AGAGCATGTC GCTATTTTTC 
ATATGGAACA CGCCATATCG CAGATCCAAG 
GCAGTGAAAA TAAATTATTG GAAGCTCAAC 
A6ATGATGAG AGAGATGGGC TATACGTCAG 
GTCGCCAAGA AGGAGAGCCG CCATATACGC 
TTGTAATCGA TGAGTCTCAT GTAACAATGC 
GAGCACGGAA ACAAATGTTA GTGGACTATG 
GACCGCTTCG TTTAGAAGAA TTTGAACAAC 
CACCAGGCCC TTATGAAATG GAACAAACAG 
CAGGTTTACT AGATCCAGAA GTAGAAATCC 
TGGGCGAAAT TCATGAaCGG ATCGAAAAAG 
AGAAAATGGC CGAAGATTTA ACCGACTATT 
TGCATAGTGA TATAAAAACA CTGGAAAGAA 
AATTTGATAT TCTAATAGGT ATTAACTTAT 
CCTTAATTGC GATTCTAGAT GCGGATAAAG 
TGCAAACGAT GGGACGGGCT GCCCGGAACG 
AAATCACCGA TTCGATGCAA CGAGCTATGG 
AAGCATATAA TGAAGAACAT GGTATTGAAC 
TGATTTCTAT TTCCAAAACA GCTGATAAAG 
ATAAAGATTT ATCAAGACAA GAAAAAGCTG 
AGGATGCGGC CAAAGCATTA GACTTCGAAA 
AATTGAAAGC TGCCAAATAA GTCAACCAAC 
TATCTTTTAC TAATCTTTTC TCTATAGAAT 
GTAAAAGGAG AACATATATG GCAAATGATA 
ACTTAAAAAA TATTGATGTC ACGATTCCTC 
CTGGTTCTGG CAAAAGTTCA TTAGCGTTTG 
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GTGATACATA 


TATTGAAAAA 


GATTCCACTA 


540 


CAGCGACAAG 


TTCTTTATTG 


GAACGAAACG 


600 


TTTTTGGTTT 


AGGGGATCCG 


CGGGAATACA 


660 


TGGAAATGGA 


TCGTAATGAG 


TTACTAAAAA 


720 


ATATTGATTT 


TCAACGGGGG 


CGTTTTCGTG 


780 


CTTCACGGGA 


TGAGCATGCG 


CTACGTGTTG 


840 


GTGAAGTAGA 


TGCGTTAACT 


GGTGAAATTG 


900 


CGGCAACACA 


CTTCGTGACG 


AACGAAGAAC 


960 


AAGAATTAGA 


GGCACGTTTG 


AAAGTGCTTC 


1020 


GTTTGGAACA 


GCGAACCAAC 


TATGATATTG 


1080 


GGATTGAGAA 


TTACTCTCGA 


CATATGGATG 


1140 


TGTTGGACTT 


TTTCCCAGAT 


GACTTCTTAC 


1200 


CGCAAATTAG 


AGGAATGTAT 


AATGGCGACC 


1260 


GTTTCCGGTT 


ACCAAGTGCA 


CTTGATAACC 


1320 


ATGTAAACCA 


AATTGTTTAT 


GTATCGGCAA 


1380 


AGACTGTTGT 


TCAACAAATC 


ATTCGACCAA 


1440 


GCCCAATTAT 


GGGTCAAATt 


GATGACTTGG 


1500 


ATCAGCGGGT 


GTTTGTCACA 


ACCTTAACGA 


1560 


TCAAAGAATT 


AGGCTTAAAA 


GTTAAATATT 


1620 


CGGAGATTAT 


TCGCGATTTA 


CGATTAGGCG 


1680 


TACGTGAAGG 


AATCGATGTG 


CCAGAAGTTT 


1740 


AAGGCTTCTT 


ACGTAGTGAA 


CGCTCGTTAG 


1800 


CTGAAGGAAA 


AGTCATTATG 


TATGCAGATA 


1860 


ATGAAACGGC 


ACGACGTCGG 


GCAATCCAAG 


1920 


CAAAAACAAT 


TATTAAAGAA 


ATTCGTGATT 


1980 


ATGAAACAGT 


GGTTCAATTG 


GATAAATCAT 


2040 


ATTTATTAAT 


GAAACTTGAA 


AGAGAAATGA 


2100 


CTGCAGCCAC 


ATTACGAGAT 


ACCATTCTTG 


2160 


bGAT i AAbTA 


AAAG AT AC b A 


f UAACCAl LCa 




GACTTGAAAT 


GATTTGACGT 


AAGAGATTTA 


2280 


AAATTGTGAT 


TCATGGTGCA 


CGCGCCCATA 


2340 


GTGACAAAAT 


GGTTGTCGTA 


ACTGGACTAT 


2400 


ATACGCTTTA 


CGCAGAAGGG 


CAACGACGTT 


2460 
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ATGTAGAGAG 


TCTCTCTGCT 


TATGCGCGAC 


AATTTTTGGG 


ACAGATGGAT 


AAACCAGATG 


2520 


TTGACAGCAT 


CGATGGTTTA 


AGTCCAGCTA 


TTTCAATTGA 


TCAAAAGACG 


ACAAGTAAAA 


2580 


ACCCACGCTC 


AACGGTGGGG 


ACGGTCACAG 


AAATCAATGA 


TTATTTACGA 


TTGTTATTTG 


2640 


CTCGGGTAGG 


TCATCCAATT 


TGTCCTAACG 


ATCATATTGA 


AATTACCAGT 


CAGTCTGTTG 


2700 


AACAGATGGT 


TGAT7VAAGTC 


TTAGAGCTAC 


CAGAGCGAAC 


AAAAATTCAA 


ATTTTGGCCC 


2760 


CTGTTGTGGT 


CAAGAAAAAA 


GGCCAACATA 


AAAAAGTGTT 


TGAAATGATC 


CAACGTGAAG 


2820 


GGTATGTCAG 


AATGCGGGTG 


GATGGTGAAA 


CCTATGATGT 


TAGTGAGGCA 


CCAGAACTTG 


2880 


AAAAAAATAA 


AAAACATGAT 


ATTGCGATTG 


TGATTGACCG 


TATCGTTGTG 


AAAGAAGGCA 


2940 


TTCGCTCTCG 


CTTGTTTGAT 


TCGTTTGAAG 


CGGCTTTGCG 


TTTAGCTGAA 


GGTTACGCAA 


3000 


TTGTTGATGT 


GATTGGTCAA 


GAAGAAATGT 


TGTTTAGTGA 


ACATTATGCT 


TGTCCTTATT 


3060 


GTGGGTTCAC 


AGTCGGTGAA 


TTGGAACCGC 


GCTTGTTTTC 


ATTTAATGCA 


CCATTTGGCG 


3120 


CTTGTCCCGA 


TTGTGATGGT 


TTAGGTGTTA 


AATTAGAAGT 


GGACAAAGAC 


TTAGTGATTC 


3180 


CAGATCCCAC 


TAAAACCTTG 


AGAGAAGGGG 


CCATTGTTCC 


GTGGAACCCT 


ATTAGTTCCC 


3240 


AATATTATCC 


ACAAATGTTG 


GAGCAAGCAG 


CCACCAGTTT 


TGGGATTGAT 


ATGGATACGC 


3300 


CGTTTGAAGA 


ATTACCTGCA 


GACCAACAAG 


A/VATTATTTT 


AAATGGTTCT 


GGCGAGAAAA 


3360 


APTTTCATTT 


TCATTATGAA 

X X X X 


AATGACTTTG 


GTGGTGTTCG 


TGATGTGGAA 


GTGCCATTTG 


3420 


AAGGGATTTT 


AAAAAATATT 


AAACGACGTT 


ATCATGAAAC 


GAATAGCGAT 


TTTACACGAG 


3480 


ATCAAATGCG 


GTTATACATG 


ACAGAATTGA 


CTTGTCGAAG 


CTGTCAAGGG 


TATCGTTTGA 


3540 


ATCCGCAAGC 


TTTAGCTGTA 


AAAATcAATG 


GCACGCACAT 


TGGTGAAGTC 


AGTGAATTAG 


3600 


CAATA7U\AAA 


TGCGGTCCAA 


TTTTTTGAAG 


GTGTGTCTTT 


ATCTGAACAA 


GAAACAACGA 


3660 


TTGCTCGGCC 


AATTTTAAAA 


GAAGTCGAAG 


ATCGGTTAAC 


CTTCTTAAi\A 


AATGTTGGGT 


3720 


TAGATTACTT 


AACCTTAAGT 


CGAGCTGCTG 


GTACACTTTC 


TGGCGGAGAA 


GCACAAAGAA 


3780 


TTCGCTTAGC 


AACACAAATT 


GGATCAAACT 


TATCAGGCGT 


TCTTTACATT 


TTAGATGAAC 


3840 


CATCAATTGG 


TTTGCATCAA 


CGAGATAACG 


ACCGCTTGAT 


TGACTCATTG 


AAAAAAATGC 


3900 


6GGACTTAGG 


CAATACATTA 


ATTGTGGTGG 


AACATGATGA AGATACAATG 


ATGGCTTCGG 


3960 


ATTACTTGAT 


TGATGTGGGA 


CCTGGCGCTG 


GACATCTGGG 


CGGTGAAATT 


GTAGCCGCTG 


4020 


GTACGCCGGA 


AGAGGTTGCT 


AAAAATCCGC 


ATTCATTGAC 


CGGGCAATAT 


CTTTCTGGGA 


4080 


AAAAAGTGAT 


TCCTGTACCA 


AAAGAACGTC 


GCAAAGGTAA 


TGGCAAAGCG 


ATCAAAGTCA 


4140 


CTGGTGCCAG 


TGA7VAATAAT 


TTAAAAAATG 


TCAGTGTTGA 


ATTTCCGCTG 


GGTGAATTTG 


A onn 
4 zuu 


TTGCTGTCAC 


AGGCGTTTCA 


GGTTCAGGGA 


AAAGTACTTT 


AGTCAATCAA ATTTTAAAAA 


4260 


AAGCGCTGGC 


CCAAAAATTA 


AATCGTAATT 


CCAATAAACC 


CGGGAAACAC 


AAAAGCATCA 


4320 


CTGGTTATGA 


AGcCATTGAA 


AAGATTGTTG 


ACATTGACCA AAGTCCAATT 


GGTCGAACAC 


4380 


CAAGAAGTAA 


TCCAGCGACT 


TATACTAGTG 


TTTTTGATGA 


CATTCGCGAT 


TTGTTTGCCC 


4440 
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AAACVJiTGA AGCCAAGGTT CSTGGCTATA AAAAA3GTCG CTTTAGTTTT 

GTGGCCGTTG TGAAGCTTGT CGCGGGGATG 3TATCL-.TCAA AATTGAAATG 

CTGATGrCTA TGTGCCTTGC GAAGTTTG7C ATGGC^tAACG TTATAATTCT 

AAGT7C-.TTA TAAAGGCAAA AACATTTCTG ATATTrTAGA TATGACGGTG 

rAGAATCTTT CAAACATATT CCTAAAATTC ATCGT.-AATT GCAAACAATT 

GCTTAGGCTA TSTGACATTG 3GSCAGCC3G CAACTACTTT ATCTGGTGGG 

GGATGA---GCT AGCGAGTGAA CTTCATAAAA ATTCTAATGG GAAAAACTTT 

ATGAACCAAC AACAGGCTTG CATACCSATG ATATTGCGCG ATTGCTTCTA 

GTTTAGTGGA AGCGGGCAAT ACCGTTTTAG TGATTGAACA TAACTTGGAT 

CTGCGGATCA TGTGATTGAT CTAGGGCCAG AAGGTG3CGA CGGTGGTGGG 

CGACCGGAAC ACCAGAAGAA GTTGCTAAAG 7CAAAGACAG TTACACAGGA 

AACGTGTTCT GAAATAAACA TTAAT6ATTA GATAAAGAAC CTTGTTTTTA 

GGTrrTCTAT TATAAAATAA CAAATATTTT TTAATAAAAA TTAAAAATAA 

AATGTGrAAT TAAAAGATAT ACTATGTTTG 7TAACAAAAT AACAATGAAT 

GTCTTAAATG AJ^AAACAAAA AAATAACAAC AAGTTTATTA ACAGCAGGAA 

TATCGGAATC ACACCTTTTG 7ACATAGTAA CACATGGAAG CAGCGACCTT 

rCAGACCATG AAAAGTAATG AGTTTTACCA GAAAGACTAA AAGAAGCACG 

GGGCCGGCGC AGTCCAGCGT 

:2) XNTORMATION FOR SEQ ID KO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A; LENGTH: 10391 base pairs 
(B: TYPE: nucleic acid 
(C: STRANDECSESS: double 
(D; TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
CAGGCTrTTT AATGACGAAT GTCGTTCAAC AATCCATGCG CGATTTGCGT 
AAGAaAAAAT CAATCGTTTG CCAGT7TCTT ATTTTGATAA GAACCAACAA 
TGTCACGGGT GACGAACGAT GTGGATGCTG TCAGCAATGC GATGCAACAA 
ATATTGTTTC AGCAGTCTTA GGTATTGTGA 7GGCGGTAGT GATGATGTTC 
CGCTGA7GGC GATTTTTTCA GTGATTATGA TTCCGTTGTC TCTGATTATT 
TTGTTAAAAT CTCCCA6AAA TATTTCCAAG 3AAT6CAAAA TTCTTTAGGA 
GTTATG7CCA AGAAAATATG ACTGGGTTCA GTGTCTTAAA ACTATATGGT 
AAACCCTTGA AGGCTTTAAA CAAGTCAATC ATCGT7TAAA TGGTTTTGGT 
CCTTTATCTC AGGATTAATG 7TGCCATTGG TTCAGATGAC CGCTTATGGG 
GGGTAGCTGT CCTTGGTAGT TACTArGTGG TTGCTGGTGT GATCGTAGTG 



AACGTCAAAG 
CACTTTTTAC 
GAGACGTTAG 
GAAGATGCCG 
GTGGATGTAG 
GAAGCACAGC 
TATATTCTGG 
GTATTA6AGC 
GTCATCAAGT 
ACGATCGTGG 
TATTATTTAA 
TGGGAAACAA 
ATGTTGACTT 
ACGAAGGGAA 
TGGTCTTAGC 
AATTTCACAA 
GGCATCAATT 



CGCGATATTG 

GGAAATATTT 

AGTTTTATCA 

TTAATCAATC 

TCCAGAACAA 

GACTTAAATG 

CGGGAAAAAG 

TTCAAAGCAT 

ACCTATATCG 

GGGCAATTAC 



4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

54 60 

5480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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AAGCGTTTAT 


TCAATATATT TGGCAAATTA- GCCAACCAAT GGGGAATATT ACGCAGTTGT 


660 


CTGCAGCTTT 


ACAAAGCGCT TCAGCTTCGA CCATGCGGAT TTTTGAAATC CTAGATGAAC 


720 


CAGAAGAAGA ACTTAACGAA CAAGATGTTC CTTTGCCAGA ACCTATTTTA GGCTCTGTTG 


780 


AATTTGAAAA 


TGTCAGCTTT AGTTATGACC CAGAAAAACC GTTAATTCGT AATTTGAACT 


840 


TTAAAGTTGA 


TGCGGGCCAA ATGGTTGCGA TTGTGGGACC AACTGGCGCT GGGAAAACAA 


900 


CCTTAATCAA CTTACTGATG CGTTTTTATG ATGTAACAGA AGGCGCCATT AAAATTGATG 


960 


GCATTGACAC 


GAAAAAAATG AACCGTAGTG ATGTCCGATC TGTATTTGGA ATGGTATTGC 


1020 


AAGATGCTTG 


GTTGTATAAA GGTACCATTG CAGATAACAT TCGTTTTGGG aAGTTAGATG 


1080 


CCACGGaTTA 


TGaAGTTGTC GATGCaGCGA AAACGGcCAa TGTGGATCAC TTCaTTCGGA 


1140 


CAATGCCAGA 


CGGGTATGaA ATGGAaATCa ATTCTGAGGG AGATAACGTT TCCCTTGGtC 


1200 


aAAAACAATT 


GTTGACCATT GCCCgAGCGG TAATTTCTGA TCCGAAAATT TTGATTTTAG 


1260 


ATGAGGCGAC 


TAGTTCAGTC GATACACGCT TGGAAGCCTT AATTCAAAAA GCAATGGATC 


1320 


GTGTTATGGA 


AGGACGAACG AGTTTCGTTA TTGCCCACCG CCTATCAACT ATTCGTGAAG 


1380 


CTGATTTAAT 


TCTTGTTATG AAACAAGGAG AAATCATTGA AAAAGGTACG CATCATGAGT 


1440 


TGCTGGAACA 


AGGTGGCTTC TATGAAAAAC TATACAATAG TCAATTTGCT GAAGAAGGCG 


1500 


ACTATGAGGA ATAAAAAGAG GTCGGGACAG AAGCGTTTAA CTCCGAGAAA TAAGAAGGAA 


1560 


TTTCCGAAAA TTGTTCTGTA ATTTTTGGAG AATTCCGGCT TATTTCCGAA GGAGTTGCTT 


1620 


TTGTTCCCGC 


CGTTTATCAG TTTTTGAGCG TGGAGCAAAA ATCCAAAGTG ATTTTTGTCC 


1680 


CACGCTCTTT 


TCTTGTCTTT TTTACTATGA AAGTGAGGAA TTCTTGTCTA TACTTAATGA 


1740 


AAAGAGGTGA 


CAGAGTTTGA AAAAAGAATA TTTTATTGCT GCTAGTGCAG ATGTCTATGG 


1800 


AAACGTACAA 


CTAGCAAAGG ATAGTAATAT TTGGTTTCAG TCGGTCTTAA GAGGTGACAG 


1860 


TAACACAATT 


ACCATTGGTG AAGGTAGTAA TATTCAAGAT GGCACGATTA TTCATGTTGA 


1920 


TGAAGATGCC 


CCGACAATTG TAGGCAAGTA TGTAACGGTG GGCCATCAAT GCATGTTACA 


1980 


CGGTTGTAAA 


ATCGGTGATG GTGCTTTGAT TGGCATGGGT TCAACCATTC TAAATAATGC 


2040 


TATTATTGGA 


GAAAACAGCC TGATTGGTGC GGGCTCTTTA GTGACAGAAG GCAAAATTAT 


2100 


CCCGCCAAAT 


GTTTTAGCAT TTGGTCGACC CGCGAAGGTT ATCCGTCCGT TAACGGCAGC 


2160 


AGAAATAGCC AGTAATCATA CGAACGCCTT ACATTATATT CAACGAGCAG AAGAATTTCG 


2220 


CCAACAAAAA TACCCGAAAC TAACATAGAA TTAGGAGGAA AATGATGAAG AAAATTAAAG 


2280 


TAATGACGGT 


TTTTGGAACA CGGCCAGAAG CAATCAAAAT GGCTCCCTTA ATCAAAGTTT 


2340 


TAGAAGAGCA AAGTCAAGGA TTTGACTCTG TAGTAGTCGT AACAGCGCAA CATCGCCAAA 


2400 


TGTTGGATCA 


AGTGTTAGAA GATTTTCAAA TTACACCGAA TCATGATTTA AATATCATGA 


2460 


AAGATGGGCA 


AACCTTAACA GATATTACAA GTCGTGTTTT GAATTTATTA ACTGAGGTTC 


2520 


TTGTGACAGA ACAGCCGGAT ATTGTCTTGG TTCATGGCGA TACAACCACG TCATTTGCGG 


2580 
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111 1 Al 


Ik li a /2 a TTr* 

LJ\ALi>Al3Al lU 


\«Au 1 \3Ul3A^A 


I ua lalaAAGGA 


GGATTGCGGA 


2640 




nlAl IL^lUCo 


ill C^uoAAb 


a a a T/: a a T/^r* 

AAAluAAlCla 


1 v«AAU 1 GGTA 


GACGTGTTAA 


0 1 /% 

2700 




1 1 1 i bCCUUA 


ACuACl UAAA 


r"P a a nr* a r* a a 

la 1 AAuvi Av«AA 


1 ilAATCAAA 


^ik 7k 7k tt m^iB m#^ 

GAAAATCATC 


2760 




T. Al TTATGTG 


TV /Tp/"»^/"» Jk Ik rp Ik 

ACTGGCAATA 


CAGLAAT TGA 


CGCGATGGCG 


TAGACCGTTG 


2820 


ATGCCCACTA 


TCAAAAT -lAT 


mf»m mm^r^lk Ik jk 

TTATTGGAAA 


Ik |k ]k iT*f^f*f^*J* Ik f 

AAATCCCTAC 


AGACC7\ACGC 


ATCGTGCTAA 


2880 


TTACAATGCA 


m^^m^^^fik^ Ik Ik 

TCGTCGTGAA 


AATTTAGGCC 


TGCCAATGGC 


AAATGTGTTT 


AAAGCTGTTC 


2940 


GTCGACTTGT 


Ik Ik Vk Ik ^ Ik m 

AATGGAACAT 


^^1k /^1k Ik Ik IH>'i>^ 

CCAGAAATTG 


Ik Ik ^rn^ik niiiifiifn 

AAGTCATTTT 


CCCTATGCAC 


A/\AAATCCCA 


3000 


AAGTCCGTGA 


AATAGTAGCG 


GAACACTTAG 


GTGAATTAGC 


TCGTGTGCAT 


TTAATTGAAC 


3060 


CATTGGATGT 


CAAAGATTTT 


CAAAATTTTG 


CAGCGAAAAG 


TGCTTTAATT 


TTAACTGATT 


3120 


CTGGCGGTGT 


GCAAGAAGAA 


GCTCCTTCCT 


TAGGTGTGCC 


AG'FTTTAGTT 


TTACGAGATA 


3180 


CAACGGAACG 


TCCAGAAGGC 


GTAgCGGCAG 


GAACATTGAA 


ATTGGTTGGT 


ACAGACGAAC 


3240 


AAGTAGTCTA 


TCAAGAGGCG 


AAAACATTAT 


TGACTGATTC 


TGATGCTTAC 


CACGCGATGG 


3300 


CGCATGCACA 


AAATCCGTAT 


GGGGATGGTC 


AAGCTAGCCG 


CCGAATCGTT 


GAAGCCATTG 


3360 


CCTATGAAAT 


GCAGAAATCA 


GATAAAAAAC 


CAGATACGTT 


TACAGCAAAA 


TAAATAAAAA 


3420 


GGCAGAGAAG 


CGCTGAAAAA 


TGTGCTTCTT 


TTGCGTATCT 


TATTTTGAAG 


AATCAATTAG 


3480 


Ik #**vk Ik Ik Ik Ik Ik 

ACAAAAAATG 


CTAGAAAGTA 


GTGAAGTGAA 


GATGTATCAA 


ACGATTTTAT 


TTGATTTAGA 


3540 


fn^<v^ik ^ik Ik mm 

TGGCACAATT 


ACGGATTCTG 


GTAGCGGCAT 


TATGCGTTCT 


ATTTTATACG 


CAACAGAACA 


3600 


ATTAGGCTGG 


CCAGCTCJTA 


GCGAGGAGAC 


ATTACGTTCT 


TTCATAGGGC 


CGCCTTTATA 


3660 


TGAGTCGTTT 


TTGCACATGG 


CGCCTTCAGC 


TGAAGCAGCG 


CAACAAGCAG 


TGGGTCATTA 


3720 


TCGAGCGTAT 


TATCAGAGAA 


AAGGTATGTT 


TGAAAATCAT 


GTGTATCCAG 


GGATCCCAGA 


3780 


GGTACTAATA 


AGGTTAAAGG 


AAGCGGGAGC 


AAAGTTATAC 


ATTGCGACTT 


CCAAGCCAGA 


3840 


AGAATTTGCT 


AAAAAGATTA 


TTACTCATTT 


TGATTTAGAT 


CGTTATTTTA 


CAGGCATATA 


3900 


TCsGAGCGAGT 


ATGGACGGTC 


ATCGTTCTAA 


AAAAGCAGAT 


GTGATTCAGT 


ATGCATTAAC 


3960 


tibAAVjCTCAA 


CTAGACCCAA 


CCAAAGAAGC 


TIATTATCATG 


GTTGGTGATC 


GTAATCATGA 


4020 




VjCTCAACAAA 


ATGGCTTAGA 


TAG T AT AGGT 


/^m/*>n>m/^mik m 

GTCTTGTATG 


GATTTGGTGA 


4080 


ftnA&&pzir:2ic 


1 1 ALAAvsAGG 


LGGGAGCGAC 


ATTCCTAGTA 


CACTCGCCTA 


Ik 7k Tk /^mmTk r*f^ 

AAGACTTAGG 


4140 


A 1 X X 1 1 v? 


\i«l/iL.\3AAAi 1 


Ul lAAlAVsAA 


AAliaVaAl iTl 


CG A 1 AAAT T T 


7k/^/^7k ^f>7kmm»p 

AGGAGGATTT 


4^UU 




lAlCul 1 iLiA 


2i*P2ir* IV a B ar*/* 
A 1 AuAAAAoU 


(jCuATlIATAG 


GCTTTAAACG 


T TATC AATGA 


4zdU 


TGAATTTTTT 


ATCGGGGAAT 


GACGGTCTTT 


TTATTAAAAA 


CATTTAATTT 


TAAAGAGGTT 


4320 


TCATGTATAA 


TAGTTTCAGT 


CATAAACGAT 


AGAATA6TAA 


GGAGGGTTAG 


TTTTGGCTAA 


4380 


TGATAACCAA 


AATAACCAAG 


ATCCCAAATC 


TTCGCTTAGA 


GATCAAGTAA 


CTGGGTCTTT 


4440 


AAATGGTAGA 


AATGATGGCG 


ATCAACCTGA 


TTCCTCTGAA 


AAAAATGATA 


GGTCACCCCA 


4500 


ACCTTCATCT 


GATGAATCAC 


AAGAAACAGC 


GAGCCGCACA 


ACACAAACAC 


GTGCAGGTTC 


4560 
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mAAnApfiTC 


f^AAAr'ar^aa 


- riAr'A Af*ar'a*^ 


ar*ar'T/^r'a af 


AGULrAAbCUU 


4 620 




nApr^AAAAnp 


r*aar*aaATar* 


aaaaaaanaa 
AAAAAAAuAA 


ar^arT"paaaa 
AUAub X AAAA 


A Ar* A A/^ A 

AAGAAGATCG 


4680 


ill iva 1 k»uV9 1 


V^o X nx V^U 1 VI 1 


T H JVPT/^TIV^I* 

lAAl ilalAvjX 


i Xv^iblAl XA 


/^rptpfpip Ik Ik m/->ik 

bX X X xAAXGA 


TGGCAATCTT 


4740 




ill lAlAAAl 


Alb 1 AGATGC 


TGGGCTACAG 


f^f^ A A ^ A ^ A 

CCACTAGACA 


AGAACAATAA 


4800 


GAAGCTTGTT 


^ TV TV f^t*r^f* IV *T* IV 

CAAGTGCATA 


*W*f>f*r^ TV ^ TV IV 

TCCCAGAAGG 


Ik m^ffim^^m m ik m 

ATCTTCTAAT 


AAGCAAATTG 


CTGCGGTTTT 


4860 


AGAAGAAAGC 


IV TVO^vp^IV fPI* IV 

AACGTCATTA 


TV TV TV f*t^f^r* IV IV VP 

AAAGCGGAAT 


^^mni1*i|Mii Ik Ik ^ 

GGTTTTTAAC 


fiiik Ik ^^mik Ik 

TACTACGTAA 


AATTTAAAAA 


4920 


CTTAACAGAT 


TTTCAAGCGG 


GCTATTATCA 


Ik Ik m^m^fffi^^ik 

AATGTCTCCA 


AGCATGACAC 


TCGATGAAAT 


4980 


TGGT G AAATG 


« Ik vk w 4k 'Vk ^ 

CTAAAAGAAG 


GAGGTACGCC 


AGAACCAACT 


AAAATTGCAG 


ATGGCAAAGT 


5040 


AACGATTCCT 


GAAGGCTACG 


Ik ^% fk Ik ^ fn fk Ik 

ATATAGATAA 


AATTGGCGAA 


GCTATTGAAA 


AAAATACAGA 


5100 


TTTC7\AGAAA 


GCAGATTTCA 


TTGCATTAAT 


GAAAAACGAA 


GACTTCTTCA 


ATCAAATGAA 


5160 


AGCTAAGTAT 


CCTGATTTAC 


TTGAAAGTGC 


TGCAACTGCA 


GAAGGTGTTC 


GTTATCGCTT 


5220 


agaaggctat 


TTGTTCCCGG 


CAACTTATGA 


TTACTACAAA 


AAAGCAACAC 


TTCCTGAATT 


5280 


TGTCGAACAA 


ATGATTGCTA 


AAATGAACAC 


AGTAATGGAA 


CAATATACGC 


CGACAATTCA 


5340 


TGCAAAGAAT 


TTAACCAATC 


AGCAAGTCTT 


AACGTTAGCT 


TCATTGGTTG 


AAAAAGAAGG 


5400 


AGTCAAAGAA 


GCAGACCGCA 


AACAAATTGC 


GCAAGTATTC 


TTTAACCGCT 


TGGCTGCCGA 


5460 


TATGCCAATC 


CAATCAGATA 




GTATGCTCTA 


GGAGAACACA 


AAGAAACTGT 


5520 


AACGTATGCT 


GACTTGGAA6 


TGGACTCGTC 


TTATAACTTG 


TATAAAAATA 


CTGGTTATGG 


5580 


ACCAGGACCA 


TTAGATAGCC 


Ik ^^#Vk,^^ Ik ^^^^ ^ 

CAAGTGAGGA 


GTCAATTAAA 


GCAGTCTTGA 


ATCCGACACC 


5640 


AAbTGACTAT 


TTATACTTTg 


TAGCAGATAT 


TTCTACAGGA 


AAAGTCTACT 


TCTCAAAAAC 


5700 


ATATGAAGAA 


^fv fn^iv Tv^n*Tvn> 

CATCAAGTAT 


m Ik ^*fn Ik ^ 9k ik 

TAGTAGATCA 


^mik m^m^^ik ik vn 


AATTCAAGTA 


GTGAATAAAA 


5760 


f^r* TV Ti Ik 71 /^iv nft IV 
TGAAAACATA 


TV TV TkOIV/^^WI*^ 

AAAGAGCTTG 


GTGAATCAAT 




TCACCAAGCT 


TTTAGATACT 


5820 


an IV IV/^TYV IV 

UAAAAAGTAA 


TCTGTTTAAC 


TV ^ TV TV TV Tk^ 

AGAGGGAAAG 


Ik ^**mik ^^ik Ik 

AGTAGCAACT 


AG/y^TTTAT 


ATAAATAAAA 


5880 


TT rAuCAAAA 


*t*f^ Ik Ik Ik Ik Ik Ik m 

TCACAAAAAT 


TCAATAATTA 


TATTTACAAT 


TTAGAAAAAG 


CGTGGTACTC 


5940 


i 1 i i VI 1 A 1 


*J*f^ A TV TV f^f^f^r^ TV 

X CaAAAGCGGA 


ATTTTTTTCA 


Ik rr* Ik Ik Ik ik ^ 

ATAACCAACC 


^ Ik ^ ^^mm ^^^^ 

AAGAACTACG 


TATCGTTAAG 


6000 






r'f^'J* A ^ TV IV IV TV A 

(jCsx AbAAAAA 


oX AlArC..v.J\A 


»P/^Ik ^FTl/^fTink^ Ik 

TGACTCTTGA 


Ik f*f* Ik Ik Ik Ik ^ik Ik 
AGGAAAAGAA 


6060 


AAATTAfZAAf^ 


A A r* a ^2/*T a a 


ar*R ATTA AA A 

AuAAi i AAAA 


A A ATT A A Ar' 

P»S.J\x\x iAAAL> 


/^^PA A A ^ A A A 

GTAAAGAAAT 


CGTAGAGCGT 


olzU 


ATTAAAATHf; 


I^T^OTa^^TT 

\* X X 1 1 


ibUouAiUi I 


X (JAviAUAAC X 


^ A ^ A /^>P A fPI^ A 

CAGAGTATGA 


AfP^A^^nVA A A 

ATCAGCTAAA 


£ 1 OA 




PTT Tr*<^ a 
will V,rla 1 1 VjA 




ar»a a/^r*'PTar' 

AL^AACb X X Ab 


A A A A T A fT*r* A T 

AAAATATGAT 


fpr**^mrpip rp^^ TV 

TCGTTTTGCA 


b^4U 


GAAATCATCG 


ATAATGATGG 


CGTGGACCAA 


GACGAAnTTT 


CCATCGGTAA 


AACAGTTACT 


6300 


TTCCAAGAAT 


TGCCTGATGG 


AGAAGAAGAG 


GAATACACAA 


TCGTAGGAAG 


TGCCGAAGCC 


6360 


GATCCATTTT 


CTGGTAAAAT 


TTCTAACGAT 


TCACCGATTG 


CGCAACCnTT 


AATTGGTAAA 


6420 


CAATTAGGCG 


ACCAAGTGAT 


TATTGCAACA 


CCTGGTGGCG 


ACATGACAGT 


TAA7UVTTACA 


6480 


AAAGTTGAAG 


CAAACTAAAT 


AATTGAGTAG 


GGCAGGCAGC 


TTCCAAGGCC 


tGTCCTTTCG 


6540 
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CAATCTATGA CATTTAGGGC 


TAAAGTCCTA 


TGGTAACCTG 




wU.»iAl(3Al 1 


OQ.UU 


AAATAAAATA 


AACGAGTAAA 


ATAAATGACA GAGGGAGGAG 




oHA 1 AACCCT 


ODDU 


TGGCGCTTTT 


TTATCGTCGC 


AGAAGCATTA CTTTTTATTC 


1 Viv^Cbl TATG 


GCAAATTGTA 


6720 


CATAATCCTG 


GaTTAGCTGT 


TTtATTAACA 


ATTGGCGTTT 


1 AO* T TGTGGC 


CTACGTTTCC 


6780 


AGGAAAGCAT 


CTAAAACACA 


TTTTAACAAC 


TTTCAATTCG 


TCCTCGGCGT 


TGTTTTTATT 


6840 


GTCATTGGTG 


CAATGAATAG 


CACGGCTGTT 


TGGTTTATGT 


TGATTTTTGG 


CGTACTCTTT 


6900 


ATCGGCTTAA 


AAGGCTTTGA 


GATTTCAGGC 


GTGGATATAG 


CTGAGCGAGC 


ACCTTGGCGA 


6960 


AAAAAACAAA 


TGATTATGGT 


GGAGACGGC6 


GCAAAAGAAC 


CTAAAAATGG 


CAAACGGTTT 


7020 


AAACGCCGCT 


GGTTTGCCAA 


CGAACGCATT 


GGTAACAATA 


TCTATGAATG 


GGACGATATC 


7080 


AATATTGATT 


TAATCTCTGG 


GGACACCATT 


ATTGATTTAG 


GTAATACGCT 


ACTACCGAAA 


7140 


GAAGACAATA 


TTATTATTAT 


TCGTAAAGGT 


TTTGGCCGCA 


CGCGAATTCT 


AGTGCCGTTA 


7200 


GgTGTAGCTA TTTTGTTAGA 


ACATTCAACT 


TTTTACGGAA 


CGGTACGTTT 


TGAAGAAGAA 


7260 


AAATATCAAT 


TGAAAAACGA 


ATCATTAAAA ATTTACAGCA 


ATGATTATGA 


TACCAATCTT 


7320 


CGTCGTTTGA AAATTATGAC 


GAACACTTTA 


GTAGGAGATG 


TTGAGGTGAT 


CCGTGTATGA 


7380 


CCGATCGGAT 


TTCAAGACGC 


ATGATTTCAT 


TATATGCGTC 


CCTTAGCACC 


TTTATTGTTA 


7440 


TCTTAATTAC 


ATTGTTTTCA 


TATTTTCATT 


CGATTAAACA 


AAACCGGTGG 


TTATTAGAGC 


7500 


TTCTTCAGAG 


AAAAGTCTTT 


TATTTACCAC 


TAATTGTGCA 


CATTGTTCTC 


ATATCCTTAC 


7560 


TAATAGGCTT 


ATTGACCTTT 


TTACTGATTT 


CATTGGTTCA 


AAAAGGGCAA 


TATGGACGGA 


7620 


TTGAAGAAAA 


ACTTCGGTTA 


TTGGCCAACG 


GTAATTATGA 


AAGTCCAGTC 


TTAAACAAAC 


7680 


CAACGACCAG 


TGAAAATCAA 


GACCATTATC 


TAACCGAAGT 


CGAACAAGAT 


ATTTGGTCGA 


7740 


TTAAAAATAA ATTATTAGAG 


ATGTCTAAAG AATTGCAATT 


ATTAAACAGT 


C6ACCGCAAT 


7800 


TAATGGATGG 


GCAAACAAAA 


GAAGAAATTT 


TAGAGAACGA 


GCGGCATCGT 


TTGGCGCGGG 


7860 


AGTTGCATGA 


TTCAGTCAGT 


CAACAACTTT 


TTGCAGCCAT 


GATGATGTTG 


TCTGCATTAA 


7920 


ATGAACAAGC 


ACAACGAACA 


GAAACCCCGG 


AACCATATCG 


TAAACAACTA 


GCCATGGTGG 


7980 


CAGAAATCAT 


TAATGCCTCC 


CAATCGGAAA 


TGCGCGCGCT 


ACTATTGCAC 


TTGCGTCCTA 


oU40 


TCAGTCTAGA 


AGGAAAAAGT 


TTGCGTAAAG 


GTATTGAACA 


Hi lACloAAA 


A A /^n* A A A A 

vxAAC r AC AAA 


0 1 

olUU 


CAAAAATTAA 


AATTGAATTG 


ATTTGGGATG 


TTGAAGATGT 




AbUACyLrAx i G 


0 1 Ci(\ 


AGGATCATCT 


TTTCCGAATT 


GTGCAGGAGT 


TACTTTCAAA 




r* A *i»r»/^^ A A A 0 
UAl (aCuAAAu 




CAAAGGAATT 


AGAGGTATAC 


TTACACCAAG 


TCGATAAAAA 


CGTGTTATTG 


CGTATTGTTG 


8280 


ATGATGGTGT 


CGGCTTTGAT 


ATGAAGGAAC 


AAAGTAATAA 


AGCCGGTAGT 


TATGGCTTAA 


8340 


ATAATATTCG 


AGAACGTGTT 


GTCGGCATGG 


GCGGTACAGT 


TAAAATTATT 


AGTTTTAAAG 


8400 


GGCAGGGAAC 


CAGCGTTGAA 


ATTAAAGTTC 


CTGTCATAAA 


GGAGGAAACT 


GCAAGTGATC 


8460 


AAAGTAATGT 


TAGTGGATGA 


CCATGAAATG 


GTCCGTTTAG 


GCGTTTCATC 


ATATTTATCT 


8520 
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a TTcivzvr^ a n 

All Krr\r\\3t\\J \J 




pr:Tafv:p<iaa ■ 

1 AvTU v>ljAn 


• rrr*ar:aaaapn 


GTAAGATTGG CTATGAAAAA 








lul lAl 1 1 ivi 


AXoutAX X Xuu 


TAATGGAAGA 


AATGGACGGC 




AX luAi iwln 


paaaaru^riaT 


pTTPaaapaT 

iuAAAiaAl 


Tcnppana a^2 


CCAAGATTAT 


TATTGTGACG 




AGTi riAl lb 


A 1 uAl oAAAA 


A(j 1 o 1 A i 


bv^iaAX XuAAb 


CTGGTGCAGC 


GGGCTACCTA 


a / DU 


TTAAAGAv- A i 


CAAVfAla^AWtA 




uAXoCAAX LKr 


GGGCGACTTA 


TCGCGGAGAG 


aoZU 


CGTGTGTTGG 


A A f^r"T*^ A A ^T 

AAOC 1 (sAAla 1 


AOr*^ A T A A/^ 

GACGCA X AAia 


A X uA X AAU 


GGTTAACAAA 


AAAACAAGAG 


Q O O O 


CCGGTGTTGC 


A t^t^ A A^A ^'PfP 

ACGAAGA 1 TT 


^A^A A AO^/*/* 

GaCAAAuCGG 


/^A A/^Ar'^AA A 

GAACAuGAAA 


TTTTAATGTT 


GATTGCACAA 


074U 


GGTAAAAGTa 


Vk r**^ A A Af>> 

ATCAGGAAAT 


Tk r^f^T^t^ A A A 

AGCTGATGAA 


CTCTTTATCA 


CTTTGAAAAC 


AGTTAAAACA 




CATGTTTCAA 


ACAT T T T AGC 


TkATkAOrPA^AT^ 

AAAAC TAG AT 


GTGGATGATC 


GGACCCAAGC 


GGCGATTTAT 




GCTTTTGAaC 


ATGGTTTAGC 


CAAATAAATA 


T C AG AT AAAT 


CCATAAATAA 


ATTCTAGGAA 




ATCCTTCTAG 


GAATTTATTT 


ATGGATTTGT 


TATACTAATT 


TTAACGAACA 


GACAGGAGCG 


A 1 O A 

y lou 


GTCAGAAGAA 


ATGAAACAAA 


ATTTTGCAAT 


TATTGGTTTA 


GGACGTTTTG 


GCGGCAGCAT 


A1 il A 


ATGCCAAACT 


TTGGTGGAAG 


CTGGTCAAGA 


AGTTTTGGCC 


ATTGACAGCA 


GTGAAGACCG 


AO A A 


TGTCAATGAA 


TATATGAATA 


TT6CAACGCA 


TGCAGTGGTG 


gcTAACGCCC AAGATG7U\AT 


AO £ A 


GACGTTGCGT 


TCTTTAGGGA 


TTCGGAATTT 


m ^ Ik m^ Ik ^^m ik 

TGATCACGTA 


GTGkTGGCAA 


TTGGTGAAGA 


A il O A 

y4^u 


TATTCAAGCG 


AGTATTTTAG 


TGACACTAAT 


GGTTAAAGAG 


ATGGGGGTTC 


CCAATGTCTT 


A il O A 

y4ou 


GGCCAAAGCA 


GTCAATGAAT 


ATCATGCCCG 


TGTCTTAGAT 


AAAATTGGGG 


CAGACATGGT 


A C il A 

yD4U 


CGTTCACCCA 


GAGCGGGATA 


TGGGCATTGG 


AATTGGTGAC 


AAACTTGTTT 


CACGCAATAT 


A£ AA 

7DUU 


TTTAGATTAT 


ATTGAATTAT 


CTAGCGAATT 


TTCATTAGCT 


GAGGTTCGGG 


TCACCAATCC 


yoou 


TAAATTTTAL 


A A fPA AAA f^f^*T* 

AATAAAAGCT 


'PA^^A/**A A*PIP 

TAGCAGAATT 


AAAT 1 1 L.L.G X 


CAGCGATTTG 


GTTT7VACTGT 


y / 


GGTAGCGATT 


CGCCGCTCGA 


A A A O A ^ A A^n* 

AAACAGAAGT 


AATTbCTTv^G 


CCAGATGCTT 


CTGAAATTGT 


y /oU 


TCGAGAAAAC 


^ A A Ik rwif IMIk K m 

GACAATTTAT 


TAGTTATCGG 


TGATAGGGGA 


GACGTGGATT 


TATTAGAT6A 


QO it n 
yo4U 


AAAAATGAAT 


GGCTAAGCAC 


TTTATTCTCT 


A A A Ar^/^Ar'Tr* 
AAAAGGAL. I b 


CGCTGTGCTA 


CAATAAATGG 




f^C A/^Ar^A^A»l» 

CGACAGAGAT 


GTGAAGGAGG 


AA/^AAA/^A 7\ f 

AACAAAGAAT 


A A A 1*1* A A A 

GAAAT r AbAA 


GTGACACCGA 


AAGCCCAACA 


y y OU 


AIGOa i i. AAA 


A A A A A cr^f* 
GAAGAAACt-G 


GGG X X uAGCL 


X bAb AL«Abb b 


ATTCGTTTTT 


TCGGAAAAAT 






Aw\^U 1 b 1 1 c 


AXoAXvsi^X X X 


TTPPaTTPPP 
1 X^OAX Xuv^u 


ATGTCTGTAG 


AAGCGCCAGA 


10080 




ATTaaa^ia^za 

Al lAAAoAlon 


A^L> X AuA X OUI 


fzaTTapPTaT 


TTTATTGaAG 


AAACAGATGA 


10140 


TTGGTTCTTT 


AAAGGGTATG 


ATTTGATTGT 


TGATTATGAT 


GAAAAGAATG 


ATGAACCGCG 


10200 


TTATCmATTT 


GAAGCAAACC 


mAGrAGaTTT 


AACmAAATAA 


TTTTAGTGCT 


AAACAAAAGA 


10260 


GGCTGAAGTA 


GTGCGTAGGA 


CTATTTCTGT 


TGCTTTCGTT 


TAACCGCTAG 


TCAnnGTGAG 


10320 


AAAAAATCGT 


TTTGGATTTT 


TTCTCAGGTC 


TTTCTATTTA 


AGTATTTTAG 


TTGGATATCG 


10380 


CTATTTAATA 


A 










10391 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 





Ai mnn X nn 


TAGAGTTAAA 
X n^snss x x ^irm 


GTTGAAAAAG 


tGaAGTTAGA 


TTATGTAGGA 
X X n X V3 X n\j\3n 




Aval Ai iAnlio 


X J^oAl AK\9l3A 


TTTATTAGAA 
X X mx x/>onn 


Go AG ^ AGGa A 


TATATGAATA 
mxm va/inXn 


TGAa AnAGTA 
X oMo AllAu X A 


1 0(\ 


L.AAAiAV9l iu 


aTnTTaaTaa 

Aiul lAAlAA 


TGGAAATAGA 


X X XUrVV&wX X 


ATAPAATAGT" 


TGGAGAGAGA 


1 flfl 

XO V 


UlaAAl^AlovatjA 


I uAi X lljI^X i. 


r* a a TnG a GPT 


Gf*AGr*TAGAT 


GTGTATPAAr" 
o X o X m v^nn^ 


TGGAGATAAA 
1 oonV3/\ 1 nnn 




Al AAj. 1 1 Inn 




Tr;aaTTaaaT 


GAAAATGAGG 


TTAAGGATPA 


T AG APPT A AG 


inn 


(^riui ill iKs 


xUvjAl VsAxAA 


T» a aT a a a^^•p*p 
xAAlAAAui 1 


fzaaatzaf^Taa 

\3nnn\3nS3 X nn 


r"aTr*a'paToa 

UpAX ^A X a X VaA 


aaaa/^aT^2/2a 

AAAAWiXooA 




a rr^T T a T r* a r* 

Aovj 1 1 A 1 ^AVj 


a 1 Ai. AAl X 1 


1 XAljiaX X X lo 


TA aTGTTaTT 
InnX oX 1M.1 1 


TaTaTATapp 

X/IXAX AXrvVsw 


TapaPTaTTa 

X Mv3 Av> X A X X A 




1 1 KjnHn. 1 A 1 1 


lAi ILjAX lA^ 


T a a TTrT' a T a 

XAAX Xov3/\Xr\ 


z^aT^TPaaTa 


aacTa/^apar" 


aaaaaPApaa 


rl O U 


oAlAV^X lAAA 


/^aTTTT^aaa 


a a arzar^rr* 


1 tb^^A 111 1 


1 1 ou 1 O Cnno 


r'aTaTf'Tf^aT 
K^n 1 A 1 1 ^^*A 1 




oil 1 


VtMwo 1X1 X 


\j 1 Lr\X e\\jJ-\r\.\D 


V^kar 1 1 v^nn 1 O 1 


AGGPAA AAPA 
nnnn^n 


AGPPAAPPGT 


O W V 




TT/^aaTrz^^aa 


1 V3/vV\^oov«orl 


111 nn 1 1 1 1 


1 1 1 1 1 1 \.tn 1 


ATATTAAAAA 
/^ini 1 nnnnn 


660 


nnnxjx^ x i ^V9V9 


X nnv^oV7\^o i x 


ATPPPACGGA 


TAACCTGGTT 
X nn\^K^ X WW X X 


TTGAAAGTGA 

X X onnn w x 


GTGGACAATC 


720 


CGATGATCCT 


PTATAAATTT 
y^xnxnnnx x x 


Tf*T A a a T TO A 
X v« X nnn x x v^n 


AAAGAGGTAT 
nnn\3n\3^s x n x 


AT TGAGACCC 


TTGATCTGTA 

X A Wa A W A W A rX 


780 


TGAAAAAGAA 


CAGAACTCGT 


TAAtGcaAAT 


GGCAACAGTT 


TTTTTGACAA 


ATTTTATAAG 


840 


GTGCAGAACT 


TCTTTCCGTA 


TACTATTCCG 


ATTGTCCTAG 


ACAATGAGCC 


TCCTCGGATG 


900 


TGACGATCTT 


CGGCAGGAGC 


TAACAGTTAA 


AGTTAGACTG 


CCTCGCTAGC 


GTTAgcACAT 


960 


CCGAGCTCAT 


TATCAAGGAT 


TCGcTGCGCC 


AATCTCTGGT 


TACGGTAACC 


AACCGGTGTC 


1020 


TCGTAGCCAA 


GTGTACCGTG 


CAACCGAAGt 


GGnTTCCACC 


AATTGACATA 


GTCAAATAAC 


1080 


TCCAAATCCA 


ATTGTTGTAA 


GGTTTCAAAT 


GTGTATTGAT 


AGACAAATTC 


TACTTTCAAC 


1X4U 


GACTTATAAG 


TTGATTCAGC 


TACGGCATTA 


TCAAAAGGAC 


AGCCTTTATG 


ACTCAATGAT 


1200 


CGATTGATGT 


CAAAAGTCGT 


TAATAATTCA 


TCAATAGCTT 


GGTTATCAAA 


CTCTTTTCCA 


1260 


CGATCAGTAT 


GAAAAATCTC 


AACCTCTGTC 


AGAGGTTGTC 


TGATACGGCT 


AAATGCTTTT 


1320 


TTTACTAGAA 


CGGCATCTTT 


ATGTTCTCCA 


CAAGAATAGC 


CGAGAATTTC 


TCGATTGAAC 


1380 


AGATCCAAAA 


TGAAACAGAC 


ATAATTCCAT 


TTTTTCCCGA 


CTCGTACATA 


AGTCAAGTCT 


1440 


GTTACGATCG 


CTTCTAATGG 


GTTGTCTCTT 


AAGAATTTAC 


GATTCAATAC 


GTTTGTCGTT 


1500 


TTGGCTTCAT 


TGCAAGTAGA 


ATGATGTACT 


TTAAAATAAG 


CAACAGTATA 


GCTCGATTTT . 


1560 


AATCCTCTAT 


TTTTCATGAT 


TCTACTAATT 


TTTCGTCGGC 


TGATCTGAAT 


GCCTCGTTTT 


1620 


GATAAGGCTT 


TTTTTATTTT 


TCTTGAGCCG 


TAGgCtTTCG 


GCTGAGGATA 


AATTCTTCAG 


1680 
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CGACTACTTC 


TTCAAGTTCT 


GATTCGTCTT 


TCTTTGGTTT 


TGATTGATAA 


TAATAGGACT 


1740 


GACGTGATAG 


ACCTAATATT 


CTGCACATCG 


CTGATATAGG 


GTAAAGATGC 


TTATTCGCAT 


1800 


CGATTACTTG 


TCTCTTCGTC 


CGAATATCAG 


CGCTGCTTGC 


TTTAAAATAT 


CATTTTCCAT 


1860 


TTCTAATTGC 


TGGTTTCTTT 


TACGTAGTTC 


TAACAATTCT 


TTTTGTTCAG 


GCGTAAGATT 


1920 


ATCTTTTTCT 


TTGAATGAAC 


CACTCGTTTT 


AGATTGCTTT 


ACCCATTTGT 


CAAATGCTGA 


1980 


AGCCGTTAGT 


TCATATTCTC 


GAATGATTTC 


TACACGTGGC 


TTTCCAGCTA 


AGTAAATATT 


2040 


GACTATTTGT 


TGCTTGAATT 


CTTGTGAAAA 


AGTTCTTCGT 


GTTCTCTTAG 


ATATAAAAAT 


2100 


TCCTCCTGGT 


ATGtTTTCTT 


CTAGTCTACA 


CACCTTAATT 


TTTCTGTCTA 


GTTAATTGTA 


2160 


GCCTATCCAG 


TGCTTCTGCC 


GGAGAAAATC 


AGATCCGAGG 


ATGCTCATTG 


TCAAGGGCAA 


2220 


TCGGAGCCAT 


AACACCGCAG 


CATTCACAAC 


ACCTTATAAT 


TTTTGTCAAA 


AAAAGTGTTG 


2280 


CCATTCCnAT 


AGATGAAATT 


CTTCAATTGA 


GAGTGTAAAA 


TATTTTGTGT 


AAATAGAAAA 


2340 


AAGGAAGTCC 


CTTCTGTAGA 


ATAGAGTTAC 


CACAACACAT 


TCACAGAAAA 


GAGGACTTCC 


2400 


ATATGAACGA 


TTTTACTACA 


GAAATTCTAA 


AGACyCTAGC 


GAACnAAAGG 


CGATTTGaTG 


24 60 


AATTATTAGT 


TGCTTTAATT 


ATTTATAAAT 


AACTATAATG 


CTTTTATTAT 


AGTATCGTAA 


2520 


ATAGTTTATA 


TGAATACATT 


AATTGTAAAA 


ATGGTGGATG 


GCTCTGAACG 


ATTAACAATA 


2580 


TTTTTTTGTT 


TTGACTATCA 


TTTTATTGTT 


CAGAAATAAT 


TTATTGCACT 


TTTTGAGGTA 


2640 


AATAGTTTAG 


AATTATTTTA 


ATATAAGTAC 


TTCAGATTTA 


ATAAAGGAGA 


AATATCTAAT 


2700 


GAAAAAAATT 


GATTTAATAA 


ATATGATAGG 


TATGTTAATA 


GGGATATTAG 


TAAATATAGT 


2760 


AATTTTCACA 


GATTGGCTCG 


GGGTGTTATT 


TTCTAATCTT 


ATTCCCATAC 


TTATTATAGG 


2820 


CATTTGTGGA 


ATTATACTTT 


CAATATTAGA 


ACTTTTTGAA 


AGCAGAAATA 


CAATGAATAG 


2880 


AATATTTGCt 


TGTATTATAT 


TAATTGTTAA 


TCTATTGCCT 


ATGGTCTATT 


TTACTTTTTT 


2940 


ATATTTTGCA 


CTGGGATGAA 


ATAAATTAAC 


CAAAAGTTTC 


TTTTTCTTAT 


AATGTGTATC 


3000 


AATATTAAAA 


CTGAAAAAGA 


TATGGATAAT 


AGGATCAAAG 


CCCTCATTGG 


AACACTCAAT 


3060 


ATATTATCAG 


ATTTAAATTT 


CAAATAGATA 


ATTTCATTTT 


TTAGTAATAT 


CGAAAGATAC 


3120 


TTTTTTAATG 


ATGAGAGTGT 


AATGTAAACC 


GTGTAAGTAA 


CCATTAATCC 


TAGGCAGGTG 


3180 


GTTGcTTCGC 


GGTTTTTCTT 


TTTCTACAAT 


AAGATCAAGA 


GGTGATTACA 


CGATGGCTAG 


3240 


AAAGAAAAGA 


AATCCTGATG 


CCGAAAAGTT 


AGCTGAATCC 


ATTCTGAATG 


CCTATCAACC 


3300 


TGAATCTGTC 


GATGACATGC 


AAGATGCTTT 


GAAAGATGTG 


TTTGGGCCCC 


TTTTTGAAAA 


3360 


AATGCTTCAA 


GGAGAATTGA 


ATAATCATTT 


AGGTTATGAT 


GCCCATTCTA 


AAGAGCCTAA 


3420 


GGAACACGAT 


AACCGTCGAA 


ATGGCTATGG 


AACTAAAACG 


CTTAAAACCA 


GTTTTGGTGA 


3480 


AGTAGCTATT • 


GATGTTCCTA 


GAGACCGGGA 


AGCTTCCTTT 


GAACCAGAGT 


TAATTCCTAA 


3540 


GAGAAAACGA 


GATGTCTCCG 


ACATCGAAGG 


GAAGGTTCTT 


TCCATGTATG 


CACGAGGAAT 


3600 


GAGTCAACGA GATATTGCCG 


CAACCGTCGA 


AGCTATCTAT 


GGCTTTGATA 


TTTCCCATGA 


3660 



wo 98/50555 PCT/US98/08985 

807 

AATGATTTCA GATATCACTG ACGCTGTCCT TCCTGAATTG GAJ^GAATGGC AAGCCCGCCC 3720 

CC.TAGCAAAG TGTTATGCCT TTCTATTCGT TGATTGTATG TATGTTACTT TACGAGAAAA 3780 

TTATGAAGCC AAAGAATATG CTGTATACAC CATTCTTGGC TATGATCTCA AAGGAATAAG 3840 

GAGATTTTAG GA 3852 
(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 



ATTGGCTAAG 


CGTTTAAACA 


AATGCGTCTT 


CTAAATAAya 


GGTTTTAATG 


ACATGACTTA 


60 


nTTCyCCTTT 


TtGtCCATAA 


TTTCTTGTAA 


tCGTgCACGA 


TTTTGTTCAT 


TTCTTCTTGA 


120 


TTGACAGAAC 


GAATGTCAAA 


TTCAATGATG 


CCaTTGTTGG 


CTTGATATTC 


TCGCGTATAA 


180 


ATGGCTGGAC 


TTTCTGCTTT 


TAATGCTTGG 


ATGACTTCCT 


TAGCGGTTTT 


TCTGCCGTCC 


240 


ACTTTGACAC 


TGGCGCGATA 


AATGTCTCGG 


CCAGCGCCAT 


CTTGGACGAT 


TTTGGCGGTT 


300 


AAATCTGATA 


AGTTGTTGAT 


GGCTTCAACA 


AACGGTTTTA 


AGCGCTCTTG 


CATCGAAGCC 


360 


CCTGATTCAC 


TGCCATGTGC 


CAAGTATTCT 


TCCACTGCTT 


GCGTAAAGCC 


CAAGATATTG 


420 


TCTTTGCCAA 


TTTTCATTGC 


TCGACCAATG 


CCTTTGCCTT 


GTAGGCGCAC 


CCAGTCAATG 


480 


TATTCTTTTT 


TCCCGACCAC 


TAAACCGGCA 


CTTGGCCCTT 


CAATTGCTTT 


GGCGCCGCTG 


540 


TAAATCACCA 


AATCAGCGCC 


TGCTTCAGTG 


TATTTGAAAA 


GATCTTCTTC 


TGcTGCCGCG 


600 


TCAACAATTA 


GCGGCACCTT 


GTGGCGTTGC 


GCCACTTTGG 


CCGCTTCGGC 


CACCGTCAGC 


660 


ATACTTTTTT 


GCACGGTATG 


GTGGCTTTTG 


ATGTATAG7VA 


TGGCCGCTGT 


TTTTTCAGAA 


720 


ATCATCATTT 


CCACGTGTTC 


TGGACTGCAC 


ATGTTGgCGT AGCCAGCTTC CACCACTTGG 


780 


CCGCCACCTT 


GCGCTACCAT 


GACTTCTACA 


GGCGTGCCGT 


AGTCCACGTT 


GTGCCCTTTA 


840 


GGTAAAACAA 


TCTCCCGCTG 


TTCGATTTTT 


TCTGTATAGG 


GATGATACGC 


GTGGTACAAG 


900 


CTGCCTTTTC 


CAATCAAGGC 


AGCCACCGAT 


TGGGCAATCC 


CCGCCGAAGC 


CGAGGAAACG 


960 


ATTTGGGCAT 


CTTCTACCTT 


CAACAAGTTG 


GCTAAAAAGG 


CCCCTGTTTG 


CACACTAAGT 


1020 


TCACTCATTT 


CAAAGAAATG 


TTCGCCGCCG 


AATCGTTGAG 


CTGCTAGGAC 


TGCCTCCGAA 


1080 


ACTTTGGAGA 


CACCTAGAAT 


GGTCATTTTG 


CCAGAAGCAT 


T7UVTGACTTC 


TTTTAAGTGG 


1140 


AATTTTtCGT 


AACTAATTGT 


CATAGATTTG 


TCCTCCAATA 


ATTGTTTTGA 


TTGGGCGAAT 


1200 


TTGCTCTTTG 


GCTACTCGTG 


TCAATCCATT 


AGAATCAGTC 


TUVTGTTTTTT 


CTTCTGCTTG 


1260 


AATCGTGAAA 


ATCGTTAAAT 


CAGCATCTTT 


ACCTATTTCC 


AACGTGCCTT 


TTTGCGTTAA 


1320 


ATGGAAATTC 


TCAGCAGGAG 


CCTTGGTTAC 


CTTTTCAATA 


ATTTCTGGCC 


AATCATAGCC 


1380 


GACCACACGG 


AGTTTTTCCA 


TCGTTGTGGC 


TAAATCATAA 


ACAGGACCAT 


TTTCTCGGTT 


1440 
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ACGGATGTAG 


ATATCTGTGC 


TAATTGATGC 


AGCTTTCATT 


CCCTCGCGCA ACGCTGTTTC 


1500 


AGCTACGTGA 


AAATTGAAAC 


TATCCGTACC 


ATGCCCGATA 


TCAAATACGA 


CTCCTTTGTT 


1560 


GTACGCTTGC 


CAAGCGAAAT 


CTTTTATTTT 


ATCCGTTGCT 


TGGTCTAAGA 


TACCATTTTC 


1620 


TTTCCCATTA 


AAACAATGGG 


TTAGGACATC 


CCCTTTTTCC 


ATCAAGGCTA AAATTTCATC 


1680 


TAAATGAGGC 


GGTGCTGATC 


CAATGTGAAC 


CATTAAGGGA 


ATTTCTTGGT 


TCTCTTGCTG 


1740 


AATCTGTTTG 


GCCAATTCTA 


AAGGCGTAAT 


GCCGTTATCG 


CCAATAACGG 


TCCTACTCAT 


1800 


ACGGGCTTTA 


ATCCCTACAA 


CAAAATCTGG 


TAATTCTTGA 


ATCGCTTTTT 


TGACTAAACT 


1860 


CGCTTGTACT 


TTACTTAAAT 


CTGCGAGTTC 


GTCCTGAGCA 


ACGATGCCCC 


ATTTAGAAAT 


1920 


ATTGACTAAT 


CCAAAAACAT 


TTGTTTTTGC 


TTGcTGCGCT 


AAGTCATAAA ATTCATGAAT 


1980 


GTTTTCAGCA 


CCTGTTGTCC 


CAGCATCAAT 


CACTGTCGTA 


ACACCCTTTT 


TGACCCCAAT 


2040 


TTCATCTGGA 


TAATCATAAT 


AAAGAGCCAT 


TTTTTCAAAA 


CAATGAACGT 


GATCATCTAT 


2100 


CCAGCCTGCG 


GATACATAAG 


TACCTGGTTC 


TAAGTGGATA 


GTTTCTTTTG 


CAGAACCTGA 


2160 


AATAGTTGCT 


GCAACAGCAG 


CTATTTTTTT 


CTCTTTAATT 


GCAATTTCAA 


CAGGCATACC 


2220 


ATTTACTGTC 


TGTCCATTTT 


TTATAAGTAA 


ATCGTAGTCC 


ATATTCTTCC 


CTACTTTTCT 


2280 


ATAAATCTAA 


ATTTTGTCCA 


AGGCTTAATG 


AATTCTAATA 


ATACAAGTTC 


TTCTTCTATA 


2340 


ATACGCCCTA 


CTCGGTTTTT 


GCGTGAGTCT 


TGATGTGGTT 


CTAAAACGAC 


TTGTAATTCA 


2400 


TTTTTGTATT 


TTCCAAATGC 


ATCATTTCCC 


ACAACAACAT 


CTCCAGGTTG 


TAGCATTGCT 


2460 


TGATTATCAT 


GTGGGGGATT 


AACTTCATTT 


TTATATTTTT 


TACGCACTTC 


TGTGGAACGA 


2520 


ATCATCTGCT 


GAGTGATATC 


TCCACGGCGA 


AAATGTTGAT 


TTTTTGTTAC 


GATTTCTTTT 


2580 


TCAATTTCAG 


AAGTCGCTTC 


TTCAAAAACA 


ATGGTCAATT 


CTGTTTGATA ACGATTCAAC 


2640 


TGACCCAATG 


CTTCTAATTC 


TTCGTTGGAA 


GCATAAGCAT 


TTCCAATAAC 


AACATCATCA 


2700 


ATTAAATTTG 


TTGCAAATAA 


ATGTTTAGCT 


TGAACAGTGA 


CTGGCAACTG 


TCTATGCATT 


2760 


TCTAAAGTAG 


GTAGCCCATC 


ATTCACATCC 


CAAGGACCAA 


TTTCACCTAC 


CTGTGATGAA 


2820 


ATAAAAGCAG 


CTGTCCGAAT 


ACCTTCCTTT 


TTAAAACGGA 


TGCTACATTT 


TTCAAAGAAA 


2880 


TCATACGGCA 


ATGCAGTTCC 


CGCTTGTGGA 


TAAAAGTTAT 


GACAACCATA 


TAAGAAAGGT 


2940 


CGATTCGCTT 


CATAC6TCAA 


AATGTTATCA 


AGATAAGCGA 


CATCATTACT 


CATATTCAAC 


3000 


TCAATAGCCA 


CGCCGAATGG 


ATTAAACGTC 


AACATGGCTT 


CTTTATTTCC 


ATCAAACCCT 


3060 


GTATCCAACC 


GAATCCCATC 


AGCTCCTAAT 


TCGCTGAAAA 


AAGTCAAATC 


ATCATATGAA 


3120 


ATTTCCAATT 


CATCAAAGAT 


ACTTGGAACG 


ACATCTAAAA 


TTGTTTCAAA TCCTAAATTT 


3180 


TTCGCATAAC 


TAATTAATTC 


TTTAAATTTT 


TTCTGAACCG 


CTTCTTTGCC 


ATCGGTCACT 


3240 


TCTAACATAC 


TCATAAAAAT 


TCGAGTAAAC 


CCACATTCGC 


TGGCTTTTTT 


TAAATACGCT 


3300 


TTATCTTGAT 


TGATATCACT 


ATGATCTGGA 


TAAACTGATA 


CACCTAATGC 


TCTTTTCATC 


3360 


TTCGTTCTCA 


CACTCCGCTA 


TTTTCAAATT 


TATTTCGAAA 


TTTCCAAGCA 


ATCACCGAGC 


3420 
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TACACAAGAT 


TGTTGTAGAG 


CCAACGAGCA- ATACTGGTTG 


ATTTAGAAAT 


CCGCCAATTA 


3480 


TCATGGAAAG AGTAATACAA ACTGCCGTAA AAACCAACGT 


CTTTACCAAA 


CCAAATTGAT 


3540 


AAAu AAL. Au 1 








ACTAACGATA 


AAAAAGAAAA 


3600 


TGAACCAAAl 








CTGACTTAGA AGGGGCAGGA 


3660 


TATTCATATC 


AGAACACCCG 




ill i AAX ill 


TTGCTGCTTG 


TTCAGCTGCT 


3720 


AATTCTTCTG 


CTTCTTTATC 




mfpmmp K fp » n n 

i i i X WilAAA 


CTTTGAAAAA 


TGGATAGTAG 


3780 


ATAGCCATTG 


TAATTAAGAA 




AUIAAX AUXb 


^^^^^^ « ^« Ik 
CGGCCGAGAA 


ACTCCAATTG 


3840 


GTACTCATCC 


AAGCAGCAAT 


TGGTGCTbGA 




CTAAGCGTGC 


AGCCATCATC 


3900 


GGAATTATAT 


TAGCGACTGT 


TAAGAAATAA 


GAl AAbb lib 


TGGTAATTAA 


CGGTGCCACG 


3960 


ATAAACGGAA 


TACCTAAAAT 


AGGATTCATT 


ACbATAbbCG 


TTCCAAAAAT 


AACTGGTTCA 


4020 


TTAATATTAA ACAAGCCTGG 


TAAsAACsAT 


AAACGGCCAA 


GACTTTTCAG 


ATACGTTGAT 


4080 


TTAGAAAACA 


TCATTAAAAC 


AACTAAAGAA 


AGCGTAGTTC 


CAGCGCCACC 


AATCCAGATA 


414^ 


AACCATTGTA AAAACTGTTC 


TGTAAAAATA 


TTAGGTAACT 


GATGAACGTT 


TGCGCCATTG 


4200 


GTAAAAGCTT 


CAAGGTTTTC 


TGCAATCGAC 


ATATCCCAGA 


AAGGACGAAT 


AACTGGGCCC 


4260 


ATAATCGCTG 


GACCATGAAT 


CCCTAAAACC 


CAGAAGAAAC 


AGATTAAGAA 


GACTGTTAGT 


4320 


AAGCCGCCAA ATAAACTGTT 


ACCAGCAAGG 


ATACCTTTAA 


GTGGCATTAA 


TAACGTACTC 


4380 


AAGAAACCAT 


TCAAATCAAA 


ACCAATTACG 


TGGCGAATAA 


CCCAGAATAA 


TAAAAGAATA 


4440 


ACTGCTCCTG 


GAATTAACGC 


AATAAATGAA 


TTTGACACTT 


CAGGCGGTAC 


GCCATCTGGC 


4500 


ATCTTAATCA 


TAATATCTTT 


TTCAATAAAG AAGCGATAAA 


TTTCAACAGA 


m n It K ^f^^t •III 

TAACAGCGCT 


4560 


GTTACAATCG 


CACCAAATAA 


AGAAGCTGAA 


CCTAAATTTG 




ATAGCGCCGT 


4620 


GCTGTAATGA 


CGTTGTCAAC 


ATCTTCAAAG 


ACACGTGTTG 


GTGGTGCCGC 


TGTTACTAAA 


4680 


AAGGCCATTA 


ATGCTAAAAT 


TCCGCACGTC 


AAAGAGTCAA 


bCTTATAAQ* x 


X I 1 bbUCAA 1 


4740 


GATGAAGCAA 


TCCCAAATGT 


TGCATAAAGT 


GCAAGTAACC 


bWil bbi Al A 


bL>bV3oAAubA 


4800 


ATATCTAAAA 


TTGCTTGATA 


AGGTTCAATC 


AATTTTGTGT 


AlbbL»lUAAX 


TrT'ZXZlTaTTT 
XVjorlnXrlX i 1 


4860 


TGAAAAATTG 


TAAAGAAT6A 


GCCCACAATT 


GTTAAAGGCA 


TCGTTGCAAT 


TAATCCTTTT 


4920 


CTAATCGCTG 


TCATATGGCG 


TTGCGAACCA AATTTATTCG 


CAATCGGCAT 


TAACTTAACT 


4980 


TCCATCCACT 


GTACAAAACC 


ATTCATTTGT 


TAATTCCTCC 


GTTCAACTAA 


TAAArATAAT 


5040 


AATTATTTCG 


CATTTATAAT 


AACATAGTAA 


cATTATAATG 


TAAaCGCTTA 


TTCTTATTTG 


5100 


TGACAGAGAA AGTGCAAAAC 


AAAAGGCTTT 


TTCTTATTAT 


AAAGAAACTG 


GGaATATTGC 


5160 


ACCAAAAATC 


ATTGC6CCTA 


AAATTGCGCC 


CCCAGCGATT 


GGTTTTTTCC 


AAATGTAAAA 


5220 


ACCTAAAGCT 


CCTAATGTAG 


CACCAATCCC 


AATAGGAATC 


GAAGCACCCG 


CCGCACTCAA 


5280 


AATAATTAAC 


GGGCCTAAGA 


AGCGTCCTGA 


GGCATTACCA 


GCCCCCATCA 


TCACATCCGC 


5340 


ACCAAAAGTG 


GAATTTCCTT 


GATTGATAGT 


AAATTTCCGA 


ATCAACACAA 


TTACGCCACC 


5400 
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AACTGCTAAC CCTAGTAGCG CACCTGTTAA AAGTGCTAAG CCAAAGTTTT CTACAGGTGC 5460 

TTTAATACCC ATGCTGAGGA AAATTGCAGG TACGCCAATC CCGATCCCCG TTAAAATTGA 5520 

GCCACCTAAA TCTAAAATCC CGACTAAAGA ACCTTCTAAA ATTCGTGCAA ACAGGAAACT 5580 

TGCACCAAAT GCTGCGGCAG cGCCATaAGA ACCACCATCT TWVACCAGCTT TTAGCATCGC 5640 

AACAATTGAA ACTTCATTAA AGGCGCCCAC ACCATAAACT ACATACATAT GTGTTCCTGC 5700 

AAAAATTGCC GCTGACATAA TCCCGACTAA AATGGGAAAG CCCCAATCAG CATACCAAAA 5760 

ACTTTTGCCT TTTAAACCTT CTTGT^ATTTC TTGTTCTTGT TGTGCCCATT CATTTTTTTC 5820 

TTGCTGTTCC ATCTATGAGA CCTCCTTTTT TCTATTTAAC GGTGAAGACA TTGTGTAAAT 5880 

TATTTAACCA TTGAGGAATA GACAAGCGGA AACTTTCGAT CATTTCTAAG TCGAACCCTC 5940 

GGAAGAAGCC ACTTAAAATA AATAATAAGA TAATGACACC AAGCATGCCG CGTGTCATTC 6000 

GGGTCCACCC TAAATCATCG ACACCTTTAC CGATTAAGAT CCCTAATACA ACTCCTGGGA 6060 

CAGCATTGCC CATGACTAAT TGAGATAATC CGCCAAATAA TGTTCCCCAG AAACCCGTCC 6120 

GTTTTCCTGC ATCTAAAGCC GCTAGCCAGA ATAAAACAGG CATCACAGTA TTGATTAAAA 6180 

TCGTTGCTGC AGGTACCAAC ACCGCAACCG CTGTTACTTG TAGCGATTCT GGAATTGCTG 6240 

CAGCTGTACT ATTTAAGAAA GCCACAGTCG CCATTCCTAC AAAGACCCCG ACAATCCCCA 6300 

TTTTTTTGGG ATTATGCATC GTTTTAGTGA CATCTTTATC CTTTACCAAT AAAATTGCTG 6360 

CAGCCCAGTT AGGCACAATC CGGTGGGTCA CATCTTGTGT AAAGGCCCCT GCTCCGACGG 6420 

CCGAAGCCCA AGCATTAAAG AAAAAACCTA AACCAATUVGA AAAGTGTGAT GCTGGATCTC 6480 

CTTGTGCCGC GTTCATTTCT CCTAATGTTC TAAATGCCCC TAACCCTTGC GTTTGTGGTG 6540 

CATGAAACAT CCGAGCAGCG CCTGCGCCAG CTGCAAAACC TAGTAAACCA CCAATAAGCA 6600 

ATGACTTCAG CAAAATGATT ACAAATTCCA TTCCTTCTTC CTCCCTTTAT ACGCCTCTTT 6660 

TTGAAAGAAA CGGAACTACT ACTGCGTTCG GTTCTTTAAT TGTTTCCGTT AAAAAATCAA 6720 

CTTCTTCGAC TGCAATGGCT TGGATTTCCA CTTCTACAGC TAGTTCGACA TGATAAATCA 6780 

CTCTCTTTCT AGGAAAGAAG AAGAACAGAA ATTTTTCTGT ATAAACTTGT TCTTTTGCCT 6840 

GACGAATAGT TACTTCTTTC GGTTCAATCC GGATGAGGAC CTCTGTTGTT TCTTTTAATA 6900 

CTTTTTTTGA AACATCATTC AATGCACTTG CAAAAGCTTC ATTCTTCTTA TTACCTTTTC 6960 

CAGAAACAAT CACGGTTTGT GTTGACCGTT TTGTGGCATC CATAATTACC GCTCCTTTCT 7020 

AAGTTACTAC TTTCTATGCT CCGTGTTTCT TAATAAATGC TTCAGTTACT CGTTTACCT^ 7080 

GTTCTTCCTG ATCCATAAAA CCAAATCCAA GAACAGTTTT CCCATCGTTA ATTGCTGTTA 7140 

CCCCTTCTTC GATTGATCGC ATGCCATGTC TTTCGGGATA CCCGTATTTA GTAGCAGCCG 7200 

TTAACGCTCC TGCACCACCA CTTCCACAGA AAGAAATGCC TAAATCAGCC TTTTCTTGTT 7260 

GCATCACATC TCCTAAACGC ATATCAGCGC CCATTCCTGG AACAACAACC GCGCGACCAC 7320 

CGGCTTGCTC CACTCCTTTT GCAACATTTT GTCCTTTTCC TAAACGATCT GCGATAACTA 7380 
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CTGTAATCAT 


AACTTTTCCT 


CCTATTAATT 


GTTTTGTTTA 


GCAACTTCAA 


AATGAACAGC 


7440 


TAAAACATAC 


CATTCATCTT 


CTGATAACTG 


ACCAATTGCT 


TCTGTTGTTT 


CTTCTGCAAT 


7500 


TTTCAACGTT 


TCTGGCGTTA 


GTTCCGTAAA 


TAACGTCCGA 


TCAACACCAG 


CTAATGTCGT 


7560 


TCCTTCTCTT 


GAACGATTCA 


GCATCTCATT 


GAGATGGTTA 


ATCAAAATCG 


TCGACTGTAA 


7620 


TTCCGTTGGT 


TCAATTTGTG 


CTTCTGCTT^ 


TCGCGCACTG 


GTTTGTTTGA 


TAATCGATTG 


7680 


TAGCTTTTCT 


GGTTCAGCAC 


TTTGTCGAAT 


GATCTCCTGT 


GCTTCCATTG 


CTAGTTCCAT 


7740 


CCTCTTCCTC 


CTTTGGTTGT 


TCTAAATAAT 


TAAGTAATGC 


TGAAATTTCA 


GCATGATTGA 


7800 


CAGGTAACTC 


ATTTTTAGCA 


AATAAACATT 


CAATTTGCCG 


AATTTCTTCT 


TGCCTTTCAT 


7860 


GAGATACTTC 


TCTATTACAA 


GGAATCTCTT 


CGTATTGTTT 


ATTAAACGTA 


ATCCGATGTG 


7920 


TCATTAATAA 


AACGTGAAGT 


AAAAAGGCTT 


CTTGATATTC 


GCTCAGTAAC 


TTTGTTTGAA 


7980 


ACATTCGTTT 


TAACTCTTCA 


TAAATAATAT 


ACGCTTTTAC 


TAATAAATCT 


TGGCTTTGTA 


8040 


TCCGTTCATT 


TTTTTCAATT 


ACATCAGATT 


TCGTGGAGCG 


AGCGGTACCT 


TTCCCTTGAC 


8100 


GTTTCCCTAA 


TAAAATTTTA 


TCGACCTCTT 


CCCGAATCAA 


ACTTAAGTCT 


GTGTCCGTCA 


8160 


ACAATGGACT 


AACTTTAATA 


JVATGGGTATG 


TCATTTCTTC 


AATTGGGAAA 


ATACTCAATA 


8220 


CCAAATCGGG 


CTTTTCTGCT 


TGAATCACTT 


CAGTTGCATT 


TAAAACAGAA 


GCAAATCCTG 


8280 


CGAGTTCAAC 


GTTTGGTACT 


TCCTCTAAAA 


TTTTTTGTTC 


AATTAAACTG 


GTCACACCTA 


8340 


AGCCCGTCGA 


GCAAACATAC 


ACAATCCTAA 


CAACTTTAGC 


TTCCTGCTGT 


TGTCGTTCCT 


8400 


GAGCTGCCAA 


AAAATGCAAC 


GCAATATATG 


CGCAGAAAGA 


ATCGTTGACT 


AACAAAACAG 


8460 


AACGTTGAAT 


TAATTGTTTA 


CAGGCACTTT 


GAATAGCTAA 


AAACAACGCT 


GGGTGTTTTC 


8520 


CTTTGATTTC 


TTCAGAAAAC 


GGATTGTATT 


CATTAATAAA 


TAAATGTTTC 


TTTGTAAAAC 


8580 


GTAAGGACAG 


GTGTGCAAAC 


AAGTTCGTAA 


ATAATTGGCG 


ATCATTTACA AAAGGATAAT 


8640 


TCAGCTCTTT 


TGAAACGCTT 


TGAATCACTT 


GCTTTGTTAA 


TTGAACCATA 


TCTTGCCGTT 


8700 


GATGGGTTCC 


GAATAAATCA 


CTATAAATAT 


AAAAATATTC 


ATCCGCTAAT 


AAAGGCAAGT 


8760 


CATACCGTTC 


AAACACCTCT 


TGTAAAAGTA 


AAAGTGGTAt 


TCGTT 




8805 



(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

TATTTTTTGC • TTAATATAAT TAATGACATC TTTCTTTTGT TCTTTAGCCT CTGATATATA 60 

TTCCTTTAGT TCTTTATCCG TAATTGTTGA ACTGTTAATT CCTTGTTCAT ACAATTGTCG 120 

TCTAGTTTTT GAAACATCAT CTGTTGGATT CAATTCTCTA TTCTTTAAAT TCTCGGTTGT 180 
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TTCAATTGTT 


ATAGATGATG 


TAGAAGTTGG 


TGTTGTGTCC 


GGATTACTAC 


TATCTTTTTT 


240 


AGAAGACTGG 


GTAGTTTGCT 


GCTTAGTTGA 


TACTTCAGTA 


GATGAAGCGT 


CGCTATTTTT 


300 


TCCCCCAGTA 


TTATTACTAC 


ATGCCGTTAA 


AAATAAGGCT 


AACAATAGTA 


CAATAAGTTT 


360 


TTTTTTCATT 


ATATCCTCCT 


ATAAAATAAC 


GAAACATGGT 


CGGCCTTCTG 


TCCAATCAAT 


420 


TGGATCATTT 


GACGGGGTAT 


TCCAAAGTGC 


TTGTGTTGAC 


ACCCAACCTG 


AGTAAAACTG 


480 


GCGACCATAC 


GGATCATAGG 


TTAAAGTGTA 


ACCATTTTTG 


TACTCATGAA 


CGACAATAGC 


540 


ATGTCCAATC 


CCAACATTTT 


GATATAAAAT 


AACGGGATAT 


CCTTCAGAAA 


GATAGGCGTT 


600 


TAATGTAGCT 


ACATCATTAA 


TAACTTTGAT 


CGAACGGCCA 


TATGATTGAG 


CTGTTATCAC 


660 


TAAATCAGTT 


GCAGAAGCGC 


CAAAGTATCT 


TTGATTTAAC 


CCGCCATAAG 


ATGCCATTCT 


720 


TGTTGCAACA 


CTGACAGGAT 


TTACATTATA 


ACCGTAAGAG 


CCTCTTAAAA 


TCATAGATAA 


780 


ACTTGTTGGA 


ACACATCCTG 


cTGCGCCAAT 


GGTATAGTTT 


CCGATATAAG 


TATT7UVTCCA 


840 


TTGTGGGTCA 


CGTTGAGAAT 


AATAATAAGG 


TTTTTTCTTT 


AATCTTTCTG 


CACGTAATCt 


900 


GTTTGCTATC 


TCTGCTGGTG 


TGcGAGCTTC 


ATCGCCAACT 


AAAAAGGmTT 


TTCCgATcCT 


960 


TTAGGAGCAG 


TTTCrCCyTT 


TCTAACTAyT 


TTGATTTGrA 


TrGCTTCyAA 


ATGTTTAGAk 


1020 


GCAGCTTGAC 


TTCCAGCTGr 


GGCrCCATTA 


GCTGCCCAAT 


TyAACCAACC 


TTTATCTTGG 


1080 


ATATGGACAC 


GGTAATAGAC 


ATCATAATAT 


CTTGAAACAT 


CACCGGmCAG 


TTTAATTTTA 


1140 


ATGGCTTCTA 


ATTGTAAATT 


TTTTCCAGTT 


GTGCCAGAyA 


ATGCCCCATT 


rGCTTTATAA 


1200 


kCTTGCCATC 


CGATATTTTG 


GATGTGCGTA 


GAGTATTCTA 


TrTTTCCAGC 


TAGAGCAGCA 


1260 


TCACTCAGGT 


TGATTTTGAT 


TGCTTCCAGT 


TGTAAATTTT 


TCCCACTTGT 


ACCAGCTATT 


1320 


TCrCCATTCT 


TGACAACCCC 


TTGCCAACCA 


ATATTTTGCA 


CGTGTGTTTG 


ATArTTGACA 


1380 


kTTGGTTTGA 


TTTCATCTGG 


ACGTTTTGCT 


TCATTTCCAA 


CTAAAAAGGA 


TTTTCCaTTT 


1440 


CCTTTAGGAG 


CAGTTTCGCC 


CTTTCTAACT 


ACTTTGATTT 


GGATGGCTTC 


TAAATGTTTA 


1500 


GATGCAGCTT 


GACTTCCAGC 


TGGGGCGCCA 


TTAGCTGCCC 


AATTTAACCA 


ACCTTTATCT 


1560 


TGGATATGGA 


CACGGTAATA 


GACATCaTAA 


TATsTkGAAA 


CATCACCGGA 


CAGTTTAATT 


1620 


TTAATGGCTT 


CTTVATTGTAA 


ATTTTTTCCA 


GTTGTGCCAG 


ATAATGCCCC 


ATTGGCTTTA 


1680 


TAATCTTGCC 


ATcCGATATT 


TTGGATGTGC 


GTAGAGTATT 


CTATGTTTCC 


AGCTAGAGCA 


1740 


GCATCACTCa 


GGTTGATTTT 


GATTGCTTCC 


AGTTGTAAAT 


TTTTCCCACT 


TGTACCAGCT 


1800 


ATTTCGCCAT 


tCTTGACAAC 


CCCTTGCCAA 


CCAATATTTT 


GCACGTGTGT 


TTGATAAwTG 


1860 






1 \y\jn\^\s x X X i 


GCTTCATTTr 




1111 \w>v«n 


1920 


TTTCCTTCAG 


GTGCCGcTTC 


GCCCTTCTTC 


ACTAATTTAA 


TTTGAATGGC 


TTCTAATCTT 


1980 


TTAGCGGCGC 


TTTGACTACt 


GCGGACTCAC 


TGTTTTTTGC 


CCAATTCAAC 


CAACCTTTAT 


2040 


CTTCTATATG 


CACACGATAA 


TAAACATCAT 


ATGCGTTAGc 


tATTTCTCCA 


GTTAAGCGAA 


2100 


TTTGAATCGC 


TTCGAGCCGC 


AAAGACTTTC 


CGCTTGTACC 


AGACAATTGA 


TTATCTTTCA 


2160 
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TTCyACAGAT 
AGAGCGTCTG 
CTTGGGATAT 
GTGTGtCCTC 
TGTATCATTA 
GGAAGTGGAC 
AGTGACACCG 
AGTAGTTCCC 
CAACAAAACC 



CCAGcCAAAT 
CTAgkcCCAG 
GTGtCtGATA 
ATTACTTkGT 
ACAGTTGTTA 
GTTGTTTCTG 
CTACTCATTT 
TTAGAGCGTT 
GCTAACATAG 



CATATCCTTG CCACCCAATT TCTTGGACAT GTGTTCTATA 
CTGAATTTGA AATATTCATT TTAATTCCTX XAAGACGIAA 
AAATTTCCCc GTTTTTGaCA AcCCCTtGGC cAwCCGATGT 
ATTAATtCAG gAcTTTTyCT cTGCtTTGCC gGTGCTATAG 
GATGTTyCCA AAGTAGTTtC AGTAcGTCGC TACTATyTTC 
CTTCTGATGT AGAAACTTCA GTGGTGGAAT CGGAGGTTGA 
TTTCCTCTTG AGAATTTATG ATGCTTGGGT TCATTTCTTC 
GTCGCTCCGT TAGAGATGTT TCTGAAGCCA CCGTTTCTTC 
CCAAAGCGTA AGCAGAAATA GGGGTAACGA GAAATCCCGA 
AACTAGTTAT AACACTTTTT TTCATTAATT TACTCCTT 
(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
GATTGGACGA AAGTAACTAA TAAATCTGTT TTTATAACTG GTGCTTCAGG CATGATTGGG 
ACCTTTTTAA TTGATGTGCT TATGTACCAA AATAAAACGA GAAATGCGAA TATTTCGATC 
TGGGCGATGG GACGAACATT GTCGCGTTTA GAAGAGCGAT TTACTAGTTA TCTAGAAGAT 
CCGTTATTCC ATATAGTTAT CGGAGATGTC ACAGAAGAAA TTCAGATAGC AGAAGCTTGT 
GACTATGTGT TGCATTGTGC TAGTAATACA CATCCTAAAT CATATGCCAG TGATCCAATT 
GGAACGATTA TGACCAATAT TGCTGGTACG CAGCATATTT TAGATTATGC GGTCAAAGCG 
AATAGTGAAA AAGTGTTATT TTTATCAACT GTTGAAATCT ATGGAGAAAA TCGTGGAGAT 
CTTGATAAAT TTACGGAAGA CTATTGCGGC TATATTGACT GTAACACGCT ACGGGCCGGT 
TATCCCGAAG GAAAAAGAGC AAGTGAATCA TTATGTCAAG CCTATATTCA GAAATATGGT 
ATTGATGTTG TTATTCCAAG AATTAGTCGT ACTTTCGGGC CTACGATGTt ACTTTCGGAT 
AGTAAAGcTT CTTCTCAATT TATTATGAAT GCAGTCCATA AAGAAAACAT TGTGTTAAAA 
AGTGCAGGGA CCCAACTATA TTCTTATGCA TATGTTGCAG ATATTGTATC TGCTCTTCTG 
TTTTTATTAG TTAAAGGGCA AAAAGGGGAA GCTTACAATG TTTCAAACGA ACATTGCGAT 
ATTACGTTAC GTACTTTTGC GGAAACATTA GCCAATGTGG CTGGAACAAA AGTGATTCAT 
GGAGAAGCCA CCGTCCAAGA AAAACAAGGG TTTTCTAAAG CAACAAAAGC ACTACTAGAC 
AATCAAAAAA TCTACGCATT AGGTTGGCGG CCACTGTACG ACAGTATGGA AGAACCATTA 
AAACATACAA TTAAAATTTT AGAAGGTAGC GGAAATTGAT ATGCCCAAAA TTAGTATTAT 
TGTTCCTGTA TACAATGTAG AAAAATATTT AGAAAAATGT GTACGCTCTA TTTTAGCTCA 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2738 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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AACGTTTACT GACTTTGAAT 


TAATTCTGGT 


GGACGATGGC 


TCTCCAGACA GTTCTGGAGC 


1140 


AATGTGTGAT 


CAGTTTGCTG 


AACAAGATCA ACGGGTAAAA 


GTTATCCATA 


AAGAAAATGG 


1200 


TGGGCTAAGC 


GATGCTCGTA ATGCTGGAAT 


TGAAATAGCA 


ACAGGTGAGT 


ATTTAGGTTT 


1260 


CGTAGATAGT 


GATGATTACA 


TTGCAGATGA TATGTATGAA 


CTATTATATA 


CAAATATCGT 


1320 


AAT^GAAGAT 


GCCGATTTAT 


CMTCTGTGG 


TATTTATGAT 


GTCTATGAGG 


GGAAAGAGCC 


1380 


AATTGTAAAA AGTTTAATAC AGGGAACTTT TTCCAGAGAA GAAGCATTAT 


TGTTAATTTT 


1440 


ACAAGGAAAT 


J\TTA 1 C rUv^is 


T A p a T^f*T/2T 




TATAAACGAA AACTCTTTGC 


1500 


GGATTTAAGA 


TATCCTAAAG 


GAAAG i Al V«A 




TTTATTATAG 


TAGATTTGTT 


1560 


GTCTGAATGT 


CAGAAAGTCT 






TATTACTATT 


ATCATCGAAT 


1620 


GGGCAGCATT 


AATACAGAAA 


CATTTTCAGA 


T2V2iaP2121TTT 
1 MMrtV-^Mri i. i. 1 


GAGTTTATAG 


AAGCTTGGGA 


1680 


GAAGAATGAA 


CTTAAATTGA 


AAGGTAAGGG 


AGCAGTTATA 


GAAGAAGCTG 


CCCATCAAAG 


1740 


AGTTTGTTTT 


GCGAATTTTT 


TAGTATTAGA 


TAAAATTTTG 


ATTAGTAATG 


CTCCGAAGAA 


1800 


AAAAGAAACG 


AAACAAATTG 


TTCGCTATCT 


TCGGGAAAAC 


TTTATTTTTA 


TTATGAAAAA 


1860 


TAAAGTTTTT 


ACAAAAAGTC 


GTAAACTATC 


AATGATTTTA 


CTTATGTTTG 


GTCTTCCTTT 


1920 


TTATAaAATA 


CCAATTAAAT 


TGAAAAGAAA 


ATAGTCAAGT 


CCAGACTCCT 


GTGTAAAATG 


1980 


CTATACAATG 


TTTTTACCAT 


TTCTACTTAT 


CAAAATTGAT 


GTATTTTCTT 


GAAGAATAAA 


2040 


TCCATTCATC 


ATGTAGGTCC 


ATAAGAACGG 


CTCCAATTAA 


GCGATTGGCT 


GATGTTTGAT 


2100 


TGGGGAAGAT 


GCGAATAATC 


TTTTCTCTTC 


TGCGTACTTC 


TTGATTCAGT 


CGTTCAATTA 


2160 


GATTGGTACT 


CTTTAGTCGA 


TTGTGGGAAT 


TTCCTTGTAC 


GGTATATTGA AAGGCGTCTT 


2220 


CGAATCCATC ATCCAATGAT GCGCAAgCTT 


TTGAATATTT 


TGGTTGATCG 


ATATAATCAT 


2280 


GAATCAATCG 


ATTTTTAGCC 


TCACGCGCTA AGTTAATATC 


TGTGAACTTA 


AAAATTCCTT 


2340 


TAACAGCTTC 


TCTGAAAGAT 


TTTGAATTTT 


TTTTAGGAA!r 


GGTGGTAAAG 


ATATTTCTTA 


2400 


GGAAGTGAAC 


TTGGCATCTT 


TGCCAACTTA 


CGTTGGTGAA 


GGATTTTCTA ATGGCAGAGA 


2460 


CTAATCCTTT 


GTGCGCATCA 


GAAATAACGA 


GTTCCGTACC 


TTGTAAACCG 


CGTTCTTTTA 


2520 


GGTATTCAAA 


AAATGTTGTC 


CAGGTCTCTT 


CGCTTTCGCC 


ACTTTGAATC 


ATGAAGCCGA 


2580 


TAATTTCACG 


GTCGCCATCT 


TTGGTTATTC 


CAATCGCTAT 


ATGACAGCTT 


TTTGAGAGTA 


2640 


CTCGATTTTC 


TTCTCGTACT 


TTTATATAGA 


GTACATCGGT 


CATTAAGTAA 


GGATAATTTT 


2700 


TTTCTGATAA 


TAAACGATTC 


TGCCACTCGT 


TAACCATAGG 


TTCTAGCTGT 


TCTGTTAAGC 


2760 


TAGAAACGAA 


GGACTTAGAG 


ACGGATTTAC 


CACAAAGTTC 


TTCCACAATT 


TTTGATACTT 


noon 


TACGAGTTGA 


AACGCCTGAT 


ACATACATTT 


CCAACATTGA 


AGCCATGAGG 


GCTTTTTCGT 


2880 


TTCGTTGATA ACGTTCAAAC 


ACTGTGGGTG 


AAAAATGGCC 


ATCACGTGTT 


CTGGGTACTT 


2940 


TTAATTCTAG 


CGTGCCTACA 


CGTGTCGTAA AGcTGCGCTC 


ATAATAGCCA 


TTTCGTTGAC 


3000 


TTTGTCGGTT 


TTCTGTTCGT 


TCATATTCTT 


TTGCTTGAAT 


ATATTCTGTT 


CGTTGATTTT 


3060 
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CCATTAGTTG ATTAAATACC GTTGTTAAAA TATTTTTAGA AACGTCATCC TTTACAGAAT 3120 

ATTCAATAAT GCTTTGAATC TCTTCGCTTT TCAGTGTAAA ATGTACTTGG GTCATGTAAA 3180 

AGTCCTCCTG GGTATGTTTT TGTCGTTAAA AACATTGTAC CGTAAAAGGA CTGTTATATG 3240 

GCCTTTTTAC TTTTACACAA TTATACGGAC TTTATCGACA AACTGATAAA TTAAAAGATT 3300 

CAGTAAATAA TTCCATCGAA GATATTATCA ATGAAAAGAG TTTAACTTTA AGTAATCAAG 3360 

TAAAAGAATT AGAAAAAGM TATGAAAGCA AAATACAAGA AGCAGAAACA ATAAAAGATG 3420 

AGAGTATATT AAAAGCAGCA ACTCACCACT ATGAAAAACA GAAACAAAAA ATTGAGGAGT 3480 

CTTTTACAGA AGAAATTAAC AAAGAAGTTG AAAAAACGAT CAATACTGTT GTGGAAGAAC 3540 

AAATAGAAAA AGTGGAAGAA AAGAAAAAGA AGACAACAGA AGATGATGTT CGAGATCATT 3600 

TACGAGGATT TGCCCGTACA "ATTCCTGCAT TCTTAATGGC ATACGGAGTT GAAGATACAA 3660 

GATTAGAAAA TTTTGAGGTT AATATTGATG AAGCCACATT CGAAGATTTG ACGAGTATTA 3720 

CAATAGAGGA ATTCAAGAGG CTCCGTGATG GATTTGAGTA TCCAGATAAT AATGGGGAAA 3780 

CTAAAGTTAT CCCAGGATTA TTCAATGAAG TTGTATTTAA TGCTAGTATC CAGGAATTTT 3840 

TGGAGACAAA AAACCGTTTG GCAAATTATT TTGATGATAG CTTGCAAGAG GACATTTTTG 3900 

ATTATATCCC ACCACAGAAA ACAAATCAGA TATTTACACC AAGACGTGTT GTAAAATCGA 3960 

TGGTAGATTC CTTAGAAAAA AGAATTCCAG GAATTTTTTC GGATAAAACA AGGAAATTTG 4020 

CAGACTTATA TGTTAAAAGC GGTATGTACA TTACAGAAAT AGTCAAGAGA TTAAATCATG 4080 

GATTAAAAGA TCAGATTCCA AATCAACAGG AACGTATTAA ATGGATTCTT GAAAATCAAG 4140 

TTTATGCATG TGCTCCAAGT AATATTATTT ATAATATTGC CAAGAACTAC ATATTGGGAG 4200 

AAATGGAGCA TGTTTCTAAT ACAAATATTA TTGAACTTGA TACACTGGCT TTGGCAAAGG 4260 

AAGCAAAGTT ATCAGAATGT ATTTATAAGG AATTCGGAGA TGTTAAGATG AAATTTGATG 4320 

TAATTATAGG AAATCCTCCA TATCAAGAAG AAAATACTGA AAGGAATAGA GATGACGCTA 4380 

TCTATCACTT ATTCTTAGAT GAGGCGTACA AGATCGCAGA GAAGGTTGTT ATGATTACGC 4440 

CAGCAAGATT TTTATTTAAT GTTGGATCGA CTCCAAAAAT TTGGAATGAA AAAATGCTAA 4 500 

ATGATAAACA CTTATCTGTT GAAATGtATG AACGGAAAAG TGaAAAAATC TTTCCTAATA 4560 

CaTCgATTAT TGGTGGAATC GCAGTCACTT ATCgTGATGC CTCAAAAATC CTAGGACCTA 4620 

TTGGAACTTT TACACCGTAT CAAGAATTAA ATTCAATATT GAGTAAAGTA AAAGAAATTA 4 680 

ACCAAGAAAG TAACTTTAGT TCACTGATGT TCGTACAAAA TAAATTTAAT TTAGATAAGC 4740 

TATATCGTGA TTTTCCAGAT TTTGAAACAC GTCTTGGAGG AAATGGACGT GAACGCCGTT 4800 

TAACATCTTC AATATTTAAT GTATTACCTG AAATTTTTTC AGATAAAAAA ACAAAAGAAC 4860 

ATAATCTGGC AATTTACGGT AGAGAAAaTA ATAAGCGAAC aTATAAATAT GTCTCTAAAG 4920 

AATATATTGA GGAACATCCA AATATGTATA AATGGAAGGT TTTTGTTCCA GCAGCAAATG 4980 

GTGCTTCGGG TACAATAGGA GATAGTCCAG CAAGAATTAT TAGTTTCCCG GTCTTAGGTG 5040 
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AACCTAATTC TGGTCATACC CAAACCTTTA TAAGTTTAGG AGCTTTTGAT ACGAAGTTTG 5100 

AAGGTGAAGC GTTGTTAAAA TACATCAAGT CAAAATTCGC CCGTGCGATG CTAGGTGTAT 5160 

TAAAAGTAAC GCAAAATAAT AAGACTCAAG AAACATGGTC AAAAGTGCCT GTATTAGATT 5220 

TCACAAAAGA TTCTGATATC GATTGGGAGA AACCCTTGCC TGAAATTGAT CA6AGTATAT 5280 

ATAT^AAAATT TAGTTTGAAT CAAAATGAGA TTGATTTTAT AGAAAACCAT ATAACGTACC 5340 

TTGATGAAGA AAGTTTAAAT AATTTATAAG TCAAGATAGA TACTTGTGAT TAAAAAAGTA 5400 

ATTGTATTCA ACTGGCATCT ATTTTTGGTT CAAATTGGTT CAAGTAATAT CAATATCT^T 54 60 

CTATTTCCAT ACATTTTTAT CTATATAACA TTTCCCAAAA CGTTGATATA AATGGATTAG 5520 

GATGTATTCA ACACACGTCG TATTTTTATA 5550 
(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



AGAAGAAAAT 


AnCATCAAAT 


ACGCTTTCTC 


TAGATGGGTT 


AAAATGCTTG 


GAGTACACAT 


60 


TACGATACGA 


TTGTTGCTTC 


CATTGTGATT 


CCTTTTGAAG 


AAAAAGATGG 


CTnTGTGACA 


120 


GAACTATCTG 


CAGGCCAAGC 


CTTTAACCGC 


ATGATGGGAG 


ATCCTGACAA 


AGAGGTTCGT 


180 


CAACGTTTGT 


TTACAGCCTG 


GGAGAAAGCT 


TGGAGTGGCA 


MGCTTCAAT 


TTTAGCAGAC 


240 


ACCCTAAATC 


ACTTGGATGG 


GTTCCGTTTA 


TCTGATTACA 


AATTGCATGG 


CACTACAGAT 


300 


TATTTAGAAA 


ATCCCTTAAA 


CTATAATCGA 


ATGAGTCAAG 


AAACATTAAC 


GGTTATGTGG 


360 


GaAACCATCC 


AAAAAAATAA 


ACAACCATTC 


GTCGACTTTC 


TAACGCGCAA 


AGcGCAAtTA 


420 


TTTGGCAAAG 


AAAAAATGGA 


TTGGCAAGAC 


CAAGATGCGC 


CAATTATTTT 


AGGTGATTTA 


480 


GAAGAAAAAA 


CGTATTCCTT 


TGATCAAGCC 


GCTGCCTTTA 


TTTTAGAAAA 


TTTCCGTAAA 


540 


TTCAGTCCAA 


AAATGGCTGA 


TTTTGCCCAG 


ATGGCTTTTG 


AAAAAAGTTG 


GATTGAAGCT 


600 


GAAGATCGTC 


CAGGCAAACG 


TCCTGGGGGT 


TATTGTACAG 


AATTACCTGA AACACAAGAG 


660 


TCGCGTATTT 


TCATGACCTT 


TGGCAATTCA 


ATCAACGAAG 


TAGCTACTTT 


AGCCCATGAA 


720 


TTAGGTCACG 


CTTTCCATAG 


TAGCGTGATG 


TGGGATTTAC 


CTTCATTGAA 


TCGAGAGTAT 


780 


GCCATGAACG 


TAGCAGAAAC 


GGCTAGTACC 


TTTGCAGAAT 


TGATTGTTGC 


TGACGCCACC 


840 


TTAAAAGAGG 


CCAAATCAGA 


TGTTGAAAAG 


ATTAACCTTT 


TAGACACAAA 


AATGCaAAAT 


900 


GCCaTTGcCA 


TGTTTATGAA 


CATTCaTGCG 


CGtTTTATCT 


TTGAAAATAA 


TTTTTaTCaA 


960 


GCGCGCCAAA 


AAGGTTTAGT 


AGCGGAAGAA 


GATATTACTG 


AATTAATGTT 


AGCGGCGCAA 


1020 


AAAGAAAGCT 


ACAAAGATAG 


TCTTGGCAGT 


TATCATCCTC 


ATTTCTGGGC 


GAGCAAATTG 


1080 
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CACTTCTTTA TTGATGATGT 


GCCTTTCTAT 


AATTTCCCAT 


A-CACTTTrGG CTATCTATTC 


1140 


AGCATGGGTA- TCTATGCGTA 


TGCCAATCAA 


CAAGGCAGTT 


CTTTTGAAGA CCAATACATT 


1200 


GCGTTATTAA GAGACACTGC 


TTCGATGACA 


AGCGAAGAAT 


TAGCGAAAAA ACATTTAAAT 


1260 


GTTGACTTAA CAAAACCTGA TTTCTGGCAA GCCGGCATTG ATCAAGTCCT AAAAGATGTG 


1320 


GAACAATTCA TGACTTTAAC TGAAAACTAT 


GTAAACTAAG 


CAAAAAAGAA GCAGTAACCA 


1380 


ACATCAGGTT ACTGCTTCTC 


TTTTTTTCAA ATAGTTTCGA 


TACGCAAGGT AGCCCACAAG 


1440 


CATTAGTCCA AACGTAACAA ATTCAACTAC TGGCATGGCT 


AACCAAACAC CGGTTAATCC 


1500 


TAGTAATTTC ACC7VAAATAA 


ATAAAACTGG 


CACAATTAAA 


AAGAACCCAC GCAAAGAGGA 


1560 


AATCATTAGC GCCATTTTCG 


GTTTACGAAC 


TGCCGAAAAG 


TAAT7UVATCC CCATAAAATT 


1620 


CAATCCTGTA AACAAGAAGG 


AAATCGCATA 


TAAGCGGATG 


GCTGTCCTAG CAATTTGCGC 


1680 


CAACTGGTCG TTCCCTTCGC 


TATTAAAGAC 


CTCAATAATT 


TGTCCCGTAA ACACTTGACT 


1740 


GATTAAAAAG AACAGGAAGC 


CGGCGACCAA 


GTACGTAATA 


AAGCTAAGTT TAACAAATTT 


180O 


ACGCAACACC TTCGTTTCCC 


CTAAACCATA 


GTATCGGCTG 


AGTAATGGTT GCGCCCCTTG 


1860 


CCCAATCCCT GTAAAAATAG 


CAATCACAAT 


AATATTGAGA 


TTGGCAACAA TCGCATAAGC 


1920 


TGAAATCGCC ACATGACCAA 


CTAAGTTTAA 


CAAAACAATA 


TTAAATAAAA ACATGACAAA 


1980 


GGCCGAAGAA AATTCATTTA 


AAAAGGAGGA 


AAAACCAATC 


GACATAATTT CTCGAACATC 


2040 


CTGAAATTTC ACTTTAAACT 


TTTCAAAACG 


TAAAACTCGT 


TGCGGCGACC ACTTATGCAA 


2100 


CGTAATCAGG ATTAAGCTTG 


TTAACGGTGA 


CATCACCGTT 


GCAATGGCCG CACCTGCCAT 


2160 


ACCCCACCCA AATTGATAAA T7VAAGACATA ATCCAAAATA ATGTTGACAA TTCCCCCACT 


2220 


GACAAAAGCA ATCGTTGCCA 


AACGAGGATT 


ATGGTCATTT 


CTTAAAAAAG TAATCAGGCA 


2280 


ATTGTTTAAA ATAAATAATA ATGAACAGCT AAGGTAAATC 


ATATAATATT CCTTTGCCAA 


2340 


ACCAAACAAG ACACCGGTCC 


CCTGAAGCCC 


CCGAATAATT 


TGATCGGAAA AAATCAAGCT 


2400 


CACTAAAGTA AAGAGACTCC 


CTATAACGAA AACTAACCCA ATCGTTAAAC TAAAATACTG 


2460 


ATTGGCTTTT TTGATTTCTT 


TTTGCGCAAC 


CGTGGTCGAA 


AACAACGTAG CGCCGCCAAC 


2520 


GCCAAACATC CAACCTAATC 


CATTAATAAT 


ATTAACGAGT 


GGTAAGACAA TGTTAAGCCC 


2580 


CGCCAACGCC TCGGTGCCGA 


TCCCATTTGC AATGAAAAAT 


GTGTCCGCCA AAATAAAAAT 


2640 


TGATTGCGCC GCCATACTTA AAATTCCTAA CAGAACATTT 


TTATAAAACA ACGCAGTT^ 


2700 


ATGGTTtTGC TGTGtTTGCT 


CCATACCCAA ATCTCCTTCT 


AACAAAAAAA GGACTCAtTC 


2760 


TTTTTGCTTC AAGGTTACGC 


AAAAATCAGA 


ATCCTTCAAT 


TATTCAACCC AGCGATTAAA 


2820 


TAATCTGTCT ATTTATTGAT 


ACTTCTAGGG 


TAGCATTCTC 


TCTTCGAAAA AACAACCATC 


2880 


CTTTTATTTT TCTCATTTTA 


AACTTTCTTA 


AGCAGTTGCA 


GTTGTTTTTG AACCACTTGC 


2940 


CAATCTTCTT TCACGACTTC 


TTTTAAACGA 


TAATTATAAA 


AGACAGCCGC TGGATGGAAG 


3000 


AGTGGAATGA TTTGATAGGT 


TTTCTGCGTC 


CAATGATAGT 


TTTGTGATTG CTCATCAAAT 


3060 
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GTTTGAATGG 


GCGACTGAAT 


GACCGTTCCG 


TGCACTGCAG 


TGATTGTTGG 


CTTTGGTCCT 


3120 


AAAAGTCGTT 


GCAACCCAAT 


ATTCCCCAGT 


GTCAAAATTA. .GTTCAGGTTG 


GCAAACAGCG 


3180 


ATTTCCCAAT 


CAAGCAAAGG 


TGCAAAAATT 


TTGATTTCTT 


CTTTGGTTGG 


CGTTCGATTA 


3240 


GGAAATTTGG 


TTACTCTTTC 


TCCCGTTTTT 


GCAATCTGCT 


TCGTTTTTAC 


AGAAAAAGGC 


3300 


CGACTCCTAA 


CCGCACTGGT 


GATATACACC 


GTTTCACGAC 


TTAAGCCAAC 


TTGTTGTAAT 


3360 


TGTTCCATCA ACAATTGACC 


TGT^CGCCCT 


ATAAATGGTA 


CGTTTTCTTT 


AATTTCTTCT 


3420 


TTTCCAGGGG 


CTTCCCCAAT 


CAGCATCACT 


TTTGGCTGTT 


GCGGACCTTG 


ACCTTCCACA 


3480 


AAACCTTCTA ATGGTAAGTC 


TTTCGCTTTA 


GCTAAAACTG 


CTTTTTTCAG 


ATGTTCAGGA 


3540 


TAGTGCATTT 


ACCATCACGC 


CTCTCTTCTG 


GTGTTCTTCC 


TTCGCTATAT 


TGATTATAAA 


3600 


TAAGGCCCAA AAGAAACTCA AAAAAAGGAG CCTATTCATT TAAGCGCTGG 


TTTTTCTACA 


3660 


CACAAAGCTT 


TGTCTTTAGC 


TAAATAAAAA 


GGCCGTAAGT 


AACCAACTAA 


ATTAGACATT 


3720 


GGTAAAACCT 


CCGATGGTAA 


ATCGAGCTTT 


AATGTCTCCT 


TTAAATAGGC 


TTTGCGTGTC 


3780 


GCAATCCGTT 


GCCACATGTC 


TGGATAAGTT 


TGCTGAATCT 


CTTTTTGCAG 


TGTTTCATCA • 


3840 


GCCAAAGCCA 


CACATTCTTC 


TGCACTCACC 


CCTGTATAAC 


CAGGTACAGA 


AGGAATAATA 


3900 


TCCACTTG7UV 


AGAGCATGCC 


ACTTTCCAAG 


CGAATGGCTG 


AATCTGGATA 


AATAGGCGAC 


3960 


GACATCCATT 


CATCATCCGA 


AACTAAATGC 


CCAGGATTCA 


AATGCCAGTG 


ATAGATTTCT 


4020 


TTTGGTAACT 


GTTCTTCAAT 


CGCTTGGTAC 


ATTTCTCGCC 


CTAACAGTCC 


GATTCGAATG 


4080 


GTTTCAAGCC 


AGTGAACGAC 


AGCTTGAAAA 


TAGGGTTTTG 


ctACGCGCTC 


TAAATAATCA 


4140 


CGTTGCGCTT 


CTGGCAATTG 


CTGCTCGTTT 


TCAATAACAA 


AACCTGTTCG 


GCTAGACAAG 


4200 


CCGCCTTTGT 


AGCCTGTAGT 


TAATGATAAG 


GCATCACCTA 


ACTGAATTTC 


TTTGGCAGTC 


4260 


GGATACATGT 


TTGCATATTC 


AAAACGTTGA 


CCAGTTGCCG 


CAATGGTTAC 


TACCGTCGGC 


4320 


GTTTGACCTT 


CATCATTTAA 


CAAAGCCCCA 


ATGTCCGTTT 


CACGCTGTCC 


TATTTCAATT 


4380 


GCATTTAAGG 


CCTTTAACAT 


ACTGCGAGAA 


GCCAAATTCG 


CCCCATATTC 


ATAGTGCGCC 


4440 


AATTCGTTAG 


CATTATTTGT 


TGCGCGAGCC 


CCCTTTGCTC 


CAATCAATAA 


GTGGGTCCCA 


4500 


TTTAGTAAAC 


GAGCTTCTTT 


CGGCAGCGCT 


TTTTTCAGTG 


CTTCGACAAT 


AAAATGTGGC 


4560 


ACATCAAACA 


TAGTAGCAGG 


TTCTTGCTGT 


TGTGTCGTAA 


ACATTTTCCA 


CCCAACGATA 


4620 


CCTATTTTTT 


GAGCAGTTTC 


ATCGAGAAGC 


GTTTCAAAAA 


TCTGCGACAA 


CTTTTGTTCC 


4680 


CCAGCCATTG 


GTTGATTAGG 


TAGTGAAAAA 


GCTAGATAAT 


GAATTAAATC 


GGCTGAGATG 


4740 


CGGGCATGTT 


GGCACAGTTT 


TAAATTTTCA 


TTACCTAAAA 


TAAGTGTCGC 


CGCACCATCC 


4800 


TTATTTAACA 


CAAGAAGTCC 


TTCTTCAAAC 


CTCGGAATAA 


AGCCAGTTAA 


GTATTCAAAA 


4860 


TTACTGCCAT 


•GCTCTTTGTC 


TGCATAGAAG 


ACCAATTGAT 


CGAATTTTTC 


TGTTTCCATT 


4920 


CGTTGTAATA 


CTTTCTGCTT 


GCGCGCCATC 


ATCGTTTCAT 


TCGTCAAATA 


CGTTGGAAAA 


4980 


ACATTCTCAA AAACAGTAGG 


TTCAGCAATA 


TGTGTATAAT 


TAATTTTTCG 


CTTCATTATT 


5040 
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ATCCTCCATT ATTCATCTGC GACAATCAAC GTTACGCCTG CTTIGCGCTAA CTGCTCACAG 


5100 


AATTTTTCAT 


CAGGTCGTGT 


ATCTGTCACT 


AAACAATCAA 


TTTCTGTTAA 


TTCGCAAATT 


slio 


TTAAATAAAC 


CCGTTTTGTT 


TAATTTCGTT 


TGGTCTGCTA 


AGGCAATGAC 


TTGTTTGCCT 


5220 


TGTTGAATCA 


TTCGCCGCTT 


GACTTGACCT 


TCTTCTTCTG 


TGTGAATAAA 


TAATCCTTGT 


5280 


TCAGATAACG 


CAAACACACC 


TAATAATGTA 


TAGTCGGGGG 


CAAATTCCCC 


AACCTTAGCG 


5340 


ACTGTTTCTG 


CCCCAAACAA 


AAAGAGTTGT 


TGTTTATGCA 


GTTGCCCAGG 


TAATACTGTA 


5400 


ATTTTTGTCT 


GATCCATTTG 


CGCTAGAAGT 


AGCGCATGAC 


TGAGAGAGTT 


CGTTATCGCG 


5460 


TGAATCGGAC 


TATTTTCCAT 


CGCTCGAATA 


GCTGCTTCCA 


CCGTGGTTGA 


TGAATCAAAA 


5520 


AAAATGGACG 


CTTGTTTGGG 


AATATTTGCC 


CAAACTTTTT 


GGGCAATCGC 


TTGTTTTTTC 


5580 


TCTGGATGCT 


TCACTATTCG 


TTGAGCATAA 


TCGTAAATAG 


TTGCTTCAGA 


CTGCACTGAG 


5640 


ATAGCGCCAC 


CGTGCGTTCT 


CTTTAAATGT 


CCTTCCTTTT 


CTAAACGAAT 


TAAATCTCGC 


5700 


CGAATAGTAT 


CTTTTGA7VAC 


GGAGACTTGC 


TnCCGTAACG 


CTTCCATGGA AATACTTTTT 


5760 


TGTTCATTCA 


CTATTTTTTG 


AATCTTGGCT 


AAACGTTCTT 


CAATCAACAT 


GCTTCCTCAC 


5820 


CTCTTCTAAG 


TAAAGATAGC 


ACCTTTTGCA 


TCATTTTGCA AACAAATGCA AAACAATGCA 


5880 


AAAAAAGACA 


AAAAAACTAG 


TCCTTCATTT 


TCTAACG7VAG 


GGCTAGTTTG 


AGAGTGGTTT 


5940 


TAATAGGTAA 


ATTGAATAAT 


CATGCCGCTA 


GGATCTTCTA 


ACGCAAGTAA 


ATGATCTTTG 


6000 


GCAATAAAAT 


GTAACTTTGC 


TTCTTCCGCA 


TTGGCTTGGA 


CTATCTCTAA 


ATCTGCCGCA 


6060 


GTTGGTAAAC 


GGAAGGTGTA 


ATTTTCTAAG 


CCAAATTGAT 


TTTCTTGAAT 


CGTTGGCAGC 


6120 


TGTTTGCCGC 


TCCATGTATT 


TGTCCCAATA 


TGATGATGGT 


ACTGACCCGC 


TGCAAAAAAC 


6180 


TTCGCTTGTT 


GACCAAAGTT 


AGACTTTAAG 


CCAAACCCTA 


AGTTTTTATA 


GAACGTTTCA 


6240 


GTGGCATCTA AATCGGCCAC 


TTTCAAATGA ATGTGCCCAA 


TTTTAGAACT 


TCGGAAATTC 


6300 


CTAGCCAGTG 


ACCGTCAGCT 


TCTGCAGCTA AACTATCACC 


GTCTAATTCT 


TCTGTCACAC 


6360 


CGATAATTTC 


ACCATTTGGA 


CGAATATCCC 


AGACTTCTTC 


TGGTTTATCC 


CAATAAATTT 


6420 


CGATGCCATT 


ATTTTCTGGA 


TCACTTAAGT 


AAAGGGCTTC 


ACTATAGCCA 


TGATCTGctG 


6480 


CGCCAATTTG 


CACATTTTCC 


TGCAAAAGCC 


ACAGCAACAT 


ATTGCCTAAA 


TCTTTGCGGG 


6540 


TTGGTAGCAG 


GAAAGCAATA 


TGGTACAAGC 


CAGTGGTACC 


CTGATTTTCT 


TCCCCAGCTG 


6600 


TTAGTTCTTT 


TAAGACAAGA ATTGCATCGT 


TGGTGCCTTG 


TGGCGCTAGC 


CAAGCAGGTC 


6660 


CCTCTCCTTG 


TTTTAAAAGT 


TTTAAGCCAA 


TTGTTTTCGT 


ATAAAAGTTT 


ACCATTTTAT 


6720 


CTAAATCAGC 


CACTTTCAAA 


GTGACTTGTT 


GAATCCCAAT 


ATTACTTGGT 


AATGCTTGAA 


6780 


CCATTTGCTT 


TCACTCCTTT 


CCTCTTCGTT 


TTACTAAAGC 


AATGATTTAG 


TTGAATTCTT 


6840 


TAAAACGAGC 


-ACTATCTTCC 


ATGTATTCTT 


TTTCGcCAGc 


tGGCGCTTCA 


ATGGAACCAA 


6900 


TGACCATTTG 


TGCACGTAAA 


TTCCAGTTTG 


CAGGAATTGA 


CCATTCTGCA 


GCAACCGCTT 


6960 


CGTCAATAAC 


TGGATTGTAG 


TGTTGTAGGT 


TTGCGCCGAT 


ATTTTCTTGC 


GCTAAGGCAG 


7020 



wo 98/50555 PCTAJS98/08985 

820 



TCCAAACGTT CGCTTGTGTT AAACCAGTTG CTTGTTCAGA 


CCAAACAGGG 


AAGTTATCCG 


7080 


CATATAATGG GAAGCTATCT 


TGTAAGTTTT TCACGATATC 


TGTATCTTCA 


AAGAATAAAA 


7140 


TTGTGCCCGC ACCAGCTGCA 


AAACCTTGTA ATTTAGCTTG 


TGTGTTTGGA 


AATGCTTCTG 


7200 


CTGGAGTTAA TGGTTTTAAG 


GCGTCTTCAG TCATTGCCCA 


TAATTTTTTG 


TGAGCATCGC 


7260 


CGAATAAGAA CACAACACGT 


TGTGTTTGAG AGTTAAAAGC 


TGTTGGACTT 


TCTCTAACAA 


7320 


CTTCTTTTAC TAAAGCAGTT 


ACTTCTTCTT TTGAAATTGG 


TAAGTTTTCT 


CCTAAAGCAT 


7380 


AGATTGAACG ACGGTTTTTC 


ATCATTTCAG TAAATTTtGA 


CATAATAtGT 


TCaTCCTTtC 


7440 


GAATTtCCTC AGTAAATyCA TTaATTnTGT kTACATGCTT 


ACTATACTAC 


CTTACTAAAA 


7500 


GTAAGTAAAy CTTTTTGTCT 


TACTTTTTGT AAGTTTTTTT 


TATTTTTTAG 


TGAAAAaCCT 


7560 


TATTCTACCA AAGAAAAAAC 


AACCAGCTTT TTTCTAAGAA AAAGAAAAAT 


CTGGTTGTCG 


7620 


ATTAATAAAT GTAGAAGTCA 


ATTGAATCGT TTTCGGCCGT 


TTTTCTTGAA 


ATAAATGTGT 


7680 


GGCTTTTTCC CCTAATTCCA 


TTAATGGATG ACAGATTGTT 


GAAAAATTTA 


AAATTTCACT 


7740 


TAATAAAGTT TGTTCCTGTC 


CCACTAAAAA GAGTGGTCTT 


TCAATAACGG 


CTTGAATCCC 


7800 


TGCGGCAATT TCATCACCGT 


TGGTCACAAC CGCTTGGAGC 


GTTGGATCTT 


GCCCACAGTG 


7860 


ACGTCCTACT TCACACCCAT 


CTTCATAGGT TAAACAGTCC 


TTAAATAACC 


GATTTTCAGG 


7920 


AATCGCTCCA TAAATTTCCT 


GATATGCGTC AACTAAATAG 


CGTGCGCTTG 


TACTTTTTTC 


7980 


TGGTGCACGG TTAATCGTAA 


TTGCCACTTC ACTACATCCT 


TGTTGTTcAA 


TGCTnGGAAG 


8040 


ACCCGCCGAT AAACGGTTAA 


ACGATCGATA TAGATACTAG 


CATAACTAGG 


GTCTTCTATT 


8100 


TTnTCACAAA AAACAACTTG 


CTCTTTATAT GGAGCT^Ca 


CaCTTAArGG 


ATTTGCCTTT 


8160 


GaCAAAAcAA TCATGCCaTC 


aAACGCACCC. nGTGGCCcAT 


TTTTTCCmAG 


tAACTTTTTT 


8220 


nCAACTTCTT TATCaTAATC CgTTGGTAAT AAACTGACCA 


TCATCCCTTG 


TTGAAATGCT 


8280 


TGGTGGATAA TCCCACTTAA 


AATAGCATCA AAATAAGGTT 


GATTCGTATA 


AGGAACCATG 


8340 


ACACCTAATG TATTACTTTT 


CCCAAAGCTA AGATTTCTCG 


CTAATATATk 


GGGCTTATAG 


8400 


CCCaACTCAT CGATAATTGC TTGGATTyTC AGTCGTTTTT CTTcAGAAAC 


ATAAGGATGa 


8460 


TGATTAATCA CACGCGAAAC AGTCGTAACT GAAGATCCTG AAAGGCGCGC AATTTCTCGA 


8520 


ATTGTTGTCA TACTTTCACC 


TCATTACTTT ACAAAATTGA 


GTTAAATAAG 


GCACAATCCC 


8580 


TGTCCCGTAA TCAACAAAAC 


GATAAAAGGT CTCCTGCCAA 


GCAAAAGAAA 


TTTCCTGAAT 


8640 


ATCCTCCTGC TGTGTCAAGT 


GcCAtCCGTT ATCTGATGGT 


nACGGrATTT 


CTTGATGCCA 


8700 


AATCTTATAG TATTTTTTGG 


GACTATTTAA CTGCAGACTT 


TTGGCCGTCG 


TAGTCACsGr 


8760 


GACAAAATAG TAATCCATAC 


CAAGTCTATC GCTTATCTCC 


ACTATAACAT 


TCATTTTCAT 


8820 


GCCCTCCAAT ATTTTGATAC 


ACCTAGCTTA CCTTATGAAA 


CGCGTACCAG 


AGCAAGAGGA 


8880 


AAAATAAAAA AAGTGAGACC 


ATGTTATACT AGATGTATCA 


TATAAAAGCA 


AAGGATGAGT 


8940 


GCAATGGAAA CTGTTACAGT 


AGCAACAAAA AGCGACTTAC 


TAACAAATTT 


GAACAAGCAA 


9000 
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CAA.7TAACCG 
TTIL-JCCTATr 

TTATTACGrC 
CCGCGTACAA 
ACCATTGTCA 
TGAy^j^JGTG 
AGAAAGAAAT 
GTT7TGT7TT 
CGA7TCCTGA 
TTACCATTCG 
A.GT7ACCAGA 
AAAJiAATTAA 
GCGGCAAAAT 
GCGTACAA7A 
CATAAATGTG 
GC67TAGA7A 
TACGCCGAAG 
CGTGTGGATT 
AAA?TGAGTA 
AAAGCCAAAG 
ATCATTAAAT 
AATCCACCAA 
TAAACGCTGC 
CAACGATTTC 
TTT7ACCTTT 
TTTTGATCGT 
CGACCAATTT 
TTTCAGTTAG 
TGCTTAATTG 
CAGATTTtGT 
CTGCCAAArC 



821 

TCATTATCTC GGGGGCACTT GCACAAGAAA TTTACCACTT AAAAAAACAA 
AAACGGAA6A AATGGGCTTT TTAGTGGGCA GCCAAGGTGC TGGCTCATTT 
TAATCGAGAA GTTATCCTAT ATAAATTCGG CGCTATCGCC AGAAGAACAA 
ATGCGATTTT GCGTTTGTAT GTTATTAAAA AATTAACTCC AGAAACTACT 
7AAAACAGTT AGATTATTAA AATTTTATTT AATCAAGATA GACCAATTCC 
CGAGTCCCTT TCATTAATCA CTTGATATAA TTACCTTTAT TCAAGTTCCC 
TTATCGATTA ACCAATGGTA TATTAGAAGT AACAGATAAG TCATTTTATT 
7TTGGGAAAT GTTTGGGGAG GAATGTTTAT GTGTCAACCG CAGGAAAAAA 
7TCTGTTGCT AAGTTACGTT ATCGTTTTGA AGTACCAATG ATTATCATTG 
ATTGTTTTTC GCATTGGGGT TAGTTGCCTT ATTGATATTG AGTGATTTGC 
CTGGTTATTA ATTGTCTTAA TTGCACCAAT TGCGCCAGTC ATTGCTTTTT 
TTATATGTAC ATGGATACTG CTTCTAACGG AGTGGaAATT ACAGAAAATC 
AATTTATTCC TTGTATATTG ATTTGGCAAA AGAAATGGGC TTTGGCACTA 
GGATGCCACG TCTTTATTTA ATCAACGGGA ATGGTGTCTT AAACGCTTTT 
SCTCTTTACA CAGACGCTAT GTTGTCATTC ACAGTGATTT GTTGGATATC 
CAGGCGATTT TTCGCTGATT CGCTTTATTT TGGCACACGA GCTAGGACAC 
GGCACACTAA TTTATGGCGA CTAATGTTAT CCATTATTTT GAAACCAGTT 
AAAGTTTTAC AAGAACGCAA GAATATACCG CTGATCGCGC AGGTTTATAC 
AAGGTGCTTT AAGTATGATT TATTTATTTT CAGGAAAATA CATGGGCAGT 
TAGAAGAATA CTTTCACAGT ATTGATTTAC ATGATGATAC CATCTGGCTT 
ACTTCTTATC CGATCATCCT GTTGGCTTCC GTCGAATGCA AACATTGAAA 
ATACTGGCAA TTGGGACGTT CATGGGAAAT TTTTCTAGCA AGAGCCAACC 
AAAnGTGCTG GGACAAAAGA TTCTTTAACT TTTGTCCCAG CACGTTTATT 
TAAATGAACA AATTCTTCTT TGGTGATATT TACAAAGTAG TCTTCAAAGT 
TAATGCTTGC CAAATTGATG AATACTGATA GTCCACACCT AATAAAGCTG 
CGTTGTTTCT TTTTTGGCAA AATAATCACC ATAAATCGTT AACTCAATTA 
TTCAACAGTC AAACGTGCAT CCACGATGCC GCCTTTAAAC TTTTCTTCTT 
AAATTTAGGC GCTTCACCAA AAATCCAAGC TTCATTGCCA TACACTTCAG 
CCTAATTTCC TGTTGATCTG CTTTGGTTAG GACATATTCT TTCTCCGCAA 
AGATTCGACA TCAAAGAGCT CCATCAGTAA ACGATTGCGA AATTCCTCAA 
TTGATATTTT TCGTCTAAGT ACGGCCGTAA ATTGGTTACG CGTCCCCGAA 
CCCTTTTGAT TCAATTTTTT TCTTAGAAAC AGTTAAAACA CGTT6AACTT 
AACATCATAC ATCAAGGTAC CATGAGTATA CATCTTCCCC TTTTTTGTAT 



9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 
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ACATGGGVT- GCCAGAAAAT " TTrCCCi:.T CAATTAATAA ATCATTGCGA CCGCTAATCT 
CGGCCCCCGT.XGCGCCCAT? TrATGTA.-.TG CTTCAATAAt TGGTTTAGTG AACGCTTTAA 
AATCCCG^. AGCTCGATGT CCTTCCTSAA CAACAAAACT AAAGCTGAQA TTCCCGAGAT 
CATCATAAAC AGCACCACCG CCCGATAAAC GGCGAGTAAT AACGATTCCC TTCTCTCTAG 
CATAAGCTAA G7CAATTTCT TCATAAGCA? TTTGATTTCG GCCTAAAATA ACACAGGGTT 
CCTSAA7ATA AAACAACACT AAGGGCTCTT CAAAA6AGAG AGTGTTTAAT AGATAATCTT 
CCGTTGCTAA ATTTCCACGA KTATCTCCTG ACTGCATTAT GACATAACGC ATAAGGTCGC 
CCCACTTCTT TTTCTTCTAG TCTATCAirTT TCTTTCAAAA TTGGTTCACT AAACGTCTTC 
TCTGGCCATT C7CAP-AAGCG GATGATC^AC AAAAAAAC6A ACGTGGGACA ACAATCACTT 
T6GATGTTTG TTCG^-CsCTC AAAATTCTAAT AAACAGCAAT TGCAGAACTG TTTAGCTTTT 
TAATCTAACC AAGTTACTAC rTCTGAAACA TCTCGTCGGA CTGTTTCAAA GCCGGGTGTT 
TGAGCTTTGC CAATCGGAAT AATCAGTCCG TTCACTAAAT GATCCTCAAT ATTAAATAAC 
TTGCGAACAG CTTCrCTTTC APAACCAGAC ATCGCCACAG TATCATAGCC TCGATCTTTT 
GCAGAAA3CA TCAGCTGCAT CGCAACT;AG CCTTGGTCTA ACATTAGCCA CTCTCTTTTT 
TGTTCrrCGC TAACATCATA GCACCAAGAA ACATTTTCAT TAATTCCTTG ACGAATTTCT 
TCCGTCATCG T7CC7TGTGC GACAGCT7TA GCACTGATTT CATCTGCATT GGCCACTGTA 
TAAGCAGTCA AATCACCTAA TAAAATAATC ACCGCCGCTG CCCCTTTAAC TGCTGGTTGT 
TGCATCGAAA C7TCGTAAAG GGCCTCTTTT TGTTCTTTTT CTGTGACAAT TAAAAAACGC 
CAT6AATTTA AATTGGTACT ATTTGGCGCG AACATCGCTT CCTCAATTAT GGCCATTAAT 
TCCTCTTTAG GAATCTCAAC A7CTGGCATA AAATCCCGAA CTGAGCGGCG GCTGCGAACA 
ACATCTTGA7 ACATCATATT TCTTCTTCCT TCCTCTTGCC TTACTTGACA TAGTTATTAT 
TACTCACTAT GATGGAGAAA AAAAGTACGC ACTTTTTATT CACCTAGGAA GAAAAAAGTA 
ACCATTACGT GTCACAAGGA GRAACTArGA CAACTGATAA ACAAACCAGT ATCAACTTAG 
CCCTTTCCAC CATTAACGGC AAATGGAAAT TATCATTAAT GGATGAATTA TTCCAAGGAA 
CCAAgCGCAA CGGTGAATTG A7GCGTGCGC TCGACGGAAT CACTCAGCGC GTCCTGACAG 
ATCGCTTGAG AGAAATGGAA AAAGATGGCC TAGTCCATCG AGAATCCTTT AACGAACTAC 
CCCCGCGTGT GGAATATACG CTCACTCCGG AAGGATATGC TCTTTACGAT GCCCTCTCTT 
CACTTTGCCA TTGGGGCGAA ACATTTGCAC AAAAAAAGGC GAGACTTAAC AAATAAGTCT 
CGTCTTTTTT TATCTATTTT GGGTATTATG GGTAAACTTA AGATATTTTT GAAGTCCTTC 
TTGAAXrGA.A TAAACGGGCG AAAAACC?AA GCCTTTCAAA GCAGAACTAT CTGAAAGAGA 
A7ATTr:AXA- TCGCCATCTC GTTCCTTTTG ATAAGAAACT GGCAAAGAAA GTCCCACTAA 
TTGGTTCATT TCTTCAATTA ATTGATTAAG G6AAATGGCA ACTTCGGTAC CCACATTATA 
AACATG7CCC A^TGCTTCAC T7TTGGT7GC TACTAAATTT AATGCAGTTA GGACATCCTC 



11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 
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12660 

12720 
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12840 
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AATGTAGATA 


AAGTCACGTG 


TTTGCTGACC AXCTCCAAAT ATTTGAAATT 


GGCTCACTTG 


13020 


CCCTTGCTCT 


AATTGGATAT 


AACGATCCAT...CAAMTGGAC ^ AGGACACCAG 


AATAGGGTGA 


13080 


AGCAGGATTT 


TGATTGAGAC 


CATAGACGTT 


AAAGAAGCGA 


ACTGCGCTCG 


TGGGTACCCC 


13140 


GTGTAAATGG 


TATTCATTGA 


GAACATAACG 


TTCAGCAGCA 


AACTTATCAA 


TAGCATAGGG 


13200 


ACTCAAAGGG 


CAAATCACCG 


AAGTTTCTCT 


TTTAGGCAAC 


GTTGGCTCTG 


CACCGTAAAC 


13260 


GGCTGCTGAC 


GAAGCAAAAA 


CAAAACGTTT 


GAGCTTTTTT 


TGATATTTTT 


TTATTGATTC 


13320 


CAGTATTTTC 


AATACACTTA 


AAAAATTGAC 


CTCATGGGTC 


TCTAGAGGCT 


GTGCCACAGA 


13380 


ACTTGCCACG 


CTTGCCACGG 


CCGCAAGATG 


AAAGATATAC 


TCAAACGAAT 


GCTTTGAAAG 


13440 


TACTTCGTCC 


AATAATTGCT 


GATCCGTCAC 


ACTTCCTTTA 


ATAAAAACCA ACTGCTCAGT 


13500 


TTGTTGCAAA 


TTTGACACTC 


GCCCCATGGA 


CAAATCATCA 


ATCACAAAAA 


CTTGATACTG 


13560 


TTGACTATAA 


AAATTAGCCA 


AATTAGAACC 


AATAAAGCCA 


GCTCCTCCAG 


TAATTAATAG 


13620 


TTTCTTCACC 


TCTTTGCTTC 


CTCTCTTTTT 


TTTTAATTTA 


ATTCCACTTT 


TTTCGTGTTA 


13680 


TTTTCTCTTT 


GGAATAAAAA 


AATGCAATAT 


CGAATGAACA 


CATAACTTAC 


AATAGCCACA 


13740 


GCTAATAAGA 


GTTCAAAAGT 


AGACAAAATA 


TTTAACAGAC 


TCGTTAGGCC 


TACCTTTTCT 


13800 


GCAAATAAAG 


GTTTAACTTT 


GACTAATGCT 


GTCGCAGAAA 


TTACTAATGG 


GAATGTAAAG 


13860 


GCAGCATAAC 


TTGGATAGAA 


TGTCACTTGC 


ATAATTTTAT 


ATAGATTCAC 


TACTACAATC 


13920 


CCATAAATAC 


ATTGAGATAA 


AATTAGCAAG 


AAAACAACTA 


ATGGCATATA 


CAAATGACCA 


13980 


CCCATTGAAA 


TATAGCCAGC 


TAAACATAAC 


GAACCTGGCG 


CTGTCATAAT 


CGTCAATAAA 


14040 


GGAATGGTCG 


ATTCAGGCAT 


TTCTTTCGAC 


TTCGTAATTC 


TCTTGATTAA 


AATTGGTAAC 


14100 


AAGAGTAAAT 


AAAAGAACAA 


CGCTACCCAA 


AAAACAATTT 


TTCCCAACTC 


ATTGATAAAC 


14160 


AATTGAGAAG 


TATTAGGAAT 


AACGCCAATG 


CCGACAAACG 


TAATAAACCA 


GCTTGGATAA 


14220 


ATATGTTCCA 


GTTCGATTTT 


TGTTGGAAAA ATATGCACCG 


CTACAAAATA 


AATCATTAAA 


14280 


ATAaAATGTA 


AAATCAcTGC 


GCCAATCCAC 


AACACGTTAA 


TCCAAGCAGC 


ATTGGTAAAT 


14340 


ATCCGATCTA 


AGAAAACACA 


GATAACCATT 


AAAGCCATTG 


TAAATGTTGG 


TGATACCGAA 


14400 


GCAATAATTG 


GATCATCTAA 


GGTGCTTAAT 


GCATGTTTCA 


TTGTAAAAAT 


AATTTTTAAA 


14460 


ATAACTAAAA 


TCATTAAGAG 


ACTGCCAAGT 


ACACAATAAA 


TGGTCCCAAT 


TGTTGCATAT 


14520 


CCCAAGGAAT 


AAAGTAAATT 


CCCCAAAGAA ACCATCCCTA AAATTAGTCC 


GCAAATGGGA 


14580 


ATCGGGACTT 


TCCGCAAGAA 


TTTTTTCGCA 


CTTTCTACCA 


TTATGTAGTC 


ACACTCCGTC 


14640 


TTTATTTATT 


ATTATTGATA 


TTAATCAAAA 


GTATGTTATT 


ATACTAAATG 


AAATATTTAT 


14700 


ATTTGTAAAA 


TAAATATTTG 


CAATACCAAC 


TATTAAGATA 


GTAAATAGAA 


AGAGGCGATT 


14760 


TTATGTTCGA 


ATGGATCAAA 


ACATTTATTG 


CTGTGTATGA 


AaCCAAAAAT 


TTTAGTACGG 


14820 


CGGCCAAACA 


ATTGTATATT 


TCTCAGCCAA CTGTTTCTTT 


GCAAATCAAG 


AAATTAGAGC 


14880 


AACACTTTTC 


GATCAAGCTG 


TTCTATCGTA ATGGAAAACA 


ATCGGTTATT 


CCTACCAAAG 


14940 
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AAGCTGATTT TTTATATCCA AAAATGCTCT CGATTATTGA AAGCTTAACT ACTTCTTTTG 15000 

CGCAAGCTGC AGAAAAAGAG AATTTTAAAG AAGACTGCAT TATAGCCTCT TCAAATACAG J5QSD 

TTGCTATTTA TTTACTGCCA GATATTATGG AGAGTTTGGT TACCGCTTTT CCACTCATCA 15120 

ACTTTTCGAT TCAACTGATG AATTCAAACG AAGTCGTGGA TATGGTTCAA AATAATCATG 15180 

CCCACATTGG ACTAATTGAA AAACCCATTG AAACCAAAAA TCTACACAAA GAAATTGTTT 15240 

ATGAAGATCA ACTAGTTCTA GCAGGAGATG CCGCTTCAAA ATTTTGGCTC TTAAGAGAAA 15300 

AGAACTCTGG CTTACGTTTT TTCAATGAGC TTTATCTCAA CGAACACAGC ATCAACTTAC 15360 

CAATTATCGA ATTAAACAAC AACGAAGTCC TGTTACAATT GCTAAAAAAT AACATTGGGC 15420 

AATCAATTGT TTCTCGATTA TCTATTTCCG ATGAGATCCC TTGGCAACCA ATTGATCAGT 15480 

CCTTTGCTTC TCGAATUU^TT TTCATTGTCC AGACCGATCA CCATGCCAAT ACTTCCTTCT 15540 

CTGAAGTTTA TAACCGAATC GTTGAAAAAA TTCGTGAACG ACAAACCTAA AAAACAGGAG 15600 

AACGACTTAG CTTTCGCTAA ATCGTTTTCC TGTTTTTTAT TCATCAGTTT CCTCTACCAA 15660 

TTCAAATTCA AAAGGTTCTA ACTCAAATAG TTGGTGTAAA ATTTGCTGGG TTTCTTGGAA 15720 

CACTTGTTGT GGCTCTTTAC ACAAAAGAGT GCTTTGAATG GTCTGTTTTT TTGATTGAAA 15780 

AAGAATTCTC GTTTCCTCTT CCTTAAATAC AACTTGATGA GTTcCCTTTC AAACACCGTA 15840 

TATTCTTCGA ACATTTCCAT AACTAAtTTT TGACGAAACC CTTCTTTGAG TGCTAATAAA 15900 

GTTACTAAAT CTGCAACGGt TAGCGkTTCT AACGCAAnTG kTTCaTCyTC TACATGAACc 15960 

AAGATAATnC GtACACTTCA CATtCGGACA GGCATTGCAG CCGCCATTAA TCAAGTCTGA 16020 

GGTAGTTGTT AyTTTCACCT GTTTCATCTT AATCCTTCTT TCCTACTTTT CGAGAAAAAC 16080 

ACAAGGGGTC TCCTCAAAAA TATTCAATAA AATATGATTT ACTTCTTGAA AGACTTCTTC 16140 

GTGATTCCGG CATTGTTTTT CCATTACTAA ACGCAGATCA TTCACAACAA AAGTCAACGT 16200 

CGTCGCTTGT TGGGAAACAT GTATACAACG GTTGCCTTTA ACAAAACATT CTGTTTGTTC 16260 

TTCAATGAAT GGTTCTTTTT GATAGCCATA TGCTGAAGCG ACACATTCCA GGATTTCGTA 16320 

GATTTCTATT TTTGGAAGTT CATAGCTACA GCTACCGATA ACTAGTTGAA AACGGTTCGG 16380 

CTGTTTTTCA TTATTGCTTG CATAATACAC TGTTGTCGTT GTCATTTTGC CTCCTTATCT 16440 

CCTCTATTTT CCTACTCTTA TCATAACAAA AAAAGCACGC TTCCCTTATT TAAATTGTTA 16500 

ATAAAGACTA TTACAGATTG GTATAACCAC ACCTAAAAAG TTCGTGATAT AATTCACTAA 16560 

GAAAGAAATA TTACAGAAAG AAGAGATGAA CGATGAAAGT ATTTAACAAA AAAGTCGCAA 16620 

TTATTGGTAC TGGTTTTGTT GGCACAAGTA TTGCCTATTC CATGATCAAC CAAGGGATTG 16680 

CGAATGAATT AATCTTAGTT GATATTGACA AAGCCAAATC TGAAGGCGAn CAATTGACTT 16740 

ATTAGATGGT GTGkCyTgGG GkCmAGAAAA TGTAAACGTC TGGGCTGGCG ACTATCAAGA 16800 

CTGCCAAGAT GCCGATATCG TCGTGATTAC AGCTGGCGCT AaTCAAAAAC CTGGGCAAAG 16860 

TCGTCTAGAT TTGGTTTCAA TCAATGCAGa AATTATGAAA ACAATTGTTA ACAATATCAT 16920 
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GAAATCTGGT TTTGATGGAA TTTTAGTGAT TGCCTCAAAT CCTGTCGATG TACTGACTTA 16980 

TGTGGCCTGG CAAGCTTCTG GTTTACCTGT TTCAAGAGTA ATTGGAACTG GTACAACTTT 17040 

GGACACAACT CGTTTCCGCA AAGAACTGTC TCAACGTTTA GCGATTGATC CACGCAATGT 17100 

TCACGGCTAT ATTATTGGCG AACACGGGGA TTCTGAAGTG GCTGTCTGGT CTCATACCAT 17160 

GATTGGTACC AAACCTATTT TAGAAATTGT GGATACGACA GAGCGCTTAA CTAGTGACGA 17220 

TTTACCAATC ATTTCTGATA AAGTGAAAAA TACAGCTTAT GAAATTATCG ATCGCAAACA 17280 

AGCGACCTAT TATGGGATTG GTATGAGTAC TGCACGCATT GTTAAAGCCA TTTTAAATAA 17340 

TGAACAAGCT ATTTTACCTG TCTCAGCTTA CTTAGATGGG CAATATGGGC AACAAGATGT 17400 

ATTTACAGGG ATTCCTGCAG TCGTTGGCAA TCAAGGTGTG ACTGACATTA TTGAATTGAA 17460 

TCTGAATGCC GCTGAAAAAG AACTCTTCCA AAAATCAGTG ACACAATTAA AACAAGTGAT 17520 

GGCATCGTTA CAACCGAACG CTTAGTAATT TTTAACTAAA AATAACACTT CAACTAAATT 17580 

ATTCTCTACC AAAATAGATT CTAATTCCCC TTTATCGTTC TTTTTGGTAC CATGAAAGAA 17640 

GAACTTCAAC TAAAGCAGAA ATTAGGAGGA CAAAAAATGG TAATCC/U^GG AGATACGTTA 17700 

GAAAATAGCG CACGTCGTTT ATTGCAAGAA CGTGGCGTAA CAATGAATGA TTTAGCTGAA 17760 

CTTGTGTTAT TTTTACAAAA AGATTATATT GACAATTTAA CATTAGATAC TTGTTTAGAA 17820 

AGTGTTGATG CCGTGTTAAC AAAGCGTGAA GTACACAACG CGATTATTAC AGGTGTCCAA 17880 

CTGGATATTC TGGCAGAAGA AAACAAGCTA CTCTCACCGT TACAAG/UUIT TTTAACGGAA 17940 

GATGAAGGGC TTTATGGCAT TGATGAAATT ATGGCGCTCT CCATTGTAAA TGTTTATGGA 18000 

TCTATCGGTT TTACGAACTA TGGGTATATC GACAAGGTAA AACCAGGCAT TTTAAAAGAG 18060 

TTAAATGCGC ATGATGGCGA ACATGTCCAC ACATTCTTGG ATGATATCGT GGGCGCAATT 18120 

GCGGCTGCGG CTGCCAGTCG GTTGGCCCAC TCACAACCAG ATTTAAGTGA TATTACTAAA 18180 

TAAAAAACTT GCCTCTTCAT TTTTATGAGA AGAGGCAAGT TTTTTATTTG CTGGACTTCT 18240 

CACAAGCTTT CTACTTAACA GTAATTATTT GCCGTGTTAT GTTACAGTCA AGAATAAACA 18300 

GCTAAAGGAG GTTATCTAGA AATGAAAAAG TTTTTACGCG TATTTTTAAT TTTTATTGGG 18360 

GTATTCTTTC TTGCAAGTTG CGGTTCAAAA ATAGAGACAA AAACCTTTGT AGGTTCGCCA 18420 

CAAGATGGGA TTGATTCCAC CTTAACCTAT ACGTACXIAAG GAGATAAAGT CTTAACGCAA 18480 

ACGGCCAAAA ACATTGTTTC TTATGACAAA TTGGGGATTA CAAAAGAAGA GGCAAAAACA 18540 

GCGTTGGAAC CCGTTTCTAA ACAATATGAG GACATCAAAG GGCTTGATTA CAAACTGACC 18600 

TATGAGGACA AACAAGCCAT CGAAAAATTG ACGATTAATT ATGAAAAACT TGATTATGAT 18660 

AAAGCCAAAA AAGTGGATGG TATCCAGATT GATGGAGATT CTTCTAAGGG AATCAGCATG 18720 

AAGAAAAGCC AAGAATTGGT GGAGTCACAA GGTTATACAG AACAAAAATA ATAAAAGAAA 18780 

GCGTGCGACA ATGGGCATTG TCGCACGCTT TCTTTTTATT AATTACATT6 ATGTTGCCAT 18840 

CTTCACTGCT TGAATTGGGT CAAAATGATG GACAGAGTAA ACGACTGTAC CGGCTTGCCA 18900 
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AACTACTGAG TTTCGATTAT AGTCAGTGGT TCCTGCCACG TTCAGTGTGA TTTGACCAAC 18960 

GGTATTCGGT GCTGGTAATG TTTCTTGTTC CAAAATGTTG ACAACATTTT TCGCTAAATC 19020 

ATCAGGCGAT TCACCATTGA TATTTGAGGC CCGTACGACT AAACTCCAAT TGCCTTCTTG 19080 

CCAAGATAAG TAACTAGAAC CTGCCGCCCC TTGTTTATAC CCAGTAATAT TGTAACCTAA 19140 

ATCTACTGGT TGACCGCCAT TGTCAATGAT TTGGTTCACT GCATTTTTTG CTTCGGCATC 19200 

AGAGCCATAT GTTTTTTTCT CAAAAGAACT AATTGGCGTT AACTGATTGA CACGTGCATC 19260 

ATTCACTGGA ATCGCTTCTT TTTCTGCATA ATATAAAACA CGGAAATTGT TTTGATCCGC 19320 

TTGTGAAGTT GTCGCTGCAG TTAAAAAGTT ATTCGTTTGA CTCGTTAAAA TTGCCTGCGG 19380 

TAAGCCTTGA CCAGGAAACA TGTTTGCTAG CTGGTTAAGC ACTTGGTTGG CTTGGTCATC 19440 

TGGATTCGCA TTTTTCGCTG GTGATTGCTT CTCTGCTGAG CCATTATTTG CCGCTTCCTT 19500 

CGTAGAAGAA GCTGTTGCTT TCGAATCAGC TGTTGTTGTT CCTGACGGTT TCGCATCACT 19560 

AGATTTTGTT GTTGCTTTGC TACTTGATTC TTTTGTAGCA GATGCTTTTG TACTCTCGAC 19620 

TGTGGTACTT TCCGCAGCAC TGGAACGACT ATCCGTATTT TCTGTACTTT TTCCGCCGCC 19680 

GCATGCTGCT AAAGCAAGGG TTAAAAGAAA AGCACTAGAA ATAAGCCCTA TCTTTTTCAT 19740 

AAGAATTCCT CCTTGTTTTC CCTACCCTTA TTGTACCTTA ATCATGTGTA TCCGTCACTT 19800 

GTTTGTGTTT CGTTCTAAGT TGGTTTAAAT AACAGAAAGT GGACGGTAGG ATTCGT^CCT 198 60 

ACACCTCAAC ATTGAGCAAG AATAAACTAC TGTTAGTCTC TATGAGTTTA ACACCACGTT 19920 

CATCCACCTA TTTGATTGAT CCTTTGTATC ATAACTTAGA ATAATTAAAT TGCCTTTAAG 19980 

GAGCTTCAAA GATTATATGA AGAATTGTAA AAAATCTGTG ACGAAAAAAC GATTGAACCA 20040 

ATCAGTAGAT ACACTGAGGT TCAATCGTAA CCAATTGTTT AATTATCTGA AAAAACTCAC 20100 

AATTCAACGT TAGTCCCTGG TGCTTTTTTG TTTTCCCCTT GTTGTTTTTT TAAGAATATC 20160 

CCCAAAATGA CGGTGATAAT GGGACAAAGC AATGGGAAAA AGGCATATGG TAAGTATTCT 20220 

AGTGCACCAA CTTTTAACGT TCCCATAATG AAGGTACCAC TAACTCCCCA AGGAATTAAC 20280 

GAGTTGACTG CCGCCCCAGC ATCTGCCAAA GTACGAGTTA AATATTTTTT ATCAATACCT 20340 

AAACGAGTAA ATGAGGATTT AAATGTTTCC CCTGGTAAAA TAATTGACAA ATATTGTTCG 20400 

CCAACTAGTA AGTTAATTCC CACAGAACTT AAGGCAGTTA AAGCGATTAA TTTGGCTGGA 204 60 

TTATTCACAT AGCCTTTAAT TTTATCAATT AAAGTAGCCA CGATGTTAAA CTTAATCAAC 20520 

AAGCCGCCCA AACCTAATGC CAAAATAATC AACGCAGCAG AACCTAACAT GCTTTCAATC 20580 

CCACCACGAG ATAAAAGTGT ATCAATACTT TGATCCCCTG TATCTGCCAC ATAACCACTC 20640 

ATTAAGATGG TAGAAACTTT CGCTAAACTT AGGTGATGAT CATTTATAAA AGCTAAGATG 20700 

ACCGCCACTG TAGAGCCGAC TAGTAACGTT GGAATAGCCG GTACCTTTTT CCAGGCGAAT 207 60 

AAGAATAAAA CCGCTACTGG CAACAATGTA ATCGGTGAAa TCCAGAATCC TTGATGTAAC 20820 

GTTTGGACCA TTGCATCAAC CGATTGTAAA TCCGCTGAAC CAGATTGATG CCCCAAGAAG 20880 
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ATATAGCCGA CAATCGAAAT AATAAACGCC GGAATAACAG TGI^VCATCAT GTTTAGTATA 20940 

TGTTCAAAAA.CIG.XTCACGCC ACCAATTCCA GCAGCTAAGT TTGTTGTATC AGATAACGGT 21000 

GAAATGTTGT TCCCTAAAAA GGCACCTGAA ACAACTGCAC CAGCTGTCAT CGCATTGTCG 21060 

AATCCAAGAA TATGTCCAAT ACCTAGAAAG GCAATCCCCA TAGTTGAAAC AGTCGTAAAG 21120 

GAGCTCCCTA CCGTCACACC AACTAGTGCA CAGACCACAA AAACGGTTGG TAAGAAGAAC 21180 

TTCACTGAGA TAATTTTAAA ACCATAGACC ATAATTGTTG GAATCGTCCC AGATAAGATC 21240 

CAAGTAGCAA CTAAGACACC TATCAGCAAG AAAATAATAA TCGGAATGAT CCCTGGTTTG 21300 

ATCCCTTCCA CAATGCCCTC GTGGATCTCA TCCCAAGAAA ATCCTTTCCA TTTGCCATAA 21360 

AGCAACAACA ACATGAATAC AAAAAGAATC GGTATGTGAG GGGTTAATTT TTGACCAATA 21420 

ATTAAGTArc CGAGAATCGC TAACAAGGCA ATCAAAATGC CTAAAcTTTC AGCGAACGAA 21480 

ACAGTGGGTT TCGTTGATTT CATATGAAAA CCTCCTTTTA TGGAAAGCGA AAACTGAAGC 21540 

AATAATCGAA CTTGCTTCAG TTnCCAAACG ACAATTTACT ATAGTCATCA GGTGAAAAGG 21600 

CACTCAGCAA AATAATTGCT AGCCTTTTTA TTCACCTAAA TCTTCTTTTA CTTCCGTTGC 21660 

CCAACTTGCT TTTTTAAAGC GATTAmTGAT TAACGGAATC ACAAACATT^ CAATAGTGAT 21720 

TAATGCAACT AAGACTGCAT CAGTCACGGT TGTTGTACCT AATGTGGCGA AAATAACAAT 2178 0 

GATAACACTA AAAATTAACA TTGCTGCCAC TAAATAAGCT AAGCCATTAC CTTTTTTACC 21840 

AATTCGGTAA GGACGTAGTG TATCTGGTTG AGATTTACGT AATTTAATTA GGGAAATCGC 21900 

AATCAATACA TACACAATAC TATAAATAAT TGTTGTTGCA TTTGTTAAAG TTAAGAATAC 21960 

AGCGTTGACA TCATCCATTA CCCCATAAAG TAAAGCGAAT AATGAGATAA CCACAGATTG 22020 

TGtTAACACA ACGTTACGAG AAACGCCATA TTTATTTTCG CGATGGAAAC CAAATTTTGG 22080 

TGGCAATAAG CCATCACGAG CZAACTTGAAT AATTGTTTTT GATGGTCCAG TAACCCAAGC 22140 

AGATAATTGT AATAACACAC CTAAGAAGAC CATGAAACTA AAGATATTAC CAATAATTGT 22200 
TGGTAAACCT AAAACGTCAC AATAAAGCAA AATTGGTTGT GTAATGTTAG ACAATTCCAT 22260 
TTTGCCATTT GGAACAATAT TAGCAACAAA CATTGCATTA ATAACGTTTA ATT^AAACTAA 22320 
ACCAATTAAT GAAATGAAAA CGCCTTTTGT ATAATTTTTT GTTGCATCTT TCAAACGAGG 22380 
AATATAAACA GAAGACATTT CAATCCCAAC AAAGATAAAT GCAATCCCTG CTAGATATTT 22440 
AAATGTGTCC ATATTTTCTA AGTTTGGTAA AACTTTTGAC CAAGAGAAGG CACCTAAATA 22500 
GCCGCCAGGT GTTAAGCCTA CTTTAATCAT TGAAAGAAGA CCTAAAACAA ACATAACTAA 22560 
TACTGGGATG TAAACACCTA GCCAAACACC AATATTTCCG CCAACTTTCG CCATGTCAAA 22620 
TTTCAAGTTC AAGATTGTAA TAATCCAGTA AATCACTAAG ATACAACCTA ATACGAACCA 22680 
ATGATTTTCA CCTAAATGAA CATTaCCAAA TGTATTaCCT AGCAAAGGGC CTAATGTAGA 22740 
AGCAACCATT ACCATTCCTG GGAACATTTG CACCCATAGT AACCAAGCGA CAACAAAACC 22800 
CCATTTGCTA CCTAAGGCAG TTTTAACCCA AAGTTGAGGG CCACCTTCTT GAGGAAGCAT 22860 
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CGTAGATAAT TCACCAGATA TCAATGCGAT TGGACCAGCA AAQAaTAAGA CCGCGAAAAT 
TGTGTAAGAA ATTAATGTCC AACCAACAGC. AGCAAGTGXX GGGATACTAC GTACAGTTGC 
ACACAATGCC ATTGTTAAAC CGATAAAAGT ACCTAATTCA ATTTTTTTAA CTTGTTGTTT 
ATCCAATTTT GTACACTCCT TATAAGATTT TGGATAAGGA GCATGAATGA CTGTTATACG 
CTGAGTAAAA CAGTCATTCG ACTGCCTCTT ACCCAGACAA AGAATTGGTT AAGAGGTTAG 
CCATCTGATT GATTTATCTC TTAAATAAAT CAATCAGACG AACGTTCCCT CATTCGGTTA 
CCATTTCTTT TTTCCGTTTA AGATTCAATT ATTTTACGTC GTAAATTTGT TCTAATTTTT 
CTTGTAAAGC AGCTTGAATC TTAGGAGCGT AAACATCAAA TTCTTCTTTA TCGTTCATGT 
ATGGAGTCAT TACAGCAGCA CGTAACACAG TTACTTTACC TGCACGGTTC CATTCTTCGT 
CACTAAATCC CAAGCTGTTT ACAAATTTTA ATGGGcTGTT GCCATAATCT GGAATAGCAA 
AGTCAGTATG TGAAGTAATG AATTCGTTGT TGTAAATATT TCCTTTAACG TATGATGCAT 
AGTCATATAC ATCATGGTTT AATTTGTTCA TTGCTACTAA ATCATCGTTG CCTTTTTCTT 
TAAACACATA GTCAACCATG TTGAAGTCTG GATGAGTTAA TGTGTGGACT TCGATTTCTT 
TGTCGCCAAC TTTGAATGTT AAATCATTCA AGAAGTTATA GAAATGATGT GATCCTTCAA 
TTGAAGCACC AATTAATTTA CCATAACCTG CAACGTTTAA AGGTAAAACG TGATGTGCAG 
CCCATACAGA AGCAGCTGTT GCACCAGCTT TTGAACCTTC AAGGATATAA GCACCTAGTA 
ACGCAGGAAT ATCGGCACCT TTTTCAAATA CGTAAGTTGC GAAGTAAGAA ATAACGTCGC 
GCATACGGAT ATCTTGGATA ACGATACCAC CAGCTGAATA AGGGATATAA CCCATTTTAT 
GAGGGTCAAT TGTTACTGAT TCTGCTAATT CGATTGCTTT AtATGCATCA TACACTTCTC 
TTGAAATGTG TTCTTTTTTC TCTTTGAAGA CACCGTATTC TTCGTG7UVCA TCTTGTAAAT 
CTTCGTAAGG GATGAAGTTG TTGTCTTCGT CTAAGAAGAT GGCACGTCCA TAACCACCAT 
AAGCAGCATC AACGTGTACA TAATAGTAAA TACCGTCTTT CATTAATTCA TCGCGTAAAG 
CAATAATTTT ATCGATTGAG TCAACGGCAC CTTCTTCAGT AGAACCAACT ACACCGACAA 
CACCTAGCAC TGGAATTTGT TCTTCTGCTA AGCCACGAAC GATTTTTTCT AATTCGTTGA 
TATCCATACG GTAGTTATGG TCAACTGGTA CTGGGATAAC TTGATCTAAA CCGATACCGA 
TAATATCAGC AGCTTTTAAC CAAGAATAGT GTTTAGTTTG TGGAACTAAC CATTTACCGA 
TTGCTTGTAA ATGTTTACCT GAACGAGCTG AATGAGCTTT GATTTCATCA ATTTCATCTT 
CAGCTGATTC TAATAAGTCC ATAATTTCTT TTGTTGGCAT GTTCAATAGT TCCCAATCTG 
ATTTGCCAGC AACTAATTCT GGTTTTACTT CTTTCATAGC AAATGGTAAT GATTTAATGT 
TACGGGCATA CCATAAGCCT TCTAAGTTAG CTAAAGAACC ATCAGCAACG ATGTGTCCCC 
AACCATTTTTGTAGCTCATT AAGTGAGCAA ATTCATGTCC TACTTCTTCT TCCATTTGAG 
AAGTTGCTGG AGAAGATTCA TAGGCAACGT TGTTCCCGTT CCATAGCATT GCAAAGTTGT 
AAGCTAATAG AGAAGGCATT AATGTTTCTG AGTTCATATG TCCCCAATAT CTACCTGCTG 



22920 

22980 

23040 

23100 

23160 

23220 

23280 

23340 

23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 
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TATGCCATGG AACTGAATGT GTACGCATGC GAGAAGAGAT TTCATTCAAT ACATCTTTCA 


24900 


TATGGTTGAC 


AGTTTTTTCG 


TAGCTTTCAG 


ATGTGCGTTC 


TTGAGAAGAG 


ATAACTGGCA. 


. . 24561) 


TGTCTTGTGG 


CATGTAGTTT 


TGACGCCATC 


CTAAATGTTC 


ATCTACTAAG 


TCGATCAACA 


25020 


AGTCTTTATA 


TAATTGTCCG 


TTTTCGGCTT 


TGTCCCCAAT 


GAATAGTGCA 


TTAAGGTTCA 


25080 


TTTCGCCTTT 


TGCTAATTTT 


TCGTTTkTCA TACGAAAATT 


CCTCCTTACA 


AATTTTAATG 


25140 


AATTAGATTT 


TGCATTCACA 


ACTTACACCC 


AAACGGCTAA 


AGTGTAAAGA 


AGAaAAATAA 


25200 


CAATGGtTAT 


CATAGCTCTT 


TAATTTTATA 


ATTTTTTCAC 


AAGGTGTCTT 


GTCaGaACCA 


25260 


TTATATaAAA AAATAAGAGT GGGaTTgTGT 


GtACAATCCC 


ATCTTATTAG 


GTGTATTAAC 


25320 


GGTTTCACCA CTACCTGTAA GAGnAAATAG AATTTTAACT TTGATC7UUVT AATCACACGT 


25380 


CTTCTCTTCG 


TGAAATTATT 


ATCAATTTCT 


AATCCATTAC 


TTTTGCTAAA AAGTAATTTT 


25440 


TCTtCCmTTT 


CCGTACAACA 


ACGAATTTtC 


CATCGAAATG 


TTGTGTTGGA 


TCGACTGCAA 


25500 


AATCTAAATC 


GGTAACTCGG 


TCACCGTTAA 


TACTTATAGC 


TCCTCCGCTC 


ACATCTTCGC 


25560 


GAGCTTGACG 


TTTTGATGGT 


TCGATTTCTG 


TGTCMCTAA 


CCAATCTACG 


ATGTTTTTAG 


25620 


CATCTTTGGT 


AATTTCAACA 


GTTGGCACAT 


GTTCTAACCC 


TTGCTCGATT 


TCCTCAATCG 


25680 


TTAAGTCTTT 


AATGTTGCCT 


GAGAAAAGAG 


CTTCTGAAAT 


TTTTTGCGCT 


TCTTCTAATG 


25740 


CTGCATCATC 


GTGAACAAAT 


CGTGTCACTT 


CTTCAGCTAA 


GCGTCTTTGT 


GCTTCACGTT 


25800 


TACCTGGTTC 


TTTTTCAACT 


TTTTcAGTAA 


CGCATCGATT 


TCTTCTTTAT 


CTAAGAAAGT 


25860 


AAAGAATTTC 


AAGTATTTGA 


TGACATCACG 


ATCATCTTGG 


TTTAACCAGA 


ATTGGTAGAA 


25920 


TTCAAATGGT 


GAGGTTTTCT 


TAGGATCTAA 


CCAGATAGCG 


CCACCCGCTG 


TTTTCCCAAA 


25980 


TTTTGTACCA 


TCTGCTTTTA 


GCATTAAAGG 


AATGGTTAAC 


CCGAATACTT 


TGGCTTCTGG 


26040 


TCCTTCTTTT 


TTACGAATCA AATCTAATCC 


TGCAGTGATA 


TTGCCCCATT 


GATCAGCGCC 


26100 


ACCAATTTGC 


AATTGAATAT 


TATGTTTTTT 


ATGCAATGTG 


TAAAAGTCAA 


TTGATTGAAG 


26160 


AATTTGGTAT 


GTGAATTCTG 


TAAAGGAAAT 


GCCACTTTCT 


AAACGACTAG 


CGACAATGTC 


26220 


TTTTGCCAAC 


ATCGTGTTGA 


CATTAAAGTT 


TTTCCCGTAA 


TCTCTTAAAA AATCTAATAA 


26280 


.AGATAGTTCT 


GATAACCAAT 


CGTAGTTGTT 


CACCATTGTT 


ACCTCAGCAT 


CTTTACCAAA 


26340 


TAATTTTTTC 


ATTTGGTTTG 


AAAGACTGTC 


CACATTATGT 


TGCACAGCTT 


CCATCGTTTG 


26400 


TAAGACACGT 


TCGGTTGTTC 


GTCCACTTGG 


GTCACCAATT 


GTTCCAGTTC 


CGCCACCAAT 


26460 


TAAAATGTAT 


GGGTGATGAC 


CTGCTAATTG 


GAATCGTTTC 


ATCATCATAA AAGGAATTAA 


26520 


ATGTCCAATA 


TGCATGCTAT 


CTCCAGTTGG 


ATCGACACCG 


CAATATAGCG 


AAATGCTCGT 


26580 


ATTTTCTGTA 


AGTTCTCTTA 


GTCCTTCTTC 


GTTTGTTTGT 


TGATTGATTG 


CATCACGCCA 


26640 


TGCTAGCTCG 


• TCAATGATAT 


TCATGTTATC 


ACCCTCCATA 


TTTATAATTA 


TCTCTTTATA 


26700 


GGAAATCGTT 


CATTTAACGA 


AACAAACGAT 


TAACACACTT 


GTTTTTACAC 


GATATCCCGT 


26760 


GTTTATTTCT 


ATGTATACAC 


GGGTTCACGA AATTTTCTTT 


GGTCTGCCAC 


GTATTCAAAA 


26820 
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AAAGAATACA AAAAATCCCT AAGTCGTCGA AACGACTTAG GGACGAATTT CTCGTGGTAC 26880 

CACCCAAATT TATACATAGT TTAGTTTATT TAAAATACTA TATACCTCTT AACAAATAAC 26940 

GCTGTTACCC GTGCCCTGTA GGCAGAAGCT CAAGAATGTA ATTCGATAAA TAACGTGTAC 27000 

TAGTTTCCAG CGCCACTAGC TTTCTGTAAC AGTGGTTATT TCTCTACTGC TTTTCTGTCA 27060 

TTGCTTTGTA TTTAACTTGT ACATGTAGCA TATTCCTTTT CGTTTTTTTT gTCaACACTC 27120 

TTkGCTTAAt TTTTkTTACT TTTATAAAAG ATGTCCCTTT TTCACAAAAA TAAAATAACG 27180 

AATTCAAAAT AATATTTATT TGTTTTAAAA GCAACTACAG ATAAAAAAAT AACGAAAAAA 27240 

TTTAAAAAAT ATTCTTTTAA ATTTTTTTGA CTTTTTTTGk GATTTTCTAT ACTTTTATAA 27300 

AATTCCCGAr GAAAAATGAC GAACTTCGGA TCAGAAGTTC GTCaTTTTGT GAAAaaCAGT 27360 

TTTAAAGAAT TGGTGAAAAT AATCGTGAAA TTTGTTGTTT • AAAAATCAAC CATTTTGACA 27420 

TGTCGCGAAC AACTTCTGGA GTCATTGTTG TACATTTTTT TCGATCTTCT AAGAAAATTG 27480 

CTTCTAATTG ATCAATGATT TTTTTATCAT ATAACACAGC ATTTGCTTCA AAGTTTAATT 27540 

TGTAACTTCG GATATCTTGA TTGGCTGAAC CAACTGTGCA GACTTCATCA TCCATGATCA 27600 

TTGTTTTCGC ATGTAAG/^ CCACCGTTAT AAATTAAAAT TTCAATATTT TCTTTCATCA 27660 

ATAAGCGGCC GTAATACTGT GTTGCTCGAT AAATAAACGG ATGGTCGGGC TTATCTGGAA 27720 

TCATAATTTT AACATCTACT CCTGAAGCCG CAGCAACCTT TAAAGCAGCC AAGACACTAT 27780 

CATCAGGAAC TAAATACGGC GTTTGAATCC AGACTCTTTT CTTGGCAGAA GTAATCAATT 27840 

TAATAAAGGC CAATTTAATT TGTTCACGGT CACTGTTAGG ACCACTGGCA ATCATCTGAA 27900 

TGGATGTATT TGCCTCTGGC ACCAATGCTT CAAGTTTAAA GAAATAATCC AATTGATACG 27960 

CCACACGATT TTTCTCGGGG GAAGAGACGT TCCAATCCAT TAAAAAACGC ATTTGCAGTA 28020 

ATGAGGCTGC TGGCCCTTGA ATCCGTAAAT GTGTATCCCG CCAATAGCCA AACTTTTTAG 28080 

TCGTTCCGGC ATATTGATCG GCAACATTAA AGCCGCCAAT GTAGCCTACT TTTCCGTCAA 28140 

TAACAACAAT TTTCCGGTGA TCATGATAAT TCAAACGAAA CTTCAAAAGT GCTTTTTGAG 28200 

AAGTAATAAA CGTTTGGACA AAACCACCGT TTTTTTTCAA TTCATTCAAA TGATGAACCT 28260 

TGGTGCCCTT TGAGCCAAAG GCATCGTAAA GCAACCGCAC TTCAACCCCT TCAGCCGCCT 28320 

TTTCTTCTAA TAAGTTCAAA ATTTTGGTGC CGATATGATC CGTTACAAAG GCATAATATT 28380 

CAATATGAAT GGAGTGCTGA GCTTTTTTTA TATCCGCCAT CAACGCATTG AATTTTTCCG 28440 

TACCATCCGT AAAAATTTCA ACATCATTTT TCTTAGTCAG AGGCATCCTA TTTAACGAGG 28500 

AAAAAAAGTC TGTAAACTGT TGTTGCTCCT CCGTCGGCAT CTTTTGACTA TAAAGTTCAA 28560 

AACTTTCTTC TTGGAAATTC TGGAAATTCT CTAATTCCCG TAAATCACTT TGTTGCAAAT 28620 

AAAATTTCTT TTJcGTCGGkT AAACCTCGAC CAAAGrATAA ATATAAGAAA AAACCAAAAA 28680 

CGGGTAGTGG 28690 
(2) INFORMATION FOR SEQ ID NO: 139: 
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(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 614 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



AAATTATTGA 


AAAATCACCG 


AAGCAAACTC 


ATACAAAATA 


TACAAAATGG 


CAAACAGAGT 


60 


CCATTTATAA 


ACAAAAhATA 


AAAAGTCGTT 


TAGACTTTTT 


ATTAGAACAA 


AGTTCTTCAA 


120 


TTAATGACTT 


CCTAGAAAAA 


GCAGAAGCTT 


TGAATTTATC 


GGCAGACTTT 


TCTAATAAGT 


180 


GGACGACCTA 


TCGATTGCTT 


GACGAACCAC 


AAATAAAAAA 


TACTCGTAGT 


CGAAGCTTGT 


240 


CAAAAAGTGA 


CCCCACTAGA 


TATAnTTATG 


AAAAGATTAT 


TGAACGATTG 


A7VAGAAAATA 


300 


AAAACGTTCT AACAGTGGAA GAATGTGTCA AAAGtACGAg AAAAAAATGA-ACAAGAAAAA 


360 


AATAATTTTG 


ACTATCAaTT 


TACmATAGAA 


CCaTGGCAAC 


TGTCTCACAA 


AACAGAaAGA 


420 


GGATATTATA 


TAAATATCGA 


TTATGGCTAT 


GGCAATTCTG 


GAAAGCTCTT 


TGTAGGAGGG 


480 


TATAAGGTTG 


ATCCATTAGC 


CAACGGTAAT 


TACAATATCT 


ATATTAAAAG 


AAATGAGTAT 


540 


TTTTATTTTA 


TGAGCGACAA 


GGATGCTACT 


AAAAGTAAAT 


ACTTAACAGG 


GGCTTCGTTG 


600 


ATTAAACAAT 


TACGACTGTA 


TAATGGACAA 


ACACCACTGA 


AAAGAGAACC 


TGTGATGCAA 


660 


ACCATTGATG 


AATTAGTTAG 


TGCCATTAAC 


TTTTTAGCGG 


CGAATGAAAT 


CGAGGACACA 


720 


AGACAACTAA 


AATTATTAGA 


AGAAAAATTA 


GAAGTTGCTT 


TTCTTGAAGC 


AGAAAAAACT 


780 


TTAGAGACTC 


TTGACGAGAA 


GATGTTAGAG 


CTGCACCAAT 


TAAGTAACCT 


CTTATTAGAA 


840 


AATGAACTGC 


AGGGAGATCC 


AGAGTTAATT 


CAGAAAAAAT 


TGAAAACATT 


GTTACCGGAC 


900 


GCAACGCTTG 


CGGAATTTTC 


TTATGAAGAT 


GTTCGAGGAG 


AAATAGAAGC 


AATTAAAACC 


960 


AGTCAAACTT 


TGTTAGAGAG 


TAAACTGGAA 


CGAACTCGAA 


ATGAAATTAA 


TCAGCTTCAT 


1020 


GAAATTCAAG 


CCGTACAGGA 


AAAAGAGCCA 


GAGAAGCAAA 


CCGACATTAA 


GCCCAAACTA 


1080 


TAAACTATCT 


ATTGGTGAAC 


GGAGAGGAGA 


ATTTTATGTT 


AGTTTTTAAA 


CGTATTTTGT 


1140 


CTGTAATACT 


ATTTTTACCT 


TTTTTTATTC 


TTCATTTACT 


TTTCCGATAT 


ATTGGAATTT 


1200 


TAACATTGCT 


TACTAGTGGA 


TTTACCACCA 


CTTTATTATT 


GATTATTGCC 


CGAGTTTTGG 


1260 


CTTtTCTTTT 


TTTAATAACA CCAATTGTGA 


TCTTAGCAAC 


AGAAGAACTT 


AGAATCGAAT 


1320 


GGGCAAGTTG 


GGTTGGTTGG 


CTTTTATTTG 


GATTACTTGT 


TCTGCTTTTT 


CATTATCTAG 


1380 


CTGCCTTGAA TGAAGAATTA GGAGGCTGGT TTTTAISATTT TTCTGATTGG TTAAGCAAGG 


1440 


TTCCTAGCAA ACTGTGGGCT 


GAATAATTGC 


TCCTATGCTA AAACGCCTTA ACTTAAAAAA 


1500 


GCCAAGGCGT 


TTTATGGTAT 


AATATACGAG 


ATAGTAAAGG 


TTTATGACGG 


TGGCTAATCT 


1560 


CTGAAAAAGG 


GGTGATTGCC 


TATGTTGATT 


TTTGGCTATA 


CTCCTAGAAA GGACTAGCTT 


1620 


TGACTATTTT 


TGAAGCGTTA ATGTTTGCGA 


TTACTTTTGC 


GACATTAATT 




1680 


GTGATAAAAA CAACAaAAAA TAACCGTCAA CTTTAGCCGA GTTTGGTTAT TTTTtGTAAA 


1740 
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AATATTTAAC 


TATCGGTCAC 


CGTCTTAAAC 


GGTGCTATCA 


TGAGGGCGTA TGACTAGTAC 


1800 


GTCCTCTTTA 


CATGTTTATT 


ATAACACATT 


CTCTACAAGA 






1860 


TTTTTTTCAG 


CGCGCCTTAA 


CGGCGGTGGC 


GGTGGGAGTT 




A A A AAA APfSP 


1920 


TCtTTTCTTG 


ATTTCTAATC 


AAGAAGATAC 


TCGCTAAAAG 




/^n* Ti Tf ''k'0'i"pA 


1980 


TATTATGATG 


ATAAGGCGTG 


CTGTAAATTC 


TCTTGCCATT 




mfn^^m m mvn A 

1 ioi 1 i lAiV^ 


2040 


TTTTCTGGTA 


CTTsCTGACG 


GTTACATwtn 


CCTTGTATTT 






2100 


TATTCTAATG 


TGAAGTAAGG 


CATATCTGGC 


TTTTCTTTTC 


m fp p TV tp TV 1\ A 




2160 


GTTTTTCCTT 


GCGTATGCCG 


TTCGATATAG 


TGTTGACTGC 




r'/^<^T A A a T A 


2220 


CTTCCTTCTT 


GAATGATTTC 


TTGTAATGAA 


TGTGGTGTGA 






2280 


TCAGCTTCTA 


ACGCTTTAAT 


TTGGTTTTTC 


CGTTCTTGAT 


AC rCaC iTTl c 


TTCAATCTCA 


2340 


CTTTTCAAAA 


GGTTTAGCGT 


GTTCATAGCG 


TTATCGTGAG 


CTACTTGTAA 


GTTTTCAGGA 


2400 


TACAATACAC 


TATCGTTATT 


AAGTGGGGTG 


TTCAACTCTT 


CTAACATGTG 


TAAATAGTCC 


2460 


ATATAATAGT 


CAAAACGCTC 


CCCTTTTCGT 


ATAAACCATT 


TTTGAAACTT 


TGTTAAGTTC 


2520 


ACTTCTTTCG 


GTAGTTGCTT 


AATTTGCGAA 


TAATGCACGT 


Ik ^MMniiiiw/*»Hmk rffi 

ACTTTTCTAT 


TTCGTTTATC 


2580 


ATAGGCGCAT 


TTCGCTCTTT 


TAACGTTTTT 


TCTAACATGA 


}k tpni^ Ik mr« AAA 

ATTCATGAAA 


GGTTGGATTT 


2640 


CTTGTTTTTA 


AtTTTTGtTT 


ATtCGCTTTT 


AGCCATTTCT 


TAGTAATTAC 


TCTCATATCT 


2700 


ACTTTATGAA 


AATACAGTCC AAAAACGTTG 


CGATAGTCGT 


CAAAAATGAT 


TTCATTCGCT 


2760 


AAGGTTGTGG 


CACCTATCTT 


TTGTAAAAAC 


TCAATTTCGT 


TTCGGTATTT 


GTAAATATGA 


2820 


GCTAGTTCTT 


GACGTTCTAT 


TTCAGGTAAC 


TGTAAGTACT 


TTAATTCCGA 


TACAGTGGTT 


2880 


AGTTTTTCTT 


GCCAGTTATT 


TGGATAGAAT 


ACTGTATTTG 


TATAAGCACT 


ACTCATAAAA 


2940 


CCATAATTTG 


CTAACAAGCC 


CTCATGCCAA 


TGTCCGTTCT 


CATAATGGGC TTTAACATGG 


3000 


GTATCGTTTG 


AAAACCGTTC 


AAGGTTTGAC 


GGCTCATAGA 


TAAGGTTTTC 


TATTCCATCT 


3060 


GTAATCGTCT 


GCCAATAACT 


ATGGGTTTGA 


ATTTCAATTC 


GTTTTGCACG 


AACGAGAATG 


3120 


ATTCCAAAGG 


AGTAAAATTT 


CGTTGGAAAC 


GATAATCGTG 


AGTATTTTGT 


TAATCGTTTT 


3180 


TTTATGATAG 


GGcAATTCTT 


GCGAGAACTC 


GCAAGAATTG 


TCTCCTTTTT 


GTTTTGCCAT 


3240 


TCATACGTGG 


GTATTTGTGC 


AGTACACCAG 


TTAAAAAAGG 


CTTTGGGTGG 


GGTTAATGGT 


3300 


TTTGGTGTTT 


TTTTATTCAT 


ACGCACTCGA ACAAATCAAT 


TTGTTCGTCC 


ACCTCTTTTC 


3360 


TTTCTGATTT 


TGTTTCTGAT 


TTTTTGGTTG 


GTTTCTTTTT 


TGTTGTGGTC 


GTTACTTTAC 


3420 


CAACTGCTTT 


GGTTTCTGGT 


AATTTCTCTA ATAAGAAATA 


CTCTTTTATC 


CAACTGAAAA 


3480 


CCACTTCGTC 


CGCCACCATG 


GCGCTTGATT 


GGTTTTTGGC 


TTGTTCTTGG 


GCTTTTCCTA 


3540 


TGCAGTAAGT 


CATAGCACCT TTTAACGTTC TATCTTyTTT 


GAGAATACCT 


AGAAAAAGCG 


3600 


CTTCGTCTGT 


TTGTCGGCAT 


AACCAGTTAT 


GAACAACTTC 


TAGTGTACTG 


TCATGGTCGG 


3660 


TTTCTAGTTC 


TTTTArCATT 


TTCTCAAAAG 


CTTGTTCTTG 


CGTGCTTTCT 


TGAGCTGTAA 


3720 
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AGAAAAATCG TTCGTAAGAA wkTTTGrCGC -=^-TCACTAG CCAyTlcrTAk TTrTTTCCTT 3780 

GGTCTATyGT TTCGTCAATy GTTAGAAAmA STGAyGArTA GTkwCCGTtG GggTCTTTwA 3840 

TCGyAACrAT yGCTTGTTCw GGTAATTTTT a----5GTAAGy CArTAACGTT TCGACTGGAT 3900 

AAAGAGCGAC CTCATAATTA G3ATAAAAAC .--PJUUU^CTG AACGTTTTTT CCGTCTTTTT 3960 

CAATGTGTTC TTTTCTGAAA GCGACACTGA lArAGCCTTG TATGTCGTCA AAAATTGTCG 4020 

TTGCTAGTGT TGGTTTGTAT TGTTTGAGTG C37GTGT7AA TTCATTTACA GTTTGTATAT 4080 

TTTGAAAGTC CATAACGTTC CCTCGT7TCT TTTAAGCTTC CCCTCTTACA TATGTAAGAG 4140 

GGGAGCAAAT TTTCGCTTGT CAAAGG7CAG TC3GTACGTT ATACTTGAAC TACGGTTAGT 4200 

AATTCAAGTA AAGTCCTATT AGAAATCTTT T3?CGTTAGC AAAAGGTTTC TTTTTTTGTC 4260 

TTTTTTTCGT GGAAATACTT TGATAATGGT 7CAAAATCTT TCATATCTCC ATTGCAATTC 4320 

ATAGTACTTA CTAGCCACTC TCCAGACTGA CGTTGTGCAT ATAAATAGCC GTTGTTATCA 4380 

GAGCCACGAT AAGGTGTAAA GGTTACTTTC CCT7CAACGA GTTCCGTCCA ACCGTCAATT 4440 

CGGTCAAGGT CAGTTAGCGT «GGTATTGr T3T7CTCTTA TTTCTTCTAA TGATTGTGCA 4500 

AGATACATTT CTGTTTCTGG TCGAACATAT A3ACAAAACT CGTGCCAACC ACAACAAGTT 4560 

AGTTCCCGAC AGCCGACATT GTCGTATTCG TAAAGTACGT TTTCTTTCAC ACCTAAACAG 4620 

GACAGGCCTT TTTTCTGATT aATAACTTTT TSAATTTTTT CAGGGTCTTG GTAATGTTCA 4680 

TATAAAACAG CACCTACTCC CTCTATATAA CCGTCCCAGT GACAATATAT GCCTTGATAC 4740 

GTTCCGTCTG CTTGTTCTTT AAAAATTGCT GrTCGTGTTC CCATAAAAGC TCTCCTCTTT 4800 

CGTTGTTTTA AATGATAAAC TCTGTTGCTT C3TGATTATC TTCTGTCAAG ATAATTTCTT 4860 

GGTCGTAATA CATGTTAGAA ACCAATTCAT GrGCCTGTTC TTCGCTTTCA GCAAATACAG 4920 

AAACAGCTAC CTCTAAGGTT TCAACAATTG irACACAGTA TTTTTTCTTT TTTTCTGCCA 4980 

TATTTATACG TCCTTCCATT CAATGTGATT TAAAAAGAAT GCTAACTGAT TCAAACGATT 5040 

ATTGGcATTT TTCAGAcAAC CATAGTrTAG TAACAATGTA GGCGTATCAT TTTCAAGAAT 5100 

GGCTTTATTT AACTTCTGAT ATTCTT7ACG TSCTTGCTCT AGCTGATTTT TTAGCTGTGG 5160 

GTCGTTCATT TTTACTACTT CCTTTCTGGT ArtCTATTAT TTTTGGTAAT CTCTATCAGT 5220 

GTTTTTTGCG CCTCTAACGC AGTTAAFJVTT GGTAAGCAAT CTCCTAAAGT CATACCGTTA 5280 

CCGACAATTA CAAAAGGTTG AAGTAACTGC CCTACCGTCA TTTTTTCGTC TGAATGAAAA 5340 

AGAGTAAATT CTTTTTCtAA AATATTTTCG CCTTGTAACA AACATTCAAT ATACACTTTG 5400 

CTAAAGTCGT AAAAACTTTC TATTGAAAAT AATTTTTGAA GATAATATTC TACAGTTTCA 5460 

AAATCAATGC AATCAAAATA CAAATTATTT 7CATCAATCA AAAAACATTG AGAGCGTAAT 5520 

GTGTTAAATA AATCTAAACA TTCTTCTAAA TCTAGCGATT TTAAATCGTT ATTTAAATAA 5580 

CGATTAATGG ATAAGCTATT AATGAAATTT TCTTTTTTGT TCATTGGTTC TCTCCTTTAT 5640 

TTTTTTGTTT TTGAGCAACC ATTTTTGTCT G3CGTGAGGG AGGGAAGCTG TTCGCTTCTT 5700 
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TGGCTCATGC TAGGACGTGG GACACTATCC TCCTTTCGTT TTTAGATTTG ATTGTTCATT 5760 

TTCGCCAGTA TGTTTCGGTC GGTTGCTGTG CCGAGAAAGA ATCATAGACA CCAAGAAAAG 5820 

ACAAGGGCGA ACGGAGTGAG TTCATATACC CTTGTCTTTT CGACTGGCTA TGATACTGTA 5880 

ACAAACAGCG ACTGACCGGG ACATACGACA ATGAGCGGTC ATAAAACGAT TGAGAGGACA 5940 

CACGTCCGTA AAAgCATGAG CCAAAGAGGC GAAATTTTAA TGCACTTAAA AGAACTAGAC 6000 

GCAAAGCGGC TAGTTCTTTC TTTTGTTTTG CCTTGCTTTA GCAgGCATTT CTTTCGAGCG 6060 

GCGGTGGCGG TTGGAGCGwG ATAGwAATAA ACACTAAAcA aAGTAATnAG GTATGAAAAA 6120 

TGGAAAGACG ATTTTTCAGT GAATAG 6146 



(2) INFORMATION FOR SEQ ID NO: 140: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 



TTTTTGTCAT GATTTGTCAA TACTATGCTT CAGTCGCTTC GCAAGGTTTT GGGACGGAGC 60 

TAAGAAATCA ATTAATGAAG AAAATTAATC AGCTTTCACA CAAAGAATTG AATAGTTTTG 120 

GTACAGATAC CCTCATCACC CGGATCACAA ACGATATCAA CCAGCTTCAA TTAGCTTTAG 180 

CGATGTTTAT TCGGTTGGTC ATTCGGACAC CTTTTTTAAG TATCGGTTCT GTGGTGATGG 240 

CTTTTTACAT TGACGTGCAG ATGGGCTTTC TTTTCCTATT ACTTTTACCA ATTTTTAGCC 300 

TTATTCTCTT TATTATCATT AAAGTGACTG TGCCTTTATA TCAAAAAGTC CAAGAATATT 360 

TGGATCGGTT AAACCGTCAA ATCAGTCAAA ACTTAAGCGG TGTCCGTGTG ATCCGTGCGT 420 

TTGCTAGAAA GGAAACAGAG CAACGACATG TTGATAAAGC TTCAGATGAT TTAGGCGATA 480 

TTTACATTCG TGTATC6AAT GTsTCTGCTT TATTAACGCC TTTAACCACT TTGATTATGA 540 

ATGTTGGAAT TTTATTTTTA CTTTATTTTA GTGGCTTAAA AGTTTCTTTT GGTTCCTTAC 600 

AACAAGGGGA AGTTTTAGCA TTGATCAATT ATATGAATCA AATGATGCTC GCTTAA 656 



(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

CGGAAGATTG ACAAAAGATC CTGACTTACG TTACACCGCT AGTGGTTCAG CTGTTGCGAC 60 

CTTTACACTT GCTGTAAACC GTAACTTTAC GAATCAAAAT GGTGATCGTG AAGCAGACTT 120 

TATCAACTGT GTGATTTGGC GTAAACCCGC AGAAACAATG GCTAATTATG CCCGTAAAGG 180 
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TACATTATTA 


GGTGTTGTCG 


GAAGAATTCA- 


AACTCGTAAC 


TAGGAGAACC 


AACAAGGTCA 


240 


ACGTGTCTAC 


GTAACTGAAG 


TGGTTTGTGA 


AAACTTCCAA 


TTGTTAGAAT 


CTCGTTCTGC 


300 


TTCAGAACAA 


AGAGGAACTG 


GCGGCGGTAG 


CTTTAATAAC 


AACGAAAATG 


GTTATCAATC 


360 


ACAAAATCGT 


AGCTTTGGTA ATAACAATGC 


CAGTTCTGGA 


TTTAATAACA 


ACAACAATAG 


420 


TTTCAATCCA 


TCATCTTCTC 


AGTCGCAAAA 


CAATAACGGT 


ATGCCTGATT 


TCgATAAAGA 


480 


TTCTGATCCA 


TTTGGTGGCT 


CAGGTTCATC 


TATCGACATT 


TCAGATGATG 


ATTTACCATT 


540 


CTAAACGAAT 


TTGATAGGAG 


GGAACAAGAA 


TGGCACAACA 


AAGAAGAGGC 


GGACGTAAAC 


600 


GTCGGAAAGT 


CGACTATATC 


GCAGCAAACC 


ACATCGAATA 


CATCGACTAT 


AAAGATACTG 


660 


AATTGTTAAA 


ACGATTCATT 


TCAGAACGTG 


GCAAAATTTT 


ACCACGTCGT 


GTTACAGGTA 


720 


CTGGCGGTAA 


AAACCAACGT 


AAATTAACGA 


TTGCTATTAA 


ACGCGCACGT 


ATCATGGGAT 


780 


TATTACCATT 


CGTTAGTGAC 


GAACAATAAT 


TTTATGACAA 


TTTCCnGAAC 


CAAAGAGCTT 


840 


GCTCTTTGGT 


TCTTTTTTAT 


CTTGACTAGT 


GTTTCACGTG 


AAACATAACA 


AGGTTCGCAG 


900 


AATGAAGTGT 


TTTGTGATAA 


AATAAGGGAT 


GAAATAACTA 


GTGAATCACC 


AGAGCAATAT 


960 


GCTCGTTGGA 


ACAAGTCGCC 


TCAAAGGTCT 


GCAGAAAGAT 


TGGAGGCAAA 


AAAATGCAAA 


1020 


AGAAGAGAAT 


TCAAAAAAAC 


GGTTTCTTAA 


TTGTTGTGGG 


TCTTCTCTTA 


GTAGAATTTC 


1080 


TCCTCTATTT 


CTTACTAACA AATAAATGGC 


TGCTATTGGC 


GGTAATTATC 


GCATTAGATA 


1140 


TCTTTCTCTT 


AGTGGTTATT 


CGGCTGTTGA 


TTAGAGATGT 


AGAAATTACG 


AACGTAGAAA 


1200 


AGATTCAAGA 


AGCAAGTTCC 


ATTGCTGAAC 


AATCGTTGGA 


TTATGTTGTA 


AATGAAGTAC 


1260 


CTGTGGGAAT 


TATTACGTAT 


AACGGGGAAA 


CACGCGCGGT 


AGAATGGCTT 


AATCCTTATG 


1320 


CTGCTTCTAT 


TTTTAATAAA 


GACAATCAGC 


TAACGTTAAC 


CGCTAGCCAA 


GTTACGTCTT 


1380 


ATTTAGAATT 


AGCAGAACGA 


AACCAAGATA 


TTTTTACGAT 


TGACGAAAAT 


ACCTATCGCT 


1440 


TTAGCGTCAA 


TAAAGAACAA 


CATACAATTA 


CTTTTGAAGA 


TATCACTAAA 


GAAAGTAATT 


1500 


TGTATCAAGA 


AAAAGTCGAA 


ATGCAAACGG 


CTATTGGCAT 


TGTGTCTGTC 


GATAATTATG 


1560 


ATGATGTCAC 


CGATACAATG 


GATGAGAAAG 


AAATTTCTTA 


TTTGAATAGT 


TTCATTACGA 


1620 


CGATGGTTTC 


TGATTGGATG 


GACCAATACA 


AAGTTTTTTA 


TAAGCGAATC 


AACGCAGAAC 


1680 


GTTATTTTTT 


CATTGCCCAA 


XGGGAAGATA 


TTCAAAAAAT 


GATGGACGAA 


AAATTTTCTA 


1740 


TTTTGGATAC 


GATCCGTAAG 


GAATCAGCTA ACCATGAAGT 


AGCCATTACG 


TTAAGTATGG 


1800 


GGATTGCTTA 


TGGGGGCCCA 


ACCTTAGATC 


AAACCGGGAC 


CrACGGCTCAA ACAAACCTAG 


1860 


ATACAGCTTT 


AGTTCGTGGT 


GGTGATCAAG 


TGGTTGTAAA 


AGAAGCCAAA 


GATGAAGCGA 


1920 


AgCCGTTATT 


TTTTGGTGGA 


AAAACGGCAG 


TAACGACGAA 


ACGTTCCCAA 


GTACGTTCTC 


1980 


GCGCAATGAG 


CATGGCAATT 


AAGGGAATTA 


TTGCGGAATC 


TGCTGACATC 


TATATTATGG 


2040 


GCCATCGTTA 


TCCAGATATG 


GATGCGTTAG 


GTTCAGCATT 


TGGTGTTGCT 


CGTTTAGCCT 


2100 


CGTTTAATAA 


TCGAAAAGCG 


TGGATTGTTT 


TAGATGAAAA 


TGAAATCATT 


CCCGATGTCA 


2160 
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AAAGAGTGTT AGAGGCGATT AAAGAGTACC CAGAATTAGA AGAGCGCATT ATTAGTCCTA 2220 

AAGAGGCCAT GAAGCGCAAG AAAGAAAGTA GCTTATTAGT TATGGTAGAT TACCATAAAC 2280 

CGTCTCTATC GATCTCACAA GAGCTCTATG AGCGTTTTGA TAAAGTAGTA ATCATTGATC 2340 

ACCATCGACG AGGAGACGAA TTTCCAGCAA AACCCTTGCT TTCTTATATT GAATCTTCTG 2400 

CCTCTTCTGC TTCAGAATTA GTCACAGAAT TGATCGAATA TCAAAGTAAT AGCGCAAATA 24 60 

AACTGCAGGC CTTTGT^GCA ACCATGATGT TGGCGGGAAT TGTGGTTGAT ACGAAAAGTT 2520 

TCAATACACG AACGACGGCG CGAACATTTG ATGTGGCTAG TTATTTACGA ACTTGTGGAG 2580 

CAGACTCATC TTTAGTACAA TATCTATTAA GTTCTGATCT TACAAGCTAT CTGGAAATGA 2640 

ACAATTTAAT CTCTAAAAGC GAATATGTCA CAAAAGATAC CGTCGTTGTT • GCAGGGAGTG 2700 

AAGACAAAGA ATATGATAGT GTCACAGCTG CCAAAACAGC GGACACATTA CTTTCTATGG 2760 

CAGGGATTAA TGCAGC 2776 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



TCCATTCCAG 


CTATTTCTTT 


CTACATCTGT 


AATATGTCTA 


ATGTTATCTT 


GATCATGCGC 


60 


ATCAAACTCA 


GATTTAGTTG 


CCTGCTTTTG 


ATTTAGTACA 


TTTCCtwAAC 


CaACCTGakT 


120 


AGCTGTTACC 


GCatGaGGAT 


TACTTATATT 


TGATTTATGG 


GCATTAAAAT 


CTTCAATATG 


180 


TGTAGCACTA 


TCTTGCAATT 


CTTTTAAAGC 


TCCGTGGGTT 


CTGCTAAAGA 


ACCAATTAAA 


240 


ATAATCAGCT 


GGTGGTTTTT 


GTGAAGCTTT 


CCAACCATCG 


CTAGTTAAGC 


TTTCGGGTGG 


300 


TTTAATACCT 


GGTGCTAACC 


AAACAGGCAA 


TTCCTTTGTA AATTTCATTT 


GCATTCTCCT 


360 


CTCTTTCTAC 


AGCGGCAGTT 


TATAATCATT 


TTCTGGAATA AAGACTCCTC 


CCAAAGTTCC 


420 


TCCGATATTA 


CCATCAATAT 


CAGCGAATCC 


TTCTTTACTA ATCTCTATAT 


CAGTCGTACT 


480 


TGAAAATGAA 


AAAGTTCCCT 


CCAAATTAAC 


GTAGGCTACT 


CTAATGCCTC 


CAGCACTGAT 


540 


ACTCTCGACA 


ATTTGTAAAA 


ATTGACTAAT 


ACTTAGACCC 


GAACTATTCA AGTAGTCAAG 


600 


AGGAGCTTTT 


TTTATGATTA 


CACAAGCAGG 


TTCTTTTTCA 


TCTACCGTCT 


CATTAGCTGA 


660 


TATAATATGG 


ATATCACTAG 


GATGACAATT 


TAAAGATGTC 


GCAATTGCAT 


GTAACATTTT 


720 


GTCGATTGAT 


CCATCGCTTG 


TAtTTCTAGC 


TACCTTTCCA 


CGAATCAATA 


CTCTATATAT 


780 


TTCATCAGTC 


GTTTTtCCTC 


TCGCTTGTCC 


AACATTAGCA 


CCTAATTCAT 


CTAGTCCTTT 


840 


CCCTCGTGCT 


TCATCTATCG 


ATCGCCAGTT 


TGCTACTTTA 


TTCAGCAAAT 


CAGTCAAGTC 


900 


TTTAATTTCA 


AACTCAATAA 


TTTCAAGAAG 


TTTTGAAATA 


TTTGAATTTT 


CGCGATTGAA 


960 
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TAGATCtGGT 


AAATAATCTC 


TAAGCTTTTT GGTCACTAGT AATCACCACA TTCTCTGTTT 


1020 


TACATTCAGC , 


.TGACTCATTT 


GGATTCAGAT 


TAATGTCTTT 


GACTTCGGTA 


GTTTCAGTAG 


1080 


ATAATCCAAT 


TTTTACATCA 


GCGACAACAA 


CGCCTGGnAT 


TTGATAAATT 


AATGGaTAAA 


1140 


TATAAGAaAA 


TCTGACTACT 


TCACCCaTAG 


tCaGATkGTw 


AATGTAATTA 


TTTACTATAA 


1200 


CTTTTATTtC 


TTCTTGTCCG 


kTTTTTTCAA ATTTTGAATC 


TACTTGAATA GAAATATTCA 


1260 


CAAAAATAGG 


AACAGATTTT 


GCATAATCAA 


ATTTAACAAT 


ATGACTAAAT 


CCTCCTAAGT 


1320 


CTTTTACTTC AACTTCCTGA TTACCAACAG TGTCAATCCC AGCGGCAACA CTTTTAAAAA 


1380 


TTGCCTGCCC 


AATATCGTCT 


TTCACTCCAC 


CTAAAATATG 


CACATGTACC 


GACTTGGGTG 


1440 


GATTACCATA 


CGAATCAGTT 


TCCATTGTTT 


TATTTTCAAC 


AACACTAGCG 


GTCCGTACCC 


1500 


CACTAACTTC 


TAGCAAAGCA 


GTTAGAATTC 


CGTTTATTGG 


CGGTCCAGGA 


TTCCCACGAA 


1560 


CAGAAATACC 


AATTCGGTCG 


CGATAGGCTT 


TATCTGTTTC 


ACGTTCTGCC 


CCACCCTCAG 


1620 


CCCTAGCAGG 


GTTATTAACA 


GATGATATTT 


CTTCAGTAGG 


TTCCACTTGT 


ACTATAATAG 


1680 


TATTAGCTGG 


TACGTTTGAA 


CTAGCATTCT 


CTTCTAAAGA AATTGCGCGG 


CCTTTCCCAA 


1740 


ATCCATTATC 


ATCAATTTTG 


ACTATATCAA 


TCATCTGAAA 


CATAACTTTA 


TTTTCTGTTG 


1800 


AAAAACGTAC 


GCCTTCATTA 


ATGATGTAAC 


CAGGTTTTCC 


TGAGAACTCC 


AGCTCAACCA 


1860 


TTGCAACTGT 


AGCTGGATTT 


CTATAGATAC 


CACTATTCGC 


ACCTAAACGA 


TCCAATGAAA 


1920 


CCCCTGTAGC 


TTGACTTATG 


AAGCCACTAT 


AATAAACTCT 


TTCAGTTAGT 


TCATGAGATA 


1980 


TAGATAAAAA 


CCATGCAACG 


ATACGAATAA 


TGATACCCAG 


AACAGAATGT 


CCTGAAACAT 


2040 


TAACATCCGA 


ACCAAACAAG 


GCTTTTGCCT 


TTTCAGACAT 


ATCATAGAGT 


ATTTCATCAT 


2100 


ATGTTTTTCT 


TTTAAATCCA 


TTTTCATCAA 


GCACGTATTA 


CCACCTCCAC 


TTCATCACCT 


2160 


AATGTTGATA 


GCATCTCCAC 


TGTAATATTC 


AACTGTCTAT 


TGTTTCTTAC 


AATTTCTATA 


2220 


TTTTCAATAC 


TATTAATTCT AGGTTCTTGA 


TCTAAAATCG 


CTTCTGTAAT 


ATCTTGTTTC 


2280 


AAGTAATCTT 


CATTGTAATT 


TTTTCCAAAC 


ATATTATCAC 


TTTCTAAACC 


AACGGATGTA 


2340 


TCTAATTTAA ATTCTTCTAG 


TCGTATTGAT 


AAGATCATGA ATACACTTTG 


TGCAAGTTCT 


2400 


AAATCTCCTT 


CAACTAGTAA 


AATTCCATAG 


TCAATAAATG ATAAGTCTCC 


ATTGACTATT 


2460 


TTCAAATCCC 


TCATTAAGCA ATCACCCCCA 


TAATAACAGC 


GTCATTTTGA CTATGCATTC 


2520 


TATCAGATGA 


TAATGAATAA 


TCAGCAGAAC 


TTCTATAATT 


ATCTATGTCA CGATCACAAA 


2580 


AAACTACTAC 


TACAACCGCT 


CCCTGTGAGA 


TATCAGACTG 


ACAATGCTTA 


GTGATCAACG 


2640 


CGTTTAGTAT 


TAATGCTCTT 


TTGCCTCCAT 


CTGATTTTAA 


TGCCATTGGC 


TGAACATCAG 


2700 


CTCTTGCACC 


TTTTACAGTG 


ACAACTCGAC 


ATAGTTGCAT 


AACATTAATT 


TCTTTTAAAA 


2760 


TTCTATTTTT 


AAATGATCGA AAAAAAGCTA 


AGTCAGTTTC 


TTTCATTAGG 


TTACCACCTC 


2820 


ACACGTTGTA 


ACAAATC6AC 


TACCATCATA 


GGAGTGCTGG 


CCATTTTTTA 


CATAAAAATT 


2880 


TCCyTTTACA TTTTTtGAAT CGATATAGAC TGCTGTACCT GTAGTGATTT TATGTTGCAA 


2940 
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AAGACACTCA 


ACAGACCAAC 


CAGTATAATC 


ATGACTTTCA 


ACTTTTGTTG 


GTTGGTTAAT 


3000 


AAGTCCCGTT 


TCAGGACTTA ATCTGTATCT, 


CTCXTTATTX XCATCTCGAA 


AATTTTTAAT 


3060 


TACAAGTTGA 


CCTCTTTTAA 


AATATAGTGA 


TGTGTCACAA 


GCTTTGGAAA 


TCTCTTCTAA 


3120 


AACCATCATT 


GCTTGGCCAT 


CAGCTGTGTA 


GCCAGACCCA 


TATATTTTAT 


TATTTTTTAA 


3180 


CTTAATTTCT 


GACAAAGGAA 


TATTTGCTTC 


CCTAGCAACT 


CGATTtATAA 


TAGTAJcGrGC 


3240 


ATCCGTyCCA TTATTAAAAG TAATATTTAC 


TTCTTTTkGT 


yCCGAGTAAc 


TTTACCTyCT 


3300 


AAAAAAGTAA ATGTTGTTGC TCTATCTACc CGTTTAGAAC AGAGGGTTTT ACTTCTGCTA 


3360 


TCGTTCCTGA 


CGTAATCACT 


CCATTTGAAG 


TACCTGCGTA 


TCCTGCGTGG 


ATATAGACTG 


3420 


GATTTCCTTT 


CTTAATAAAA 


TCAATACTTT 


TTTTATTTAA 


ATTGTATATT 


GTCACACTCA 


3480 


CTTCAGATGG 


ATTAGGTGAG 


TCAGAAAAAG 


GTGCTGTAAA GTGAATTTCT 


AACCGATCCA 


3540 


ACCTGCCTGA 


ATCAGCTCTT 


AGTAGAACTC 


GATTCTTCCC 


ATTCTTGTCG 


TGTATTTCAA 


3600 


TTTGTAATAA 


TCGTTGCCAT 


TGCGTATTTC 


CCATTTAATT 


ATCTTCCCTT 


TCCAAACTTG 


3660 


GAACATTATA 


ATTCGGAGGT 


AAATCATCTA 


TATAAAGGAA 


AACCTGAATA 


CCAAAATTTT 


3720 


CAAAAGTTAT 


TTCTTTTGAA 


GCTCCCGACT 


CATCCATCGG 


CACCAAATCT 


GCTGAAGGTA 


3780 


AACGAGTATC 


TACAATGTCT 


TGCCAAAGTC 


TTTCATCGAT 


TACCATACGC 


TCACCAATAG 


3840 


CTATCGGTGT 


ATGATCTATG 


TCATATAGAT 


CTACAGTAAA 


AAATTTTTCT 


GTTTGATTAT 


3900 


AGTCCACCTC 


AAAAATGTAA 


TTTGTATTAC 


CTAAAGGTAT 


TTCGAATTTT 


TCAGGTAATG 


3960 


AATATTTATC 


AATAGGAATA 


TATGCTCTTA 


AAGACATTTA 


GTCACCTCAT 


TTCACACGCG 


4020 


CACGAGCGCC 


TATGGGTATA 


AATCTATCTG 


GCCACTTGTT 


CCAATCTCTT 


AATTGCTGAA 


4080 


TAGACGTACC 


ATATTGTTGC 


CACCAACCCC AATAAGTATT 


TCCTGGCTGA 


ACTGTCACAT 


4140 


ATACCGCATT 


ACTTGGTTTC 


GAAGGTTGTT 


TTGCAACTGG 


CGGTTGTGGA 


ATCTTCTCCC 


4200 


AAATAGTCTT 


AGCTACACGT 


ATGGGTTGTA 


ACGTAATTGT 


TATAGTGAAA 


CCATTTTCCA 


4260 


CTGTGTCATA 


CCCTTTACTG ATATCTTGTA 


TAACTGCATT 


TTTAAAATAT 


GATCTGCCTC 


4320 


GATAAACAAT CCAACGAACA TCTTTTTGCC ATGCAACTAA TGTATTGTAA GCTTGCTCTG 


4380 


CAGCGTtACC 


ATTTTTAGCG 


AGAATCCAGC 


CACTAATTGT 


GACTGGACCG 


CCTGTATACA 


4440 


TCATATTGTC 


AGTGATTGGT 


GCTCCTGaTT 


CAACAGGATA 


TTGAGATACA 


TTAGCAGCAC 


4500 


TGGTTACAGT 


CTCACTGaCG 


TTTACAATCT 


CAACTTTAGA 


TTTACCACTC 


TGaATGTATC 


4560 


CCATTAAACA 


ACCTCTCCTG 


TTCCTAGAAT 


ATTAAGTAAT 


TTTGCATACT 


CATCTTCTAA 


4620 


TGCTTGTCTT 


ACAGCCTCTT 


TAATATCACC 


AACAACAGAT 


TTATCAGAAT 


TATTCCCAAC 


4680 


ATTGATAGTA 


ATATTAGGAC 


TAAAATTAAC 


AGATGGAGAT 


CCTTGATTAA 


AGAGCGCTTT 


4740 


AGTTTTTTCA 


•TGAGGATGTA 


CAGTTCCAGC 


TGTATCTGCT 


TCGAACAATT 


CTGGTCCATT 


4800 


CTCTCCTACT 


AAAACTGTTT 


CACCTTTTGA 


AGGGCGGCCT 


CCCTTTGCAT 


AAGCCCTTAA 


4860 


TCTATGACCT 


GTTGGTCCCC 


ATCCAGAATG 


TCCATACGGC 


AAATCTGTTC 


TCCAATTAGA 


4 920 
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ATTATTGAAA AATGCCATTA GTTGATGGAA ACCATTCATT AT^TTTTCAT ATCCACGTAC 4980 

TTTATATGCA TCAAAAGTTT GAGGAATGTA CTGAAGCAAT CCTCGAGCTG GGTTTCCACT. 51340 

AGCTGTATTT ACATCCCAAA CAGCAGAACT TTGAATAATA CTTTGGTTAC CACTAGACTC 5100 

CCTCTGTATT TGAGCTAAAA TGCCGTTTAC TTCTGCATCA CTTACTTGTT GATTCATTTG 5160 

TTTTGCAGCT TTTCTAACTT GAGGACCCCA TCCTGCAGCA CCTACCGCCA CACCACCAGC 5220 

AGCTCCGCCA TAAGTATCAG GATTTACGTG TTGGCCATTT GGTCCGCCTT TTCTAAGTtC 5280 

ATAATGAACA TGAGGACCAG TTGACCAACC AGrAGrACCT ACATCACCGA CGATTTGACC 5340 

AGCTTTAACT TTATCGCCCA TCTTAGCTCT TATGCGGCTC ATATGTCCAT ACATAGcCCA 5400 

TAGaTTATCm GcAACTTTaA TTCCTACGTG CTCACCTAAT CCAaTGGnAA GAAGATTGAA 54 60 

CCCaATCAAC CAAACCaGGA TA^TGAGCTG GaATAGGCGT cCAGTTGGAG CAGCATAATC 5520 

AATACCGTGT GAAAATCCCC ATAGAGGCCT GGTCGCTTAC CATAATCAGA AGTACGAACA 5580 

AATGGTGATC CAAAATGTGG AGCAAAAGCA CCTGAGCCAA AAGAGACACC ATCTTCCGAA 5640 

TCAAAAAAAC CTTTGAAAGC TTCAAGTTTA TTTTTCACCC ATTCGGCACT GGTGTCTTTT 5700 

AGTTTGTTCA TGACACCAtG TCCCAAACCT tGGATATTTT TACCCTTTTT ATAAGkGTTA 5760 

TGcTTATTAA ATAAGcCAGT CACTTTTCCa ATCGGGTCTG ATAGCCAATC TTTTGcTGTT 5820 

TCAGCTATAT CTTTAGTTTT TTCTATAGCT TTCGTTCCTA CTTCTTTTGC TTTATCTACT 5880 

GCATTAGATC CAAAGTCTTT TAATTTATCA ACAGTATTTG CACCGAAATC CTTAGCACTA 5940 

TCAACAAAAT CAGAAAGCCC ATTTCCTTTA CCTTTATGGA AACCTGGTAA AACTGTTCCT 6000 

TGCCCCATAC CACCATTTAA TACAGCTTTT GTATCTGAAT GGTTTAAAAT CCGTTCACCC 6060 

GATGTTACAT TAGTAACTTC TGGACCATTT GATCCTAATA AACGCATTGA AGAAGTGCTC 6120 

TTGTTATATG CTAATTCAAC ACCTTCTTCA CCAACTAGAG CTTGACCACT GTATGATGCA 6180 

CCACGAGATC CTGTGCTAAA AGAGCGACCA TGCTTGTTTT TAGTAGAATT AGGTGTATTC 6240 

GGTTTCCATT CTGGTATTTT AGGAATACTA AAGAAATCTA ATACGACATT AATACCACCT 6300 

GTAACAGCAT TTACTTTATC GGCGAGTCCT GTTTTAAAAT TATCCCAAGT TTCTAATGCT 6360 

CCAGACACAA CTTTATCCAT AGCTTCACCA AAGGCCTTAC CAAAACTTTT AGCGCCTTCA 6420 

ACTGTGCCGT CCCAAATTTC TTTAAATTTC TTTTTGGAAT CCTTTTTAAA GACTTCCCAT 6480 

TTAGATAATG TTTGTCCAGT TTCCCAGTCC ACTTGTTCAA GATGACCTTC CGCTTGCtTT 6540 

TTAGCCTGCG TTACAACATC TTCATGTTGT TTTTCTGCTT CTTTGACTAC ACCGTCTCTT 6600 

TGTTCTTGGG CTTTTTTTAA TGCATCATCA TATTGTTGTT GAGTAATCGT ACCATTGACA 6660 

TATCTTTCTT CATCTAAAAT GCGCTTTGTT TCTTCATATT TTTCATTTGC AGATTTAATT 6720 

GTACCTTCTT TTGCTTTATA TGAAGCTGCT ACAGCATCAG CAGCCTGTTG CGCGCTAATC 6780 

TCACCTGATG CATTTTTCAA ATTACCCATA ATGATTTTTT GCTCTTTAGC TGAATCTGAT 6840 

AATGCAACAA CAGCTTGATT TTTCATATCT TCATGAATCG ACTTATTAGC AGCAGCATGT 6900 
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TTTTCTTCAA CAGCACGTAC TGCAGCAGCT GAGGTTTTTT CTAACGACTC AATTTCTTTT 6960 

TTATCGTTTT CTGATAGTTC TCGATTTTCT AATCTTGCTT TTTCTTTGAT TTCATTGATA 7020 

CGATTTGTGT AATACTCTGT AGCAGTAATA GATTCATCAT ATTCTTGTTT TTCTAATTTC 7080 

TTGAAATCTT GAACTTTTTC AGAGAACATA TTTGTTCGTA CTTTTGCTAA TTCATCAGCA 7140 

GCTGCCTTGG CTCCTTGTAC ATCCTTTtCA TTAAGAATTC CCATACCAGC TAATTTATCA 7200 

AGATTGCCAT TAGATTTATC TkTCTkTTTA TkCAGCTGTT TAGTAACTTG ATTCTGCATG 7260 

TCTTCTAAAG CTTTATATCG TTTAGTAACT TCTTCCATTG GTAAGAACTC GTCCTTCAAT 7320 

TTTACCTGTA GTGTAnT ^337 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1160 t)ase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 



ATAAGGAGGA 


AATCTGATAT ATGAAAACAA AACGCTTTCA 


GCGCACGGCG GCGATGCTGC 


60 


TGTGCATTTT 


TATCTGCGCG TGGACAATGT TCCCCCTGAT 


GGGTGCGGCC TTTGCCkCsG 


120 


AGAATGGGCT 


GCATGACACC GCTGAACCGG AAATCACCTG 


CTTCTGTGAT ATTCACTGCA 


180 


CGGCGGATGC 


CCTCAATGCA GACTGTCCGG TTTGTACACA 


GGATGTGAAC GCCTGCACCG 


240 


GCACGGAACC 


GGAAGCGCCC ACCGAAACAG AAACACCCAC 


CGAGCCGGAA ATCACCTGCT 


300 


CCTGCAATAT 


CCGCTGCACG GCAGATACCC TCAACGCAGA 


CTGCCCGGTT TGTACGCAGG 


360 


ATGTAGCTGC 


CTGCACCGGC GCAGAGCCGG AAATTGCCTG 


TTTCTGCGAT ATTCGCTGCA 


420 


CGGCGGACAC 


CCTCAACGCA GACTGCCCCA TCTGTGTGCA 


GGAGGCAGCC GCCTGCATTG 


480 


GCGCGGACCG 


GAACCCGTCT GTACTTGTGA AACGCGGTGT 


GCGGCAGACG CACCTGACAC 


540 


GGATTGTCCC 


ATCTGTACGC AGGATGTAAC CGGCTGTACC 


GGCAAAGAGC CGGAACCGCC 


600 


TGCCAAACTG 


GTCTGTATCT GCGCCGCCCA CTGTGAACCC 


GGCGCGGTCA ACGCCGACTG 


660 


CCCCGCCTGC 


CTGTTGAACC TGGCAGACTG TATTGTGGAA 


GCCCCCACGC CCGTCCCGGT 


720 


CTGCGTCTGC 


GAAAACCGCT GTGAACTTGG TGCGGTCAAC 


GCCGACTGCC CGATcTGCAG 


780 


AACCGACCTG GCCGGATGCA CCGGcAAAGC CCCCGcGCCA ATyCCTGCAT CCAATCTCTC 


840 


CATCAAAATT 


ACACCGCCCT CCGGCTGGGC AACCGGCAGT 


GCTGCCGCCG AGTTTCGCAT 


900 


TACCGATGAA 


GCCGGGAACG GCTTTGCGCT GGTGCAGGTG 


AAAATCGAGA AAAACGGGCA 


960 


ATGGCGGGAT GTGACGGACA GCTTGAAGCA GAACGAAAAC 


CGCTGGTATG GGGAAATCGA 


1020 


CCTTTCGGAG 


AACTGCACCG TCTATGTCTG CGCcAACGGC CATGATGGGA AgTGTATGAG 


1080 


AAtCCCGCTA TATTGAGTGT TTCGACCGmA CggcCTGCTT 


ACGCGcCGGC ATkGAcGGcG 


1140 


GtTsTcCGTG 


CGGAACAAGC 




1160 
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(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

TTTACnTGAA CCAATTTCCA ATGTGGATGG GATGATTAnG CCAAACGTTC TGGCCATTnC 60 

CCTGGCCATT TTTCAGGACC TTCTACCACA TATTGTGGAT ATTTTGCCAT ATATTCCGCT 120 

GCTCCGGGTC TATTTCTTAC ACGCATATTT AAAATTCCTT TCTTTTAAAA ACAGACTGGT 180 

GAAATTCGCC AGCCTGTGTG GATGTTAAAT GAATTAGTTT AAATACTAAA CATTCGTTTT 240 

AATTGTAAAA TTTCTGCAGT CATTGCTCTT CTTTCGCCCT CTTGGTGATG ACGAACAATC 300 

TCTTGTAAAA AGCTAAATAA ACCATACCAA TAAATTCGTT GAACCGTACT TTCATTCGGG 360 

ATTAAACCAT AAGATAGTAA CCATTGATTC CAACCGGAAC GAGGGACATA ATGACTCAAA 420 

ATCATTCCTA AATCTACCGC TGGGTCCGCG ACCATCACAG AATCCCAATC TACTAAATAG 480 

AGATAATTAT TCGAGACTAA CCAATTCCGA TGATTCACAT CGCCGTGAAC AGCTACATAA 540 

CTAGTTTCTG GATATTCTGG TAAATGATTA TATAAATAAG TGATTACTTC TTTTAAATAA 600 

CTGTTTCGTT TTAATTCTTT AGGTAGTTGA CGCTTATACT CTTTCAACAT TGTTAGCGGG 660 

GTACAAACTT CGCCACCAAT TTTTTTCAAC ATGTCTTTCA GCAAATTGGA ATGATGCAAT 720 

TGGTATAACA CGTCAATCAC GTCATTGCGT TTGCCAATTT CGTCCGCATG TAATAAACGT 780 

CCTTCCAGCC ATTCTTGGGC TGTTAGTACG TCGCCATTCG CTGTCCGCTT TGTCCAAATC 840 

ATCTTCGGGG CTAACCCTTC TTTAGACAAT GCTGCAAGAA GTGGAGAAGT GTTTCGTTTA 900 

ATGAACACTT yCTCGGTTTC TTIcTATTCCC ATGTATGTTT GACCAGTTGC GCCCTTTATC 960 

GGCTGCAGAC GCCAGTCTTT ATCCAACTGA AATTCCATGG AAATTATCTC CTATCTCAAC 1020 

ACAATTTTAT TGCAATTTTT ACAGCAATTA CCATTTTAGT CGCCAAACCT TGCTTCGTCA 1080 

AGCTTCTATG TGGTTCTTTT GTAAATTTTG CTGGGTTTcT CATCTTTATA ACTTATTCAA 1140 

GCGCATTGGA ACATACATTT TAGTGAAAAT CACAACTTCT ACAATTAAGG CACCCAGAAT 1200 

GATCAAGCTA TTTTGCAAAC TGCCTAAACG AAGGCAGACA ATCCCACCAA AAATAATGGC 1260 

CGCAACAGAT AATGCGCCGC TAATTAACCG GCGAATCGCT GCTTGTTTTT TTTCTATTGA 1320 

AATTGGATAT AGCTGTGTCA TCACCATATA ATCAAATT6C GTGTACATGG GAATCAACTG 1380 

GAACCCGATT AAATAGACAA ACAATAAAGC AACTACCAAC GCAATCCAGA ATTCTTGTAA 14 40 

AAAGAAGAGT AGCACGCCAC CAACCAAAAT CAAACGAAAC AATAAGCCGC TATATTCGCT 1500 

ACCTCGTAAA AAGCTCCGAA TGAATAAGTA AGCATACGTG TTCTCAGATG TTTTTTTGAC 1560 

TACTCCGAGG AGAGGATCTA AGTATTTACG GCGTTTCACT GTACTACTCA CTTCAGGAAT 1620 
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ATCTGTAAAC 
CATTTTJICC 
AACGGCCAGA 
CATAATCAAG 
AAGATGATAG 
GACCAAAAAT 
CAATAGGCCA 
TCGTTTCAGA 
TTCTGTCAGA 
CGGTCTGCCC 
TAATCCGCCT 
GTACTTGGAC 
ACCCCACCTT 
AATTAAATTC 
GCACCACAAA 
TCGCTGCCCC 
GATCTAAGCC 
AAAGTACCAT 
TATTGTCTAA 
GAATATCATA 
AAGAAGGTGT 
ATGTTTCCCC 
TGATGGTGGT 
TGACGTCAAA 
CAATTGTTAA 
AGCGAACCAA 
TATACACGAA 
TCAACTTGTG 
TGTATTTTTT 
AAAGTCTACG 
AAACAACATG 
CGTATTCCTA 
ATTTTAAATA 



AGCTGAATGA 
.CAATCCAAGA 
ATGATGCCAC 
AGACTACTGA 
AGCCCATAGC 
AAAAAGAAGG 
CTTCCTA;iAA 
TAAGCAGCGA 
GTAGCCAACT 
CAAACAAAAT 
AATAAAAACA 
ATTTTTTTGA 
TTCTTCCTTC 
TGCACGTAAT 
ACGATCACAA 
TTGCTTACGC 
TAGAAAAGGT 
GACTTTTTGT 
ACGGAACGTT 
AGCCAAGGCC 
TTCCGGAATA 
GTCAATCATA 
ACTTTTCCCC 
ATTAATATCT 
GCTCATTTCT 
GAATTCAAAA 
ATGAAACGAA 
GTAAAGTAAA 
GTAAAATTAT 
CGTTTTTAGA 
TTGCTGATAT 
AAGTAGCACG 
ACAATAAGGA 



ATTTATAAAT 
AACTCTGTTT 
CAATCACACC 
TCAACCAAAC 
TTTGTAAACT 
TTTGAAACGA 
AGCTGACTAC 
ATTGTTTTTC 
TACCAACTTG 
CACGAGGTAA 
TACAGACTAA 
AATGCCGTGA 
GTCAAGGCAA 
TCAGCCATTG 
TATTTTTCAG 
ATCGTATCCA 
TCATCAATAA 
TTCATGCCTT 
TTTAGTAATG 
GTTACTTCGA 
TACCCGATTT 
ATTTTGCCTT 
GCACCATTCA 
TTTAAGACAG 
TTTGCCTCCA 
ATTACTCCCT 
CTTTTTAGCG 
GGAAAGAACC 
CAACGGCGAA 
CATTAGACAA 
TTTTGAATAT 
AGCATTAGAA 
AGTGGCTTAT 



842 
TCTATGCATT 

CATCTTATTT 
AACCCAAGGC 
AAACCACCAA 
AAGATGTGTA 
CCAACCTGTT 
AATCGGTAAA 
TTTTGGTAAT 
AATGAGTAGC 
TTCTTTTAAC 
AACAAAATGA 
CAAACGTTGA 
GATAAATATC 
TACCGTTTGC 
CTGTTGCTAA 
TTAATTCTAA 
TATATAAACT 
TTGAAAAATT 
GTTCTGCTCG 
TATGCTCTtT 
TTTTACGATA 
TTTGCGGCGT 
AGCCAATTAA 
GAATATGGCC 
ACCTTTTCTC 
TAATTAAATG 
AACTCACGTA 
TAGAGAAAAG 
ATCCCAAGTT 
GTGACCAAAG 
AATGAC6TTT 
AAAGCTTTTC 
CAATCCGTTT 



CTGTTTTGTT 
TGCCAAATAC 
GTCCAGTAAA 
CTACGGTAAG 
TAAATCATAC 
GTGACTACAA 
AGTAAAGAGT 
AAAAACACCT 
AGCCAGCCCA 
AACTGGGAAT 
TCATTTAAAA 
CTAAAAAATT 
ATCTAAAGAA 
CCGTAATTTC 
AATATGCGTG 
TAAAGCATGA 
CGGTTCAATT 
AGCAGGAAAC 
TTTGAACGCT 
CAATGTTAAT 
TTCTTCAGGG 
AAGTAACCCA 
ACCAACCATT 
GTAACCTCCT 
TCTACCATTA 
GATTTTTTTC 
CTTTTTCATC 
AGGAATTTTT 
ATAAAGTCTA 
GGCATACCTT 
TAGCAAGCGA 
CAGAAATGGA 
TCCATTCCCA 



CTTTTTGGAT 
TCGATAAACA 
TAGCTCCCGT 
TGGTTGACGA 
AAACTAGCAT 
TCAACGGCAT 
AGCGCAgCGC 
TGTCTGGCTC 
ATAGAATTAA 
AGTACAGACC 
TATAGCGCAA 
CGCCCATGTT 
GACTCTGGTA 
CCTTCGTGTA 
GACATTAAAA 
ATTGCTAAAG 
AAGAATGCAC 
CATTCCAATT 
TCTTCTAATG 
TCTTCATATA 
GCTTGTTGCA 
ATAATATTTT 
TCACCAGACT 
GTTAAATGTT 
TTAAACCCTA 
TTTCTCTTAT 
CaCAGAGAAA 
CATGAATGAT 
TGAAGACGAG 
AATGATTCCA 
CGTCTTTGCT 
AGGATTAAAC 
CGTTCATTTG 



1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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ATTCCCCGTT ATTCAAAAGA AGATGATTTC TCTATTCATT TT^GCAACCA TCAAGAGGAC 3660 

TATTCCGCCG AAGCGATGCA AGAAATTGCT 6AGACCATT.G . CAAAGCAGGT GAACTAAATG 3720 

AAAAAATTTA TTAAAGGTCT CTTTTTTGGC GCAgCTGCTG GTACCATTGG CGGCTTGTTG 3780 

GCTGCTCCAA GAAGCGGGAA AGAAACACGC CAACATTTAA TCAATGAGCT AGAGGACTAT 3840 

CGTTCACTGA AAAATCAAGT CACGAATGAC TGGGATCAGG TGCAACGAAA TTTGGCAGTT 3900 

GTCGAAGAAA ACGTTCCACT GGCAACTGAA TTTTCAAAAG ACCTGCAACA AGAAATCACT 3960 

GATTTCAAAT TCCAAGCGGA ACCTCGGATT GCTCAAATTA AAGAACAAAT TGCTAAAATC 4020 

ACGGCTGAAT TACCTGACAC ACAAACCAAC AAGCAAAAGT AAAGTCTTTT ATGTTAATTA 4080 

TTTCTTTTGT TATTAAAAAA ATAAAAAGTG TGACGTAAAT ACGAATTTTG TTTGAAACTA 4140 

TGCCTGAGAC TCTTTCTTTT TCAAAATGAG GTAtGGTATA GTTATAACAG AwATAAAACT 4200 

ArAAAAAACA GGAGTGCATA AGAGAATGAa GAaAAAACTA ATCTTAGCTG CAGCGGGCGC 4260 

AATGGCCGTT TTTAGTTTAG CAGCGTGTTC AAGCGGTTCA AAAGATATCG CAACAATGAA 4320 

AGGTTCAACA ATTACTGTTG ATGATTTTTA TAACCAAATT AAAGAACAAA GCACTAGCCA 4380 

ACAAGCGTTT AGCCAAATGG TTATTTATAA AGTCTTTGAA GAAAAATATG GCGACAAAGT 4440 

AACTGACAAA GAyaTTCAAA AAAACTTTGA CGAAGCCAAA GAACAAGTAG AAGCACAAGG 4500 

CGGAAAGTTC TCTGATGCAT TAAAACAAGC TGGTTTAACT GAAAAAACAT TCAAGAAACA 4560 

GTTAAAACAA AGAGCAGCCT aTGATGCAGG TCTAAAAGCC CACTTAAAAA TTACAGATGA 4620 

AGACTTAAAA ACAGCTTGGG CAAGTTTCCA TCCAGAAGTA GAAGCACAAA TTATCCaAGT 4680 

TGCTTCAGAA GATGATGCCA AAGCTGTCAA GAAAGAAATC ACTGACGGCG GCGATTTCAC 4740 

AAAAATTGCT AAAGAAAAAT CAACAGATAC TGCTACGAAA AAAGATGGCG GTAAAATTAA 4800 

ATTTGATTCA CAAGCAACAA CTGTTCCTGC CGAAGTTAM GAAGCTGCCT TCAAATTAAA 4860 

AGATGGCGAA GTGTCAGAAC CAATTGCTGC AACAAATATG CAAACCTACC AAACAACCTA 4920 

CTATGTAGTG AAAATGACGA AAAACAAAGC AAAAGGCAAT GACATGAAAC CTTATGAAAA 4980 

AGAGATCAAG AAAATTGCTG AAGAAACAAA ATTAGCCGAT CAAACATTTG TTTCGAAAGT^ 5040 

CATTAGTGAC GAATTAAAAG CGGCCAATGT GAAAATTAAA GATGATGCCT TCAAGAACGC 5100 

TTTAGCAGGC TACATGCAAA CTGAATCTTC AAGCGCTTCT TCAGAGAAAA AAGAATCAAA 5160 

ATCAAGTGAT TCTAAAACAA GCGATACCAA AACAAGCGAC TCTGAAAAAG CAACAGATTC 5220 

TTCAAGCAAA ACAACAGAAT CTTCTTCTAA ATAAGCATAA AAAAGAGAGT GGTCGATTGA 5280 

CCACTCCCTT TTTTTATTTA TCTTTTGGAA CATAGAAACT GCGGTTATCT AAACCAAAGA 5340 

TTCGTTCCGT ATACTCCCCa GGCTGTGTTT GACGAATTGC CCCTAGCATt CATTTGAATG 5400 

GACGCATCCA TGGTATCAAT TTGATGTAAG ATTTCCGCTT CCATAATTCT TGGTCGCACA 5460 

GGTGAGCCAT ATTCTAACAA TCCATGGTGG GCTAAGACCA TGTGCCGAAG AACGACAACA 5520 

TCTTCTTCTC GATCATCAAT TTTCAAAGCC AAACACGCCT TGGTAATTTC CTCATCCACC 5580 
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AAAACTAAAT GACCAATTAA GTTCCCTGCC ACCGTATATT CTQTCGTAAT AGCACCAGAT 5640 

AGTTCTAAAA CTTTTCCTAA GTCATGTAAA ATAATCCCCG CGTATAATAG TGACGTGTTA 5700 

ATTTCAGTAT ATTCCTTAGC AATTGCTTTG CCTAGGCGCA ACATTGAGAC AGTGTGGTAA 5760 

GCTAAGCCAT TCGCAAAA(3C ATGATGGTTC CGTTTGGCTG CAGGAAATTC ATAAAATTCT 5820 

TGTTGGTATT TCGTCATGAG ATAGCGCACA ATACGATTCC AATGAGCATT GGTAATTTCA 5880 

AAAATCACTT GATTGATTTC CTCGACCATC TCTTCTTTTT TCAAAGGCGC TCGTTCCATA 5940 

TATAAACTTG GCTGGCTTGG TTCCTCTGCT GTCGCTAAAC GCATGTGTAA AATTTTTACT 6000 

TGGGGTTTAT TTTGATAAAC TTCTCGtTTC CCATTCAGTA AAACAACGTT TCCCGCAGTA 6060 

AATCGTGTAA TTTCATCCTC AGAGGCGTCC CAATATTTtC CGTCAATC6T GCCAGATGTG 6120 

TCTTGAAAGG TAAAGGCAAT AAACTTTTTC CCATTTTTAG CTACACGTAC ATCGGCATTT 6180 
TTAATTAGAA CAAATGATTC AAAAAGTTCT TCTACTGTTA ATTCACGTAA TTTTTTCACT . 6240 

TGACGCTTCC TCCTTCTAAC GCAATTACTC TTTGTTGTAA GTCCTGATAA TAGCTTACCA 6300 

TTTCTTGGTC AGAAGAAAAA CAAATGACTT GCTGTTTTTG ACCAAATTTA GTAAATAATT 6360 

CAGCTAATTG G6CTTTTCTT TGATGATCAT AGTGTAACCA GCCATCATCA ATGATAATTG 6420 

GACAAATCAG TTGCTCGCCT TGGACTGCTA AAAAGGCAAA ACGAACAGCC ATCATCAATT 6480 
GATCTTTAGT 6CCAGTTGAT AATTCAT6TA AATAAAACAT TTCTTGTTCT TTCGTCACCG 6540 
CAACTAGTTG ACCTTCGGCT ACTTGGATTG ATTGATAACG CTGATTTGTT AAAAGTGCAA 6600 
AGTA6CTTGA TGCATACTGT AATAAGCTGG GCAATTGTTG TTCCGAAAGC TCTGTTAATA 6660 
GGTCCATTAG CAGTTGTCCC GCTAGTTGAT ACCCAGACCA ACGTTGAGCT AGTTCTTTGA 6720 
TTTCCGCTTT TAAAATTGCT TGTCGTTGAT ACAACTCATC TAAGGTACCA TCCGTCATTA 6780 
ATTGTTGTTT TTCATAAATC AGCGCTTGAT ACTGTTCTTG TATTTTATCT AACTGGCTTT 6840 
CTTCCAATGC TTGTTGctGC GCCAGTTGTA AAGAACGGTC TTTTAAC6TT GCTTCTGTTA 6900 
CCGCTTCTGG AAACA6ATTA CCTAATTGCG TCGCTAATTC TTGCCCACGA CCAACTTGGG 6960 
CTTGCTGAGT GGTCAGCTGT TGAATCCTAC TTGGAACATC AGCGATTGAT AGAATCCGAT 7020 
AACGAATTAG TAAAGGCTGA ATTCGTGTTA ATGCTTCTTG CTGTTTA6TT TTCAACTCTC 7080 
GTAACTGCTG TTTGAGATAG CCATCTGCTT GATACTCTTG AGCAAAGCGA ACTTTCTCCA 7140 
TTTCTTCAGC AAACCCTTCT AAAAAGTGTA ACTTTTCGGA AAGGCTCTTT CCTTGCAAGG 7200 
GTAAATGCGT CATCAATGGG AAGATTTGTT CTTCAAATAC TTGGCGTTTT TGTTGATTTT 7260 
CCTTCAATTG ACTCATGAGT TCTTCATTTG TTTGAACAAG CAATAAATAG CGCGTAATCA 7320 
ATTCCCGCTG ATTCATTAAA GTATCAATTT GTTGCATTTT TCCTAATTGA TGGACTTGGG 7380 
CTT6CTCCGC TACTGCTTGT TCAAGTTGTT CCACTTGCAG TTGTGCCTCA CGTTCTTGAT 7440 
TCAGCGCCTG TTGGAACTGT TCATTTAAAT AATCTAGTTG CGAAAGTTTT GTCTGCCACT 7500 
CTTCTTTCAC TTCATCAGAG GAAGCTTCTT TTCTAGTGGC GTAgcTAAGC ACTTCTTTCG 7560 
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CGATAAAACT 
CGAGTAATAC 
TTTGGGCACG 
GGTCTTCCAG 
GAGCTTGCGT 
AAAGCTGTGG 
GACGATTTTG 
CATAGCGTTG 
CTTCTGGAAC 
CTTTGTCTAG 
CTAAAACAAC 
GaCGTTgcTg 
GTGtCACTCG 
GcTGCGCTTC 
TGAGTAATGG 
TTCGGTAAGT 
GTTCCTCTTC 
CTTGAAATAA 
TTTGTTGTTG 
GAACTTCCCC 
TAGGTGCGTA 
TAAATTGATA 
ATTCAAATGT 
ATTTCATAGT 
CAAGCTCTCT 
TAACAATTCC 
CGTTAATAAG 
CAGTATTAGG 
GTAGCTTAAT 
TTCTTTCAGT 
ACTGGTTTCA 
TTCATTCGTC 
AAGTGTGACC 



AACTCCCACG 
AAGGATACTT 
CGTCGCTTCA 
ATTTTCAATT 
ATACGCAACT 

tggtc6ttca 
ctcaaaggtt 
attcaac?;at 
atcgagcgct 

GTACATATAT 
GTTCTCTGTT 
TTCCgCAACC 
CTGTTGTTCT 
TTTCTCTTGA 
CTGCTGTCCT 
CAAAAGTTGT 
TGTCAACTTG 
CTTTTTCGTT 
ATAATAAATG 
AAACACAGGA 
ATCGCGTTTT 
CAATGTGGAC 
AATTTTCTGT 
GGGTCCTCCT 
AATCGCCACT 
TTGGTGGTAT 
TTGGGACTAG 
CGATAAACAA 
AGTTCACCAT 
GTCAACGCCA 
AGATAAGACA 
CAATCTATTT 
ACTGCAACCC 



AGAACTAGGC 
CCTAGCAATA 
CCAAAAAATG 
TGAGAAGCCA 
TGTTGTTGGC 
GGCGACCATG 
TGGGTCATCC 
TGTTGAATCA 
TCTGACTGAA 
TCTTGAAAgG 
TCTTcAGGTA 
TGTAACAAGT 
GTTTCGCGAA 
ATTTGCTCTT 
TTACCTTTAA 
TGACTACCAG 
TCACTCGTAA 
AGCGGATGTA 
ATTGCTTGCC 
TGTGCCAACC 
CTTTTGCCCT 
TTGCCAACTT 
TGTTGCCATT 
GAATAGTAAA 
GCTCATCCAT 
CTTCAAAAAT 
CGGCTAAATA 
AAAAATCACC 
TGACAATGGC 
TCAATTGCTG 
AAACGCTTTG 
CTGCGACTTT 
CTTGTAAAGA 



845 
CCCATTTTAA 
ACCAAAGTGG 
CGGCATGCGC 
AGAATGCTTT 
GCCACTTTTG 
ACCATTCGTT 
ATTCCGATTC 
ATGCTTCTTG 
GAGATAACTC 
TTTCTTCTAA 
GCGACTGCCA 
cCTGTTGCGT 
TCGTTTCTTC 
TTAGTTCTTG 
AGATTTTTTG 
AAACACCAAT 
TTAATTGTTC 
ACATTTGCTG 
CTTTATTTTT 
ACAAACGACC 
TTGTGGGAAA 
CATTTGCGCC 
TTCCAAAGCC 
GTCTTCATTA 
TGAAAAAATT 
AACTGGATCT 
GATCGGTTCT 
TTGCGTTTTT 
TTGCACCACG 
TTGCTGATGC 
GGTGTCACGG 
TTCAAATTGA 
CTGTTCTTTT 



AGQGCTTGGC 
TTGACTTGTT 
TTTTTCAAAA 
CTGTTCTTGT 
AACGGTTTCG 
CTCTAAAAGG 
CGTCATCAAT 
TTCTAGGTAA 
TTTGGTCAAA 
TTGaACTTGG 
TTCTTCGTAC 
TTCTTTTAAT 
TAATTGTTGA 
ATAAGCCGCC 
TGCTTCTTTA 
GGCTAACAAG 
TTGTTGGAAA 
TAAGGTTTTT 
TTCTTTAAAA 
ACCATACGAG 
CCCAAACAGC 
GTACAGCAAT 
CACAATCTCT 
ATTTGCTCAT 
CCAGCAAATT 
AAGTATGTTT 
TTCGTTGGTT 
TGTAAAAGCT 
GTCACATCCA 
GCCTGCCACT 
CATTGCGCAA 
ACAGTCGCTT 
TTCGTGTGTC 



AAAAAGAAAC 
TTCTGTGCTT 
TCTGTTAACT 
CCTTTTTGAA 
TTGTCATAAA 
ACCATTTCTT 
TGTTGCATAT 
AAAGCATATC 
CGTTGTCGTT 
TCTTCTTTTA 
aAGGGAAAGT 
TGCTGTTGCA 
AAAGTTTGTT 
AACTTCTGAT 
AAGTACGCAT 
GATGTTTGCA 
GTAAAAACCT 
TCATCCCCCA 
CGCTCGACTT 
CCACCATTTT 
ATTGCTTGAA 
TGATTACCAG 
ACTGCGAGTA 
GTGCTAGGCG 
CTGGATAACG 
TTTCTAATGC 
CCTCTGCTTG 
GCTGCTGTAA 
AATGTTGCGT 
TCGTCAGCAA 
GAGATACTTC 
GATTTTGGGA 
CTTGAGGTGT 



7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 
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TCCAGGATAA ACAATTAGCG GTTGTTCACT AACAATTTGC GGCTGATGAA TGTGGCCCAA 9600 

TGCCCAATAA TTATAGCCAG TGGCTTTTAA ATCCTTCCAT GTAAACGGAG. CATAAXTTTQ 9660 

TTGCGCTTTC CGACTGGTGT CTCCATGATA AATCCCAATA TGAATATCAG ATTCTGCTTG 9720 

TTTGATAGGA AATTCTAGTG CTTTATTTTC ATCAATCCAG GGATGCTCAT AACTAAACCC 9780 

TGAAACAGCC ACTTGTTCGC CCGCTTTCGT TTCAAAATAC AGTGTCTCAA CCATTTCTTT 9840 

TTCAAATAAA AAGACATTGT CTGGAAAAGA AAACCAATAG CGGTCTTTTT TATAGTAATC 9900 

ATGATTCCCA AATGTTAGAA TCACAGGGAT TTCTGCTTGT TTCAACTGTT CTAATGCAGC 9960 

CATCAATTGG GCTTGCATTG AAATAGAGGT CTGACTTTGA TGAAACGTAT CTCCTGCAAA 10020 

AATAACCAAG TCCACCGCTT CTCGAATGGC AACTGTTACG ATATTCATTA ACAATTGTTG 10080 

ATTAGCTTCT CTTAaGCGcT TCGCTAAGGC TGTCGGGATC CCAGTTAACC CTTCAAAAGA 10140 

CCGATCCATG TGTAAATCAG CTGTATGCAA TAGTTTCATC GTTTCCCCTC CTTTTCTCCT 10200 

TCCTTTATCA TATCATTTTC TGCCAGCTTG GTCAGTAAAA ACATACGAAA ATTTGTTCGT 10260 

,j»Y^YTTATTT TCAGTAAAAG AAAAACTCTG ACAAATTGTC TTATAGACAA CTTGTGAGAG 10320 

TTCAATTACG TAAATTAATC GTTATACATA TCACGCACAG GAGTCATGAT AATTCGGTTT 10380 

AAATCATTGA CAACTAAGCT AAACGCTTGT TCTTTTTGCA TTAATTCGTT GATAACTGCT 10440 

GTTTCTTGCA CTTTCGTTGC AATGCTTTGT GCTTTTTCAG CATCTTCATC CGTGAATTCT 10500 

TCGCCGCGCA TTTGTTTTTC TTGTAGTCCT TGTTGGAATC CTTGGAATTC TTTAAACAAT 10560 

GTGTAAGCTG CTTCGTCTGC TTTTACTTCT TCATACGCTG CTTGTAATGC TTTAAATTCG 10620 

TCTAACTCAC GTAGTTCACG TTCAATCTGA TTGGCTGTAT CATAAATGTT ACTCAATGGA 10680 

ATCCCTCTTT TCTCAATAGT TTCATCTTTA TTGTACCATT TCTCTTAGGT TTTCCCAACT 10740 

AATTTCCTAA TAAGCCTCCG AAAATTCCTT CAATCTTGTC TTTGACTTTT TTCGCGCCGT 10800 

CTTTAATTTG TCCCCCAACT TCATTCAATG TATCTCCGAA ATTGCTAGCG CCGTCTTGAA 10860 

CAGAATTTCC AAAGTCTTTC AAGTTATCCC GCCAATTAGA ATTGGCACCA CCAGTTAAAC 10920 

TATCGTTGCT ATCAACAACT TTCCCACCTG TGGCATAAGC ATCCGCGACA GGAAATTGTG 10980 

CTTGTTTGAC TTGTGGCAAG ATGCCACTGG CTTGACTGTT GAACACCTGT GAGGCATACG llt)40 

TAGCGCTACT GCCTTCCAAA TAATGGGTTT TACTGGTTTC TTGGAAGCCT AACCAAGTAG 11100 

CAATGACAAC TTCTGGTGTA TAACCAATCA CCCACTGATC GTTCGTTTTA CTACTATCGA 11160 

AGTTGGTTTC TGTCGTACCT GTTTTACGAG CCATTACGTA GCCTGCTGGT TGTGCGTTGA 11220 

CCCCGCTCCC AGAACTAAAG ACCCCTAGTA ACATACTTGT AATCCCGTCT GCAACATCTT 11280 

TGGTAATGAC TTGTTCTTTC TTGACTTTAG TATTATCAAC TACAATTGCG CCAGTTGAGT 11340 
CGACGATTTT TGTAATTAAA TGCGTTTCAG TTTTATAGCC TTCATTGGCA AAGGCACTAT 11400 
AAGCTCCCGC CATAGTTAGT GGCGAAACAC CCGTTTGTAA TCCTCCAAGG GCTAAACCAT 11460 
AGTACCGATC TTTTTCTGAT AACGGAATGC CAAATTTCTC TGTTTTTTCA TAGCCTTTAT 11520 
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CTAAACCAAT TTGATGTAAT AACCATACCG CTGGTAAATT 
GATACATAGG.AACTTCTCCA CTGTACGTGC GGCTATAGTT 
GTGGTTTATC TTCTAAAACA GAGTCTGGTT TATAGCCAGC 
CAGAAATTGG TTTTAAGGAC GAGCCTGGTG AACGTTTTGT 
CTCGATAAAC GTGTTCCCCT CGACCGCCCA CCAGTGCTTG 
CAATGGCTAC AGAGCCACTT TGAACCATGG CACCGTCTTC 
CATCATTTTT ATAAGTTGCA TCCATCGCAT CTTGATACCC 
TCTTGTATCC TTTATTTAAA ATATCTTCTT CTTTAAATTT 
CAATTACCGC ATCAAAATAA TATGGATATT TGTAACTATT 
CATTTAATAG ACTCGCTAAG TTAACACTCG CTTCTTGATT 
TTTTATTGTC GACCATTAGT TGTAAAACGG TATTCCGTCG 
AATCAATTGG GTTATAAATT CCTGGCCCTT TTAACATGCC 
CCACTGTCAC TTGGGACGCA TCCACGCCAA AGTATTTACG 
AGACACCATT ACCAAAATAA GAATTATTTA AATACATTGC 
ATTTCTTTTC GATTTCAATC GCTAAGAATA ATTCTCTGGC 
CTTGCGTTAA GTAAGCATTT TTGGCTAATT GCTGCGTAAT 
CACCGATTTT CCCGAAGGTC AATTTTCCAA CAACCGCACG 
ACCCTTTATG TTGATAAAAC CCGCGATCTT CAGTTGAAAT 
GAGAAATATT ATCCAGTTCC ACAAAAGTCC CTTTTTGCCC 
TTCCTGATTC ATCATAAACG CGTGTTGATT CACTTAGCCC 
TTGCGGACTT GGCTAAGTAA AATAAATAAA TACTGGCACC 
TTAACAAGAA AATTTTATTG ATTTGATACT TTTTCCAAAT 
ATTGTATTAA ATAAGGCTTT 
(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
AACGCTAAAA CCTTTAATAA ATAAAGGTTT TAGCnTTTTA ATAAAGAGCC ACCTGCCGGG 
CTTGAACCGG CGACCCCCAC CTTACCATGG TGGTGCTCTA CCAACTGAGC TAAGGCGGCA 
CTTTCAAAGG CTAAACTAAC TCGAGGATTG CAACGAATGA AACCCATAGA GTTCATTTTC 
AAAGGCTACA CTGCGACGAG TAATCGTCAA AGTTCCTTTT GAAAAATCGT ACAAATCTCG 
TGCACAAGAA ATAGAATAAC GTATAGACGA AAGTTTGTCA ATTGCTCTTT CGTAAAAAAA 



TAAGCTTTCT 
TTGCGCTGGA 
CTCTAAAGCT 
TTGCGTTGCA 
CACGCCGCCG 
TGCATTGGGT 
CTGATTTAAA 
ATAACGATTG 
TTCGTCGCCA 
GGCTTCTTCT 
CGCTGTGGCA 
CGCTAGCGTT 
AGCTGCATCT 
CAAGATATCT 
TTTCCTATCT 
TGTACTTCCG 
GGCAATCCCt 
AACCGCGTCT 
GAATAACTTT 
TGATTTTAGG 
TAAAGCGACT 
CCGCTTTCTG 



CCTAAGGCTT 
TAATAATCTT 
GGTGTATAGA 
AAGTTAAAGC 
GTTTTCGGAT 
GGATUVTAGTG 
GAAGTGTAAA 
ACCGCTTCAT 
ACATAGGTGT 
TGAGATAGTT 
TTGTCAATGT 
GCGGCTTCCC 
TGCACGCCCC 
TTTTTCGAAT 
AACGTTTGTT 
CCACCACCGC 
TTGATGGAAT 
TGAATATAAG 
CCGACTTCTT 
GTCTCAACGT 
ACCAAACCAA 
GCTTGATGGA 



11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12860 



60 
120 
180 
240 
300 
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GAAAAATGAT TTTTTTCTAA GAAAAACTGT TGACAATCTC TTTTTTCTCT AGTAAAGTGT 360 

AAATCAACAT CACAGATAAC TATTCATTCG TTGAAGAGAA GAGTAAATTT TGATGACCTT 420 

CAAAAGAGAG TCCTGATTGG TGAmAAGGGA CAAGGTTCAG AAAAGTTGAm GATGGCCTCT 480 

GAGAATAAAT TGCTGATAAG TAAGCAATTT CGGCTAAGCA CCCGTTACCA TGTGCACCAA 540 

GTTAATAAAG ACTTGGATAA GAGATGAGAT ATAAAGAATC AGTTATATTT CAAAAAAAGG 600 

TGGTACCGCG AATCATTCGC CCTTTACTAG CAAAACGCTA GTGAAGGGCT TTTTGCTGTG 660 

AATCACTACG ACTGGCGCGA AGACAGAGTA GATGATGAAC AGTACAAGAT GACCGAAGGC 720 

AAGTTGTGTC ATTTGCACGA ATCACTACGA CTGGCGCGAA gACAGAGTAG ATGATGAACA 780 

GTACAAAATG ACCGAAGGCA GGTTGTATCA TTTGCACGAA TCACTACGAC TGGCGCGAAG 840 

ACAGAGCAGA TGATGTCTGC GATTACTCGA CGCAGGATGG CCTTTGTGAG TGATGAACAG 900 

TACAAGGTGA CCGAAGGCAA GTTGTGTCAT TTGCACGAAA GTGTTCCTAC GATTCTCGTC 960 

ACAGTCAAAT CCTGTTGAGC ACTACGACTG ATGCAAAGAG AGAGAGTAGC TATTATCTGC 1020 

GATTACTCGA TGCTTGAAGG GGTAGTAACT CTAGTCTGTT TGATTTTGAA AATAAACGAA 1080 

AATCAGTTTA AAATGAATAG AAATAAAAAA GCAGTGATAT AGTAAGTACA AGACTTGATT 1140 

TGTCTAACAG AGAGCCTAGG TAGCTGAGAA TAGGTGACAA AGACGTTTTG GAAAATGGCT 1200 

ATTGAGCAAG CGCTCCGAGT GTTAAACACA AGGCGGCGAC GGGGGCTCCC GTTaTCGAGC 1260 

TAGGGTATCG AACATATTGT TCCGTACTTG AAAAGGGAAT TTGTGTGAGC AAATTCTGAA 1320 

CATGAGGTGG TAACACGCAA AAAAGsGTsC TCAACGATAC AACTCGAGTT GTGTCGTTGA 1380 

GGATGCTTTT TTCTTTTTTA TGAAAAAATG CAGGTTGGTA AGAATGAAGG GGGATAAAGA 1440 

ATGATTGAAC TAAAAAATAT CAGTGTGACA TTTCAACAAA AAAAACAAGA AATTCAGGCT 1500 

GTCCAAGATG TTTCTTTGAC AATTGATAAA GGGGATATCT ACGGAATCGT TGGTTATTCC 1560 

GGTGCTGGAA AAAGTACGCT AGTACGGGTA ATTAATTTAT TGCAACGACC AACAGCGGGC 1620 

ACCGTTATTA TTAACAAAGA AAATATTTTA ACTTTTTCTA AAAAGGAATT ACGACAACAA 1680 

CGAAAAAAGA TTGGAATGAT TTTTCAACAT TTTAATTTGA TGAAAGAGCG GACTATATTT 1740 

AGTAATATCG ATTTTTCATT AAAATATTCT GGGCTAAGTA AAAGTGAACG GCGTCAAAAA 1800 

ATCAGCCATT TATTAGAGCT AGTTGGACTA AGTGAAAAAC GGGACGCTTA CCCCAGTCAA 1860 

CTATCAGGAG GGCAAAAACA GCGGGTTGCG ATTGCCCGCG CTTTAGCAAA TGATCCTGAA 1920 

ATTTTACTTT GTGATGAAGC GACGAGTGCG TTAGATCCAA AAACAACGGG ACAGATTTTA 1980 

GCCTTGTTAA AAAAGTTAAA TCAAGAACTG AACTTGACGA TTGTCTTAAT CACCCATGAA 2040 

ATGCAAGTGG TTAAAGAAAT TTGTAATAAA GTCGCGGTGA TGGAAAATGG CTGCGTTGTC 2100 

GAATCAAACG ATATTGTTTC GATTTTTAGT CAACCCCAAC AACCTTTAAC CAAAGATTTT 2160 

ATCCGGACAG CAACTCATAT TGACCAAGCA TTAACAACAA TTCTTGAACA TCCTAAATTA 2220 

GCTGATTTAG ATAAAAATCA AGAATTAATT GAATTTTCTT ATGTGGGAGA TCAAACGAAT 2280 
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GAGCCATTAA TTGCACAATT GTATAGCCAA TATCAGGTGT ATACGAATAT CCTTTACGGC 
AATGTCGAAA TTGTTCAAAA TGTTCCTATT GGCCATTTAA TTGTGGTGCT TTCTGGTGAT 
GAGGCCCAGC GTCAACAAGC CCTTACTTAT TTAGCGAAAC AAGGCGTGCG AACCAATGTT 
TTGAAAACCT ATCAGCAAAC GAAGCAAAAG CAGAATTTAC AGGTGATTTA AAAAAGGAGT 
GAACTGAGAA TGTATCAATT ATTTGAGAAA TATTTTCCGA ATGTTGTCCA ATTAAAACAA 
GAGTTTCTTC AAAGTACATG GGAAACATTA TACATGGTTT TTTGGACAGC ATTGATTGCT 
GGCGTGTTAG GAGTCTTGTT GGGTGTCGTG CTTGTTAGTA CTGGCCCCAG TGGTGTTTTG 
AAAAATCCAC CCCTGTACAG TGTCTTAGAA AAAATTATTA ATGTTTGCCG CTCTATTCCT 
TTCATTATTA TGCTCGCACT GATTCAACCA TTAACACGAA TTTTGGCAGG AACGACAATT 
GGTACAACCG CAGCGTTAGT CCCATTAGTT ATTGGCGTAA TCCCGTTCTT CGCGCGCCAA 
ATTGAAAATG CGTTATTAGA AGTGGATCCT GGCGTTATTG AAGCGGCAGA AGCCATGGGG 
ACGAGTCCCT TAGGGATTAT TTTTAGGGTT TATCTAATTG AAGGGTTACC AAGTATTATT 
CGTGTTTCAG CGGTGACAAT TATTAATTTG ATTGGATTAA CAGCCATGGC AGGAGCGATT 
GGAGCCGGTG GTCTGGGCAA CTTAGCGATT ACTCGAGGAT ACAATCGGTT TCAAACCGAT 
GTGACATTTA TGGCCACGTT AATTATTTTA ATTATGGTAT TTATCAGTCA AGCCATTAGT 
AATCAATTAA TCAAAAAAAC ATCACATTAG AAAAAAGGAG AAATGACAAA TGAAAAAATT 
TAGTAAATTA ATTGGACTTA TTGGGGTATT AGCTTTTACG ATTGCAGGTT GTGCATCGGG 
GTCTGTGAAG GATACTAAGA CAGAAACCGT TAAACTAGGG GTTGTAGGAA CAAAAAATGA 
TGAATGGGAA TCGGTCAAAG ACCGTTTGAA AAAGAAAAAT ATTGATTTAC AATTGGTAGA 
ATTTACAGAC TATACGCAAC CAAACGCAGC ATTAGCAGAA AAAGAAATTG ATTTAAATGC 
CTTTCAGCAT CAAATCTTTT TAGACAATTA CAATAAAGAG CATGGAACGA AATTAGTATC 
aATTGGCaAT ACAGTCAATG CaCCATTGGG AATTTACGCT AATAAATTGA AAGATATCAC 
GAAAATTAAA GACGGCGGAG AAATTGCTAT TCCTAATGAC CCAACGAATG GCGGGCGGGC 
GTTAATTTTA TTACAAACTG CAGGACTGAT AAAAGTAGAT CCTGCGAAAC AGCAACTACC 
GACTGTCAGT GATATTACTG AAAATAAACG CCAATTGAAA ATAACTGAAT TAGATGcTAC 
GCAAACAGCG CGCGCTTTAC AAGATGTC6A TGCTTCAGTG ATTAATAGCG GCATGGCTGT 
CGATGCTGGG TATACACCAG ATAAAGATGC TATTTTCTTA GAACCTGTAA ACGAAAAAGC 
GAAACCTTAT GTGAACATTG TCGTGGCCCG AGAAGAAGAT CAAGAGAATA AACTTTATCA 
AAAAGTTGTA GAAGAATATC AACAAGAAGA AACGAAAAAG GTCATTGCAG AAACATCAAA 
AGGCGCCAAT GTTCCAGCCT GGGAAACATT TGGTAAAAAA TAAAGGAGGC ATTTATAATG 
AGTACAACAA CGATTCAAAC AATCCAA6AA GCTATTGCTA CAGAAAAGGA ATGGATAATC 
CATTTAAGAC GTCATTTTCA TCAATATCCT GAAGCAAGTT TAAAAGAATA TGAAACGATT 
AAGCGAATTA AAGAAGAACT ACTAGCCTTA GCTATTCCTT TTGTAGAAGT AGGGGAAACG 



2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000. 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 
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GGTGTTTTAG 


CAACCATTGA AGGAGGTCTT 


GGCGCTGGCA 


AAACGATTTT 


GTTACGTGCA 


4320 


GATATTGATG 


CGTTGGAATT 


(3CCAGATGCA ACAGGTGCTG 


CCTATGCTTC 


TAAAAATCCA 


4380 


GGACTCAATC 


ATGCTTGTGG 


ACACGATGGT 




CATTGCTAGG 


TGCAGCTAAA 


4440 


GTGCTCAAAA 


AACATCAGGA 


TACCTTTTCA 


GGAACGATTA 


AACTGGCCTT 


CCAGCCAGCT 


4500 


GAAGAAATTG 


GTGCAGGCGC 


TCGACAATTT 


GTGGAAGGAA 


ATTATTTAGA 


AGCAATCGAC 


4560 


CAAGTGTTTG 


GGaTTCATTT 


AGATTCCAGT 


GTGCCGGTCG 


GAAAATTAGT 


CGCTACCAAA 


4620 


GGCGCCACCA 


ACGCCTCTTG 


TGATATTTTT 


AAAAtTGAAG 


TCAGTGGTCA 


AAGTAGTCAC 


4680 


GTCGCCCAAC 


CACAGAATGG 


CCGGGATGCy 


stTTTAGCAG 


CGGCCaGTAT 


CGTTGTGGAA 


4740 


TTACAAAAAA 


TTGTAGCTCG 


CGAGATTGAT 


CCTTTAGATT 


CTGTCGTAGT 


AGGAATTGGC 


4800 


GTTTTACAAG 


CAGGAACACG 


CTATAATATT 


GTAGCAAACC 


AGGCAACCAT 


TGAAGGTACT 


4860 


GTTCGAACAT 


TTAGTCAGGA 


AACGCGCCAA 


TTTGTTTTAC 


AACGAGTCGA 


AGAAATTGCC 


4920 


CATGAAATTG 


CCCAGTCTAA 


TCGCACAGAA 


ATTGCTGATT 


TTTCTGTTTA 


TGCGGCAGCG 


4980 


AATCCATTAA 


TTAATGAAGA ACAAGCAACC 


AACCGTGCAC 


AGCAGGTAGC 


CAGTGAAATT 


5040 


GTTGGTTTTG 


AAAATGTTGT 


GACCGATCAT 


CCTAAAAGTT 


TAGGGGCGGA 


TGATTTTGCT 


5100 


GATTACTTAG 


CGGTAATTCC 


TGGTATCTAT 


GGACGGGTCG 


GTTCACGAAA 


CCCTGAAAAT 


5160 


CCTGCTACTC 


ATTTTGGACA 


CCATCATGAA 


CAATTTGATA 


TTGATGAACG 


TGCCTTACTT 


5220 


TTAGCAGCAG 


AGTACCACGT 


ACGCTATGCA 


TTAAATTATT 


TATCAGAATA 


ACAGGGATAG 


5280 


GGAGCCGACA 


AAATGAT'^GT 


TTGTGGGCTC 


CTTTCGTTAT 


AAACCAACGT 


GGCTTGCTTT 


5340 


CTTGTCATAC 


TATGGTAGAA 


TACAAGCATA AGAAAGGTGA 


TTCACATGTT 


TTTTGTTTTA 


5400 


ACTGATAAAT 


TAACAAGAAA 


AGTAGCTCCC 


GATAGTTTAA 


GAACAGATGA 


AGCACTGTTG 


5460 


TGGAGCCGAG 


CAATTAATTT 


CTTCATCGCT 


TGATAGGACA 


TTTTTTGACT 


AGAAAAGCTA 


5520 


GTCTTATTTG 


AGTGTACGTA 


CAAGCGGTTG 


TCGGGATTTC 


TGTCAGGATG 


GACGACGTGC 


5580 


AGACGTGGTT 


CTTTTTTTGT 


TGTCCCAAAA 


TAATCAGTTA 


ACAAGAAAGA 


AGTGACGAAA 


5640 


ATGAGTATTT 


TAACTATTGA ACATTTAACG 


CATCGATTTG 


GCGAAAAGGT 


CTTGTATGAA 


5700 


GAGGCTTCAT 


TGCAAGTGAA 


TAAAGGGGAT 


CATTTAGGCT 


TAACTGGCCA AAATGGGGTC 


5760 


GGCAAATCCA 


CCTTAATTAA AATTTTAACG 


GGGGAA6TAT 


TGCCaGACGA AGGAACGATT 


5820 


CAGTGGCAAA 


AAAATTGCAA 


GATTGGGTAC 


TTGGATCAGC 


ACGTTTCTGT 


AGAACAATCA 


C O O A 

5880 


CTAACGATGG 


TTGATTTTTT 


GAAACAAGCC 


TTTCAAGAAC 


TTTTTGATAA 


AGAAGCGAAA 


5940 


CTGACAAAGC 


TTTATGAAGA ATACAGTCAA ACGGCTTCGG 


AAAAACTTTT 


AGAACAAGCA 


6000 


GGCAAGTTAC 


AAACAGATTT 


AGATGAAAGT 


AATTTTTACC 


AAATCGACAC 


GATTATTCAG 


6060 


GATTTAGCCA ATGGGTTAGG ACTACAAGCA ATTGGTTTGG 


ATAAAAAGTT 


AGGGGAGCTA 


6120 


AGCGGTGGTC 


AACGTTCAAA 


AGTGATTTTA 


GCAAAATTAT 


TGTTAGAGGC 


CCCTGATGTG 


6180 


TTACTTTTAG 


ATGAACCCAC 


CAACTATTTA GATGATACAC 


ATATTCAATG 


GCTTGTTCGC 


6240 
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TATCTGAATA ATTTTGAAGG 


AAGCTTTCTA 


TTAGTCTCCC 


ATGATTATCA ATTTTTAAAT 


6300 


GAAGTGACGA ATTGCATCGC AGACATTGAG 


TTTGGCAAGT 


TAACCAAATA* 


TACTGGAAAT 


6360 


GT Ao AAAAA 1 


L. 1 i 1 1 oUAvA 




AATAAACAAA 


CCTATTTGAA 


ACAGTATCAG 


6420 


GCCCAACAAb 


AAAAAAl Kin 




GCCTATATTC 


GTAAATACAA 


Af^T A AAT 


6480 


CGAGCAACGA 


TGGCTAAAACj 


f*f*f* A o R ft a ft ft 


CAATTGGACC 




n X X X v^VrfVj 


6540 


CCTGGTTCCT 


TGACTAAGCC 


ft ft TTr* ft ft 


TTTCCTTATC 


nnwUO 1. X X 


TGfAAfGfTAA 


6600 


GCACTAACGA 


CCCAGAAGTT 


AGTTGTCGGC 


TATCGGGAAC 


\^x. X X o X X norv 


APfGTTAGAT 


6660 


TTAATGGTTC 


ATGTCGGTGA 


AAAAGTCGCA 


TTGAAAGGCT 


X X /AiA X X 


TnnrAAATPA 


6720 


ACATTAATTA 


AAACGTTGAC 


GAAAGTGATT 


CCTTCATTAG 


ATGGAGAATT 


TCATTATCCG 


6780 


CTGAATACAA 


AAATTGCTTA 


TTTTACCCAA 


GACTTAGCGT 


GGCCTAATGA 


GCAGTTAACG 


6840 


CCACTAGACT 


ATTTATCAGA 


TCGTTTTCCA 


GATACAACGA 


TTAAAGAGCG 


AAGAAGTCAT 


6900 


TTGGCTAGAG 


CAGGTTTACC 


AGATAAGTTA 


GCAATGCAGT 


CGCTAGCCCT 


GTTAAGTGGT 


6960 


GGTGAGCAAA 


CAAAaGTAAA ACTAGCGGAA 


CTAATGATGC 


AAACAAGTAA 


TTTGTTATTT 


7020 


TTAGATGAGC 


CAACCAATCA 


TATTGATGAA 


GCGGCCAAAA AAAGTTTACA AGAAGCAATT 


7080 


CACGTTTACC 


CAGGAACGGT 


TTTTCTGGTT 


TCCCATGAAG 


CAGATTTTTA 


TGAGGAAATT 


7140 


GTGGATCGA6 


TAATTGATAT 


TGAGGAATTA GTTAAATAAA GAAGAAAGAA CCAAGTTGAG 


7200 


GGGGAGTCAA 


CTTGGTTCTT 


TCTTCTTTAT 


TATTTATTTA 


CTCATTAAGC 


CATCATGGAT 


7260 


TTTTGTTAAA 


TTCCAGTTCA 


TCATGCTGTA 


GTACGTATCG 


CCTTCTGTTC 


CTTCTTTGGC 


7320 


AAGAGAGTCT 


GTGAAAAGTG 


TATCGTAAAT 


TGGTCGTTTC 


ACTTCTTTTG 


AGACCCGTTC 


7380 


CATACTACGT 


TTATCGACAC 


TGGTTTCAAC 


AAATAACACA 


GGTGCTTTTG 


ATTTCTTAAT 


7440 


GGTATCAATA ATCGTGGTCA TTTGTTCAGG TGTyCCTTGA CTTTCTGTGT 


TAATTTCCCA 


7500 


AATATAAGCG 


GCATTTAAAT 


CATAAGCTTT 


GGAGAAATAT 


TTAAAGGCAC 


CTTCACTTGT 


7560 


AACTAATAAT 


TTTTTATCAT 


CAGGAATATC 


AGCAAATTTA 


GCTTTGGCTT 


CCTCATGTAG 


7620 


TTTGCTAAGT 


TTTTCGGTAT 


AATTTTTCGC 


GTTTTCTGTA 


TAGAAATCTT 


TATTTTTTGG 


7680 


ATCTTTTTCT 


ACTAACACGT 


CACGAATGTT 


TTCTACATAT 


TTAATGCCaT 


TTTCAATGTC 


7740 


TAACCAAGCA 


TGTGGATCTT 


CTGTTTGTTC 


tTGACCGGCA 


CTTGTTAAAT 


ATTGTGGCGT 


7800 


AACATTTTTG 


CTTGTAGAAA AGTAATCTTT 


ATTCTCAACT 


TTTTTGGCCG 


TTTTCATTAA 


•7 O ^ A 

7860 


TTTGTTAAAC 


CAGCCATTTC 


CGCCTGTTTC 


TAAGTTCAAG 


CCGTTAAAGA ATAAAATGTC 


79Z0 


CGCTTCAGAA 


GCTTTCGCAA 


TGTCTTCTGG 


TAACGGTTCA 


TATTCGTGAG 


GGTCTGTCCC 


7980 


AATTGGCACA ATACTATGCA GCTCAATTTT 


GTCTTGCCCA 


ACATTTTTCA 


CTAAATCAGA 


8040 


TAGGATCGAG 


TTCGTTGTCA 


CAATTGCTAA 


TTTTTCTTTC 


TTTTCAGCGG 


CTTGATTCCC 


8100 


GCAAGCAGCT AACGTTAACC CTGCTAAAAG 


TGTTAAAAAA AATAAACTAA ATTTTTTCAT 


8160 


TCGTTGATTC 


CTCCATTTCT 


TTCTCACGGT 


TTACAAATAC 


TAGGCCTTTC 


TTTGGTGAGA 


8220 
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AAAAGAAAGC 


CAATAAAAAG 


AATAAGGCGG 


CGGTTAAAAC 


AATCGTCGCA 


CCAGAAGCTA 


8280 


AGTTATAAGA 


ATAACTGAAG 


AATAGGCCAA 


TCACTGAACT 


TAAAATCCCA 


AaaGTaGAGG 


8340 


CTAaGCCAaT 


CATtGTCGGT 


AAGTGATTTG 


TTAaTwGATA 


gCTGTTGCTG CTGGCGTAAT 


8400 


CAGCtGGCAT 


GACTAAAATA 


GTTCCGACGG 


TTTGTAAmGA 






84 60 


GTAAAaACAT 


CAATGCATAG 


TGGAAAAATT 


GAaTATTTAA 




KiAbUk^X iv7 


8520 


TGGGATCAAA 


AGAAGTGAGT 


TGCAACTCTT 


TATAAAACAA 


TGCCAUAAAA 


A 1 L> AAAAL* AA 


8580 


TCACACCGAC 


GACACAAGTA 


ATTAAAATAT 


CGGTATCTGC 


Va ACLbU I AA 1 


Av^Al 1 iC^AA 


8640 


ACAGAATATG 


ATATAAATCG 


GTGGAACTTT 


TTGCAAAAGA 


GATAAAAATA 


ATCCCTAAAG 


8700 


CAAAAAAAGA 


ACTAAAGACA 


ACACCAATGG 


CAGTATCATT 


TTTTAGTGGA 


CTTTTTTGGG 


8760 


TAATAAAGCC 


GATCGATAAT 


GCAGCTAATA 


AGCCAAAAAT 


AGAAGCGCCA 


AAAATATAAT 


8820 


TGAAACCAAA 


CATmTAAGAA 


ACAGCGACCC 


CTGGTAAAAC 


AGCATGTGAA 


ATCGCATCAC 


8880 


CCATCAAAGA 


CATCCCACGT 


AAAATAATGA 


ATGAACCAAT 


TACCCCTGAA 


ATGAGTCCGA 


8940 


CAATTATAGA 


CGTCAAAAGG 


GCATTTTGTA 


AAAATTGATA 


ACGGAATAAA 


CCATCAATAA 


9000 


ATGCAGCAAT 


CATTTTGCAA 


CCCCCTGTAT 


TAATAAATCA 


CCTAACAAAT 


CACCGTATGC 


9060 


TTCTTGGAGC 


GTTTCTGCAG 


TAAATG 








9086 



(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 



TGCGCTTGCA 


ATAGTGCAAT 


GTACTAATCG 


TTTATTTTCC 


ATGTTCATCT 


GATAATCAGC 


60 


CCCCTAACAT 


TTTGGACACG 


CTTTCATTAT 


TACATTAAAG 


GGGCGTTCCA 


AAATAAATCT 


120 


TTTATTTTTG 


GAACGtCCAT 


CTCTAAATCA 


TTCTCTTAAA 


AACTGTAATA 


GCTTACGACG 


180 


ACAAGTCAGA 


ATTATTCATC 


CTCATCATCT 


ACTGCAACAT 


TTAGCAGCAA 


TTCGTTGACT 


240 


TTCACAATAC 


TTTCTGCTGA 


TTCCTTAACA 


TAGTCACCTT 


CGATCGCTAA 


TAAACTTGCA 


300 


TTTACATAGC 


TGATAACCAT 


CGGTAAATTC ATCCCAGTAT 


ATAAGTCAAA 


CGAAGCACCT 


360 


TGCATAATTT 


TTTTGGATAA 


AATATTACAA GGTGTCCCCC 


CTAGTAGATC 


AGCAAAGACA 


420 


ACAAAATCTT 


CTAGCGATTC 


CGTAATCGCT 


TCAAATTTTT 


CTAAAAAGTC 


AGCTTCACCT 


480 


tCTTCTGGTA 


ATAAAGCAAC 


AGTGGnAATT 


GTTTCTTGCG 


GTCCCATAAT 


CATCTCCGCA 


540 


CTTTTTTTCA 


ATTCCTCGCA 


AAAACGGCCA 


TGACTTACTA 


AAACTAAATG 


TTCACTCATC 


600 


GATTTTCyCC 


TGTCTTACTT 


ATTTTGAAwT 


AACACCTAAC 


GCAGACAaTG 


CGACACAGAC 


660 


AaTTAATACG 


ATAAAAATGG 


CTTTAGTAGA CGTCATATTC 


TTTTTACCTA 


GCATCCAATA 


720 



wo 98/50555 PCT/US98/08985 

853 



T^CGCCACCG ACAATTCCTG 


CAGGAAGTAG 


ACGTGGTAAA 


ATCATATCCA AGTTATTTTG 


780 


CATGTTAAGT 


GTTACATCCC 


CAATACTTGG 


TGCCCATGAT 


AGTTTGACAT 


TAACCATCGT 


840 


TGCCACTAAA GCGCCAACCA TGAAGACCCC TAGTAAAGTT 


GCAGCGTCTG 


TTAAGGCAGT 


900 


TAAGCGGTGT 


iVjL*/\iAul iut 


1 1 AV^wAMAuA 




CGATACGCAA 


ATTCCAATTG 


960 


TTTCCAACGG 


AAG ACC A 1 L.A 


L.GGOAA1 1 IG 


n r* li a r* a a T p 

AGUAAv^AA i U 


PAGATAAAGA 


TACCAGTTGG 


1020 


GTTACCATTA 


ATCGCCArGT 


TGGCaGCTAG 


CGGCCGAAAA 




TTAAAGPTGP 

X X fVV\w^ X WW 


1080 


GAAAATTGAA 


TCCCCA^M i.G 


cTGcAAACGG 


T CCCATTAAA 


PPnrSTTTTPa 


aappTCa aap 


1140 


TGTTTGtTTT 


CCAGCAATGc 


CTTCTTTTTC 


TTCaATGGcC 


A A ATPPATPP 




1200 


TGTATTAAAG 


AAGTTTGAAG 


TATTGAAAAA 


TTGCGTGTGT 


ol 1 i l^/\i ir\ 


PTTPTTTTaa 


1260 


TTCAGGCGTA 


TCATCCCCAT 


AAATTTTACG 


CAATTGCGGC 


A A A A T/^^TAT* 

AAAA 1 CG 1 A 1 


AL« AAo 1 AAl^^ 


1320 


TGATCCTTGC 


ATTCTTTCGT 


AGTTCCAACC 


TAATTGGAAA 


/~»fTi7\ A Ar^ A A^/^ 

G i AAAGAAGG 


1 ilrflls^oAl i 


1380 


AATTTGTTTA 


AAATCTTCTT 


TCGTCAATTT 


ATAATTAGAT 


i IGGIGAIGI 


*P/^ A A TPTPTP 


1440 


CTTCGTCTGA 


ATCATTCGCT 


GCATGTTCTT 


TTTTGGCTGC 


I GGCGCAG 1 G 


V9(jAAiL»AX X X 


1500 


GACCATTTTT 


GTAGCTAATT 


GCGGCTAAAG 


CGAAACCAAT 


fA A A/^/*'^A/**T 

i AAAGGGAG 1 


GGIAAGAX XG 


1560 


GTAGTGCGTT 


AAATACACCA 


CTGAAGTCTT 


TCACAACGCT 


pppnapa^l 2kT 


f^TTPPT a AAA 


1620 


GTTGCATGTT 


TGTAAAGATT 


GTTCCTAGTA AAGCAGTTAC 


a PT n a pp n 
AG 1 AAAAGGA 


apaaTTaaaT 

AuAAX X A/viX 


1680 


AAGGG7VAATG 


TTTTTTCACT 


GGTAAGTAAC 


GTAGTAAAAT 


T/i p a a a a ppn 


aPPflPTGGTA 


1740 


AGACCGCCCC 


AGCAACAGAC 


AAGCCATCAC 


CTAACCATTT 


p a a a 'vcnr^c*'^ 


TTTa/inapan 


1800 


ATACAACTTT 


TTGTACTAAA 


CCACCACCAA 


AAGCAAGTGC 


TaapaapaPT 


ric a aTP ATTP 


1860 


GAGAAAGAGA 


CCAAGGTAAG 


GCACCCATTA AGAAGTTACG 


•PTPaaTTPPT 


•p'p/^'pa^'p'ppa 

X X \9 X AV9 X X k«n 


1920 


TATCTTCGAC 


CATTTTATCA 


ATCCGATGTG 


CAAAGTAGGT 


aTVPPpaaaa 
Al 1 GGGAAAA 


ppapppaaaa 


1980 


TGTCTAATTG 


T^TCATTAAA 


CTAGCAACTG 


GTACTGCAAT 


appapppaTT 


/^PTTnTTPAG 


2040 


GGTlCAr ICL- 


TAAAGACACA 


GAAAATGCTG 


TGGCAAGAAT 




TTAGPATCAA 


2100 


rn »P rp fp n 
111 IGGAGGC- 


ACCCCCGAAA 


GTCCCAACTC 


CTAATACAGT 


TAATTf^PATK 


PTTCCACCAA 


2160 


TAATAAGTCC 


TGCTTTAATA 


TCCCCCATCA 


CTAATCCTGC 


AAAAAGACCT 


GCAAACACTG 


2220 


GCGCACTTAA 


TGAAGAATAA 


ATTTGTAATT 


CATCTAAAAT 


TTGGTATCCT 


GCGTATAATG 


2280 


TTAATAATAA 


GATCTGCCAC 


CATAAGATAT 


TCATTTTTTG 


TTCCCTCCTA 


TTTTGTTATC 


2340 


AAACTCATAA 


AGTCTTCCGC 


TTTGTCACTT 


GGCACCATTT 


GAGCAATCAG 


ATGGACACCT 


2400 


TTAGCATTTA 


ACTCTTTAAA AACCTCAATA 


TCTTGATCGA 


CCACATTGAT 


TGACTTAGTA 


2460 


ATCGACCGTG 


TTTCGTTTGT 


TTGAGACATG 


TTCCCAACAT 


TGATTTCTTT 


AATCGGCACC 


2520 


CCAAGTTCTA 


CTAGTTTTAA 


TAAACGATCT 


GGTTTCCGTA 


CCACAATTAA 


TAAACGTTGT 


2580 


GAGTCGTATT TTCCTGCTAA AATATTTTGA GCAGCTTTTT CAACTGGTAA CACGCTTAGT 


2640 


TTCACGCCAG 


CTGGTGTTGC 


TAATTTCAAG 


CCACTTTTTT 


CAATGGCATT 


CCCTGCAACT 


2700 
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TCATCATCAA 


CGACCATAAT 


TCTGGAAATA 


TTCAATTTTG 


TCGCCCATAA 


ATTTGCCACT 


2760 


XGTCCATGAA 


TCAAACGTCC 


ATCAATTCTT 


ACGCCAATAA 


TACTCATTGT 


AATCCCTCTT 


2820 


TCATTTCTTT 


ATATAGTGGT 


TCTTTTACTA 


ACTGAATTTC 


TGGTTGGTAC 


GTTCCTTCCG 


2880 


TTTCAAAAAT 


GACGATCTCA 


TTTTGCCCTT 


CTTTTAATAA 


TCCTTTGGGA 


ATATATAAGG 


2940 


AAAGCGTCGG 


ACCAACATTC 


CAGAAACGAC 


CAAGATTGGT 


TTGGTTAACA 


AAAACAATTC 


3000 


CTTTGCCAAA 


TTTAGTUU^CA 


TCAATAAATG 


TATCTTTCAC 


TTCTGCTAAC 


TCCACATGAT 


3060 


ACTGGTAAAA 


GCTTGGTTGA 


TCAGGCTGCC 


ATTCTCTTGA 


ATAGTCTACC 


TGTTCACAGC 


3120 


TTGTCATAGG 


TAAACAATAT 


TGTTGCCATT 


GCGTCATAAA 


ATGTAAATCT 


GCCATGACAC 


3180 


CCGTTCGAAT 


CCCTTTTTTC 


TGAGTATCTG 


CAAATAGTTT 


ATGCCCGTAA 


TTGACACGAC 


3240 


CCATATTTTC 


CATTAAAATA 


TCGATTTGAT 


TATTCTCTTG 


CGGCAGTGTA 


ACGTAAATAT 


3300 


CTTCGCCAAT 


TTCCGTTTGA 


TATTGCGTTG 


CTTGATGTAC 


CTGATTGACA 


AATAATTGCA 


3360 


GACGATCACG 


ACCGTCAATC 


ACTCTTAGCT 


TTTCTTCTGC 


AGCATCTTTT 


TCAATGCTTG 


3420 


tTCGGTAAAG 


AAGATAGCCg 


GTATTTTGTC 


CTAATTGTTC 


CATGGTTTGC 


GGATAAACGC 


3480 


TGACCACTGG 


CTGACTAATC 


GTTTCTAAAG 


TCGCAAACAA 


ACTCACTTTG 


TTGGTTAAAG 


3540 


GAATTGCTGT 


TTGCGCAAAA 


GATTCTTTTA 


CTAATGGTTC 


AGCTTGTGGT 


AACGCAGGGT 


3600 


ACTCTTCGTG 


AAGCATTTTT 


TGTAACGCAA 


AATACTTTTC 


GGTTGGATTC 


CCTTGTTCAT 


3660 


CAAGTGGCGC 


ATCATAATCA 


TAGGAAGTGA 


TTTGTGGTAA 


ATCAATGGTT 


CCTCGTGCGG 


3720 


AGCAGCCATT 


CATAAACCCA 


AAATTGGTGC 


CGCCATGAAA 


CATATATAAG 


TTTATACTGC 


3780 


CTAATGCTAA 


TGCTTCTCTA 


ACTGATTCAG 


cx:aattcttg 


CGGATCTCGT 


TTAATAATTG 


3840 


GCTCTTTCCA 


ACGATTAAAC 


CAGCCATCCC 


AAAATTCCAT 


ACACATTAGC 


GGCCATTTTT 


3900 


TACCATGTTC 


TTCAAAAAAG 


GCTTGCATCA 


TACCGAAATT 


TTCTTTGGCT 


TTTGAGCCAA 


3960 


/VATTTCCGGT 


TACTAAAATA 


TCATCTTCAA 


tcatgctgcc 


TGCCCGCAAT 


GTTGCGCGCC 


4020 


ATGGCCCGTC 


AGAAGTAAAA 


AATGGTGCTG 


TCACGCCGCG 


TGCAATCATC 


AAATCGCGAA 


4080 


TCGCCCGTAA 


ATACGCTTTT 


TCTTCGCCAA 


ATGAACCATA 


TTCGTTTTCG 


ATTTGAATCA 


4140 


TTAAAATATT 


GCCACCGTTA 


GCAAGTTGAT 


GCGGAACAAT 


TTTTTCCATT 


AAAACATCAT 


4200 


AATATTCCGC 


AACATGCTTC 


A6ATAGGTTG 


GATTATTCGA 


ACGCATCCGT 


CCTGGTTCAT 


4260 


TTAATAGCCA 


GGCTGGAAAA 


CCGCCAAACT 


CCCATTCTGC 


ACAAATATAA 


GGAGACGGCC 


4320 


GCACAATCGC 


ATATAGTCCA 


AGTTCTTGTG 


CTAATTTTAA 


AAAGCGCTCC 


AAATCTAATA 


4380 


TTCCCTCAAA 


ATGAAATGTT 


CCTTTTTGCG 


GTTCATGTAA 


ATTCCATGGT 


ACATATGTCT 


4440 


CAACTGTATT 


GAAGCCTAAA 


GCTTTCAAGT 


TATAAAGGGA 


ATGATGCCAA 


TCTGAGGGAT 


4500 


CTACACGAAA 


ATAATGAATG 


GCGCCTGATA 


AAATTTTAAA 


TGATTGACCA 


TTTAAAAGAA 


4560 


ACTCCTCTTT 


AATTTCAAAT 


CTGTCCACAT 


TGTCACCCTT 


TCTAGTAAAC 


GCTTTCAATT 


4620 


GCTGGATAAT 


TGCTAAATAA 


AGAAAGCTCT 


TTATTTAGCA 


ATTTCGACTA 


CTTGGATGTT 


4680 
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CATAAGATGT 


TTCTTCACGT 


TCTCTACGTC 


TACATGCCCA 


GTCATTCTTT 


CTTGGGCATT 


4740 


AGCCATTCCA 


GCCGCCATGC 


CCCATTTTAG 


CAGTTCGGCA 


GCTGGGGCAT 


CCTTGGCTAG 


4800 


TCCATATGCA 


AGACCAGCAA 


TGGTTGCATC 


TCCTG/VACCA 


ACAGGATTTT 


TwGcTtGAaT 


4860 


CGTTGGAATT 


TTTACCACGA 


TAGAATTGAT 


CGTGATGTTT 


GGCAATTGCA 


CCATCTTTTC 


4920 


CTAAAGATAT 


GACAATCCAT 


TCAATTCCCG 


CAAACATTGG 


TTTTGTTAAA 


GCTGTTTGTA 


4980 


CAGCTGCTAA 


TGGATTTTCT 


GAAAAATCTT 


GCCCTAGCAA 


TCCTTCTAGT 


TCCTCTAAAT 


5040 


TAGGTTTAAT 


TAAATACGGT 


TTCCATGGTC 


CTTGAAGAAC 


TTGCCTTAAA 


CTATCACCAG 


5100 


AAGTATCTAG 


TAGCACTTTA 


ACCTCTTGTG 


CGTGAnCTTT 


TTGAACTAAT 


TCTTGATAAA 


5160 


AATCTGATGG 


TAATCCTTTC 


GCTAAACTGC 


CAGAAATTGT 


GACAATTTCT 


GCTTGCTTAA 


5220 


TTAATTGATC 


GAAATTTTCA 


AGGAAGTTAC 


TGATTTCTTC 


TGGGGAAACA 


GTTGGTCCAG 


5280 


CTTCTAAAAT 


TTCAGTTTGA 


TTCCCCTCGT 


GCAAAATCGC 


AATCGAATCA 


CGTGTTTCTT 


5340 


CTTTGATTGA 


CGTAAATGCT 


TGGGGGATGT 


TTGCTTTCTT 


TAACTCATTT 


GCAATAAACG 


5400 


CCCCATGAAA 


ACCACCAAGA 


ACGCCGGTAG 


CGATAACATC 


TCCACCTAAA 


TCATGAATAA 


5460 


CGCGAGTGAC 


ATTTAGTCCT 


TTGCCACCAG 


GGGTTTTGGT 


CACCTGACTG 


GTCCGATTAA 


5520 


CGGTATCAAG 


CTTTAGATGA 


TCTAATAGGT 


AGGAGATGTC 


AATTGACGGA 


TTCATTGTTA 


5580 


CTGTTACTAT 


CACAGCTATG 


CCTCCCAATT 


ACCCTTCGTT 


TACTTGAACT 


TTCGCATGCC 


5640 


AAGAGCTGGC 


GGTTGCTGCG 


ATGACTTCAT 


TTAAACTTTC 


AATGTTTTCT 


CGGCCTTCTG 


5700 


TTTTTAACCA 


ATCACAAGCC 


GCAGTTTCTC 


CTGCTTCAAC 


GAAAGGTTTC 


ACGCCATTTT 


5760 


TCCACGTAGC 


ACGGCCACAT 


AAAACACCAT 


TAAATGTTGA 


ACCAGCTTCT 


TTAGCGAAAC 


5820 


GTAGTGTTTC 


TTGGAATAGC 


TCTGTTGAGA 


CACCGGCACT 


TAAAAAGATA 


AATGGTAAAT 


5880 


CAGTGGCTTG 


ACTTTGTTCT 


AAGAAATAAT 


TCGCCGCTTC 


CTCTTTTGTA 


TACGCTGTTT 


5940 


CAGCCGGCGC 


AAAACCTTCT 


ACAAAATTCA 


TATTAACTGG 


AACTTCCACT 


TTCAATACAT 


6000 


CTACTTTATA 


TTGGGGTTTA 


GAAAATTCTT 


TCATCATTTC 


ATTCACTTTA 


TGCGGTTTAA 


6060 


CTTTTGCATA 


TTCTAAAGAA 


GTCGCATCTG 


CTATTTGTGC 


ATCATAAGAA 


ACTAATTCTA 


6120 


AGTAAAATGG 


TAGGTCTTCT 


TCTGCACATT 


CACrrCCTAA 


GCGTTCAATG 


AATACATGTT 


6180 


TTTGATGATT 


GATTTCTGGA 


TCCTCATCCA 


CATCATAGTA 


TAGTAAAAAT 


TTAATCGCAT 


6240 


CTGCGCCTTC 


TTCTTTCAAG 


CGTAAAACAG 


ACCAATCTGC 


TAGTAAATCA 


GGCAATCGTC 


6300 


CCGGAGTTGT 


TGCATCATAG 


CCCGTTTTTT 


CATAAGCGAG 


TAACAATCCA 


GCCTCGGTAT 


6360 


CGCGTGCTTT 


TGCAGCAGGC 


AATCCATATT 


CTGGaTCCAA 


TAAAATGGcA 


GACGCATAAG 


6420 


GTGTCAATTC 


TTTCGAGACT 


AATTCTTTAA 


ACGTTTCAAT 


TTGTGCATCA 


GTTGGCTCTA 


6480 


CATCTAGCGC 


TTTGATGATT 


TTTTTCAAGG 


CACCTCGTTG 


ATCAATGGCT 


AATGCGCTAA 


6540 


TGATTCCTTC 


TTGGGTTGAC 


AAACGATCCA 


TTGCCGCTTT 


TTTTCCAGCT 


GTTAACGTAA 


6600 


GCATAGTTCC 


TCCTCCAAAT 


GTTGTATTAT 


TTATAGTCAT 


AAATCGTTAC 


GCCTTTAACC 


6660 
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ACACGATTCA 


CTGTACCTGT 


CGGTGAAGGG 


GTATCCGGTT 


TATTTCCAAC 


TTTAATAGAT 


. 6720 


GTTAAAAGTG 


CGACTGTTTG 


GGCTACCATG 


ATATCTGCnA 


ATGCnAAATA 


GCCATCAGGT 


6780 


AATAGAGCAG 


TTTCTGCGGa 


ATATTCAATm 


GyTGAACCtG 


AGAAATTACG 


TTTGCCAGGT 


6840 


TGAGCAATCG 


CAAAAACACC 


CGCCGCAATT 


TCGTCACCGT 


GAACTTCTTC 


TAAGATATCC 


6900 


AAGTCATAAT 


CTCTTGTGTA 


TGGTGTATTA 


TTaACAAAAC 


CAAACATTAT 


TGTTTTTTCA 


6960 


TTAATAAATG 


ATTTTGGTCC 


ATGACGGAAG 


CCCATAGAAG 


AGTCAAAAAT 


TGTGGCAACT 


7020 


TTGCCGGCTG 


TTAGTTCTAA 


AATTTTCAAT 


TGCGCTTCTC 


TTGTCAAACC 


AGCTAAACTT 


7080 


CCAGAACCAA 


CATAAACAAT 


GCGTTCAAAA 


TCAAGGTTAA 


CAATTTCATT 


TAATTCTTCT 


7140 


TCACGGGATA 


GCACTTCTTC 


ACCTAATTTA 


ATTAGAGCGG 


TTACATAATC 


TTGTTTTTGC 


7200 


TCAAAGGCTG 


TTTGATCAAA 


GAGTAGTAAT 


GTACCCAAAG 


TCATGCAAGA 


AAAGCTACCA 


7260 


GTCATCGCAA 


AACCACCATC 


ATTGGAACGT 


GTTGGCATTA 


AGAATAAAAA 


GCTGTTTTCT 


7320 


TCTTTTTGAC 


TTATTTGTGC 


TAGTTGTCCT 


TCTTCGGCAC 


AAGTAATCGC 


TAAATGATGA 


7380 


ATTGTTTCGA 


CCACTTGATT 


AGCTACTTCA 


ACTGCCGCTA 


AACTTTCAGG 


ACTATTTCCA 


7440 


CTTCTTGCAA 


ACGAAACTAA 


CAATGTTGGT 


TCGTCTTTAA 


TCAAATAATC 


TTCTGGTTTG 


7500 


GCCACAATAT 


CCGTTGTCCC 


AATACTTTCG 


AATGAAAAGG 


CTTGTGTGTC 


GCCGTGTGCT 


7560 


CGGAGATAAG 


GTACAACCGT 


ATCACCGACA 


TATTGCGAAG 


TCCCTGCTCC 


TGTAAAAATT 


7620 


ACTCTTGTGC 


GTTTTCCTTG 


CGCTTTGGCC 


TGGACTTCTT 


ttaaa;\agtt 


TTCCAAAGCT 


7680 


GTTTGATTTT 


CATGATAAAG 


TGTCACGGTT 


TCTTGCCAAA 


GCTCAGGTTG 


CTGACGAATT 


7740 


TCACGTGTAG 


TGATTTCCGC 


TCCTAGTTGT 


TCTAGTTCTT 


CTTTTTCTGC 


TGTAAACATA 


7800 


CTTAACTCTC 


CTAACTATTC 


CGAATATGGC 


GTATCTGATA 


ATGAAATTGa 


TCTGCTCGCG 


7860 


CCACACTCAA 


CGTAAATTCA 


ATAATTTCAT 


TTTTCATGTT 


ATATGTCTGG 


CGTGCTAAAT 


7920 


GAAGGACTGG 


CGCTCCTTCC 


GGaATCaTCA 


ATAATTTGGc 


ATCTTCTTTT 


GACGCAATGC 


7980 


TAGCGTATAA 


TTCTTCATCA 


GCTAAACGAA 


CGGTTTGCTT 


AAAATCTTCC 


GAAAATAGAT 


8040 


CATATAATGG 


TTTGCTACGT 


AGTAATTGAT 


CCGTGAGTGA 


TAAGAAAAAT 


TTCACAGGCA 


8100 


AGTATGTGTC 


TTCAACCATT 


AATGGCTCGT 


TATCAGCAAT 


CCGCAACCGA 


CTTAATTTAA 


8160 


AAACTGCCTC 


TCCTAACGAA 


AGATTCAAAT 


GCTGACAAAT 


AAATTTATCC 


GCTTCCAATT 


8220 


TTTCGAAGGA 


TAATATACGG 


GTATGTGGTT 


TTCGACCCAG 


CCCTTTCATT 


TGTTCTGT7VA 


8280 


AACTATATGC 


GCCAGCTAAA 


TCAGCGGCCT 


CTTTTTTTAT 


ATCTGAAACA 


AATGTTCCTT 


8340 


TGCCATGACG 


ACGATAGATG 


GAACCTCTAT 


TTTCTAGTTC 


TTGTAGCGCT 


AACCGGACCG 


8400 


TCGTCCGACT 


CACTCCATAT 


TGTGCAGTCA 


ATTCACGTTC 


AGAGGGCAAC 


TTGTCATGAG 


8460 


GAATCATGCA 


TGTTTCGATT 


CGTTCCTGCA 


GCAAATCAAC 


AAGTTGATGA 


TACAAAGCTT 


8520 


TGTTTTTAAA 


GGCATTTTCC 


ATAACT 








8546 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3493 base pairs 

(B) Ty?S: nucleic acid 

(C) STRA>:OEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQOEHCS DESCRIPTION: SEQ ID HO: 147: 
CGACACATTG TGGGACA7TG CTGCACGTTT TGGAACGACA GTTGATAATT TAATGTCACT 60 

AAATGGATTA ACATTGGATT CTGTTTTATC GGTGGGTCAA ACCATCCAAA TCGGTTAATT 120 

TTGCAATTAT TTAGTkGAAG AAAGACCTmA CAAAAGCAGT AGTTTTTGTA AGkTCTTTTA 180 

TCGTGAAAAT AGGTAGGGCT AGCAGCAATT ATTTGATGAG CCTAATCCGT TTrTTTATTT 240 

AATTTTTCAC AAACTAARAT TCGTTTTTCT GGTAGAAATC GTTTTTTTGA GCTATGCTIA 300 

GAGAAAGAAT TATTATATTG TTTGTAAAAT TGGAGGAT6T AGGATTATGA CTAAAGGTTA 360 

TGTAAAATCA GTAACAGAGT TAATTGGTCA AAGCCCAGTT GTGAAATTAA AACGGATGGT 420 

ACCAGAAGGA GCTGCAGATG TTTTTGTTAA ACTAGAGTTC TTTAATCCTG GTGGCAGTGT 480 

GAAAGATCGA ATTGCTTTAA GCATGATCCA ACAAGCCGAA GCAGATGGTC GCCTAAAACC 540 

TGGACAAACG ATTAT7GRSC CGACTAGTGG TAATACAGGC ATTGG6TTAG CAATGGTAGG 600 

TGCTGCGTTG GGATATCCAG TTAAGATTGT GATGCCAGAT ACTATGAGTA TTGAACGCCG 660 

CAAATTAATG CAAGCATATG GGGCTGATCT ATTATTAACC CCTGGTGCCG AAGGAATGAA 720 

AGGGGCAATT GCAAAAGCTA CAGCATTAGC AGAAGAACAC GGGTACTTCA TGCCTTTGCA 780 
ATTTAATAAT CCA6CTAATC CAATGGTACA TGAACAAAAA ACAGGAAAAG AAATTGTTGA 840 
TGTCTTTGGT AAACGTGGCT TAGATGCGTT TGTTTCTGGT GTTGGCACTG GAGGAACCGT 900 
TACAGGAGTT GGCCATGAAT TGAAACGGAT TTTTCCAGAT ATTGAAATTG TTGCAGTAGA 960 

ACCAACAGAG TCGCCTGTTT TAGAAGGTGG CGAACCAGGT CCACATAAAA TCCAAGGAAT 1020 

AGGCGCTGGT TTTGTCCCAG AAGTTTTAGA CACCACCGTT TATCAAAAAG TTGCCGCTGT 1080 

TTCTAGTGAA GACGCATTAG AAACTGCTCG TTTAATGGGG CCAAAAGAAG GTATCCTTGT 1140 

GGGGATTTCA GCAGGGGCAG CAATTAAAGC TGCCATTGAT TTGGCA6TCG AATTAGGCGC 1200 

AGGCAAACGT GTCTTAGCGC TGGTTCCGGA TAACGGTGAA CGTTATCTTT CGACAGCTCT 1260 

TTACGAATTT CCAGAATAAA AAACACGCCA AAAAACGAGC ATTTCTCTTA GTTTAGTAAA 1320 

ATGCTCGTTT TTTCCAC6AG AATCCTACAA TAATTCTTGT CAATCGTAAT AAGAAATAGT 1380 

ATTATTTAAB AAAGATAGAC AGGGGAGGGA TACAAAGATG GACAACGTAT TGGTTAAAAA 1440 

TGCACTTGCT GAATTAAAAG AAGCCAATAT CCGAATTACT CCGCAACGTT ATGCTATCTT 1500 

GGAATATTTA ATCGAAAATC ATACACACCC AACAGCTGAT GAAATTTATC GCGCACTAGA 1560 

AGATCATTTT CCAAATATGA GTGTAGCAAC GGTTTACAAC AATCTACGTT TATTTACTGa 1620 

AATCGGTTTC GTTCAAGaAA TGRGTTATGG CGATGcATCT AGTCGTrTTG ATTTTAGTTC 1680 
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GAAGAAACAT 


TATCACGTGA 


TTTGTCAAAA 


ATGTGGTAAA 


ATCGTTGATT 


TTCATTATCC 


1740 


AGGGTTAGAG 


GACGTTGAAA 


TGGCCGCTAG 


TAAATTAACA 


GGCTTCGAAA 


TTAATGAACA 


1800 


TCGTTTAGAA 


TTATATGGAT 


TATGTCCAGA 


TTGCCAACAA 


GCACAACAGG 


AGAATGTGTA 


1860 


AAAAATTACA 


TACTGTTAAA 


AATTTATCAA 


AAGATAAGTC 


ACTTGTCTAG 


GCAAATTTCA 


1920 


CAAGCGTAAG 


AGTTTTCACT 


CTTATGCTTG 


TTTTTTTGTG 


TGGTTATTGA 


CTTTTAAATT 


1980 


CTTAAAAACA 


AAAAATTAAC 


TAGCTATTAT 


TAAGAATTAG 


CAACAGTTTG 


TGAATTTTTT 


2040 


TATATGCGAT 


TCACAAGCAT 


AGAACTCGAA 


TAAGCTTGAT 


TCCTCTGATA 


TATTTTAAAC 


2100 


AGACAGAGGG 


AGACGTCACC 


TCTGCAAAAC 


AAAGATTGTG 


AAAGAAGGGA 


TTTACAACAT 


2160 


GAAAGTCGTA 


GTCGTAGGAT 


GTACCCATGC 


TGGAACTTCA 


GCAGTTAAAA 


GTATTTTA6C 


2220 


AAATCACCCG 


GAGGCGGAAG 


TCACTGTATA 


TG7\AAGAAAT 


GACAATATTT 


CCTTCCTATC 


2280 


TTGCGGAATT 


GCCTTGTACG 


TTGGAGGCGT 


AGTTAAAAAT 


GCTGCTGATT 


TATTTTATTC 


2340 


AAATCCAGAA 


GAACTGGCTT 


CTTTAGGAGC 


AACAGTGAAG 


ATGGAACACA 


ATGTTGAAGA 


2400 


AATCAACGTG 


GACGACAAAA 


CTGTTACAGC 


GAAAAACTTA 


CAAACGGGCG 


CAACTGAGAC 


2460 


CGTTTCTTAT 


GATAAATTAG 


TAATGACAAC 


TGGTTCTTGG 


CCAATTATTC 


CACCAATTCC 


2520 


AGGAATTGAT 


GCTGAAAATA 


TTTTATTATG 


TAAAAACTAT 


TCACAAGCCA 


ACGTGaTCAT 


2580 


CGAAAAAGCG 


AArGATGCCA 


AACGTGTCGT 


TGTCGTTGGT 


GGCGGTTATA 


TTGGTATTGA 


2640 


ATTAGTAGAA 


GCTTTTGTAG 


AATCAGGTAA 


ACAAGTTACT 


T.TAGTTGACG 


GGTTAGACCG 


2700 


CATTTTAAAT 


AAATAT7TAG 


ATAAGCCATT 


CACAGATGTT 


CTAGAAAAAG 


AATTAGTTGA 


2760 


TCGTGGCGTT 


AATTTAGCTT 


TAGGTGAAAA 


TGTCCAACAA 


TTCGTAGCAG 


ATGAGCAAGG 


2820 


AaAAGTAGCA 


AAAGTCATTA 


CACCAAGTCA 


AGAATTCGAA 


GCAgATATGG 


TGATTATGTG 


2880 


TGTTGGTTTC 


CGTCCAAATA 


CAGAATTATT 


AAAAGATAAA 


GTTGACATGT 


TACCAAATGG 


2940 


TGCAATCGAA 


GTCAATGAGT 


ATATGCAAAC 


AAGCAATCCT 


GATATTTTTG 


CTGCGGGTGA 


3000 


TAGTGCAGTT 


GTTCACTACA 


ACCCAAGTCA 


AACAAAAAAT 


TATATCCCAT 


TAGCAACGAA 


3060 


TGCTGTTCGT 


CAAGGTATGC 


TAGTAGGTCG 


TAATTTGACT 


GAACAAAAAT 


TGGCTTATCG 


3120 


TGGCACACAG 


GGAACTTCTG 


GTTTATATTT 


ATTCGGTTGG 


AAAATTGGTT 


CAACAGGTGT 


3180 


GACAAAAGAA 


AGTGCCAAAT 


TAAACGGACT 


AGATGTCGAA 


GCGACTGTTT 


TCGAAGATAA 


3240 


CTATCCITCCA 




PAAPAAPTCA 


ATVAAGTTPTG 


ATGCAATTAG 


TCTATGAAAA 


3300 


AGGAACACAA 


CGCATTGTTG 


GTGGTCAATT 


GATGTCTAAA 


TATGATATCA 


CACAATCTGC 


3360 


CAATACTTtA 


TCATTAGCGG 


TCCAAAATAA 


AATGACgTCG 


AAgATTaGCG 


ATTTCTGATT 


3420 


TCTTCTTCCA 


ACCACATTTT 


GATCGCCCTg 


GGAcTACTTA 


AACTTATagc 


CCAAGCCGCT 


3480 


TTAGAAAATA 


TGT 










3493 



(2) INFORKATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1299 base pairs 
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(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
[D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

CTTCTTTGGA GATTAGGACT TTCTCAATAT CTTTTTCTAT CATAGTGCGC TCCTTTCAGC 60 

ATTACTCCTT TGTCTTAAAA AGGAGTCTGT AGTGTATTTT ATCAGTTTCT AAAGGAATAC 120 

TCAAATAGGA ATTTGCAAAT TTTGGTACCC AAATGATTTC CTCTTGCTCA TCCGTAATGA 180 

CCCAAGCACT CTCTCTCGCT TCATTTGGCA CTTTTTGATC AATAAAAACT CTGCTTAATT 240 

TTTTCGTAAA CCCTGGTTTT AATGTAATCC GATCACCACT TTGGCGATGA CGAACGGTCA 300 

AAGGTGTGGC TGTGGTCAAC GGGATCGCTA ACAACTGACC CGTCCACTGA TTAATCTCTT 360 

CAGGAAGAGG AAACGGTTTG TCTGTCGCAA TTAATCCTAG CCACTCGGTT TCCGATAGAT 420 

AAATCCCCGC ACCAGGTACT AATAAATGTG TAACTTCTTC TCGCAGTGCA GCCTGTTTTT 480 

GTTCTAAGCA TAGCCAATCA TAACTTTTAG CTAGTTGCCA TTGCTGTTCT AACATAACCG 540 

AGCCTTGCGG TTGCCGTTCA TCATTCAAAA CTTTTAAAAT TTGTGTCATC TGGCGTTGGT 600 

TTAACGAAAC CCCTAATGGC ACTAATGTTT TTTGGaAAAA CGCGGTTAAA AAGAAGTGCT 660 

GGTACGCAGG CGTTTCTTTT AATAATCTCT TCAAAGGAAT GCGCCAGCCT TGTTTGGTTG 720 

GTTCTACGGC TTCAGATAAT TGAGGCTCAA TTTGTTCTTG AATAAATTCT TGGGCAAAAC 780 

GTTGTTCATT ACTAAAAGAA GCAATCTGAT CTAAGAATTG CGGATTTTCC TGCTTTAATA 840 

GCGGCAAAAC CTGATTACGG AGCCGATTTC TCAAATATTC ATTGGTTTGG TTCGTTTCGT 900 

CCTCAAAATA AACGAATGCT TGGCGCTGTG CAAATTGATA CAATTCTTCT TTTGGATAAA 960 

TGAGTAACGG ACGAATCAAT TCCCCTGTCG CAAAAGGACG TCTCTCTTTA ATGCCTGCGG 1020 

AATGGGAAAA ATTAGTTCCT CGAATCAGCT TCATTAAAAT CGTTTCCGCT TGATCGTCTA 1080 

GATGATGTGC CGTCATTAAG ACCGCCGCAC CTTCAATTTC CATGACTTCT TtAAAAAATt 114 0 

CATAACGAAA TGTGCGCGCT TTGCTTCTAA ATTtCGTTGC TTTcAgGGTC TcCCAACGTG 1200 

TTcATAATAC GTaGcgCTCG CTGCTGACAA TAGGTAGCGA GATATTGGCG CTTCCTGTGC 1260 

CGACGCTTCT CGCAATTGAA GGATTGACAT GTGGCAAnA 1299 
(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

TTACTAGTCT GTTGGAATAC ATTATGATAG TGTAAGATAA ACGGACTATT GGAAAGTCAC 60 

AGCAGAGGAG TTTGAACATG TACACATTTA AAATCGGTAT TCCGGCAGAA GTACACGATA 120 
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CATTTGTTAA 


AAATCACCCA 


TTATGTAATT 


TACTACAGTC 


ATCTTCATGG 


GCAAAAGTCA 


180 


AGGATAACTG 


GGGTTCTGAA 


ATT6TTGGCG 


"TTTATGAAAA 


GGATACATTA 


GTTGCATCAA 


240 


GTTTAGTTTT 


AATCAAACCA 


TTACCAGCCG 


6TTTTACGAT 


GTTATATACA 


CCAAGAGGTC 


300 


CAGTAATGGA 


CTATACAAAT 


GAACGTTTGG 


TCAGCTACTT 


TATGGCTGAA 


TTGAAAAAAT 


360 


TTGGCAAGAA 


AAAACGAGCT 


TT AT TT AT AA 


AAATGGATCC 


AGCGGTTCAT 


TATCAAGATT 


S ^ V 


TTCATTTGGG 


AGAAGAACAC 


CAACCGCATG 


CTGAAGCGAC 


ATCTATTATT 


GAGACCTTGA 


M O VI 


AAGAAGCAGG 


TGCCAAATAT 


CAAGGACTGA 


CAATGGATAT 


GGGGGCCACC 


ATCCAGCCTC 




GCTTCCAAGC 


GAATATTTAT 


CGTGAAGATT 


TTAGTGAGGA 


ACAATTGTPA 


AAAAGTACCA 




AGAAAATGAT 


TAAAACAGCT 


GAAAAAAAAG 


GGGTCGTTGT 


GCAGCAAGGG 


rATGTTGAPT 
\*t\ X w X X X 




TTGTTGATGA 


ATTTGAAAAA 


GTCATTCAAT 


CAACCATGGA 


ACGCCAACAT 


ATTTPTTTKP 

X X X X X X O w 




GG/^TAGTGA 


TTACTTTAAG 


AAGCTGTTGA 


ACATTTACCC 


AGAAGACTCC 


TTTATTATGP 

X X Xn>X XAXU^ 


780 


TGGCGCAAGT 


CAATTTAAAG 


GAACGCTTAG 


ATAGCACGCG 


AC AAC GCT AT 


GACAAAAATP 


840 


AAAAAGACTT 


GAGTAATTTG 


AAAGAAAA.TC 


AAGTGAAAAA 


GCGrPATAAT 


TTARAA^^AAT 




TAGATGCTTC 


CTTGACTCGG 


GAACTAAAAG 


7y\.TTAGAAGA 

**** X X **XJ*XXi^J£x 


AAATATrGrA 


ciAAnpTt^nTf: 




AAATCGTCAC 


GGTTGCTGGC 


\3\^ X X X X \J 


X V3nv«n XXX OV3 


Tf PAAPAAnT 

X V'V^/vlV^ivVo X 


nAAATPPTTT 
unrvilv«^.^x 1 1 




ATGCTGGCTT 


AGATGATCGT 


TATAAAPGTT 




rl X M X k9 1 \jn\* X. 






CCATTCAAGA 


ATGTTTTAAC 

X \9 X X X X w 


X W X Wi*V9 


AAAnTGTAAT 


t\ X \9\S\J\^\3\3\^ X 


1 A0/vl\30 1 AO 


114 0 


TTTAAATGAT 


GGC vT AATT A 


AGTTTAAACr 


GAAfTTCAAr 


PPmAPf^ATTA 


ATRAATTTAT 




TGGGGAATTC 


GATTTACCGG 


TTAATAAATT 


ATTAT TTAAA 

.r»A A A A Af^aar^ 


GCCAGTGAAT 


ATGCCTATAA 
t\x xnxnn 


1260 


ATTACGA7WVA 


CAAAAAAAAT 


AAATAAAAAA 


CACAGGGACT 


AAAAATCAAA 


PTTGATTTTT 

X X X X X X X 


1320 


AGTCCCTGTG 


TTTTTTTCTA 


TAGTTTTTGC 

A m^%^ A A A A A 


GTTGCATTCC 

A A^TX^akA a ^^^^ 


GTTTCATTAA 

^9 XXX X X 


GCGAACAAAA 


1380 


CGCTAGCAAT 


TGCTGAAGCA 


GCTACACCAG 


CAACAA6AGC 


TTCTGGCACA 


CCAGAACTGA 


1440 


AAATCGTCGC 


AATAATAGCT 


AGGTAAACAG 


CAGTTCCAGC 


TTTACCAATA 


ATTTCAGCAT 

• • AAA X^#»NJ^^ J* A 


1500 


AAGCTGAATT 


AAAAAGAAAA 


TAAATCAT6T 


TCATGACTAA 


AATGGTATTC 

• ••• A %#X# A*» A A 


GTTGCTGAGC 


1560 


CTAAAACACC 


AGCCACAAAT 


AAGGAGACAG 


GTTGCGCTTT 


TTCCTTCGTT 


AATTTATTTA 


1620 


GCCATTTGTA 


AGCGAAATAA 


GGAACGACCC 


CAATTAAAAT 


CCGCGGAATG 


AGCACAATTA 


1680 


AAAGTGCCTT 


CCAACTACCA 


TGATCGGTAC 


CGATCACAGG 


AATAAATGGA 


GAAAAAACAA 


1740 


ACGACAAAGG 


TGTTGGTGCT 


GTCGTATTTT 


TAATTAAGCT 


AATGATCCCA 


AACGTGCCGC 


1800 


CCAACATACT 


ACCGATTTTT 


GGCCCTAAAA 


CAATGGAACC 


AATAATTACA 


GGAATATGCA 


1860 


TCGTGGTCGA 


ATTTAAAGGA 


CCTAATGTGA 


TAAATCCTAG 


TGGGGTTAAA 


GCCAGTAAAA 


1920 


GTAAGATAGC 


TAAGAACATT 


GCTGTTAACG 


TAAATTTTTT 


CGTGTTATTC 


ATAAATTACT 


1980 


CCTTGTTATC 


AGATGCTAAA 


ATTTTCATCA 


CTGTTTGAAG 


AATATCTTCA 


ACAGTGGCAA 


2040 


GTGCGCCAC6 


ACCATAATCG 


CCACAAGCTA 


ATAGAGCTTC 


ACGTGGGATT 


ATTTCTTGAT 


2100 
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AGCCGACTTC TTTTAAAATG CCAATATTTC GTTGGACAAT 
TATTCATCGC GGGCGCAATA ATTTTTGGTG TTTCAGGATA 
ATAATAAGTC GTCGGCAATC CCATGGGCTA ATTTCCCAAT 
CTACTAAAAA TAAATCCGCT TTTTTAGCTA ACTCAATGTG 
TTTCCATCAT GACATCCGTA TGCACTGGAT TCTTAGAAAG 
TGATGaACGC TGTGCTACTT TTAGTCATTA AAATTTCCAC 
GTTGACTTGT AATATCTGCT GCTTTATATG CAGAAATGCT 
TAGTTTTCAT TGTTGTTCTC TCCATTTCTT TTCAACATTT 
CCCTTGTTTT GAATAAGCTT CTTGGACAAT GCCCGTTTCG 
TTCTGTTTCA TGAACGTTCA TCAAGTCATT GGCT7WVGACA 
CAAAGCGGCT TGTGCAACTT GGACAAGTTC TTCTTGTGAA 
GCCCACTAAA ACAGTTTGGG GTTGCTTGTC ACGAATCATT 
CTTTAGAAAA ATTAATAATC GGTCCGTATC AGATGAAATT 
ATCAGCAATC TGATTCAATG TGTTTTGAAC GATAGTTACC 
CAATGTTTGT TCGCTTAATT GTTGAGCAAA aGAATCAATA 
GGTTTCCGTC GTAAAATCAC TGACTGCCAT TGCGTGAACA 
AGCAAAGAGT TGTTCTAACG TAGTGGCTAA TTCAACCGTT 
TAAGTCCAAT TGTTGGGTTG GACGTAATGC ATGTTTGGTT 
TCCAGCAGCT AAAAaACTTT CAGCAATTGC TTTTCCTAGT 
AATCGAACGA ACATTATCAA TCGGTTCTGA CGTTCCGCCA 
AATCAAAACG CCTCCTTAAG AAATTCGGAC AATGGTCCGT 
TAATGCTTCA TUUU^CAGGGG GAAGTTCTTC TAAAGTAATC 
AGTGATATCT AAATCAGTGG CAAGACGTTG CCAGACTATT 
ATTaACCGAA TCAATGCCCA ATwAATGAAC GCCCCGTAAA 
CTCTAATGTA ATTCCAGCAG CGCGGCCACA AATGGCTGAA 
TGGTAAAACA GCAGAAGTAA TTTCCCCACC AGTGGTGTCA 
TTTCATCAAT GCTTTAACAG GAGTTGCTAA GAAATCTTCT 
ACCTAACTCT TTTAAAAGCT CAAAAGAATC TGGCTTTCTA 
GTAGCCTAAT TGATGcAACA TCGCAwTAGA CaATGTCGCA 
TACAATAATG GGGGCTTCTT TGTCGTCGCG TAAGCCaTCG 
TAATCCGGCC -GTAAAGCCAG CAGTTCCGAA GATCATTGCT 
AGGAAGTGGA ACTACCCAAT CGCCAGGAAC ACGCACATAT 
ACTAACCCCT AATTGATAGC CTGTTACGAT CACTTTATCG 
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GGGGTTTTGG 


TACATATACG 


2160 


AAGTGCCAAA 




2220 


TGTATTGGCC 


GAAGCAGGTG 


2280 


ATTAATTTTA 


CTGGGATCAA 


2340 


TGATTGTAAG 


GTCAGCGGAG 


2400 


ATTGTAGCCA 


AGCTTAGCTA 


2460 


GCCAGAGACG 


CCTAATAAAA 


2520 


TTAACGATCA 


TTGAACCAAT 


2580 


TTAATTAATA 


ATCCTTTGTG 


2640 


AAGTCACACC 


GATTTTTAAC 


2700 


ACGTTGACAA 


GTAATTTAAA 


2760 


TGGATGACTT 


TAGGATTTTT 


2820 


TTTGTTGCTG 


TTTGAGGAAT 


2880 


AAGGCTTCTT 


TGGTCTTTGG 


2940 


AATTGTTCTT 


CCGTTTGCGC 


3000 


ATCGCATCAA 


ATTGCTTTTC 


3060 


GTTTCTATTT 


CTTTAATTGA 


3120 


GTTACGTAGG 


TTACGGTGTG 


3180 


CCGCCAGAAG 


AATGATTGGT 


3240 


GCAGTAACTA 


AAACATCCAT 


3300 


CCAATATGTT 


GGCCTGCTTG 


3360 


TCTTGATACA 


CAGCTTTTTC 


3420 


TTTCGTTTGT 


CCATACCAAC 


3480 


ATAAAGGGTA 


AAACGGTTGT 


3540 


CGGCCATCAT 


AACGAAGTTG 


3600" 


ATTGCAAAAG 


CAAAACGCTG 


3660 


ACTAATAAGC 


ATTCACTAAC 


3720 


GTTAATGCAG 


TGATTGAAGT 


3780 








TCTTCTAAAG 


CTGTGACAGA 


3900 


TCTTTGAGAC 


TCATTTTTTC 


3960 


TCAGAATAAC 


CACCAGTATG 


4020 


CCTTCTTTGA 


AaCGAGGATC 


4080 
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TTCCGATGTG ACAACAGTGC CACTGACATC AATTCCTGGA ATCATTGGAT AATTGCGAAT 4140 

CACACCGCCG TCTTTTTTTG AAGCTAAAGA GTCTTTAAAG TTAACTGAGG AGTAGGCTGC 4200 

TTTGATTACT ACAGTGCCTT CTGATAAATC GTCTAAAGTA ATGGTTTCTA GTTGACCAAT 4260 

AACAGATGTA TCTTGTTCTC TTAACGTAAA TGCTTGAAAT GAATTCATAT AAAAACTCTC 4320 

CTTCCAATTA TGTAAAAAAG CAACTCGTCT TATTTTATCA TATTCTGATT TTTTCGTATC 4380 

ATTTTTCTTC TTATAAACAG TAAGTTATAC AGAAAAGAAA GATTGTGTGA CAATTAGACT 4440 

AGAAAAATAA AGAAAAATGG TTGAAATTTT TCAAAAAGTT CGTATACTAA AAAAGGTAAA 4500 

ACT 4503 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:" 11739 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



GCGTTAGGTC 


AAGTAAGTCT 


TGGmTCGTTT 


TTGATGAcGT 


CTTTTTAtGG 


TGTTTTTmaG 


60 


TACTmGTTTA 


TCGTATTTTC 


TATTACGCAA 


AAATCAATCC 


AATTTATTTT 


TATTATTGAT 


120 


GATTGGCATG 


ATTTTAGGTA 


CTTTTTTCAG 


TAGTATTAGT 


ACGTTTTTAC 


AAGTGATTAT 


180 


GGATCCTAAC 


GAATACGACT 


TACTCCAAGG 


AAAGTTGTTT 


GCCAGTTTTG 


GTAACGTCAA 


240 


TAGTCAGTAT 


CTCATTTTCG 


TTGGTGTGGC 


GGTACTCATA 


ATTTGTGGGA 


TACTTTGGCG 


300 


GG7\AAGTTAC 


CGTTTAGACG 


TTTTACATTT 


AGGGAACCAT 


CaAGCACAGA 


ATTTGGGGAT 


360 


TGCTGTGAGT 


CAGTTTCAAT 


TAGTGTTGTT 


GTGTTTGATC 


AGTAGTCTGG 


TCGGCTTATC 


420 


AACAGCATTA 


GTGGGACCGA 


TTACATTTTT 


AGGTTTTATC 


GTGGCCAATA 


TGAGTTATCA 


480 


ATTCATGAAA 


ACCTATCGCC 


ACCGAGAACT 


GTTTGTGGCA 


GGGAGTTTAA 


TAGCAGTCTT 


540 


TTTACTGGTG 


TTTGGTCAAT 


TGATGGTTGA 


ACAGGTCTTT 


CATTTGAATA 


CCCCGCTCAG 


600 


TGTGGTGATT 


CAATTTGTTG 


GGGGCGTTTA 


TTTTATATGG 


AAAATTATTG 


CAGAAAGGAA 


660 


GCAACGCACA 


TGATCCAGAT 


GAGTGATGTA 


TCAAAAAAAT 


ATGGCGATAA ATTTGTCGTT 


720 


TCAGACATTG 


ATTTACCTAT 


TTCTGAAGGC 


AAATTAACGG 


CCTTTATTGG 


ACCAAATGGC 


780 


GCTGGGAAAA 


GCACGCTGCT 


AGCCATGATG 


AGTCGCTTAA 


TTCCCAAAGA 


TACCGGGGAA 


840 


ATTTATCTGG 


ACAAGCAAGA 


AGTAAAAACC 


TGGAAGCAAT 


CGGCTTTTTC 


ACAAAAAATT 


900 


GCCCTTTTAA 


AACAAGCCAA 


CGGTGTACAA 


CTTAAATTAA 


CTGTGCGAGA ACTGGTCAAT 


960 


TTTGGTCGTT 


TTCCTTATTC 


CAAAGGTCGC 


TTAAAAAGCG 


CTGACCACGA 


AAAAGTAACA 


1020 


GAAGCACTGG 


AACAATTGGG 


CTTAACTGAA 


ATGGCGGAAG 


AATACATTGA 


TACCCTTTCT 


1080 


GGGGGCCAAT 


TACAACGGGC 


GTATATTGCC 


ATGATTTTAG 


CCCAAGATAC 


CGATTATATT 


1140 


TTATTGGATG 


AGCCCTTAAA 


TAACCTCGAC 


ATGAATTATG 


CGGTTCAAAT 


GATGCAGATT 


1200 
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CTTAAACGCT 


TAGTGGATGA 


ACTAGGAAAA 


ACGATTCTGA 


TTGTTCTACA 


TGATATTAAT 


. .1260 


TTTGCCGCAA 


GCTATGCAGA 


TGAAATTGTC 


GCCATGAAAG 


GCGGAAAGCT 


CTATGCGCAT 


1320 


GGCGCAACAG 


AAGAAGTTAT 


TCAAACAAGT 


ATTTTT^TG 


ATCTTTATGA 


AATGAATATC 


1380 


CGGATTTGTG 


AAATTGAAGG 


AAAACGCTTT 


TGTCTTTATC 


ACTAAATCGG 


GCAATTTAAA 


1440 


AATAACGTTT 


TTGGAGGA/^A 


AGAATGAAAA 


AGAAATTTTT 


AGCAATGATG 


GCAGTTTCAA 


1500 


TGATGGGACT 


GTTAATGTTA 


AGTGCTTGTC 


AAACAAATAA 


AAAAACAGCA 


GATTCTGCAA 


1560 


CAACAGAAAC 


AACAGCTAAA 


ACGGAAGTCA 


CAGTCAAAGA 


CACCAATGGT 


CAATTAACCG 


1620 


TTCCCAAAAA 


TCCTAAGAAA 


GTCGTTGTTT 


TTGATAATGG 


TTCCTTGGAT 


ACAATGGATG 


1680 


CACTAGGTGT 


CGGTGACCGC 


GTGGTAGGTG 


CGCCAACTAA 


AAATATCCCT 


GCGTATTTGA 


1740 


AAAAATACCA 


AAAAGTTGAA 


TCAGCAGGCG 


GCATTAAAGA 


ACCAGATTTA 


GAAAAAATCA 


1800 


ATCAACTAAA 


ACCAGACTTA 


ATTATTATTT 


CTGGTCGTCA 


ACAAGATTAT 


CAAGAACAAT 


1860 


TAAAAGCCAT 


TGCGCCAACC 


ATTTACTTAG 


CTGTAGATGC 


CAAAAATCCT 


TGGGCATCAA 


1920 


CGAAACAAAA 


TATCGAAACG 


TTAGGCACTA 


TTTTTGATAA 


AGAAGAGGTA 


GCTAAAGAAA 


1980 


AAATAACTGG 


CTTAGAAAAA 


GAAATTGCTG 


ACGTGAA7UUV 


ACAAGCAGAA 


GCTAGCGCGA 


2040 


ATAATGCGCT 


TGTTGTGTTA 


GTTAACGAAG 


GACAACTTTC 


CGCTTACGGA 


AAAGGCTCTC 


2100 


GTTTCGGTTT 


AATTCATGAT 


ACATTTGGCT 


TCAAAGCAGC 


AGACGATAAG 


ATTGAAGCTT 


2160 


CCACTCATGG 


GCAAAGTGTT 


TCTTACGAAT 


ATGTTTTAGA 


AAAAAATCCT 


GGGATTCTCT 


2220 


TTGTGGTAGA 


TCGCACCAAA 


GCAATTGGTG 


GCGACGATTC 


AAAAGATAAC 


GTCGCTGCAA 


2280 


ACGAATTGAT 


TCAAAAAACC 


GATGCTGGTA 


AAAATGATAA 


AGTCATTATG 


CTTCAACCAG 


2340 


ATGTTTGGTA 


TCTAAGCGGT 


GGTGGATTAG 


AATCAATGCA 


TTTGATGATA 


GAAGATGTTA 


2400 


AAAAAGGATT 


AGAGTAAAAA 


AArGCCTAGG 


ACAAAAATCA 


CTTTGGATTT 


TTGTCCTAGG 


2460 


CTTAAAAGCT 


GATAAACGGC 


GGGAACAAAA 


GCAACTCCTT 


CGGAAATAAG 


CTGAAATTCT 


2520 


CCAAAAATTA 


AAGAACAATT 


TTCGGAAATT 


TCTTCTTATT 


TCTCGGAGCT 


AAACACTTCT 


2580 


GTCCCGACCT 


TCTTGATGAA 


AAACAAAAAA 


GTTTTAAATT 


AAAAATGTGG 


TTGACGGCCT 


2640 


CGAAAATGTT 


TGCTAAACTA 


TCCGTAAATC 


AGTATGATTG 


ATGAAACAAA 


TAAAAGAACA 


2700 


AAATGTGATG 


AGAAGGAAAG 


TACTTTTTAA 


AATAGCTTGC 


AGAGAGCCTC 


GTTTGGT6AG 


2760 


AAGAGGTAGT 


GACGTTAAAG 


G6GAAAATGG 


CCTTTGAACA 


GGTTGGTTGA 


AGCCGTTTGC 


2820 


TTAAACCAAT 


TCAGGTGCGC 


CTGTTATAGC 


GCTACAGTAT 


GATTGTACTG 


GAAACGAGCG 


2880 


GATCTTTTTT 


AGCGAAGAGA 


TCAACAAAGG 


TGGTACCGCG 


ATTGCATAAG 


CATTCGTCCT 


2940 


TTTTAGGACG 


AATGCTTTTT 


TTATTTGTTT 


AAAAAATATG 


AGGGGGCAAC 


AACATGAAAT 


3000 


TGGGGAAAAA 


AGTAGTAGGT 


TTGATTGCAA 


CAGGGTTTCT 


TTTAGCCGCA 


TGTGGCGGAA 


3060 


CCAAAGAAGC 


GGCAGAGAAA 


GTAGATTCGG 


GAAATTTAGC 


AGCTGAACAA 


AAAATCAGTA 


3120 


TTAGTTCACC 


TGCACCAATC 


TCAACATTGG 


ATACAACACA 


AACAACAGAT 


AAAAATACCT 


3180 
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TTACAATGGC 


ACAACATTTA 


TTTGAAGGCC 


TTTATCGGTT 


TGATGATGAT 


AGTGCCACGG 


-3240 


TGCCAGCTCT 


AGCTAAAGAT 


GTCAAGATTA 


GTGACGATGG 


GCGCAAGTAC 


CACTTTACCT 


3300 


TGCGGGAGGG 


GATTAAGTGG 


AGCAACGGCG 


AGCCAATCAC 


GGCCCAAGAT 


TTTGTTTATT 


3360 


CTTGGAAAAA 


ACTGGTGACA 


CCAGCGACGA 


TTGGACCGAA 


TGCCTATTTA 


CTAGACAGTG 


3420 


TTAAAAATAG 


TTTTGAAATA 


CGCAACGGTG 


AAAAGTCAGT 


CGATGAATTA 


GGGATTTCAG 


3480 


CCCCGAATGA 


CAAAGAATTC 


ATTGTTGAAT 


TAAAACAGGC 


CCAACCTTCC 


TTCTTAGCAG 


3540 


TCGTTTCGAT 


TGCTTGGTTA 


GCGCCACAAA 


ATCAAAAATT 


TGTCGAAGCG 


CAAGGCAAAG 


3600 


ATTACGCCTT 


GGATAGTGAA 


CATTTACTTT 


ATAGCGGGCC 


ATTTACGCTA 


GCCAATTGGG 


3660 


ATGCGACTTC 


AGATACTTGG 


ACATTGAAAA 


AAAATCCAGA 


ATACTATGAT 


GCGGATCAAG 


3720 


TGAAACTGGA 


AGAAGTTGCG 


GTTAGCACAA 


TCAAAGAAGA 


TAATACTGGG 


ATTAACTTAT 


3780 


ATCAAGTGAA 


TGAACTAGAC 


TTAGTTCGCA 


TTAACGGACA 


ATATGTTCAA 


CAATATCAAG 


3840 


ATGATCCAGG 


CTATGTCAGT 


CATCCAGATG 


TGGCCAACTA 


CTTCTTAGAT 


TTCAACAAAA 


3900 


AAGAAGGAAC 


GCCATTAGCG 


AATGTTCATT 


TACGAAAAGC 


GATTGGCCAA 


GCAATTGATA 


3960 


AAGAAGCCTT 


AACACAAAGT 


GTCTTAAACG 


ATGGGTCAAA 


ACCCCTTAAC 


GGATTGATTC 


4020 


CAAGTAAACT 


TTATGCGAAT 


CCAGAAACGG 


ATGAAGATTT 


CCGAGCTTAC 


AGTGGCGAAT 


4080 


ATTTGAAAAA 


TGACGTCAAA 


AAAGCTCAAG 


CTGAATGGAC 


GAAAGCCCAA 


GCGGATGTCG 


4140 


GTAAAAAAGT 


GAAACTTTCA 


TTGCTGgCGG 


CAGACACAGA 


TCAAGGAAAA 


CGAATTGCTG 


4200 


AATATGTTCA 


AAGTCAGTTG 


C7VAGAAAATC 


TGCCAGGTTT 


AGAAATTACC 


ATTTCATCGC 


4260 


AACCAAGTAA 


TAATGTGAAC 


CAATCGCGAC 


GTGAAAAAAA 


TTATGAGTTG 


TCTCTTTCAG 


4320 


GATGGATTGC 


CGGCAGTAGT 


GAATTAGACT 


CTTACTTTAA 


CTTATATGCA 


GGAGAATCAA 


4380 


GTTACAATTA 


CGGCAATTAT 


CATAATGCCA 


AATACGACCA 


ATTGGTAGAA 


GAGGCACGAA 


4440 


CGATTAATGC 


CAATAATCCA 


GAGAAACAGT 


TTGCAGAATA 


CAAAGAAGCG 


GAAGACATCT 


4500 


TGTTGAACCA 


AGATGCTGCC 


CAAGTACCGC 


TGTATCAAAG 


TGCCTCAAAT 


TATCTAATCA 


4560 


ATCCTAAATT 


GAAAGGCaTT 


AGTTATCACT 


TGTATGGGGA 


TTATTTCCAC 


TTGCGCAATG 


4620 


CCTATTTAAC 


AGAATGATTC 


AGGAAGAAGG 


AGTAAAAAAT 


GGAAAAATTA 


GTCGaACGAT 


4680 


TTTTAACTTA 


CGTGAAAGTT 


AACACCCGTT 


CAGATGCCAA 


CAGTCAAACC 


ACACCGACGA 


4740 


CGGTCGGTCA 


AGTGGTTTTA 


GCCAAAATGA 


TTGAAACAGA 


ACTTCATGAA 


TTAGGCTTGG 


4800 


CCGATGTGCA 


TTATAACGAA 


CAAAACGGCT 


TTTTAACGGC 


ACGGTTGCCT 


GGTAACCAAC 


4860 


CTGCAGCGAA 


GAGTATCGGC 


TTGATTGCAC 


ATCTGGATAC 


GGCGGATTTT 


TCAGCAGAAA 


4920 


ACATCCGTCC 


CCAAGTCATC 


ACTAATTATG 


ATGGTCAAGA 


TGTCCTCTTG 


AATCAAGAAC 


4980 


AAGGCATCGT 


TTTATCAGTT 


GCTGAATTTC 


CCAATTTAAA 


AGAGTATCAA 


GGGGAAACGT 


5040 


TGATTACTAC 


AGACGGAACC 


ACACTTTTAG 


GAGCAGATGA 


CAAAGCGGGA 


ATTGTCGAGA 


5100 


TATTAGCTGC 


TGTTGAGTAT 


TTTTTGGCAC 


ATCCAGAAGT 


AGCGCGGGGG 


GATGTTTGGC 


5160 
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TCGCTTCGGT 


CCAGATGAAG 


AGATTGGACG 


CGGGGCCGAT 


CAATTTGATG 


CGCCGAACTT 


5220 


CCCAGTCGCT 


TTTGCGTATA 


CGATTGACAG 


TGGCCGCGTG 


GGGCATTTCG 


AGTATGAAAC 


5280 


GTTTAACGCT 


GCCCAAGCTG 


TGATAACAAT 


TGAGGGAACC 


AGTGTCCATC 


CGGGAACGGC 


5340 


CTATGGTTCG 


TTAGTAAATG 


CCATTAAATT 


AGGCGAAC7VA 


CTTGATCAAT 


CACTACCGCA 


5400 


AAAGGAAGTG 


CCAGAACAAA 


CACGTGGAAA 


TGAAGGATTT 


TATTTACTCA 


ATAAATTTAC 


5460 


AGGAAGTATT 


GAAAAAGCAG 


AACTTGTCTA 


TATCATTCGG 


GATCATGATC 


AAGAAAACTT 


5520 


TCAAGCGAGA 


AAGCAATTTT 


TGGAAAAACA 


AGTCCAACGT 


TTAAATGCGC 


TGGCGGATAA 


5580 


ACCCAGACTA 


ACAATTACTT 


TCCAAGACCA 


ATACTATAAT 


ATGAAGGAAA 


TCATCGAAAA 


5640 


AGATTGGACT 


CCTGTAGAAT 


TGGCCGTCCA 


AGCAATGGAA 


AGTTGTGACA 


TTGAACCAAT 


5700 


CATCACACCA 


TTTCGTGGCG 


GGACAGATGG 


TTCTAAGATT 


TCTTTCATGG 


GCATTCCCAC 


5760 


GCCTAATCTA 


TTTACAGGTG 


GCGAGAATTT 


TCATGGGCAA 


TATGAATTCA 


TTACCGTTGA 


5820 


ATCAATGGCC 


AAAGCTGTTC 


AGACGATCAT 


TGCCATCATT 


CGCTTAAACG 


CTAATTAAAG 


5880 


AAACACGCAA 


GGGAAATGCC 


GATTTTCCTT 


GCGTGTTTCG 


CTGTTTTTCG 


GTAAAGTATG 


5940 


TAGAAAAGTA 


GGGAGTTTTG 


CTCATGAATT 


ATCCaTTGAT 


TTTTGCAGAA 


AACCAACGAA 


6000 


TCGAAACCGA 


ACGCTTGATT 


TTAAGGCCCG 


TCACATTAGC 


CGATGCAGAG 


GATATGTACG 


6060 


i\ATATGCTTC 


AGATGAGGAA 


ACGGTCCGCT 


ATGTGTTTTT 


GAAAAATCAA 


ACAATCGCTG 


6120 


AAACAAGACA 


AAACATTGCT 


AAATTTTTTA 


TGGGGGAACC 


TTTAGGAAAA 


TATGGGATTG 


6180 


AAGTGAAAGA 


AACGGGCAAA 


ATGATTGGCA 


CGATTGATTT 


GCGGGTCAAT 


GAAACCAATA 


6240 


ATATTGGAGA 


ATTAGGCTAT 


GTCTTGAACC 


GTACCTTTTG 


GGGCAATGGC 


TACATGCCTG 


6300 


AAGCCGCGAC 


GGCGTTAGTT 


GAACTGGGTT 


TTGCAAAAAT 


GAAACTCATG 


AGAATCTTTG 


6360 


CTTTACATGA 


TCAAGACAAT 


CCAGCTTCTG 


GTCGTGTCAT 


GGAGAAAATT 


GGTTTTACCT 


6420 


ACGAAGGAAC 


ACTACCCAAT 


GCTCGAATTA 


GTAAAGGGAA 


AATTGTCACA 


GATGTTTATC 


6480 


GTGGGATGAC 


ACTGGAAACA 


TGGCAAAATC 


GACAAAAATA 


GAAGCGGAAG 


ACAAGTTAAG 


6540 


AGAGGAATTA 


GAGAAATGTT 


GATGTTTACA 


GAAAAAGAAT 


TCGCAGCCTT 


TGAAGTTGCT 


6600 


GGATTAGATG 


AACGAATGGC 


GGTTATCCGT 


GCGCAAATTC 


AACCTATTTT 


TCAAGAATTA 


6660 


GACACGTATT 


TTGCCGAACA 


ACTAGCGCCT 


GAACTAGGAA 


CGGAGTTGTT 


TGTCCATATT 


6720 


GCGCAACATC 


GCCGTCGCAC 


CGTGTATCCG 


CCAGAAAATA 


CTTGGTCAGC 


ACTTAGTCCG 


6780 


AATAAGCGCG 


GGTATAAAAT 


GCAGCCGCAT 


TTTCAATTAG 


GCATTTGGGG 


CGACTATGTG 


6840 


TTTATGTGGT 


TGTCATTCAT 


TGATAATCCT 


AAAAATGAAA 


AGCAGATTGC 


GCAACCTTTT 


6900 


TAGAGAATCA 


GCAATTATTC 


CAAGCATTAC 


CAGAAGATAC 


GTATGTyTCC 


TTGGATCATA 


6960 


CGGTTCCTCA 


AATTACTCCG 


CTGCCAGAAA 


CGGATTTAGA 


GAAAGCCTTA 


ACTCGTTTTA 


7020 


GAGATGTAAA 


AAAAGGTGAG 


TTTGAAATTG 


GGCGCATCAT 


TCCCAAAGAT 


AGTGACTTGT 


7080 


GGCAAAACCC 


TGAAAAAGCA 


CGAGCTTATA 


TGTTGGCAAC 


CTATCAACAA 


CTCCTGCCAT 


7140 
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TGTATCAATT 


AGCTGTTGCC 


CAATAAGAGA 


AACAGACAGT 


TATTTTGTGT 


TTTAGTGTTA 


7200 


TTTTATCGAA 


TTATCCAATC 


GTCAAAAATG 


AGAaAAGCTC 


ATAATGTTAG 


TTTTTKATTAC 


7260 


T7UVCATTGTG 


AGCTTTTCTG 


TAGTTTTTTT 


GAAAAAAATT 


AAGGGATGAC 


TGGCTGATTT 


7320 


CTATGGAAAA 


GTACGTTTTG 


ATAGCCTTTT 


TTGCTTAAAT 


AACAAGAGTA 


GCAACCCTTG 


7380 


TTTAATAAGG 


GATTTTCGCG 


AATAGATGTT 


GTCAAATGAA 


AAAAAATGAT 


GTTTTGAACA 


7440 


AAGAGTTCTT 


TGCTATTATA 


AAGGTATGAA 


AAAGGTTGTT 


TACGCATTCT 


TTGTATAGAA 


7500 


GAATGTGAAT 


GAGTTAAAAG 


GAGGTTTTTA 


AATGAAAAAA 


AAGCGTTATT 


TAATAATTGC 


7560 


GTGTTTACTA 


TTTTCCCCTA 


GTTTTTTTAT 


AAATGTTGAA 


GCATCTGAGG 


GTGGTTCTAG 


7620 


TTCGGTGGGA 


ATTGAATTTT 


ACCAAAATCC 


GGCAACACCC 


GCTCCTAAAG 


ATGCCCCACC 


7680 


GAAAACAGAT 


GAGCCAGCTG 


CGGATCCCAA 


GGAACCAGCT 


GGTCCTCTGC 


AAGGAGATCA 


7740 


ACGAAgTGGT 


GGTTCGACAC 


AGACCACCAC 


AGCTGGCTCG 


CAGCTCCCTC 


GTACAGGAAG 


7800 


CAAGAGTCAG 


GCAAACCTGA 


GCATTCTTGG 


TCTTGTCTTG 


ATTGGTCTTG 


TCGGAATGGT 


7860 


CCAGAGAAAG 


AAGGGACGAC 


ATGAAGCAAA 


CTAAGTGGCA 


ACGATTAGCA 


ACCATTGGCT 


7920 


TGTGTAGTTC 


TTTAGTGATT 


AACGCCTTTT 


CTGGCGCGAC 


CGCAGTTGCG 


GAAACAATCA 


7980 


CGATTGAAAG 


TAGTCCGACA 


GTAGCAAGTA 


GCGCTAAGGA 


AGCAACATCA 


GCAAGTAGCG 


8040 


CAACGCCAGA 


AAGTACGGAA 


AGCAGTCAAG 


AAACGACTGA 


AACGAGTCGG 


GAAGAAGT6A 


8100 


CACAGGAAAC 


AGTGGAACAA 


GAAGATACAA 


CGGAAATAGC 


TCAAGAAGAA 


AATCTGCTGG 


8160 


AACCATCTGG 


TATTCAAGCG 


AAAATTATGC 


CTAGAGCATT 


TGTCTGGGCA 


GCTACAGGAA 


8220 


CCACGAATAG 


AACCACGGTT 


CGTCTTAACG 


ATGTAGTTGA 


TTATCGAATG 


AAACTGGAAA 


8280 


ATATTACAAC 


AGATGATACA 


AATTCAGATA 


TTGTTGGTCC 


GATTCGAATT 


GAATCAAAAT 


8340 


TGGCAGCTGG 


AATGACTAGA 


CCAACGTCAG 


TTATTATCAA 


AGTTGGCCCC 


GAGGAACAAA 


8400 


ATGTAGTTAT 


AGGAGAAGGC 


TCTCATAATG 


CCAATAGTGG 


TGGCGAATAT 


TTTGTTTGGA 


8460 


CAGAGTCGAC 


TAGGACGGTC 


ACTGCATTTA 


TCAATCGTCT 


ACACGGGCGA 


ACTTCTGGAA 


8520 


ATACTGGGTA 


TATCAAAACA 


TTGTATTTTC 


AGACAAGAAT 


TACTTCGGGT 


ACACACAACG 


8580 


AAAAAAAATA 


TATCGATTCC 


ACGATTCGAT 


TTAATGGATT 


TGGCGCAGTG 


AATAGACGAA 


8640 


ATGAAATGAC 


GTATGTGGAT 


AGATTTTCTG 


6TAACTTAGG 


TTTGAAAGGG 


AATTTAAAAT 


8700 


TTTACGATGA 


GGATGGAGTG 


GCTAGTAAGG 


AACAAATGAG 


TCTTTCCATA 


AACCCAGTAT 


8760 


TATTTAAATC 


AACCAATTCG 


AGAGTAGAAG 


GGACGCTGTT 


AAGATATCCT 


TTAACTATGA 


8820 


CAAATGCTGG 


TAGTGGGCTT 


TATAGGTTTG 


ATACAGGACT 


AAAAAATATA 


GGTACTAGTA 


■ 8880 


ACGGGGGTTC 


GTGGTTTTTC 


TTTAGTAACG 


ATATTGCGAT 


TAAwTTTGCT 


ACAGCTACCA 


8940 


CCAATAGAAT 


TGATAAAATT 


GTGTTAGCAC 


CACCTAAAGC 


AACACTTCCA 


GACAGCCGAA 


9000 


CAGGTGTGAT 


GACTTCCAGA 


GCGTACACAT 


CAACAACTTA 


TACGAGAAGT 


AACTATGATA 


9060 


TTTCaCaTAT 


AaCTTTACTA 


AAGCGACTGG 


TAAGACATGG 


CGTATAACGA 


ATCGTAGTTC 


9120 
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TAATAGCGGA GTAAGCACCA ATATTATGTT TAGTAACAAT CTTGTAGGAA CAACAGGACA 9180 

GTTCAATAAA GGGAATTCTG GTGCTACTAT TAATTATTAT ATTTATTACA AAAAGCTTTA 9240 

TGAGAATTTT GTTAATAGTA ACGGACAAAA AATTACACCA CCATCAGGAT TCACAC7VAGG 9300 

AAAGCGAACG GTTATTAATA GTGAAGCGTA CACTTTTAAA CAAAGTGGCA CCTTGCCAGA 9360 

TACCTATCAA GCAGACGGCA AAACCTATAA GTTCAAAGGT TGGTACAAAG GCATVAACCAA 9420 
GCCAAACACC TTGACCACCA CCAAAGCACC AAGCTATGCG GTGACTTATA ACAATGATGA 9480 
TGATTTGACG GTGGTGTATG AGGAGATTGA GTTTTTTGAC TTCCCAGCGC TGACCTATCA 9540 
ATTCGGGTTC GTGGACGAGT CTGGCAAACG AGTAGATGCT TCAACAATTG GGTTGACTTA 9600 
TGACA7VATGG CGTGGGGAGT TGTTAACAAA TAATGTCAAT GACTGGAAAA CAGTCAGTCT 9660 
TGAAAAA6GT CAAGTTGCAC CAACCAAAAA CAATTTGAAG GAAATTGCTT ATCCTGCGCA 9720 
ATCTCTTGAA ATACTGAGTG ATCGAAGTAC ACAATATAGT GCGGCTAATT TAACCTTTAC 9780 
ACTGCCCAAA TACTATGGGA AAATCAGTGT TTATAACAAG AGTGGTACTT TTGATGCAGG 9840 
CTACCTGTTT CCAAATATCC AAAATAATAA TAGTTCAGAA CCATCTGTTG AAGGCTTACA 9900 
CAAAAATCTT TCTAGCTGGT TTGAACTAAA GAAAAGTGGG AACCAATCGT TTGTTTTTAA 9960 

CAAACCGATT CCAGCAGCAC CAATCGATGT TCAGGTGCCT GCTTACTTGC GAGAGATTGT 10020 

CTATAATCCC GATCTGATAT CAAATACAGC CGTCTATTAT GTAATAGATA AACCTGTCTA 10080 

CTACTATTTA ACCAACCGCA AAGTCACCGA AAACTTCGTA GACACAAGCG GTGCCAAAAT 10140 

TACACCACCA ACAGGTTTCA CCCAAGGAAA ACAAACAGCG ATTACCAGTG ACCCATACAC 10200 

CTTCAAACAA GCAGGTACCT TACCTGATAC CTACACAACA GGCGGTAAGA CCTACAAGTT 10260 

CAAAGGCTGG TAGAAAGGGA AAACCAAAAC CGAGCCTTTA ACCACGACGA AAGCACCAAG 10320 

TTATGGGGTA ACCTATAATG ATAATGATGA TTTGAATGTG GTGTATGAAG AAGTAACAGC 10380 

TATTTATCCA GGGTTCTATG CGCAATTTGT TGATGAACAA GGCAAAGCTT TTACTAATTC 10440 

GCTAACTCTT TCTGAAAATT ATACTGAGTT TCTCAGAAAG ACAGGCACTA CTGTTTTTGA 10500 

AACGAAGGGC GCCTTTTACC CAATGGTAGG TAAGCGCGAT GCAACAGTGA CAAATAAGTA 10560 

TAAGATTGAA ACAACGAGTA GTGTTGCAGT ACCAAATAAT TATTGGACAG CCATGCCTGC 10620 

TGACATTGCT AATAGAGGGT ACGCATTAAA TAACTTGGAT ATTACCAATC AAATGAAATA 10680 

TGTTGACAAG ATAGAAACTA CTGACACGCA CACTACAATG TACGATTATA CTGGAGTTTC 10740 

GGACATTGGG GGCAAACCAA ACAATGCTTC TCAGACAGAA TTAAATCCTA TT6ATAAAAC 10800 

ATTTGTTAAG CAAACAGATA ACACTTTTGC GATAAAATCA ATTTGGGGAA ATGGaACGTT 10860 

TTCGCATATA GTTCCCAAAA TGTTATTGTT AGCAGAAAGT GGCAGTTTTT CAAGATTATT 10920 

CGGGTTTGAT GGTACTCCAG ACTACAAGCA AACCGTAAAC TACAAAGTCA CCCGTAAACA 10980 

AGTCACCGAA AACTTTGTTG ATGCCAACGG TGCCAAAATC ACCGCACCAA CAGGCTTCAC 11040 

CCAAGGCAAC AAAATACCCA TGACTAGCAA CACCTTCAAG TACACAGCCG CAAAAGCCTT 11100 
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GCCAGCAACT TATACCGCAG GTGGCAAGAC CTATACTTTC CAAGGCTGGT ACAAAGGCAA 11160 

AACGAAGCCT AATACTTTAA CAACCAGCAC TACGCCAACA TATAACACAA CTTTTGATGG 11220 

C7UVTGACGAC ATGACGGCGA TTTATAAGGA AGAAGTGCCC A7VAGCAAGTG TCGCTTTAAC 11280 

GCGCACTACC GCTGAAACCG TGACTAGCGG CGGCAATGTC ACCTGGCGCG CCACAATTAC 11340 

CAATACAAGC CAAGCACCAC TAACCACGGC GACCATCAAG AAATCAACCG CTTGGACAAC 11400 

TGGCTTAGCA GCACCAACCG CCATGATTGT CACGCCAGCT GGCGGAACCG CAAAAACAGT 114 60 

TCCCGTCACA GCGACTACGT GGACCAACGG TGTTAGCTTA GGCACTGACA TTCCCGTAGG 11520 

AAAATCTGCC ACGGTTCAGT TCACTACGAA AGCGACGGGG ACAGCGGGGC AAGTCTTGCG 11580 

CGCTGGGATT ACCACCAGTG GGAATTACAG TGGTGTTTCT ACTTCGGCCA* CCGTACGTGT 11640 

GAAAGACAAT GATCAAGCAA TTGTAACACC AACGGCTGAA GGCTTCATCA GTGTTCCAAC 11700 

CTTCAATTTT GGTCAAGTTG GGGTAGCGGG AAGCACGCA 11739 
(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4045 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



GAATTTCTTC 


AGTATCCTAA AACAAGAAGT 


GTATTATGGA 


AAAATCTATC 


AAaGTCAAaT 


60 


GAATTGrTaG 


AAGCAaTTGA 


GAACTACATT 


TATTATTACA 


ATCATCACCG 


T^TCAAGG/VA 


120 


AAACTTAACT 


GGAAAAGCCC 


AGTAGAATTT 


CGACAATTCA 


ATCAAAAAAC 


TGCA7MKMJK 


180 


TAGAGTGGAA 


AAATCCACTC 


TATAAAAAGT 


CTAACTTTTT 


GGGGTCGCTA CAACAATACG 


240 


TTTGTTTTTT 


AATCCTTCAG 


TTTATAACAT 


TATAAGAGAA 


AAGC6AGAAG 


AATCATCCTC 


300 


CTTCTCGCTT 


TCTTGTTACC 


CTAAACCAAC 


ACGTTCAAAA 


ATCGTGTCAA 


CATTTTTTAA 


360 


ATGATAATGG 


TAATCAAAGG 


CATCATCTAG 


CTCTTCTTTT 


GATAAGACAG 


ATGTAATTTT 


420 


TTCATCCGCA 


TCTAGCAACG 


GACGGAACGC 


CGTTTGATGA 


TCCCATGCGT 


AAGCTGTTTT 


480 


TGGtTGGACC 


AGATCATACG 


CTTCTTCACG 


TGTCATGCCA 


TGATCAATCA 


ATTTTAACAA 


540 


GACACGTTGG 


CTGTAAATTA 


GACCATAGGT 


CGCATCCATG 


TTACGTTTCA 


TATTTTCTGG 


600 


GAAGACCGTT 


AAGTTTTTGA CAATATTACC 


AAAACGATTC 


AACATGTAAT 


TTAGTAAAAT 


660 


CGTCGTATCT 


GGAATAATAA 


TCCGTTCAGC 


AGACGAGTGA 


GAAATATCTC 


GTTCATGCCA 


720 


TAAACTCACA 


TTTTCAAAGG 


CAGTAATGAC 


ATGACCACGA 


ATCACACGTG 


CCAAACCAGT 


780 


CATATTTTCA 


• GAACCAATTG 


GATTCCGTTT 


ATGCGGCATT 


GCCGAAGATC 


CTTTTTGTCC 


840 


TTTTGCAAAA 


AATTCTTCTA CTTCACGTGT 


TTCTGATTTT 


TGTAGACCAC 


GAATTTCTGT 


900 


TGCAAATTTT 


TCAATACTTG 


TCGCAATTAA 


TGCCATGGcA 


GAAACATATT 


CTGCATGTAA 


960 
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ATCACGTGGT 


AAAACTTGTG 

FWmrW* X X w X \J 


TCGAAATCTC 


TTGAGCACGG 


ATCCCCAAAT 


GTTCGCAAAC 


1020 




APAAATGGTG 


rAATATTGGc 


AAAGGTACCT 


ACGGCTCCAC 


TGATTTTACC 


1080 


rCZCTTCGACA 

V»V9V# X X VrVSnVvJn 


rrTTTGGCTG 


CATGTTCAAA 


ACGTTCAATG 


TTGCGTTTCA 


TTTCAGAATA 


1140 




AATTTPAAAP 


PAAATGTAGT 


CGGTTCTGCA 


TGAACGCCAT 


GCGTTCTTCC 


1200 




Gi AlGi I lAl 


aPTPTTTfl^^P 


TTTGGPTPPG 

XXX V9\3V« X ^.^V^U 


APAATPTrrG 


TAAATGTTTG 

X rW^X ^7 XXX \3 


1260 


TAAAiUi 1 lU 


ppTaaaaTaT 


PATTPGPPTG 


TTTCAATAGA 


TAAPPATAGG 


CAGTATCCAC 


1320 


AACATCTGTto 


pT^sp TT a a pp 
GXGGX lAAGG 


p a 1* a ^T^^T a p 


PPATTTTPGP 


TPTTPAPPAA 


GCGTTTCTGA 

X X X ^ X wn 


1380 


AACCGCACGA 


GTAAAAGGGA 


paapaTPaT/i 
GAAGAIGAXG 


aPI^Ti^TTTPT 
AGG X G X 1 1 X 




PTAAAATAPG 


1440 


TGCAaCATCA 


A A trtf^ A Tt^/^ A n* 

AATGATGCAT 


r TTG X GGGAX 


X X X X XGXAGA 


XG X XG X X XAG 


fi a aTT TPCPP 

GAAX X X G\3\«V^ 


1 snn 


AAGTTCT^'^CC 


^A A/^/^TT*^ A T 

GAAGGl XGAl 


GGGGGAAAA X 


X X GAAG X 1 G X 


apppaTPPPT 

Avj^GAl uL^V^ 1 


TnTaf^PCfSTT 
1 V3 1 AuLfV^la 1 1 


1 sen 


CTTATCTGTC 


CAAATTGGGG 


GGAl i XGGGG 


IGl IGIAIAA 


ppaTpaaTpa 

GGAX GAAi GA 


TTPTf^PPTTa 


X 0£ V 


TCCCCTCTTT 


AGTCGTCAAl 


fprp#prp/-» A A AAA 

1111 G AAAAA 


T A ATT A A 

XAAX lAGGHG 


PA A ATPTTTP 
G AAA 1 ^ 1 1 1 o 


TTTPTTT a aT 
1 1 1 1 1 X AAl 


X QO 


TTCTTCCAG T 


G n. 1 GA I GGA 


1 1 1 G 1 1 1 GGl 


TA aTaTaPTa 

1 AfVl A 1 Ao 1 A 


aTaTPr^pppa 

A 1 A ILj L- k-- A 


TpppTppTTr: 


X f *s u 


CGCTTTTGCT 


1 GWG i G 1 GGG 


AT A A*I*aPTP 
GAl AAl AGX G 


GAAATGCCAA 


TCTGGATGTT 


paTTaaTaaT 


X O V/ V 


GTCGATCGx x 


XGAlAGGGi X 


paTTTPPaap 
GAX X XGGAAG 


CACATTGACC 


GTGACTGCTT 


paf^apaapan 


18 60 


TTGAATCGl i 


GGGAX TGGGG 


A ATTapaTaa 

AAX XAGAlnA 


ACCACGAATA 


TGCGTATCAT 


a TTf^riPT AAA 
AX X\3V3x«lArm 


X 7£ \/ 


AGAAGAAGUA 


TGTAX XGAAA 


A ATTPPPa/^P 
AAX XGGGAGG 


ATCTTCTGGA 


CGAGGTAAAA 


X X X X AX X xtlX^ 


1 980 


ATAAATCCCG 


CCTGTTTc TG 


*P A fP A A A A ^ A 1* 

T AT AAAAGA 1 


TTCAATTGAA AGGGTGCCGA 


pTaaaPPAAP 


9040 


TGCTTTAGCA 


Afn^A^^fpn^A ^ 
ATCACG TTAG 


GAATTGGT X G 


CAACTCACCA 


CTGAcTtCGT 


PATPaaTTPn 
GAX GAAI lOv 


91 00 

£ XU V 


GGCTGGTGCA 


ATCGTTTCAT 


Gl AGC r X AGG 


CTCTTTTTTA 


ATTGTTTCCA 


PaaPTP^AAA 


91 fiO 


TGTCGCAAAT 


TGGGGAGGAG 


^A*n»APpapa 
GAX XAGGAGA 


AATCAGAACA 


GATAATTCTT 


TTTP^ZTAAGTJ 


2220 


AAl v#v#AAGv« 1 


Tppa arzapap 

1 GGAAGAGAG 


afZ/2TTPPTTP 
nGGl x\..V«X 


TCTCAGAAGA 


TTCATGGATG 


GPGPTAAGTr 


2280 


1 G AAG i AG i. A 


•p^T a T ap a T 
XGXAGX AGAX 


aap^apPTTP 


CCCTCGCGTA 


TTCTTCAAGA 


TACACGGATA 


2340 


appaaTfZPpa 

AwGAAi l9W»GA 


T^'a A'PTP^*'!**!* 
IGAnX XGGX 1 




AGTCGGACTA ACAATCGTTG 


CATAAGGTGC 


2400 


ZVATPAPAATA 


X Xnx xnx\>x X 


PPAARAAAPT 


TTTTTCCATT 


AAGCGATCTT 


GCGTAATCGC 


2460 


TAAAATflTPT 


GTTPPTTflPG 

O X X \^\^ X X 




CAAATGTTGA 


ATCGCTGTTA 


ACGCATCAAC 


2520 


ATrTGPATGT 


TPAAATTPAT 
X ^nrv\ X X Vi#n x 


APGTAATCAP 


TTGGGCACGT 


TTTGCCAATT 


TCTCTAACGC 


2580 


TTCAGGATCA 


TTATATGPTC 


PTAATAATTG 

X inkn X X X 


CCAATCGGCA 


ACTTGTGCAG 


CAGGACAATC 


2640 


AATTTGTGGA 


TCTAGCACAC 


CCACTCGAAA 


CCCCATTTCT 


CTTGCACTAA 


GGGCCATCAT 


2700 


TTTTGCTAAA 


TGTCCGCCAC 


CAATGATACC 


AATCACTTGA 


CCAGGTAATA 


GTGGTTTATC 


2760 


CAAGTTCATC 


ACAACTTTCC 


ATCACAGTTT 


TATCCTTAAA ATCCTTCTGC 


GGTTTCTCGT 


2820 


CGCAGTCTAG 


CCTTTAAGAG 


TGGCCCTnTT 


TACATGaTAC 


mAGGrAAAAG 


GcTCAACTTC 


2880 


AAGTGCCAGC 


AGTGTTTCCA 


TTAGGTTTTC 


TCATCAGAAT 


TTATTAACAT 


ACTTTAGGaT 


2940 
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AAAACAGCGG 


TAACTTACTA 


CTCGTCTCTT 


TGTAACCAAG 


CTTTTTTCTC 


TTTTTCTCTT 


3000 


CAAATTGATG 


AAAACATTAA 


ATTTTTCTAA 


ACGAT.GATGA. 


.GGTCAAAAAA 


GTATGAAAGA 


3060 


AAAGCCACCT 


GACCTTATCT 


ATCATACTTT 


TTTGAAACTA 


TTCGTTACTT 


TCTGAGGTTT 


3120 


CTTTTAAACC 


ATTGAATAAA 


AGATTTAAAA 


TAATTGCGGT 


AATGCTGCTC 


ATGACAATTC 


3180 


CATTACCAGT 


AAACATGCTA 


ACAGTTTCTG 


GTACTTTATT 


AAAGAGCGTC 


GGCATAATAT 


3240 


TAAAGCCTAA 


ACCAAAGCCA 


ATCGAGATGG 


CAATAATTAA 


TAAATTTTTG 


TCATTTGAGT 


3300 


AGTCAACTTT 


TGACAACATG 


CGCATTCCTT 


GGACAGCCAC 


CATTCCAAAC 


ATAACCAGCA 


3360 


TTCCGCCGCC 


TAAAACCGGC 


TCTGGGATAA 


TTTGGGCCAT 


CGCACCAATT 


TTTGGTAATA 


3420 


AGCCTAAAAT 


AATTAAGAAA 


AAAGCAGAAA 


AATAAATTGG 


TCGACGTGTT 


TTGATCCCAG 


3480 


ATAATTGAAC 


TAACCCGACA 


TTTTGAGAGA 


AACCTGTATA 


AGGAAACGTA 


TTAAAAATCC 


3540 


CCCCTAAAAT 


GACGGCTAAA 


CCTTCTGCAT 


GATACCCACG 


ACGTAAATCA 


TCTTCGCCAA 


3600 


TTTTCTTCCC 


TGTAATATCT 


CCTAAGGCAA 


AATAAACACC 


TGTTGATTCA 


ACCaTACTAA 


3660 


CAATtGaAAT 


AATAATCATC 


AACACAATAG 


AAG/^GATC 


AAACGTTGGT 


TTGCCAAAAT 


3720 


AAAATGGCTG 


TgGAAAaTGa 


aACCACGTAG 


CTTGACCAAC 


AGGAGACAAa 


TCCACTAATC 


3780 


CCAGAAACGC 


AGCTAAAATT 


GTGCCACCGA 




1 AAUAL-VjviOA 






TAAAGCCACG 


TCCCCAAACT 


TGCACAACAA 


TGATTAAGGC 


AATTGTTACA 


AATGCGAGGA 


3900 


GCAAGTTTTT 


AGGATCACCA 


AAACTTTTGT 


CCGTTGCTAA 


GCCGCCACCC 


ATTTTTTCAA 


3960 


CAGCAACAGG 


AATTAAGGTC 


AAACCAATAA 


CTGTAATGAC 


TGTGCCGGTG 


ACTAAAGGCG 


4020 


GAAATAGACG 


TTTAATTTTT 


GAGAA 








4045 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

nAnnAATATA CTATAAAGTC CAAAGAAAGC GCTTTAATTC GATGGAGAAG GTGGTTTAGT 60 

GAAAAGATTT TCATTTTTTT TACTAATTTT ACTTGCTTTA ACAGGTTGTA AATCCGGTGA 120 

AAAAGAATTT GATGAAGAAT CTCTTCAAAA TCTAAAGGAA ACGkCACAGT CTTAyTCAGA 180 

AACAGAATTA CAAAATGGTG ACGTTCGTTT AAATGAATAT ATTTCTTTGA AAGGGGAGAT 240 

TGTTGAGAGT GACAGTCGTT CCAGTTTAAT AAAAAAAGGT GATCGTTTTA TTTTGAAAAG 300 

TGGTTCTAGT AAATATCAAG TTTyTAATGA GCAAAAGAAA AAATTGAAGA TTGGTGACGA 360 

AGTGACAGTT TACGGAGAAT ATTACGGCTT TTTGAAAGGG ACATTAATTG AAAGTGAGGA 420 

GAATCATGAT TCAGCCACGA ATTAGACAGA ATCCTTTTGA AACATACGAA GAAGTTGCAG 480 

AAGGATTTGA AGATTTATTA GCGCCACTGG AATTGTTTTT TGATCGAGAG TATCAAGGTC 540 
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ATCTGGATTT 


AGGGACACAC 


GGAACGGTTT 


ATTCTAAAGG 


GACACGTGAT 


GCGGAGGCAT 


600 


TTTTACGACC 


ATTATGGGGA 


CTAGGTCCTT 


ATGTAACGCA 


AAATGAAAGT 


GAGTATTTGA 


660 


ATGACTTTCT 


AACGGGAATT 


ATCGAAGGAA CCGATCCTGA 


GAGTTCATCT 


TATTGGGGaA 


720 


AAACAAAAGA 


TTATGATCAA 


TTGATTGTTG 


AGATGGcAGC 


ATTAAGTACA 


TTTCTTTTGt 


780 


TAAACAAAGA 


AAAAACGTGG 


GACCAATTAa 


CAAAAGAACA 


GCAAAACAAT 


TTACACAGTT 


840 


GGCTGATACA 


AGCCAATGAA 


AACATTATTC 


CGCCAAATAA 


TTGGCATTTC 


TTTAGAGTGT 


900 


TGCTGAtTac 


GtGAAcATgT 


GAAgCCGTAT 


cnAAAGAaCA 


GAtCGCTGTT 


GATTTGGCTG 


960 


TAATnGATCG 


TTTTTAtGtC 


GGTAATGGTt 


GGTATTATGA 


CGGAGTCGAG 


ACACAAGTTG 


1020 


ATTATTATGT 


TTCTTtCGCT 


ATTCATTATT ATAGTTTATT 


GTATTGTCGT 


TTTGCCCCGG 


1080 


AAGATACCGC 


AAGAGTGGCT 


ATTATGAAGG 


AACGAGCAAC 


TTTGTTCGCG 


CAAGAGTTTA 


1140 


AATATTGGTT 


TACTCAGCCA 


GGAGAAGCGA 


TTCCCTTTGG 


ACGTAGCTTA ACGTATCGTT 


1200 


TTGCTCAAGT 


GTCATTTTTT 


AGTGCTTTAG 


TTTTTGCAGA 


TGTTGAAGCA 


TTGCCGTGGG 


1260 


GTGAGATTAA 


AGGTCTAATC 


AGCCGACATC 


TTCATCAATG 


GATGAACAAA 


GATATTTTTA 


1320 


CAACAGATGG 


ATTATTGTCG 


GTAGGTTATG 


ATTATCAAAA 


TATGGTCTTT 


GCCGAAGnTA 


1380 


TaATGGTCCT 


GGTTCACCTT 


ATTGGGCCTT 


tAAAACATTC 


ATTTtGTTAG 


CGGTTCCAAA 


1440 


AGATCATCCT 


TATTGGCAgC 


AGAAaCACaG 


CCtATTAGCT 


TtCCaGAAAA ACACTTACCT 


1500 


cACCgAAAGT CGGAACTACT 


ATCAaGTAAA 


TGAT6CAGGG 


ACACATGGTT 


TAATGTTTCC 


1560 


AGCGGGACAA 


TTTATTAATT 


ATCAAGCTCa 


TGCGCATGAT 


AAATATAGTA AATTTGTTTA 


1620 


TTCTTCCCAT 


TTTGGTTTCA 


GTACTATAAA 


GAGTGATTAT 


TGGTATTACG 


AAGGTGCTTA 


1680 


TGATAATTGC 


TTGGCTTTAG 


CTGAAGACGA 


CCACTATTTC 


CGCACGAAAG 


GATTAGATGA 


1740 


TCAGTACGAA 


ATTCTTGATG 


ATCGTATCAT 


TCATCAATGG 


CATCCTTGGT 


CCGATGTAGC 


1800 


T^TCAAAACG 


ACAATTGTAC 


CATTAGAAGG 


CCAACATTTA 


CGGATCCACG 


AAATTGAGAC 


1860 


ACAACGTGCA 


TTAGTGGCTT 


ATGAAGGTGG 


CTTCTCAATC 


CCGCTCTTCG 


ATGAAAAGGT 


1920 


CACTTGTGTT 


TCTGATCAAA 


TGGCTGAGGT 


AAAAAATGCA 


AAAGGTGTGT 


CAAAAGTAGA 


1980 


AAATATCAaC 


GGCTTTTCTG 


AAGCyGCGAT TATTCGAACA 


GAACCGAATA CAAATTTACT 


2040 


TTATCCGTTA 


ACGGA6TTGC 


CATATTTGAA AGCAAACTTG 


TCGAAAGGGA AACATCTTTT 


2100 


AATTTCATTG 


GTGACAGGCG 


TACTACCGAA 


TGAACAAATT 


GAGCCAGTTA 


AAGTTAGGTT 


2160 


AAAAGAAAAT 


CAGCTTTTAG 


TTGAAGAAAA 


AGTTGTAATT 


TTGGGGAATT 


AGGGGGAGAA 


2220 


ATACACAATG 


ACAGAAAATG 


GACAGATTGA GTGGTTGCCG 


AAGCAAATCA 


GTTTTGCTTT 


2280 


AAACAAAGTT 


AGAGGAAATT 


TAGAGACATT 


TAAAGAATTA 


GTTCCACCCG 


CGGCAAGTTT 


2340 


AAATCAAATT 


TATTATCCTG 


AAGAAAaTAT 


TGACTGGACG 


GCCAGCTTTT 


GGCCAGGAAT 


2400 


GTTATTTTTA 


GCAAAGGAAT 


TAACAAATTC 


T^CAGAGTTC 


GATGAGGTGA 


TTGCCACACA 


2460 


GATGGCTTCT 


TTTCAACATC 


GCTTGGATGA 


ACAAATTGAG 


CTGGAAACAC 


ATGATATAGG 


2520 
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CTTTTTGTAT 


ATTTTAACAG 


CCATTGCTGA 


TTATCAAGTC 


AATGGTCATG 


AGGCCTCAAA 


. 2580 


AGAAATGGCC 


TTGCAAGCTG 


CGGACTTATT 


AATGAAACGT 


TATTCACCAA 


AAGCCAAGAT 


2640 


TATTCAAGCT 


TGGGGGAATT 


TAGACGATCC 


AGAGCAACGA 


GGTAGAATGA 


TTATTGATTG 


2700 


TTTAATGAAC 


TTACCATTAT 


TATACTTTGC 


AGCAAAAATG 


ACTGGCAAGC 


AAGAGTATTA 


2760 


TGAAGCTGCC 


TACAATCATG 


CGAAACAAAC 


ACAAAAATAT 


ATTGTGCGAG 


AAAATCACAC 


2820 


TACGTTCCAT 


ACGTATTnTT 


TTGATACGGA 


AACAGGCGAA 


CTTGTACGGA 


AAGACACAGC 


2880 


AAGGATACTC 


AGATGATTCT 


TGTTGGGCTC 


GCGGCCAAGC 


CTGGGGAATT 


TACGGCTTTA 


2940 


CGTTAAGTTA 


TTTATATACA 


GGAGATAGTA 


GTTTTCTGGA 


AACAGCAAAA 


AATGTTGCTG 


3000 


ATTACTTTAT 


TCmAGAATTA 


CCAGAGGrCa 


AAATCTGTTA 


CTGGGATTTA ATTTTCAATG 


3060 


AAGGGAGTGA 


AGAAGAACGT 


GATAGCTCAT 


CTGCTGCAAT 


TGCCGCATGT 


GGCTTACTAG 


3120 


AATTATCTCG 


TCAGCTTCCA 


TTAAATGATG 


AAAAACATGG 


CTATTATGAA 


AAGGTTGCGC 


3180 


TAGAACTATT 


ACAAGCATTA 


GCGGAAAAGT 


ATACAACGGT 


TTTACAACCA 


GAGTCGAACG 


3240 


GCCTATTGCT 


TCATGGTGTC 


TATGATAAAA 


AAACAAATAC 


AGGAGTCGAT 


GAATGCATGA 


3300 


TTTGGGGCGA 


CTACTTTTAT 


TTGGAAGCGT 


TAACTCGTTT 


AGCGAAAAGC 


TGGTATTCTT 


3360 


TTTGGTAATT 


CATGGAGGAA 


AACTCCTAAA 


AAATAATAAA 


TAGCGGAGGT 


AGTACAAATG 


3420 


AAATTTTGGA 


AAAAAGGCTT 


AACAGCGGCA 


GCGCTGTTAG 


CAGTGGCGGC 


AGTAACTTTA 


3480 


ACAGCATGTG 


GTGGTTCAAG 


TGAAAAGAAA 


GCAACTGAAA 


AGAGTGAAGA 


TGGCAAAACA 


3540 


AAATTAACAG 


TAACTACTTG 


GAATTATGAC 


ACGACCCCAG 


AATTTGAGAA 


ATTATTCAGA 


3600 


GCTTTTGAAG 


CGGAAAkTCC 


TGATATCACT 


ATTGAACCGG 


TGGACATTGC 


TTCAGATGAT 


3660 


TATGACACAA 


AAGTAACAAC 


GATGCTTTCA 


TCAGGAGATA -CGACGGATAT 


TTTAACCATG 


3720 


AAAAACTTAC 


TTTCATATTC 


TAATTAcgCG 


CTACGCAATC 


AATTGGTGGA 


TTTAACCGAT 


3780 


CACGTTAAAG 


ATTTAGATAT 


CGAACCTGCC 


AAAGCAAGTT 


ACGAGATGTA 


TGAAATCGAT 


3840 


GGTAAAACCT 


ATGCTCAGCC 


TTACCGTACA 


GATTTCTGGG 


TATTGTATTA 


CAAT/\AAAAA 


3900 


ATGTTTGATG 


AAGCCGGAAT 


TGCCTATCCC 


GATAACTTAA 


CTTGGGATGA ATATGAAGCG 


3960 


TTAgCGAAAA 


AATTATCTAA 


ACCAGAAGAA 


CAAGTATATG 


GTGCCTATCA ACATACTTGG 


4020 


CGCTCAACCG 


TTCAAGCGAT 


TGCTGCTGCT 


CAAAACAATG 


CCAATTTGAT 


TGAACCAAAA 


4080 


TACAATTATA 


TGGAAACTTA 


TTATGATCGC 


GCATTGAGAA 


TGCAAAAAGA 


TCAATCACAA 


4140 


ATGGATTTTG 


GAACAGCAAA 


ATCAACAAAA 


GTAACGTATC 


AATCACAATT 


TGAAAATTCA 


4200 


AAAGCGGCGA 


TGATGTACAT 


GGGTAGCTGG 


TACATGGGGA 


CTTTATTAAC 


AAACATTGAT 


4260 


GATGGCAAAA 


CAAATGTCGA 


ATGGGGGATT 


GCCGAAATAC 


CACAACAAGA 


AAAAGGCAAA 


4320 


GCAACTACCT 


TTGGCTCACC 


GACAAGTTTT 


GCAATTAATA AAAACAGTAA AAAACAAAAA 


4380 


GCTGCTCAAA 


AATTCTTAGA 


CTTTGCTTCA 


GGTAAAGAAG 


GTGCAAAACT 


TTTAGCAGAA 


4440 


GTAGGGGTGG 


TTCCTTCTTA 


TAAAACAGAT 


GAAATTGATA AAATCTACTT 


TGCAAGAAAA 


4500 
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GGAaTGCCTT CAGACGAGTC TCACAAAAAG CCTTTAACCC AGATACAATT AATTTAGAAT 4560 

TCCCATCTGA TAAAAACGGT GCCGCAATTG ATAAAGTATT ACAAGAmGmA CATGATTTAA 4620 

TTATGGTCGG CGmCGAAAAA CCAAAAGATG GTATTGCTAA CATGGAAAAA CGTGTCAAAG 4680 

CAGAGATAGA CTAATTATTA TTTTCTGCAA AAACACAAAA GATTGATGAA AAAAATAAGC 4740 

TGAAATGAGT AACCTACAAT GAATTTCAGC TTATTTTTTT AAATAAAGAA TGTAGCTTGA 4800 

GTTTGGAAGG AGGGCCTGTT AAGTTCACTT AACAGAATAC AATTATGGAA ATTACTTCAA 4860 

AAACAAAAAC AGCAAATAAA TTGAGACGAA AAAATACGTG GACTGCGTTA TCTTTTATCG 4920 

CACCAAACTT TATTGGCTTT TTCCTCTTCA CTTTAATCCC GGTTATCTGT TCACTGATTT 4980 

TAGCTTTTAT GAGTTGGGAT TCTTTTTCAA CCCCTGAATT TGTcGGAATG AAAAATTTCA 5040 

CCAAAATGAT TCATGATGAT ACGTTTTGGA TTTCGTTAAA ACAAACGTTT ATCTATACGA 5100 

TTGGTGTTGT TCCGTTAACA TTGATTTGCT CATTAGGACT AGCGATTTTG TTGAATCGTA 5160 

AAATTAGAGG GATGAAATTT TTTAGAACGG CTTTTTTCTT TCCTTATGTA ACGTCATTAG 5220 

TTGCGATTGC GGTCGTTTGG AATATGCTAT TTCATCCAAC GATGGGACCA ATTAATCAAT 5280 

TTTTGAAATT ATTTATTGAA AATCCTCCTG GCTGGACATC TAGTTCTACT TGGGCTTTAC 5340 

CAGCAATTAT CATCGTTAGT GTGTGGCGCT TTATGGGGTA TTATATGATT TTGTATTTAG 5400 

CTGGTTTACA AAGTGTGCCA AGAGAATTAT ATGAAGCAGC GTCAATGGAT GGCGCGGGTA 5460 

AATGGAATCA ATTTTTAAAT GTTACCTTAC CTTCACTACG ACCAACGACC TTTTTCGTGA 5520 

CTATCATGCT TGTAATTAAC TGTTTCAAAG TGTTTGACTT AGTTCAAGTT ATGACTGGTG 5580 

GTGGACCAGG ACGTGCAACG AATGTGTTAG TTTACGAAAT TTACAATGAA GCCTTTGTCA 5640 

AATTTAATTT TGGTTATGCA TCAGCCATTG CGATGGTCTT ATTCATCATC GTTCTAGCGA 5700 

TTACTGTGGC CCAATTTAAA TGGAATCAAC ATCAAGAGAA AGTTTAGGAG GAGAAAAGAT 5760 

GGAAGCAACT TTTTCAAAAA AAGAAAATAC TCGAAAAGCA AAAACGaAAA AAGCAGAAAA 5820 

AGTCACACCT GGTGGGATCC TTGTAATCAT TCTCTTAACT TTACTTGCGT • TAATTACTTt 5880 

GGTGCCTTTC ATTTGGATGG TTTCAGCATC ATTTAAAnCn AACAATGAAG TTTTCACCAT 5940 

TCCAATTCAA TGGATTCCTA AAAGTTGGCA TCCAGAAAAC TATTCCGTTA TTTGGGAACG 6000 

CATTCCGTTG TTAACTTTCT TTAAAAATAC ATTATTTTTA AGTATTGTGA TTACGATTAT 6060 

TCAACTATTT AGTTCAAGTT TTGCAGCATA TGGCTTTTCA AAAATGAATT TTAGAGGAAG 6120 

AGATACACTT TTTATTACGT ACATCGCAAC AATTGCTATT CCTTGGCAAT CGTATATGAT 6180 

TCCACAATTT ATTATGATGC GTCAATTAGG TTTGACTGaC ACGTTATGGT CACTTGTTTT 6240 

aTTACAAGCA TTTAATGCAT TTGGGGTTTT TTTACTGAAA CAATATTACA GCAGTATTCC 6300 

AGATTCACTA TGTGAAAGCG CGCGGATTGA TGGGTTAAGT GAGTGGGGAA TTTATTGGAA 6360 
AATTATTTTA CCATTAACTA AACCAGCTTT AGCTaGTCTC ACGATTATTA CTTTkGTGAA 6420 
TACATGGAAT GACtATATGG GaCCTTTCAT TTATTTATCT TCTACTGAGA ATAAAACGAT 6480 
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TCAGTTAGGT TTGAAAATGT TCGTCGGTTT ATTTGATGCT GAATATGCCT 
CGCATCAGTG GTTtCTATTT TACCAGTAGG GATTGTCTTT TTAGCAATGC 
TGTTGAAGGT ATTGCTACAT CTGGAATGAA GAATTAGGAG GAAGTTTGAA 
GCAAACCTTT GATAATAATA TCTATATGAA AATTTTTCGC TGGCTTTACA 
CGGTAATCTA GGGCTTCTCT TAGTGAACAT TCCTTTTTTC ATAGCGGTGA 
CATTGATCCA CGAAACCTCC CACTTTTTGT AGTAACGTTG TTGCCGATGG 
GATTGCACTT TTAGGTTTAA TCGATACTTT TAAAGAAGAA AAGGAGCTGG 
AACTTTTTTC CAAAAGTTTC GTCAGTTTGG GTTAAGAGGC TTTCTTATCA 
ATTAGGAAGT AGCATCATCA GTGTGACAGA TATATTCTTT TTTGCCAAAA 
TAAATGGTTT ATTCCATTGA TGGTCCTTCT GCTAATTTTT GGGTTAGCTA 
TGCTTGGTAT TTTCAAGTAC GAAACCCGCA AGCTTCTTTT AAAGATGTGT 
TGTTTACTAT GGACTGCGGA AATGGTATGT CAGCTGTCTA AATGTTTTTT 
AATGTTTGCA ATGATGTTTT TGAAGCCACA GTTTGGGTTT GTTGTAACCC 
CCTAGGGATT ATTTATTTAA ATACTGGGAA ATTGCATGAA AAGCAAAAAA 
ATTTTTACAT AATAAGTAAT CTCGATTTTC AGGAAAATGA ATGAAGGTGG 
TTTGTTCTTG CCTTGTTTAT TAAAGTGGAG GGAGCGAACG AATGAAGAAA 
TTTTAAGTAG CTTTTTGATA GGTGCTATTT TTGTGCTCGT TATTTTTACT 
AAAGTATGAT TAATAACgAa GCmATCCAAG TCgArGAATT AATTGTTwAT 
ATATGAATCG TATA 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1229 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



TAATCATGGC 

AAAAATACTT 

TGTTTACTAA 

TTTTATTTAT 

TTAGCTTAGA 

GGACAGGGAT 

ATCCATTTAA 

GCTTGCTTGG 

CAACGATTGG 

TTATGCTGAA 

TTCGGATTTC 

TACTATTCTC 

CTGTCCTGTT 

AGAATAAATA 

GGTTGCTTAT 

AAAATAATTA 

ACCAGCCAGC 

GGCGATAAAA 



6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7574 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

TGTnTAATGT TCGTGTTTTA TnTTTTAATA ACCTTTTTCG AAAAAATATA CGAACGTTCG 60 

GGTTTAAAAA CGAACAAAAA GATGAAAATG TGTTGAAATA TACGTTTTCC TTGCGTATAC 120 

TGATGAAGTG AAAAAACGAA CGAGGAGTGT ACGAATGAAC AAGAAAGTTC CTTATTTAAT 180 

TGTGGCACCG GGGCTAGTAC TATTATTATT TTTCTTAATG ATTCCATTAA TTACTAGCAT 240 

TTTGCCAACC ATTTTTACTG ATCATGGCTT GACACTAAAT CAGTATGTTA CTTTTTTTAA 300 

AGATGACTAT AATGTTTCAA TTTTCTGGC6 AACGATAAGA GTTTCGTTAA TTGTGACGGG 360 

TATTTCAATC GTGTTGGGCA TTCCAACGGC ATACTTTATT GCTGGAGTTT CCAAAAAATG 420 

GCGTGGCTTT TTAATGGCTA TGACCTTGTT TCCATTATTA ACCAATTCAG TTATTCGAAG 480 
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TTTTGCTTGG ATTAACATTT TAGGAAAAAA CGGTGTAGTG AATACGTTGT TATTAAAAAC 540 

•AGGCTTAATT GAGCAACCGC TGAATTTATT GTATACAGAA TTTGCAATTA TTATCGGTTC 600 

AGTGTATTTA TTTTTACCAA CAATGATTAT GACCTTAGTT GGGGTTATGG AAAACATTGA 660 

AGGAGAAATG CTAGAAGCGG CTGAGACGCT AGGCGCCAAT CCAATGACGG CTTTTCGGAA 720 

AATTGTTTTA CCTTTATCAA TTCCAGGCAC CATTGTAGGA AGTATATTAG TGTTTACAGG 780 

AACTTTAACG GCTTATACGA CGCCACAGCT GTTAGGTGGT AACCAAAAAA TGATGATGTC 840 

GACCTTCTTA TATCAAAAGG CTAATACTTT AGGTGATTGG CAATCGGCGA GTGTTCTTGC 900 

ATTTATTATG ATTTTAACGA CATTAATTGT GATGAAAGGC CTAGACATGG TCGCTAAGAA 960 

AGTAGATCGG AGAGAAGCAA ATsATGCGTA AsAAAAAGGA CTATCTGTTA TTGCAATTCT 1020 

AGTTTTTsTG TTTTtATTTT TACCGTTATG TTTGATTGTG GTGcnTCTTT tGGAaCAGCC 1080 

GCAGCGATTC AATTTCCGAT TAAAGGATTA CTTTAGATTG GTACGCCAAA GCATTACAAT 1140 

CGGAAACGTT TATGGACAGT TTAAGTTAAG CTTGTTGAAT GGGGTATAGC GACTGTTTTA 1200 

GCCATTAATA GTAGGGATCC GGCCTCCnA ^229 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

CTTGGTTATC TGGTATGGCC ATCCTCTGTG TGAGCAAAAA TTATGGAGAT CTATTnGTAA 60 

TATTATTTGT GTGACTCCGT ACCCTCTCCT TGCTGTGGCT GATAAGGGCA TCTATGTGGT 120 

TGAAGCACGA AAGrAGGgtT mCGTGTATTg rTaATCCTGT AGcgTATTaA CaATwACCCa 180 

cATTTGaTTa CCaATTATTT AATTTAAACA ATTATCGTGT ACTTTCAACT AGAACTCCaA 240 

AAAATAATTT TTCAGATCAA aTAGAGTTAG ATATTTATAG TAGAGGAATG GGTGGTATTG 300 

GGTTGCCAGG AGATTTATCA TCAGTATCTA GATTTGTAAA AGCAACTTTT ACTAAGTTAA 360 

ATTCTGTATC AAGAAGTTCA GAATATGAAA GTATTAGCCA ATTTTTTCAT ATTTTAAGTT 420 

CTGTCGAACA ACAAAAAGGA TTGTGTGATG TTGGTGATGA AAAATATGAG TATACGATTT 480 

ATTCTTCATG TTGTAACCTG GAAAAGGGAA TTTATTACTA TCGTACGTAT GACAATAGTC 540 

AAATTACTGC TGTGGATATG AATAAGGAAA ATTTAGAGAA GGATAGCTTA ATTGTTTATC 600 

CAATGGTGGA AACACAACAA ATTAACTATG CTAATTAATT TGTTGTAATT CTTGTTTTTA 660 

TAAGTATTTT- CGAAATGAGT TTAAATAAAC GGGaATAATA GAaGAGTAGA TTTaAGTTAT 720 

TGATAACTAC TAGGTAATTG ATGATAAATT AAmCAAATTA GATGCTTTAA GGAAAGAT6A 780 

CAAAAAAGTA CAGTATTACA TGAAAAAGTG GATTTTTGAG AAAAAAAGGG TTTAACAATT 840 
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AATATCAAAA 
ATAAAAAAAT 
AGTAACTACG 
ATGtGaTAGG 
AAaTGaAAAA 
GTAATGTACC 
ATCCGTTTGT 
AAACGCGCCC 
AGTAGACACC 
AATGGTCTCC 
CCGTCGTTTA 
GTGCTCTGTC 
GGTGCTTTAT 
GTTAAGCCGT 
TAGGTTTCGT 
TAGAGGACTT 
CCACGATCAT 
CCAACAGCGA 
CTGTCCTCTC 
AACAGAACCT 
ACCGATAGCA 
TAAACTTAAA 
AGTTGTTTGA 
ACGAAAACCC 
CGATTATTCA 
ACGTTCAAAA 
CAAAAACAGG 
GATCAGCATT 
GTTTTACACT 
TGTTCACTTA 
GCCGGCGCGA 
CTAATCGACG 
TATAATGGCG 



ACAACTATTC 
GAATAGTAAA 
ATTGTCCGCA 
KGtATTAGtA 
AGTATGtGCT 
AATTTTTACA 
TATAGCGTTT 
TCTCTGAACT 
GAAAATCCGA 
ATGCCCTTAA 
ATAGGTCGAT 
TTAGTATATA 
CGGTCACAAT 
CCGCATCAAT 
CCATTTTCCA 
TGCTGTACTC 
ATAACAATTC 
CAATAATGAC 
TGTCTTTTTT 
TTTTTTGCAA 
ATTTCGTGAA 
ACATGTATTT 
TTTATATTTT 
AGTTAATACA 
ATTAGCAGAC 
AGACGGAACA 
AGAAACAGAA 
CATTTTTAAA 
CTCCTACTTG 
TTATAAATTA 
TGGGTTCCCG 
GCTGGCAGGA 
AAGAAATTAC 



TTATAAAACG 
CACTTGTAAT 
TAATTTTTTt 
tTTATTTAaa 
ATTTTAATGA 
GCAGAAATAA 
GTCGATAAAG 
CCAAATATCT 
AGAGCGTTCC 
TCGTGGAAGA 
GGTCTTGTTC 
AACCCACACT 
TGCTTGCGTA 
TGCACGATAA 
TGAATAGAAG 
TTGCACCCAA 
CTGaACTTCA 
GTCTTTTTTG 
CTCAATTTTA 
TAGAATTACT 
AAATAATTGT 
TATCTTCTTT 
AAATTATCAA 
GATACAGGCG 
GGAACAACAA 
GTAACGGTTA 
AATATTGCAT 
AAGArAaTCT 
AAGTtTCCGA 
AGTtTGGAGG 
GTTCGGACTG 
GCATATCAAT 
TGTTAAAGTC 



876 
AATAGTTGTT 
TTTTTAATAA 
GAAGgTGgTA 
AGrCtAAAAA 
TTTCTGCGTC 
ATTATGCGCA 
TGAAGGGTTC 
TAGGCTGGTA 
ATTTCTTCGG 
GGCTGTACGG 
TATTAAATTG 
CTGTAACTTT 
TCCCGTTTCT 
AGATAATGCC 
GATTGTCTAT 
CGATAAATCG 
CGATAGCTTA 
AATTGTTTGC 
CACTAAAATA 
AATGGGAAAA 
TATTAAGAAT 
CATAGGATCC 
TAAAAGAAAA 
TTGTGATTGT 
AGAAAGTTGA 
AAGGTTCAGA 
TGTCTGTCCT 
AATTAAGTAT 
ATGAAAACTC 
CTTTTATTTA 
ATGTTGAAAC 
GCCATCAAAG 
TCAATTGTTA 



TTTTTGTATT 
GAAgAATAGC 
AAAAAAAtAT 
AtGtGaAGGA 
AGTTTTAGGA 
TGAAAATGAA 
TGTTGCAAAG 
TTCCCATTAA 
TTCTTTTTAT 
AGACTTTGAT 
TTAAGATACT 
CTAAAGGCGG 
TTCGTAACCA 
AACGTCCCTT 
TTTTCTTCTT 
TAGTATGACA 
GATTGTAACG 
CTTTAAAATG 
GATTTTTTGG 
CAAATTAATG 
AAAACTATAC 
TCCCGTTATT 
GCTAGGAAAA 
CGCAGTTGAA 
AGCTAAAGAA 
CGGAAGAATG 
AGGTTCATTA 
TTATAAAAGC 
TATAATGAGA 
TGAAAATTGC 
AAGGTGGTAA 
AAAACGGATT 
ATCAGAATGA 



CATATTTTCT 
TTGTTATAAG. 
AGrGkTATAA 
GkTTGaATCg 
ATGGTGCCCA 
TTACAAATGA 
TTTGAAAATC 
CCTTGATTTC 
ATATTCCTCG 
TUU^TTTATT 
TCACAGTTCG 
AGCAAGaGAA 
GATATCTAAG 
AATTTTGATA 
CCAAAGATCA 
AACATTTATT 
CaGGTAGTAA 
ATTCATTACT 
AAAACTTTGC 
CGATAACCAG 
TAGATATGAA 
GTCATACAAT 
AAGCATTCAA 
GATAGTAAAC 
ATTGGAGCAG 
AAAGTTTTAC 
ATGGTATTAG 
ATGGTCGCAA 
AATGTGAAAA 
AATTGCAGGG 
CGATGTTTTG 
GAAGGCTAAT 
GGTGCCCACT 



900 
-.560 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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GGTGAGCAAT 


TTGATTTGAT 


TATCTTATTC 


ACGAAGGCAA 


TGCAGTTGGA 


AAAAATGCTG 


2880 


CAGGATGTTA AACCATTAAT 


TGCTGATCAC 


ACAGAAGTTT 


TGTGTCTTTT 


AAACGGTATC 


2940 


GGTCATGAAG ACGTAATCGA AAAATTCGTA CCGATGGAAA 


AAATCTTTAT 


CGGCAACACC 


3000 


ATGTGGACTG 


CTGGTCTAGA 


AGGTCCCGGC 


AAGGCTAAAT 


TATTTGGCAG 


CGGGTCTGTT 


3060 


GAATTGCAAA ATCTAGGTAT 


TGGCCAAGAA GAATCAGCGA AAAAATTGGC 


GGAAACTTTG 


3120 


TCCGCGTCTG 


GTTTAAATGC 


CAAATATTCC 


GACAACATTC 


ATTATTCAAT 


TTATCGTAAA 


3180 


GCTTGTGTCA ATGGTACGAT 


GAATGGCTTA 


TGCACGATTT 


TAGACGTCAA 


TATGGCGGGG 


3240 


CTTGGTGCAA 


CCAAACCTGC 


TCATGACATG 


GTAGTGACTA 


TTGTTAACGA 


ATTTGCAGCT 


3300 


GTTGCTGCCA 


AAGAAAAC6T 


CAACTTAGAC 


ATTCCCGAAG 


TAATCGAACA 


TGTAGAAACT 


3360 


TGTTTTGACC 


CGACTACGAT 


TGGTATGCAT 


TTCCCATCGA 


TGCACCAAGA 


CTTGATAAAA 


3420 


AATAACCGTT 


TGACCGAGAT 


TGACTACATC 


AACGGAGCCA 


TTTCTCGTAA 


AGGTAAAAAA 


3480 


TATGGAGTAG 


TAACTCCTTA 


CTGTGATTTC 


TTAACGCAAC 


TGGTTCATAG 


TAAAGAAGAA 


3540 


ATTCTAGGAG 


CGAAATAGTA 


ATCATTAGAA 


AGAGGATGAG 


AAAATGGAAG 


AAAAATTAAC 


3600 


TCCAAAAATA 


TTTTTAAATA 


AAGTATTAGC 


GGGTACCGCA 


ACGGGGATCA 


TCGTAGGGTT 


3660 


GATTCCTAAC 


GCGGTATTGG 


CAGCAATTTT 


GAAATTATTT 


GGTGAAAATA 


CCTTAGCAGT 


3720 


AACGATCACA 


CAAATGGCAG 


TTATTTTCCA 


ACTAGCTACT 


CCACTATTGA 


TTGGCGCTTT 


3780 


GATTGCGGTG 


CAGTTTGGCT 


TCAAACCAAT 


GCAAATGTTA 


GTAGTTGCCG 


GTGCAGCTTT 


3840 


CGTTGGCTCC 


GGTGTTGTGA 


AATTCAATCC 


TGACACAGCT 


ACTTATATCG 


GTGCTGGTAC 


3900 


CGGAGACATC 


ATCAATACCA 


TGATTACTGC 


ATCAATCGCT 


GTCGGCATGA 


TCTTGTTGAT 


3960 


CGGCGAAAAA 


TTTGGGTCCG 


TGGCAATTGT 


TGCAACACCA 


ATCGTTGTTG 


GTATCGGCGC 


4020 


TGGTCTAATC GGCTATTATT TATATCCTTA CGTAACCAAG ATTACTGCCG CTATCGGTGA 


4080 


CTTAATTAAT 


ACCTTTACTA 


CTTTGCAACC 


AATTTTGATG 


TCAATCTTGA 


TTGCATGTTC 


4140 


CTTTGCTTTC 


TTGATTATCT 


CGCCGATCTC 


TACAGTTGCC 


ATTGGTATGG 


CAATTCAATT 


4200 


GAACGGTGTT 


TCTGCTGGTG 


CAGCCGCTAT 


GGGTGTTGCC 


GCAACAACAG 


TTGTCTTGGT 


4260 


AGTTAACTCT 


TGGAAAGTCA 


ACAAACCAGG 


GGTTACTTTG 


GCCATCGCAT 


TGGGTGCTAT 


4320 


GAAAATGATG 


ATGCCAAACT 


TGTTTAGGAA ACCAATTATC 


TTGGTGCCAT 


GTCTATTCAC 


4380 


CGCTATCATC 


TCAGCAATTC 


CAGTAGCGTT 


ATTCTCAGTC 


TCAGGGACTC 


CTGCTTCTGC 


4440 


TGGTTTCGGT 


TTGGTAGGGT 


TGGTAGGTCC 


GTTGGCTTCA 


TTGGACGCCG 


GTTTGAGTAT 


4500 


TATTCTACTA 


TTAATCAGCT 


GGTTTGTTGT 


ACCAATCGTA 


GCTGCTTTCG 


TAGGTCAAAT 


4560 


CTTATTTGAA 


AAAATCTTGA 


AATTGTATGA 


TCGTAAAGAT 


GTTTTCGAAT 


TTTTAGGATA 


4620 


GTCAACTAGA- AAAAGTGTGA GCCTTGTGCT 


CACGCTTTTT 


TGTGCAATTG 


AAGTTGGTGA 


4680 


ATCTCAGCGA AAAGAGAAAG GTACCTTAAA TAGTGTGTTT CTGGTCAATA GCACAATGAA 


4740 


ATATCTAAAT 


ACATTATGAA 


CTGCTTTTCG 


TTTTATGAAG AGCAGTTTGT 


GCTATTTGAG 


4800 
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AAATTGCTCA 


ATTTAATCAT 


TAAGTAGTTT 


CATCAATGGT 


GACTATGCTA 


CTGCAATTCT 


4860 


TGTTGTACCA 


ACCGGGAGCG 


TCAATAATTT 


TGTGTAAATA 


ACTCGTCCTT 


CTGCAAAATA 


4 920 


ATGGGTTACT 


CAGTAAACAT 


TGAAGCTAAT 


GTATCCGTCA 


CTTGTTGGAA 


GCCTTTATGG 


4980 


CTTCTATTTA 


AGAATTTTTG 


ATTGTATGTA 


TCAAAGATGG 


ATACTAGGAA 


ACGCTCTAGT 


5040 


GATTCTTCAT 


TTTGAAACTG 


CTCTTTTCTG 


CGGCTGTATT 


TCTTGATTTG 


TTTATTGAAA 


5100 


GATTCGATTA 


AATTAGTTGA 


GTAAATGCTT 


CGGCGGATAC 


CAGGCGGAAA 


CTCATAAAAA 


5160 


GTCAATAAAT 


CTTGATTTTT TAtyCAnkGC 


TGCGTCACTT 


TAGGATAAGT 


TTTCTGCCAC 


5220 


TTCTCAATCA 


TACTCCCTAA 


AAAGGTATTT 


GCCTCTTCCT 


TTGAGCTAGC 


TTGATAAACA 


5280 


GCCTTAAAAT 


CATCACAGAT 


TTCTTTTCGA 


TCTTTGaCAC 


GTACTTTATG 


AGCAATATTA 


5340 


CGGTAAACGT 


GAAcACAGCA ATGTTGGTAT 


TTAGCtTTAG 


GATAAATCTG 


GTGAATCGTA 


5400 


TCTTTCATGC 


CTTTTAAGCC 


ATCGGTAATA 


AACAGTAAGA 


CATCATGAAC 


CCCTCTGGAG 


5460 


TAAATGTCCT 


GTAACAACTC 


ATTCCAAACG 


TAGGTCGATT 


CAGTCGGAGC 


AATCGCATAA 


5520 


CTTAGTACTT 


CTTTAGTGCC 


GTCTTCTCGT 


ATACCAATGG 


CAATATAGAT 


TGCTTCTTTG 


5580 


GAGACGGTTT 


GCCGTTTTAA 


AGAAATATAA 


GTAGCATCCA 


TGAAAATAGC 


GACATACTTA 


5640 


TCATTTAGAG 


TTCTAACTTT 


AAATGCATTT 


ACTTCTTCAG 


TCTGCGATTT 


CCGACATGGT 


5700 


AATACCTTTT 


CGGAAGAGGT 


GAATGACGGT 


CTCCTCTAAG 


GTATCATTCG 


TCCGTCTATA 


5760 


AGCAGGAACA 


GTCTGTTGCT 


TGAACTCGCC 


GTTGCGGTCG 


CGCGGAATTC 


TGTGAATGTG 


5820 


TTGTGGTAAC 


GGTTCTGTTG 


CAAAGTTTTA 


AATCTACTAT 


CAAATAAGGT 


AGAATAATAG 


5880 


AAAAAGATAG 


CAGGAGGAAT 


GACGGTGAAT 


CATTTTAAAG 


GAAAGCAATT 


TCAGTAGGAT 


5940 


GTGATTATTG 


TAGCCGTGGG 


CTACTATCTT 


CGTTATAACC 


TTAGCTATCG 


TGAAGTTCAA 


6000 


GAAATCTTAT 


ATGATCGTGG 


CATTAACGTT 


TCTCATACGA 


CGATTTATCG 


TTGGGTGCAA 


6060 


GAATATGGCA 


AACTACTCTA 


TCAAATTTGG 


AAAAAGAAAA 


ATAAAmAATC 


CTTTTATTCA 


6120 


TGGAAAATGG 


ATGAAACGTA 


CATCAAAATT 


AAAGGaAAAT 


GGCATTATTT 


GTATCGAGCC 


6180 


ATCGATGCAG 


ATGGTTTaAC 


CTTGGaTATT 


TGGTTACGTA 


AAAAACGGGA 


CACACAAGCA 


6240 


GCCTATGCTT 


TTCTTAAGCG 


GTTAGTGAAG 


CAGTTTGATG 


AACCGAAGtT 


GtAGTCACAG 


6300 


ATAAAGCCCC 


CTCTATTACA 


AGTGCCTTTA 


AGAAACTAAA 


AGaATACGGC 


TTTTATCAAG 


6360 


GGACAGAACA 


TCGTACCATT 


AAATACCTGA 


ATAATTTGAT 


TGAACAAGAC 


CATCGTCCAG 


6420 


TAAAGAGACG 


CAATAAATTC 


TATCGAAGTT 


TACGCACTGC 


CTCTACCACG 


ATTAAAGGCA 


6480 




TCGAGGATTA 


TATAAGAAAA 




AGGCACTCTC 


TTCKGGTTTT 

X X VxV7s3V7 M. A X ■I. 


6540 


CGGTCTGTAC 


TGAAATCAAG 


GTATTATTGG 


GAATCCCAGC 


TTAAATCATA 


GATACCGTAA 


6600 


GGgATTTTAT 


TCTTTATTTA 


AAACTTTGCA 


ACAGAACCAA 


TAATTTGATC 


TACTCATTTC 


6660 


TAATTCTTGG 


GAGAATTTTT 


CTTTTGATTT 


GGTACCTCTC 


ATCTTTTTTA 


AGCTTTCTCT 


6720 


AAATGCTAGT 


TCCAAGTTTC 


TCCCTCCTGT 


TTATGTAAAT 


ATTTTGCACG 


TTTTCCATTT 


6780 
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AATTATATCG 


TTACACCTTG 


TAAATCACAA 


GTGATTAATC 


ACAAATCACT 


TGTGATTAGT 


6840 


GAXTGT.TAT.T . 


GGTGTTGCCT 


AGAACGATTC 


TAAGCACATG 


TTTAATTTTA 


AAGTGGGCAA 


6900 


TTATCTTTAA 


AATGTCCTAC 


AATGATTCAA 


TAAAAGCATG 


GCTGCTAGTG 


AAACTTGCGA 


6960 


CCATGCTTTT 


AAAAATACTT 


AATGGGGTCC 


CGAGCGCTTT 


AGCTCCTTGG 


AAGCTGTCAG 


7020 


TAGTATACCy AATAATTTAT 


CTACATtCCC 


yTTAGTAACG TGkAACTTTC CAAATTTACa 


7080 


AAAGCGACTC AtAGmmTtAT 


TTCCTCCCGT 


TAAATAATAG 


ATAACTATTA 


AAAATAGACA 


7140 


ATACTTGCTC ATAAGTAACG 


GTACTTAAAT 


TGTyyACTTT GGCGTGTTTC ATTGCTTGAT 


7200 


GAAACTGATT 


TTTAGTAAAC 


AGTTGACGAT 


ATTCTCGATT 


GACCCATTTT 


GAAACAAAGT 


7260 


ACGTATATAG 


CTTCCAATAT 


TTATCTGGAA 


CATCTGTGGT 


ATGGCGGGTA 


AGTTTTATTA 


7320 


AGACACTGTT 


TACTTTTGGT 


TTAGGATGAA 


AGCATTCCGC 


TGGCAGCTTA 


AGCAATTGCT 


7380 


GAATCGAGAC 


TTGAGTGTGC 


AAGAGCAACC 


CTAGTGTTCG 


GTGAATATCC 


AAGGTACGCT 


7440 


TGTAGAATCC 


TTCTTCAACA 


ATCAGATAGA 


TGTCAGACGC 


ATGGCTTTCA 


AAAACCACTT 


7500 


TTTTAATAAT 


TTGTGTGCTT 


AAATGGTAAG 


GAATATTCCC 


AACAATTTTA 


TACCTCTGTT 


7560 


TGTTAGGGAA 


TTGAAACTGT 


AGAATATCTT 


GGTGAATTAA 


AGTGACACGA 


ATGTTCAGTT 


7620 


TTAATTTTTC 


TGACGATAAG 


TTGAATAGAT 


GACTGTCTAA 


tTCAATAGaC 


GTTACCTGTT 


7680 


TACTTATTTT 


AGCCAGTTTC 


GTCGTTAAAT 


GCCCTTTACC 


TGTTCCAATT 


TCGTAAACGG 


7740 


TAwCGGTTTC 


TTTTAAATTC 


AATTGTTTTA 


TTATTTGGTT 


GAGTACTTTT 


TCACTCGTTA 


7800 


AAAAGTTTTG 


AGAATATTTT 


ATATTTTTGT 


TCcATGTAAT 


CACTCCTGAA 


GTGATTACAT 


7860 


CTATAAATAA ATACAGAAGT 


TAAACGATTT 


GTTTGTAATT 


TTAGTTATCT 


GTTTAAAAAG 


7920 


TCATAAGATT 


AGTCACTGGT 


AGGAATTAAT 


CTTUVCGTATT 


TATTTATCTG 


CGTAATCACT 


7980 


GTTTTTAGGG 


TTCTGTTGCA 


AAGTTTTAAA 


TAAAGAATAA AATCCCTTAC 


GGTATCTATG 


8040 


ATTTAAGCTG 


GGATTCCCAA 


TAATACCTTG 


ATTTCAGTAC AGACCGAAAA CCCGAAGAGA 


8100 


GTGCCTTCTT 


TTCGGGTTTT 


CTTATATAAT 


CCTCGAATGG 


CTTCCATGCC 


TTTAATCGTG 


8160 


GTAGAGGCAG 


TGCGTAAACT 


TCGATAGAAT 


TTATTGCGTC 


TCTTTACTGG 


ACGATGGTCT 


8220 


TGTTCAATCA AATTATTCAG 


GTATTTAATG 


GTACGATGTT 


CTGTCCCTTG 


ATAAAAGCCG 


8280 


TATTCTTTTA GTTTCTTAAA 


GGCACTTGTA 


ATAGAGGGGG 


CTTTATCTGT 


GACTACAACn 


8340 


TTCGGTTCAT 


CAAACTGCTT 


CACTAACXTGC 


TTAAGAAAAG 


CATAGGCTGC 


TTGTGTGTCC 


8400 


CGTTTTTTAC 


GTAACCAAAT 


ATCCAAGGTT 


AAACCATCTG 


CATCGATGGC 


TCGATACAAA 


8460 


TAATGCCATT 


TTCCTTTAAT 


TTTGATGTAC 


GTTTCATCCA 


TTTTCCATGA 


ATAAAAGGAT 


8520 


TkTTTATTTT 


TCTTTTTCCA 


AATTTGATAG 


AGTAGTTTGC 


CATATTCTTG 


CACCCAACGA 


8580 


TAAATCGTCG 


TATGAGAAAC 


GTTAATGCCA 


CGATCATATA 


AGATTTCTTG 


AACTTCACGA 


8640 


TAGCTAAGGT 


TATAACGAAG 


ATA6TAGCCC 


ACGGCTACAA 


TAATCACATC 


CTGCTGAAAT 


8700 


T6CTTTCCTT 


TAAAATGATT 


CATCGTCATT 


CCTCCTGCTA 


TCTTTTTCTA 


TTATTCTACC 


8760 
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TTATTTGATA 


GTAGATTTAA 


AACTTTGCaA 


CAGAACCGTT 


CATGGGTATA 


GCATGAATTG 


8820 


TTTAACTGTC 


CTCTAGGATT 


CCCGTCCTAT 


QGCAGTQGaX. 


TCTTATTTTG 


TTAATTGTCT 


8880 


TGTAAGTTGC 


CTTTATATCG 


CACTAAGTTT 


AAAAAAGTAA 


AAAGATTTAT 


CTCTCAGATA 


8940 


ATTGCTATAT 


CTyVGGTAACT 


ACGAGATTCT 


AAACTAATTA 


TCCTACAAGT 


GTGACAACAA 


9000 


ATAAAGTTAT 


CAATGGAATA 


ATTTTTTTCA 


TAGCTCGTTT 


GCTCATTCTA 


ATAGCTCTCC 


9060 


TCTTATTTAA 


ACTTAACTTA 


CAAATTAATA 


ATAGCATAGA 


TTAAAATAAA 


AATAAACAAA 


9120 


AATGCAAAAA 


TTTGCGCTTT 


TTCATCAAAA 


AAATTTATTA 


CATAATAAAA 


AAACCAAAAA 


9180 


ATCATTCCAC 


GAATGCACTT 


TTTTGCGCCT 


CGGAAGGAGT 


ATTTCAAAAT 


GTTTCTTTAC 


9240 


GAACTATTCA 


AAGAACAGCG 


AAAACTCTAT 


AATTTTACTC 


AAGAAGAATT 


TTATGAAGGA 


9300 


ATATTCAAAA 


AAAGAGCTGC 


CTCTTCTTTT 


GAGGTACACA 


ACACGCATGA 


TCTCAAAGTT 


9360 


AAAGATTTAC 


CTGTTTTATC 


AGATAGAAGC 


ATGATGAGTA 


TCTTAGAAAT 


AATTCATTAT 


9420 


GCTAAAGAAG 


AATTTATTTC 


TCCtTACGAC 


GAAGACTTAA 


ATTCATTGTT 


TGATATATTC 


9480 


CAGAATAATA 


CCGAAAAAGA 


AAATAAAuAL. 
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9540 


GAGCTAAAAG 


AACATTCTAT 


AATATATTGG 


AATCTTTACT 


TAATAATAAA 


AATCCAATGT 


9600 


TCTGAATACG 


ATTCAAGAAT 


TGTCCCTACA 


GACTCACAAG 


ATTTATCTGA 


GTTAAAAAAA 


9660 


ATGaTTTTAT 


CAAaACaAAA 


GTTTACTCTT 


TACGACTATA 


AAATTGTTAC 


TAATCTATCT 


9720 


CTTGTATTTT 


CTTACAAAGA 


ATTGCAACC 








9749 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

AAAATCAGAT ATCTTATAAn nGnAATCAAG CACGTATTCG CGATAAGGGT AAAAAATTAC 60 

GTAAGAATCG GCACAGCGCT TCTTTAAAAA GTAAATTAAA CTTTGACAAC TTTTTCGAAT 120 

AATGGTACTG TTAAGAAGTA AATTTGTGAA TAGAGGAGGT CGAAGAACAT GGCAACAAAG 180 

aAAGCAGCTC TTGCTTGTTC CGTTTGTGGC TCTCGCAACT ATTCTAAATC CGTTAGTGAA 240 

GGTAAACGTG GCGAACGCTT AGAAATCAAT AAATTCTGTA AATATTGTAA TCAATACACG 300 

TTACATAAAG AAACGAAATA GATTTAGAGG AGGGTATCAA CATGAAATTT TTCCGCAGCG 360 

TTGCTGATGA GATGAAGCAA GTAACTTGGC CAACAAAAAA ACAATTGCGT AAAGATACAT 420 

TAGTTGTAAT TGAAACATCT ATTTTATTTG CAGCATTGTT CTTTATCATG GaTACTGTTA 480 

TCCAAACGGC ATTTGGcTGG ATTTTAAAAT AATTAATTCT AGTATTTCCC GACGAGGGGT 540 

GCTATAATAG TATTCGAGAA AAAGCTTCGG GAAACTGAGC TTTTTTTATT TGCTCGAAAA 600 

AAATAAACTA AAACATTTAA AGGAGCCG/UV TCAGTAATGG AAACTTTTGA AAGAAATTGG 660 
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TAcGTATTGC ACACCTATTC AGGCTATGAA AATAAAGTAA AwGCAAACAT. TGAATyACGT 


720 


GCACAAAGCA 


TGGGGATGGG CGATTATATT 


TTTCGTGTCG TTGTACCAGA 


AGAAACAGAA 


780 


AAAGAAGTCA 


AAAATGGTAA AGAAAAAGAA ATCGTTCATA AAACTTTCCC tGGTTATGTG 


840 


TTAGTGGAAA 


TGATTATGAC CGATGATTCT 


TGGTATATTG TTCGTAACAC 


GCCAGGCGTA 


900 


ACTGGTTTTG 


TAGGCTCACA TGGTGCTGGT 


AGTAAACCCG CTCCTTTATT 


ACAAGAAGAA 


960 


ATTAACCACA 


TTTTACGTTC AATTGGCATG 


AGCACCCGTC AATCAGACTT 


AGAAGTAGCG 


1020 


CTAGGCGATA 


CAGTGAAAAT TATCGAAGGT 


GCCTTTTCTG GTCTTGAAGG 


TGTTGTGACA 


1080 


GAAATCGATG 


AAGAAAAACA AAAATTAAAA 


GTAAACATCG ATATGTTTGG 


TCGTGAAACA 


1140 


AGTACAGAAT 


TAGACTTTGA ACAAGTCGAT 


AACATTGATT AATAAACAAG 


CTGACAAACT 


1200 


ATTGTTCAGC 


TGATTGATAC AGATTGTAGG 


GTCTGGAAAA GCAGTCAACT 


GACTTCTTTT 


1260 


CC7UVGCCCTT 


TTTTAGTTCA TCGCAGAAAG 


GAtGrATU^AA AATGaACATG 


CCCAAAAATA 


1320 


TCnGTtATTT 


TTCTTTGCTA ATGGGTCTtG 


tTCTATTATT AAGTGCTTGc 


CAAATTGGGG 


1380 


CaACTACGAA 


GGATGACAAC CAAGCCGCCa 


CAAAAGAAGC AACTGTTGAG 


TTAAACCGCA 


1440 


CAACAACACC 


AACGCTTTTT TTTCaTGGTT 


ACGCAGGAAC TAAAAATTCG 


TTTGGCTCGT 


1500 


TACTGCATCG 


CTTGGAGAAA CAAGGTGCCA CAACTCAAGA ATTAGTGCTA CTCGTTAAAC 


1560 


CTGATGGGAC CGTGGTTAAA GAGCGAGGAG CTTTAAGTGG CAAAGCGACG 


AATCCCAGTG 


1620 


TTCAAGTTCT 


ATTTGAAGAT AATAAAAACA ATGAATGGAA TCAAACAGAA 


TGGATAAAAA 


1680 


ACACATTACT 


CTATTTACAA AAAAATTATC AAGTGAACAA AGCCAATATT 


GTCGGGCACT 


1740 


CTATGGGTGG 


TGTTAGTGGT TTACGTTATT 


TAGGAACCTA TGGGCAAGAT 


ACATCGTTAC 


1800 


CTAAAATTGA 


AAAATTCGTC AGCATTGGAG 


CACCTTTCAA TGATTTTATT 


GATACGAGTC 


1860 


AACAGCAAAC 


CATCGAAACG GAACTAGAAA 


ACGGCCCCAC AGAAAAAAGT 


AGCCGCTATT 


1920 


TGGATTATCA 


AGAGATGATT AATGTTGTTC 


CAGAAAAACT GCCCATTTTA 


TTAATTGGTG 


1980 


GTCAATTAAG 


TCCAACAGAT TTAAGTGATG 


GAACGGTGCC GTTATCTAGT 


GCCTTAGCAG 


2040 


TCAACGCCTT GCTAAGACAG CGAGGAACTC AAGTCACTAG CCAGATTATT 


AAAGGAGAAA 


2100 


ATGCACAACA 


TAGTCAATTA CATGAAAATC 


CTGAAGTAGA TCAATTGCTA 


ATCGAATTTC 


2160 


TATGGCCGAG 


TAAAAAaTAG CAaAAAGGCA CATGTTCCAT AAAAAATGGA ACATGTGCCy 


2220 


TTTTCTATAC 


TTTGGCTAGC GAAATAGCTT 


TTTTCAGCTA TAATAAATAA 


AATTATTTAT 


2280 


TTTTATGAGA 


TATTGGAATG GAATCCTCAT 


GATTTAAATA AGCACAAATC 


AAGGCTTCAA 


2340 


AGGTGATTAA 


TGTTTGGCCA AAAAACATTT 


GTGGCTGAGC GACACTTGTT 


GAAAACGGGC 


2400 


TGTAAGAGTC 


AGTTGAAAAA ACGAGATCTG 


CCAGTCGACC AAGAGTTGAA 


TTTTTGGCGC 


2460 


CAACAAAAGA 


AATGATAGGA TGTTTTTTCT 


CTTTCGCAAG CAGAGCTGTT 


TCTTTTAAGT 


2520 


ATTTTTCTTC 


ACCAGAATGT GAAAGGATGA TAAAAAGAAA ATTCTGATTA 


TTTTGACTAT 


2580 


TTATCGAATG 


AGTATAGGCT GTTGTGATGC 


TGGGGATGCC ATGAAAATTG 


AAAACTTCAC 


2640 
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TAATGTAAGC 


CGCCAAATGT 


CGTGAAAAGT CATGACCTAA 


AATAACAAAT 


AAATGACTCT 


2700 


TGTGTTTGGC 


AAGCAGTCGA 


CAAAACTCGT TGGTTGTTTC 


AAATGAAGGT 


GCTGTTTCAA 


2760 


AAGGGATTGG 


CTGAGAAAAA 


TGGGCTTCTT TTAATTTGTA 


AATAAGTTCA 


TTATAACCTG 


2820 


ATAAATTTAA 


TTTTTTGGCT 


AAACGAATGA TGGAACTAGG 


CGAAGAAAAG 


CAGTGTTTTG 


2880 


CAACTTCCCG 


AATACTCAAA 


TCGGCCGCAT TCGTACTTTG 


TTGATCCAAA 


TAATACAAAA 


2940 


TCTGaCTTTC 


TGTTTTCGTT 


AAATGaTACG TTTCTTTTAA 


GTAATCTAAG 


TTCATGAGCA 


3000 


AACCTCCTTT 


ATCAGATTAT 


AAATAATTTT GTGGGAAAAA 


GGTATCAAAA 


AATAAAAAAA 


3060 


TGAGGTGTTT 


TTATGAAAAG 


GGCAACAAAG CAAAGGCTGT 


CTTTGGCAGC 


AATCATGGTT 


3120 


CTACTTCTCT 


CGGGCTGTGG 


AAGTGTTGGG AAAGAAACCA 


AAAAGCAAGA 


ACAACAGGTA 


3180 


TTACGGGTCG 


GGATTGATTC 


GGAATTATCA ACGGCAGACG 


TGTCGTTGGC 


AATGGATAAT 


3240 


ACCGCAGCAG 


ATGTAATGAG 


CCAAGTAGGG GAGGGACTTT 


TCTCCTTTGA 


CGAAAAAGGA 


3300 


GAAGCGAAAC CAGCATTGGC AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT 


3360 


ACTTTTACGA 


TTCGAAAAGA 


TGCAAAATGG AGTAACGGCG 


AGCCAATCAC 


AGCAAATGAT 


3420 


TTTGAATACT 


CTTGGAAGCG 


CACAGTGGAC CCAAAAACAG 


CTTCCCCGCA 


AGCGTATTAC 


3480 


TTTGAAGGGT TAAAAAATTA TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA 


3540 


GGGGTAACAG CCATTGATGA CCATACCTTG GAAGTAGAGC TAAGCTATCC 


TATGAGTTAT 


3600 


TTTCAACAAT 


TATTGGCGGT 


ACCAGCTTTT TATCCTTTAA ATGAAGCATT 


TGTCGAAAAA 


3660 


ACGGGCAAAA ACTATGGTAC 


ATCAGCTGAG TC7VACACTTT 


ACAATGGCGC 


CTTCACATTA 


3720 


GAAGGTTGGG ATGGCACGAA TAATACTTGG TCCTATGTGA AGAATA7UVAA TTATTGGGAT 


3780 


CAAGCGAATG 


TTTCGCTAGA 


TAAGGTGGAT GTCCAAGTAG 


TTAAAGAAGT 


CAATACTGGG 


3840 


AAAAATCTTT 


TCGAAGGGAA AGAATTAGAT GTTGTAAAAA 


TTTCTGGAGA 


7VATTGTTGCA 


3900 


CAAGAACAAG 


GCAATGCAGC 


TTTGAAAATT CGTGAAATTC 


CTGGAACGTA 


TTATATCCAA 


3960 


TTAAATACGC 


AAAAAGATCT 


TTTGGCAAAT AAGAATGCAC 


GTCGAGCAAT 


AGCATTATCA 


4020 


TTGAATTCTG 


AGCGTTTAGC 


TAAAAATGTT TTAAATGATG 


GCTCAAAAAA 


AGCACTTGGC 


4080 


TTCGTGCCAA 


CAGGTTTCAC 


TAATCAAGAA ACGCAAAAAG 


ATTTTGCAGA 


GGAATTAGGA 


4140 


GATTTAAATC 


CTAGTGAACC 


AGAAAAAGCG AAAGAGTTAT 


GGCAAACGGC 


TAAAAAAGAA 


4200 


TTAGGAATTG 


AAAAAGCGGA 


GcTAACGATT TTAAGTTCGG 


ATACAGAAAA 


TGCTAAAAAA 


4260 


ATCAGTGAGT 


ATGTTCAaGG 


AGCTTTAGCA GATAATTTAG 


AAAATTTAAC 


AGTCAATGTT 


4320 


TCACCAGTTC 


CTTTTAATAA 


TCGTTTAGAA AAAAGTCGCA 


GCGGAGATTT 


CGACATTGTG 


4380 


GTTGGTGGCT GGACGCCaGT ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA 


4440 


AAAAATTCCA ATAATTTTGG 


TAAATGGTCT AATAAGACCT 


TTGATCAGTT 


GCTTCAA6AA 


4500 


GCAAACGT7\A CTTATGCAAA TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT 


4560 


CAATTGGTTG 


CGGAAGAAGC 


CCCCCTAGTT CCTCTTTATC 


AATTAACAGA 


AGCACGCTTA 


4620 
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GTGGCCGATT 
GTCTCTATCG 
AATTCAGGAA 
CCTTCAAGAC 
TCTTGAGAAG 
CGAAGTGGGT 
AGGTGGCTGG 
TACCTATCTG 
TCAGCGACCG 
AGTAATTGAA 
TTACAACGAA 
TTGCTTAGTG 
GACAGTAACT 
CAATCCTGAT 
AGAGATGATT 
GATTACGAAT 
TGTTCAAGTT 
AGGAGCCCCG 
TGTAGATTTT 
AGATGCTGAC 
ATTTATGTAT 
GAAGACACCG 
ATTCAAAAGA 
CGTGGATTGA 
AGCGGCTTTC 
TTTTTCTTAG 
AGCGCTAGTG 
AATGCAGcTT 
TTAGAGGAAC 
GAATCTCTGG 
GTAGCAACTA 
TTGCAAACAC 
GAAATGTTAC 



CTGTCCAAAA 
GCGACAAGTA 
CTTTCAAATG 
GCAACTGCGG 
AAAAATAATC 
TTTATGATTC 
GTCCCTAATA 
CCAGGCGTAG 
TTAACCATTG 
ACTTACAAAA 
CAGACAGATC 
GATGTCATGG 
GCACAAGAAG 
TTAGCTATTG 
CAATCAGCAA 
CCGGAATTTC 
TCTGTTCGAA 
ACGATTGTTG 
CAAGATTACC 
AAAATTCAAG 
CATGCGATAT 
TTTTATTCCG 
GTGCCAAAAA 
ATTTAGCTTA 
CTCGCATGGA 
GTTATCATGC 
TTGCTTATGA 
ACGCTAGCGA 
AACTTTTTCA 
GACGTTATGC 
CAAGTCAGGT 
CTGCAACTGt 
CAATGATTAA 



TTTAGTCTAT 
AAGGTTGGGA 
CTAATGGTGT 
ATTTTACTAT 
TAAGTGAAGG 
AAGCAATTAA 
CGATTTCTGC 
TTACTAGTCG 
CTGATTTGAC 
TCGATCTTGG 
TTTTTTTAGG 
AGGAGTTGAA 
AAGTTGGCGA 
TTTTCGAAGG 
TGGGAAAAGG 
AAGAGTATGC 
6TGGTGGTGG 
TAGGAATTCC 
AAGCGGCGAA 
CACTGGTTCA 
TGAAGGATTA 
AGAATTATAT 
TGAACAAATT 
TGCGCGCTTG 
TTATATGATG 
CGGTATTGGT 
TTTAAAAATA 
ATTAGGTGTC 
AGAAAAAATG 
AATCAAAAAT 
GTTAACTTCC 
AAAATTGCAA 
ACGCATTGAT 



883 

GGTCCATTAG 
AAAAATGAAA 
TTCAGGCTTT 
TCAACGGCTA 
TCCAGTAGTG 
AGAAAACGGC 
ACAAAAAGTT 
GCCCCCTCAT 
GATCGACATT 
TGCACCGGTC 
AAAAGCTTTT 
AGAAGAGACG 
AAGAGGAGCA 
CTGCCCGGCT 
ACCGATGTTA 
ACTAGAGATT 
AACCAATGGC 
TGTCCGTTAC 
AGAATTAGTC 
GCCATTGTCA 
GCAGGTATTG 
CACCAACATG 
ACAGAAATTA 
GCAGAAATGG 
AGTGGCTTGG 
AAACAAAAAG 
AACGACTTAG 
CCTGTTGGTT 
ATGCCCGAAG 
AGACCCATGC 
TTTGCGTTGT 
TGTGTGACGA 
GGTAGAACTG 



GTTCAGGCTA 
AAAAGTCAAG 
GAAACAGAAG 
GATTCCATTA 
CTTTTTGATG 
TTGTTGCGTT 
CGGATTCGTA 
TTTATGACAC 
GGTGCTACCT 
ATTCCAGATG 
GATTGTCGCA 
CTTCCATTCA 
CTTATTGCCG 
GATGATACGG 
CGTTATTTTG 
GCTAAAATAC 
ATGGCGATTA 
GCTCACACAC 
ATTCAATTAA 
AAGGAGTGGA 
CGACATTTGA 
TGGCTTGGTT 
CTATTGCCGA 
ATGAACGAAT 
ATAGTAGCTA 
GGAATATGGA 
CGATGAATGA 
TAATTATCGG 
TTCCCTTTGT 
AACAAGTTCG 
CTGAATTACC 
CAGCACAAGC 
TTTCATTTGT 



TTACAAATCA 
CTGTTCAATT 
TCGTTAGAAT 
AAAATTTATA 
CTCATAGTGA 
TTTTTCCGTT 
ACCGGGAAGG 
CAGAGGAACG 
CAAAAGAAGA 
TGACGTGCTG 
TTGGCTGTGC 
AGTTAGTTGC 
CAAAACAAGT 
CAGAAACGCC 
ATGTTTCTAT 
AC7VAAATTCC 
CACAAGTTCA 
CCCATTGCTA 
TCAAAAATTT 
ACAAATGAAG 
TATGGAAAAA 
AATTGAAGGG 
TTCGCATAGC 
TTGCTTAGTG 
TGATGTGGTC 
TCATGGGTAT 
AACGACAATT 
TGAATCAGGC 
TTCCACCAAA 
GGAAGCTATA 
CCGATATGCC 
AGATCGTATT 
AGGTGAAACG 



4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 
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ATGAAAGATG 


TCATGAATGG 


AATTGTCGCT 


GTGGTTGGTT 


TAGGCGGAAC 


AAGTTATTAA 


. .6660 


AGTAAGAAGC 


TTACGGAAAA AACGAATGGA 


CATTTTTTTC 


TGTAAGCTTC 


TTGCTATAGT 


6720 


TATTTCTATT 


TTTGTTTCTC 


TAGAATTTCT 


TTGCTTAAAC 


GTTGTGCAGT 


TGGTGTCACC 


6780 


GCAAGACcGC 


CTTCAGCGGT 


TTCCTTGAAA ATTTGTGGCA 


TTTGGCGTCC 


CACTTGGTAC 


6840 


ATTGCGTGGA 


TGACTTCATC 


TGGCGGAATG 


ACACTTCGAA 


TCCCCGCTAA 


AGCCATGTCA 


6900 


GCTGAGATGA 


AGGCTTGAGA 


AGAACCTAAT 


GCGTTTCGCT 


TCACGCAAGG 


GACTTCAACA 


6960 


AGTCCCGCAA 


CAGGATCACA 


AATGAGGCCC 


ATCATATTTT 


TTATCGTAAT 


GGCCACAGCT 


7020 


TGAGCAGATT 


GATCAGgCGT ACCGCCAGAA GCCGCAACTA 


AAGCGGCACT 


TGCCATTGCA 


7080 


CTAGCAGAAC 


CGACTTCAGC 


TTGACAACCA 


CTTC6GCACC 


ACTGATTGAG 


GCATTATTGG 


7140 


CAATCACTAA 


GCCAAAAGCC 


CCAGCTGTAA 


ATAGAAAATC 


CAATTGTTGT 


TGATGCGTTA 


7200 


ACTTTAAGCG 


ATCAATAACT 


GCCAACAAGA CGCCAGAAAC 


TACTCCGGCA 


CTGCCTGCGG 


7260 


TTGGTGTAGC 


ACAAATCAAG 


CCCATTTTTG 


CATTAACCTC 


ATTAACAGCA 


ATCGCATTTC 


7320 


GCACGGCTTG 


TAAAATGGTC 


TCACCGCTTA 


AAAAGTTGCC 


TGAATGTATG 


TAATCATTTA 


7380 


AGCGGGTTGC 


ATCGCCACCA 


GTTAGCCCTG 


TTACTGAGGT 


AACACCAGCG 


ATTCCTTCAG 


7440 


CGACGGATTG 


TTTCATAACG 


GCTAGATTTT 


TTTCCATTGT 


TTCAATAATT 


CGTTCACGAG 


7500 


AATACCCGCC 


ATGAGCCATT 


TCTGTTGCAA 


TCATTAATTC 


GCTCACAGAA 


GGGTAATCTT 


7560 


TCGCTTGAGC 


TACCAAATCT 


TCAATTGAAT 


AAAACATCGT 


ATTACTCCTT 


TACGTTCATT 


7620 


TAGTCAAAAT 


AGTTTU^CGCT 


ATATATATGA 


GGGATTTCAG 


CTAATTGCAT 


GACAATGTCA 


7680 


CCGACTTCGG 


CTTGATCAAC 


TTCAATAATC 


ATAATCGCTT 


TTTCCCCTTT 


AGATTCTCGG 


7740 


GTAACGGTCA 


TTGTACTAAT 


ATTGATATCG 


CTAGCTGAAA 


GGATGTTGGT 


CACCTTAGCA 


7800 


ATCATGCCAG 


GAACATCTTG 


ATGAACAACG 


ATAAATGTTG 


GCGTGCCCAT 


ACTTAAAGAC 


7860 


AATTTGAAGC 


CATTTAATTC 


AGAAATTTGA 


ATGTTTCCAC 


CACCAATTGA 


AATTCCTGTC 


7920 


ACAGATAGTT 


TTCGGTCGCC 


AGAAGAAACC 


ACAATTTTAA 


CGGAATTAGG 


GTGGTCTGCT 


7980 


TTTTCACTTT 


TAGGAATAAA ACAAACTTCC ATTCCTTGTT 


CATAGGCAAT 


TTCCAAAGAA 


8040 


TTGGATAGTT 


GCTCATCATC 


TGGTTCCATG 


CCTAAAA6GC 


CGCCAACTAA 


GGCGATATCT 


8100 


GTTCCGTGAC 


CACGATAAGT 


TTTGGCAAAA GACTCATATA 


AATAAATGTC 


AACGCTATCA 


8160 


GGTTGCTGTC 


CAAAAATACT 


GCGAACCACT 


TTTCCAATTC 


TTGCCGCTCC 


AGCTGTATGA 


8220 


GAACTGCTT6 


GACCAATCAT 


AACAGGTCCG 


ATAATATCAA 


AAACACTATT 


AAAGCGTAAC 


8280 


TCTTCCATTT 


CAGCACCTTC 


TTCATTAGAA 


TATGTTTTCA 


TTGGAAGTCT 


AACACAAAAA 


8340 


ATGGCGGGGA 


ACTAGGCAAT 


CGCTTGTGTT 


TTAGCGAATA 


ATTTAAATTT 


TTATCAGAAT 


8400 


GAAAGCAAGG 


GGCCTTCGTT 


GACAAATCAG 


TGAAGAGATG 


CTAAATTCTA 


GATAATAAAA 


8460 


GACGAAAAGA 


GGGATTCGTG 


CGTTGAAAAT 


TTAAGTTAAT 


TTTTTAGAGA 


AACTGTATAA 


8520 


CTTTGCCTGA 


ATATAACGGG 


TGGAGTGTGC 


GGATTTTCGG 


AAATTAGCTT 


AAA6AATTCA 


8580 
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TTTCCGTAAA 
AAAACAACTA 
CACGaTGGAT 
CTTATTARGA 
TTCGTCTATT 
GAGGTGACTT 
CCTGAnGTTT 



GCTGCATCAA 
TCTTTTGCCT 
ACCAATTGGA 
TgTTACAAAA 
TTCCACAAAC 
CTTTTGAACA 
ATGGCGTCCC 



ACCACGGGTC TTTTTGTATA CCTTGGATTG ATGCCGCTGA 
TTTTGCTTTA TGGAGGTATA TTTATGTCGA AAATTGAACT 
ATGaTAATCA AGAAGCGTTG CTTTTTGaTC AGGCAAATAT 
AATTAGGaTk GATTGGCCGC AATGGCcGTG GGAAAACAAC 
GCAGTTGGaT TACCAAGGAG AGATTCyTCa TCaAGTCGAT 
AGTTGCAGAA GAACAACAGC TCACTTATTA TGTCTTACAA 
GTGGnAATTA 6AACGAGAAT TAACGCTTTT AAACGTTGAT 

TTTCTTCTTn ATCAGGCGGC G 

(2) INFORMATION FOR SEQ ID NO: 156: 

(tt SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6491 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNE5S: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
ACAATAAACG GGAGGGATCG TTTATGAATA TTGAAAAAAT GACAACCACG CTACAAsAGG 
CGATTGCTGA AGCTCAAAAA GTTGCGGTGA CACGTCAACA TCAAGAAATT GATATCGCAC 
ATTTATGGAA AATTTTTCTT CAACCAAACC ATTTTGGACG TAATTTTTAC ACAGATGCTG 
GTTTAGATGT T6ACGCATTT GAACGAGAAG TAGATAACGC GTTAGATGAA TATCCAAGTG 
TGGCAGGCGG TAATGTGCAA TATGGTCAAA ATTTAAGTCA AAATTTGTTC CATTTATTGC 
AAGAAGCTGa TTCACTTCGA GAAGAtTCCA AGATGaATTT TTATCaaCCG rAATTGtACT 
TTTAGCTTTA ATGrAATTGa AAAATTATCG TTTaACmAAA TATTTAATGc mACmAGGcAT 
TACGGaAAAA GAGTTwAGrA AAAATATTGA AGaGATGAGA GGAGGAGATC GTGTGACTTC 
TCAAAACCAG GAAGAACAAT ATAAAGCACT AGAAAAATAC GGTGTTGACT TAGTACAACA 
AGTAAAAGCA GGAAAACAAG ATCCAATCAT TGGTCGTGAC GAAGAAATTC GTGACGTTAT 
TCGGATTTTA TCAAGAAAAA CTAAAAATAA TCCAGTCTTA ATTGGTGAAC^CAGGTGTTGG 
TAAAACAGCG ATTGTTGAAG GATTAGCACA ACGAATCGTT CGTAAAGATG TTCCCGAAAA 
CTTAAAAGAT AAAACCATTT TTTCTTTAGA TATGGGCGCC TTAATTGCGG GAGCAAAATT 
CCGTGGCGAA TTTGaAGAAC GGTTAAAAGC TGTTTTAAAA GAAGTGAAAA AAAGTGATGG 
CAAAATCATT TTATTCATTG ATGAAATTCA TAATATCGTC GGAGCTGGTA AAACAGAAGG 
CAGTATGGAC GCCGGAAATT TATTAAAACC GATGCTAGCA CGTGGTGAAT TGCATTTAAT 
CGGTGCCACA ACGCTTGATG AATATCGCCA ATATATG6AA AAAGATAAAG CCTTAGAACG 
TCGTTTCCAA AAAGTATTAG TCAAAGAACC AACAGTAGAA GATACAATTT CTATTTTGCG 
TGGCTTAAAA GAACGATTTG AAATTCACCA CGGCGTGAAT ATTCATGACA ATGCTTTAGT 



8640 
8700 
8760 
8820 
8880 
8940 
9000 
9021 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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TGCTGCCGCT ACTTTATCCG ATCGCTATAT. TACTGATCGT TTCTTACCAG ATAAAGCGAT 1200 

TGACTTAGTC GATGAAGCAA GCGCAACGAT TCGAGTGGAA ATGAATTCGA TGCCAACAGA 1260 

GCTCGATCAA GTAACACGTC GGTTAATGCm ACyAGaAAtT GAAGAAGCGG CCTTGAAAAA 1320 

AGAATCAGAT GATGCAAGTA AAAAACGGTT AGCCAACTTG CAAGAAGAAT TAGCAGATTT 1380 

GCGTGAAGAA GCCAACTCAA TGAAAATGCA ATGGGAAACG GAAAAAGAAG AAGTT7\ATGC 1440 

GGTTTCCAAT AAACGTGCAG AGATTGATAA AGCA7\AACAC GAATTAGAAG ATGCTGAAAA 1500 

TAATTATGAT TTAGAACGAG CTGCTGTTTT GCGTCATGGA ACAATTCCGC AATTAGAACA 1560 

TGAATTGAAA GAATTGGAAG AAAAGAACGC TAAAGATAAC GTCAAAATGG TACAAGAATC 1620 

GGTTACTGAA AACGAAATTG CGCAAGTGGT CGGTCGTTTA ACTGGCATTC CTGTGACAAA 1680 

ATTAGTTGAA GGCGAACGAG AAAAATTAAT GAAACTGAAT GAAACATTAC ACAAACGTGT 1740 

GATTGGTCAA GATGAAGCCG TCGATGCTGT TAGTGATGCG GTGATTCGTT CAAGAGCAGG 1800 

CTTACAAGAT CCAAATCGcC CACTCGGTTC GTTCCTTTTC CTAGGACCAA CTGGTGTTGG 1860 

TAAAACAGaA CTTGCTAAAG CTTTAGCTGA AGATTTGTTT GATTCTGAAG ATCATATGGT 1920 

ACGGATTGAC ATGAGTGAAT ACATGGAAAA ACATGCCGTG TCTCGTTTGG TTGGTGCCCC 1980 

TCCaGGCTAT GkTGGTTwTG AAGAAGGTGG CCaGTTAACG GAAGCTGtTC gTCGrAACCC 2040 

tTATACAATT GTCtTATTAG ACGAAATTGa AAAAGCGCAC CCAGATGTCT TTAATATCtT 2100 

ATTACAAGTT TTAGATGATG GTCGTTTGAC AGATTCTAAA GGTCGGGTCG TTGATTTTAA 2160 

AAATACAGTT TTAATTATGA CAAGTAACAT CGGCTCTCaG CTTTTATTGG AAGGTGTGAC 222*0 

ACCTGAAGGA ACAATTCCAG AAGAAGTTGA AAATCAAGTC ATGAATATTC TAAAAGGACA 2280 

CTTTAAACCA GAGTTCTTAA ACAGAATTGA TGATACCATT TTATTTACAC CATTAAGCTT 2340 

GGaTAATGTG AAAGGAATCa TTGGTAAAAT GACCGCACAA CTTGCTCATC GTTTAGAGCA 2400 

ACAAGAAATT GTGTTGGAAA TCACTGATGA AGCTAAAACA TGGATTGCTG AAAATGGGTA 2460 

TGAGCCAGCT TATGGAGCTC GTCCGTTAAA ACGTTTCATT ACTCGCGAAG TCGAAACACC 2520 

ATTAGCAAAA GAAATTGTTT CTGGACGAGT AATGCCAAAA ACAAAAGTCA CGATTAGTCT 2580 

ATTAGACAAT CAATTAGTTT TTGAAAATGA ACCGATAGAA GAAGTGTAAA AAAGAACGAG 2640 

CGAATTCGCT CGTTCTTTTT TTATTATGAC AATCGTTGCT TAATTTCTTC CAACAAATCC 2700 

TCTTCTGAAG AAACAACCCG ACCATTCAAT TTAATCAAAC CAACTGTGTA AAGGTTTAAA 2760 

TAATGGAATT GATTTTCAGC GATTTCGTTC AAAGCATCAA GCTTTTGGTG ATTATCAGCA 2820 

CCTTGTTGGC GGCTGTCTGT GTAAAGGGCA ACCACCGGGA TTCCTTTGGC ATAAGCAACA 2880 

CCAATTTCTG AAGcTACGCC CGCATCAATG GTTGGTCCGT CTAATAAAGC AACCAATAAG 2940 

TCGCTAGCAA GGACGTTTTC TGTATCTGCT AAGGCAATCA TTTTGCTATC GGCATAAGCA 3000 

GATTTGTCGT TAATCGCCGC ATTTTCTTGT GGTAAATAAA GATCGATTGT TTTATCCAAT 3060 

TGTCGAATTT GTTCGACTAA ATAGGCATTA TAACGTAAGT CAGCTTGCGA AAATAAAGGT 3120 
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CCTGCAAAAT AAATTTTTGT CATTTGTCTG. TCTCCAATCC AAGGGTTTAG AAATAATCAC 3180 

TGAAATMGC TTCTTGGTCA GGAAAAATCC GTTCCAAAGA ATTTAAAGTT TCTTTATCTG 3240 

TATCAATACG TTCAATTCGA TGTAAGTACG AAGGGCGTCG GTAATTCATG ACTAAACAAG 3300 

TTAGTGTTTG AATATCTAAG TGAACGGCGG TGCCTAGAGG TTCATCCGTA ATGGTCACTT 3360 

GATCATTTTC ATCCCAAATT AAGCCAAAAA TACCATTATT CCACTCAGCG ACAGGATCTT 3420 

TAACGACAAA ATGGAAGGGC TTAGCTGTAC TTTCAAAAGG AAAGTTTTCT AAAAAGGCTT 3480 

TGACATCAAC AATTCGCGCC ATATAATAGG GCTCAATGCT TTCTTTAATT TGGCTGTCTT 3540 

CTAAAAGAAA GGCTAATGGT TCGTTTTTAT AGATATCGCC CTTGACCCAA TAAACCATCG 3600 

AAAAATGCGC AGTAATAAAA TTCCaCAAGC CATTTCGGGC TTCTTGGTTC AAGTAAAACA 3660 

TTTCCTTAAT ATGAAATACC TCATCGGCcA CCCAATAGAA AAGTACTCCT AAAGGTTCTT 3720 

GATTGGCACC ATAATAGACA GCAGCAGTCC GTTCTTCCTC GTTTTCGAAA CGCCAATATT 3780 

CTTCCCAGTT GAAGGCACTA CGAATGAGTG CGCCGTGATT TTGACGAGCG AATCGCGCAT 3840 

AGACATCAAA GACATCTGGG TGATCGACAG CTAACCGTTC AATCATTCCT GGAACAGGAA 3900 

CGGTTTTTGG TAATTGCGTA TCCCGAATTT TAAAAGATAA CTTGTCGGAC ATAATTTCCC 3960 

AACCTTTACG TCGGTAATAA GGGATGTTGT AAGGGAAAAG ATAAGAAATC CACTGTTTAT 4020 

CTTGGCGCAT TTCTTCTAAT GCCGTTTGAA TCAAGTCTTT CATTAAACCG TGGTTGGCAT 4080 

ATTCAGGATA GGTACCGACA CCAGTAACGC CGCCCATTTT ATATAAAGCG CCATGGATAT 4140 

TGACTTCGCA TGGATAGATA GCAATTTGTG AAATTAATTG ATTTTCGTGA AACCAGCCAA 4200 

ATACTTTTGA AAGTTCTAAA ATCGGTTGTT TTGATTTGAT AAATGCGCGT TTGTTTTCAA 4260 

AACCACTTTC TTCAATATCT GCTTCAGTCA CTTGAAAAAC ATAGGAAAGA AGTTCATTAA 4320 

ATTGATCAAT GTGTTCyTCC yCaACAGGCT TTAAnGGTCa GTTGCTTtcC rAAATtCyTG 4380 

gTTCcATCCc ATTTTTcCAT CTCCAATAAA TTTAcTGTTT TTAgTATACC AAATTTTTCG 4440 

AAACTCTGTT AGCCTTTCGA AACCAAAAGA ATAAAAGGGA AGTGCTTTAA ATCTTCTTCA 4500 

AAAATTGGTA TAATGAAAAG TGCTGATTAT CCTAGACCTT TAAACATGAT AATTTGATAA 4560 

AAAAGGCAAC TAAAATATAA CAATTGATTA ATGAAGCTAG CAGGGAGCTA GCTTTTCTTA 4620 

AAAGAAATGA GGCAAAAAAT GAACAATAAA GAAATGAAAG CAAGACAAGA GAAAATTCGT 4680 

AATTTCTCGA TCATTGCCCA CATTGACCAT GGGAAGTCAA CTTTAGCCGA CCGGATTTTG 4740 

GAAAAAACAA ATACAGTTAG CAGTCGAGAA ATGCAAGATC AATTACTTGA TTCAATGGAT 4800 

TTAGAGAGAG AACGCGGCAT TACTATCAAA TTAAACGCAA TTGAATTAAA CTATACAGCC 4860 

AAAGATGGTG AAATCTATAC TTTCCATTTG ATTGACACAC CAGGGCACGT CGATTTCACC 4920 

TACGAaGTTT- CTCGTAGCTT GGCAGCTTGT GAAGGGGCTG TTCTAGTTGT TGaTGCGGCG 4 980 

CAAGGaATTG AAGCGCAAAC GCTAGCaAAT GTCTATTTGG CATTGGATAA TGACTTAGAA 5040 

ATTTTACCTG TTATTAATAA AATTGATTTA CCCGCCGCTG ATCCAGAGCG TGTTCGGACA 5100 
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GAGATTGAAG ACGTAATTGG AATTGATGCA TCGGAAGCTG ^ttTAGCAAG TGCAAAAGCA 5160 

GGGATTGGGA TTGAAGATAT TTTAGAACAA GTGGTGGAGT ATGTACCAGC TCCATCAGGC 5220 

GATATTGAGG CTCCTTTAAA GGCTTTGATT TTTGACTCTA TTTACGATAG TTATCGGGGG 5280 

GTCGTTTTAA ACATCCGTGT AATTGACGGT GTCGTTCGTC CTGGGGATAA AATCCAAATG 5340 

ATGAGTAACG GTAAAACGTT TGATGTAACA GAAGTCGGCG TTTTTTCACC GAAACCGATT 5400 

GCTCGTGATT ATTTAATGGT TGGTGATGTG GGCTATATCA CCGCTAGCAT TAAAACGGTT 5460 

CAAGATACAC GGGTCGGGGA TACAGTGACT TTGGCCGACA ATCCAGCAGC AGAAGCACTA 5520 

CCAGGCTACC GCAAAATGAA TCCAATGGTT TATTGTGGTT TATATCCAAT TGATACGTCG 5580 

CGCTACAACG ATTTACGGGA AGCATTAGAA AAATTACAAT TAAATGATGC GGCGTTACAA 5640 

TTTGAACCGG AAACATCGCA AGCTTTAGGG TTTGGTTTCC GTTGTGGTTT CTTAGGTTTG 5700 

CTGCACATGG ATGTTGTTCA GGAACGTTTG GAACGAGAAT TTAATTTAGA GTTAATTACA 5760 

ACAGCACCGT CTGTAATCTA TCACGTTAAT AAAACTGACG GAACAACCGT TGTTGTTGAC 5820 

AACCCAGCTG AATTTCCAGA ACCAGTAACG ATTGAATCTG TGGAAGAACC TTATGTTAAA 5880 

GCGCAAATCA TGGTGCCAAA CGATTATGTA GGAGCAGTAA TGGAATTATC ACAACGTAAA 5940 

CGTGGCGAAT TCATTACAAT GGATTACTTA GACGATTATC GTGTAAACGT AGTTTATGAA 6000 

ATTCCGTTAT CTGAAATCGT GTTTGACTTT TTCGATAAAT TGAAATCAAG TACAAAAGGC 6060 

TATGCATCCT TAGATTACGA AATGGCTGGC TATCGTACCA GCCGCCTAGT GAAAATGGAT 6120 

ATTCTATTAA ATGCTGAAAA AGTGGATGCT TTAAGCTTTA TTGTTCACCG AGATTTCGCA 6180 

TTTGAGCGTG GTAAAGCGAT TGTTGAGAAA CTGAAAAAAC TAATTCCACG TCAACAGTTT 6240 

GAAGTCCCAG TTCAAGCGGC GATTGGTCAA AAAATTGTGG CTCGTTCAGA TATTAAAGCC 6300 

TTACGCAAAA ACGTACTGGC TAAATGCTAT GGTGGCGATG TTTCTCGTAA ACGTAAATTG 6360 

TTAGAGAAAC AAAAAGAAGG GAAGAAACGG ATGAAAcAAA TTGGATCCGT GGAAGTTCCT 6420 

CAAGAAGCCT TTATGGCGGT TCT6AAAATG GACGACCAAG ATAACGCGAA ATAGCTTGTT 6480 

TCCAACGGAA A g^g^ 
(2) INFORMATION FOR SZQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CCTACGAAAA ATATGGCACA AAGAAAGTTG GAAAAATGAC AAACTTGTCG ATTTTCCAAC 60 

TTTCTTCATC TATTCGTGGT TGGCTTGCTG AACTGCATCC ATCACGGCAT AACGGAAGCC 120 

ATTTTTTTCT AGAGCTACGA CACCGCGAAT CGTACTTCCT CCTGGAGAAG TCACTTCATC 180 

TTTCAATGCT CCTGGGTGTT TTTTCGTGTC TAAGGCTAAG GTCGCCGTTC CTTTTATCAT 240 
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ATTTCATAAG 


ATAAGTCTCT 


TGGTAACCCT 


XCTAAGACTG 


CTGCATCTCC 


300 




ATAAAGATAT 


CAACAAAAGC 


TGGACTGCAA 


CCAGCCACAG 


TACCAAAAGT 


360 


TTCCAGTTGC 


/^/^rfi Wl/** Ik K^f^ Ik Ik 

GCTTCACGAA 


TTTCTTTGAC 


TAACCCTAAA 


GTAGCAAGAA 


GTTCCATTGC 


420 


CAGGTCCTTC 


GTTTGGCCTT 


CAATTGTTGG 


TGCCAAGGCT 


ACGCCAATCA 


CGCCTTGATT 


480 


CACGCTAACT 


GGTGTATTAG 


GAATGGCATG 


AACAACTTTG 


GCTTCTTTAT 


TGCCAAGAGC 


540 


CATTTGTGCT 


TCTTGTATGG 


TATGACCTGT 


TGCAACAGAA 


ATCAATAGAC 


TTTCGTTGGA 


600 


AAAATGAGGG 


GCTAATTJ'T 


GAAGAACAGG 


TAACACGACC 


TTTGCTCCTG 


TCGCAATAAA 


660 


AATAACCTTA 


CATTCTTGGA 


AAAGTGCATA 


ATCTGTGACA 


AGTTGAAAGC 


CCCACTCTTT 


720 


TTGAAGTGCT 


TCTGCGGTTC 


CACTTGATCC 


GCCTTTCACA 


TATAAATGTT 


CCTTCTTGAC 


780 


TGCCTCAGCT 


TTCAGTAACC 


CTTTAATCAT 


GGCACCGCCC 


ATACTGCCAG 


CACCGATAAA 


840 


GCCAATTTTC 


ATAGCATTCC 


CTCCTTAGTT 


AAATAArGTG 


TTtGTGkCAA 


AAGTCAATTA 


900 


GCTTTtGACA 


CnAACACTAA 


CAGTCGkACt 


TTTTcCGtAC 


aGCtGkTTCG 


TTAaTTaATT 


960 


gGTAAtcCAT 


wACGaCkGCT 


CaAAcGCgCA 


ATTTTTCaGA 


AGtCyGaACC 


AACT 


1014 



(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



TTATAAAAGA 


CCCTTCCTTA 


AAAAGCAATG 


TTTCTTCCAA 


TCAAAAAAAC 


GAATTATCTA 


60 


AAAAATGCAA 


CTTATTTGCT 


CAAAAAAGCA 


AGCGATGATT 


GACAAACGGT 


CGTTTTTAAC 


120 


TGATAATGAG 


AATCGTTATC 


AACTAAAAAT 


AAAGGAGTTA 


CACAATGAAA 


CTTTTAAAAA 


180 


AGACGGTCCT 


AATTGGTACA 


ACCCTTCTTC 


TTGGTTCATT 


CTTACTCGCA 


GCTTGTGGTA 


240 


ATACGAATAA AGAAGCCAAC 


AAC6CTGACA 


AAACACATGA 


AGTAACAGAT 


ACCTTAGGCA 


300 


ATAAAGTAAC 


CGTCCCCGCG 


AAACCCAAAC 


GGATTATTGC 


GAGTTATTTA 


GAAGATTATC 


360 


TAGTTGCATT 


AGGAGAAAAA 


CCAGTGGCAC 


AATGGACAGT 


TGGACAAGGC 


AGCATTCAAG 


420 


ATTATTTAGC 


GAAAGAATTG 


AAAGATGTCC 


CCACTATTTC 


CTATGACTTG 


CCATATGAAG 


480 


CGGTTCTAAA ATTTGAACCT 


GACTTATTAT 


TAATCAGTTC 


ATCTGCTCTA 


GTTGAAGGCG 


540 


GTAAATACAA AGAATACAGT 


AAAATTGCGC 


CAACTTATGT 


AGTCAAAAAC 


GGCGAAAATG 


600 


TCACCTGGCG 


TGATCAATTG 


GaAGATATTG 


CCACTGTTTT 


AGATAAAAAA 


GAACAAGCGA 


660 


AAAAAGTGTT- 


AGAAGATTAT 


GATACCTTAA 


CCAAAGGCGT 


CCAAGAATAT 


CTTGGCAAAA 


720 


AAGATGCTGG 


CAAATCTGCG 


GCAGTCTTAT 


GGGTAACCAA 


CAACCAAGTC 


TTTATGGTTA 


780 


GCGATAATCG 


CTCAAGCGGA 


ACCGTGCTCT 


ATCAGGACTT 


AGGCCTCCAA 


GTTCCAAAAT 


840 
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TAGTGGAAGA AATTTCTAAA AACGCTACT(3 CGGATTGGAA» TCAAGTTTCT TTAGAAAAAT 900 

TAGCTGAGCT TGACGCAGAC CACATTTTCC TTGTAAACAG CGATGAAXCA GCACCTCTTT 960 

TCCAAGAAGC AATTTGGAAG AACTTACCTG CTGTGAAAAA TAACCAAGTT CATACCTATG 1020 

ATAAAAAAAG TAGTTGGTTA TACAACGGAC CTATTGCGAA TACTCAAATT GTTGAAGATG 1080 

TAAAAAAaGC GCTCTTAAAT TAAAAAAATG CtTGGATGAA AATAATCATC CAAGCATTTT 1140 

TTTTAATTAG CAATTGCCGT TTCTAAACCA ACTTCAATCA TTTCATTAAA CGTCGTTTGG 1200 

CGTTCTTCTG CCGTTGTTTC TTCGCCTGTC ACCAAGCTAT CACTCACCGT CATAATCGCC 1260 

AATGCTTCTA CATCAAATTT TGCTGCTAAA TAATATAACA TTGCCGCTTC CATTTCAATC 1320 

GCTAATACCC CGTATCGACC TAAACGAAAG ACATCATCTA AACTATCTTT GTAAAAAACA 1380 

TCATCTGACA ATACGTTACC AACATGGGTA GTAAAGCCTT TTGCTTTAGC AATTTCGTAA 1440 

GATTTGAGTA AAAGATCAAA GCTCGCAATT TGTGGAAAAT CATATTTAGG AAAATCATTG 1500 

CGAATCGCGG AAGAAGGCGT CGCTGCTGCT TGTGCGATAA CTAATTCACG AACGTTGACT 1560 

TTTTCTGAAA TAGAACCACA CGTACCAACA CGAATTAACT TCTTCACATC GTAAGAGTTA 1620 

ATTAACTCAT GGGCATAAAT ACCTGCTGAA GGCATCCCCA TCCCTGTTCC TTGGACTGAA 1680 

ACACGCTGTC CCTTATACGT TCCTGTGTAA CCTAACATCC CACGAACTTG ATTATAGCAG 1740 

ACAGGATTCT CTAAAAAAGT TTCCGCAATA TATTTTGCTC GGAGGGGATC CCCTGGTAGT 1800 

AAAATTTTAT CGGCAATTTC ACCTGGTTTT GCTTCGATAT GCACACTCAT TCTGGTTTAC 1860 

TCCTTTTTTA TAAAGCTGCT AGTGTTGCTT TCACTAACTC TTTGAATTGT CCTTTAACGC 1920 

GTTCAGTTGT TTCAACTACT TCTGCATGAT TTAAGCTACT TTGCATACCA GCTGCTAAAT 1980 

TCGTAATACA AGAGATGCCT AGTACTTTCA TGCCACTATG CACAGCTACA ATGACCTCCG 2040 

AAACAGTTGA CATGCCAACT GCATCCGCAC CCATCGTCCG TGACATTCGA ATTTCTGCAG 2100 

GCGTTTCATA TGTCGGGCCA GAAAAGCCCA TATACACGCC TTCTTTTAAA TCAATATTTT 2160 

GTTCCGCAGC CACTTTTTTT GCCACTTCCC GATATTCTTG CGTGTAAGCA TGACTCATAT 2220 

CTGGAAAACG CGGCCCGATT TCTTCATCAT TTTCACCAAT TAACGGATTA TCGCCTGTAA 2280 

AATTAATGTG GTCATTGATT AACATCAAAT TTCCCGGTGT ATATGTTTCA TTTACGCCAC 2340 

CAGCTGCGTT GGTTACAATC ATTGAATGAA TGCCTAAAGC AGCCATCACA CGAACGGGAT 2400 

AAGTGACCGT TTGCATCGAA TGTCCTTCAT AATAATGAAA ACGCCCTTGC ATCGCCAAAA 24 60 

CTTTTTTTCC AGAAAGCGTT CCGTAAACTA ATTGTCCAGC ATGTCCAACG ACTGTTGAAA 2520 

CAGAAAAATG GGGAATTTCA GAGAAAGGAA TAGCAATCGC ATCGGTTATT TCATTAGCTA 2580 

ATTCACCTAA CCCAGAACCT AAGATTAGAC CGAAGTCTGC TTGTTGCACC CCTTTTTCTT 2640 

TCAAGAaCTC AGTTGTTTCG TTTAAATGAT TCGTTAACGT TGTCATGTGT TGCCTCCTCG 2700 

AATTATTGTA ATTGATTTAA AAAGCTTTGA CCGTTTTCCG TTGCAGGAAC TTCAAAGTTT 2760 

TCAGCAATCG TTGCTGAAAT ATCTGAATAA AATCCTTGTG GCAAGCTGCC TTGTTTTGTC 2820 
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ATTTTTTTAC 


TATAAGCCAA 


TAAAGGCACA 


TATTCACGTG 


TGTGGTCTGT TCCTGGGAAA 


2880 


GTTGGATCAT 


TCCCGTGATC 


TGCTGTAATT 


AAAAGCAAAT 


CATCTTCGGC CATAGCTGCC 


. 2940 


ATAATCTCTG 


GCAAACGACC 


ATCAAAATCT 


TCGATGGCTT 


TTGCATAGCC CTTTACATCA 


3000 


CGACGGTGAC 


CATACAGCGC 


ATCGAAATCC 


ACTAAGTTAG 


TAAAACTTAA ACCTGTGAAC 


3060 


TCTTTGTTCA 


TGACCGTTAA 


TAATTGATCC 


ACCCCATCCA 


TATTGCTCTT CGTGCGGATA 


3120 


GCTTCAGTAA 


TTCCTTGTCC 


ATTAAAGATG 


TCGTTAATTT 


TCCCAACAGC GATCACATCA 


3180 


TTGCCGTTTT CTTTCAACGA ATCTAAAACA GTATGGCCAA AAGGATCTAG AGCATAGTCA 


3240 


TGACGGTTAC 


TGGTTCGAGT 


GAAATTACCT 


GGTTCACCAA 


CATACGGGCG AGCGATAATC 


3300 


CGACCAATCA 


TATAAGGTTC 


GTCTTTCGTA 


ATGTCACGAA 


CATATTGACA AATTTTATAC 


3360 


AATTCTTCTA 


ATGGAATAAT 


GTCTTCATGC 


GCCGCAATTT 


GTAAAACAGG GTCTGCTGAA 


3420 


GTATAAACGA 


TTAGATCCCC 


AGTTTTCATT 


TGATGTTCGC 


CATAATCATC AATGACTGCT 


3480 


GTCCCGCTAT 


ATGGTTTGTT 


ACAAACAACT 


TTTCGCCCAG 


AAAAATCTTC AATTTGTTTC 


3540 


AGTAATTCTT 


CAGGAAAGCC 


GTTTGGGAAT 


ACCCGAAAAG 


GTTTTTTGAT GTTCAAGCCC 


3600 


ATAATTTCCC 


AGTGACCGGT 


CATAGTATCT 


TTTCCAACAG 


AAATTTCTTC TAATTTGGTA 


3660 


GCATAGCCAT 


CGTGGTCTGC 


CACTGCTTTG 


ACACCTGTTA 


AAGGAGCAAT AGTTCCCAAT 


3720 


CCAAGTTTTT 


CTAAATGAGG 


AATGGTTAGT 


CCCGCTTCTT 


TGGCAATATG ACCCaGTGTA 


3780 


TCACTGCCGA 


CATCTCCAAA 


TTTTTCAGCA 


TCTGGTGCTT 


CACCAATACC GACTGAATCt 


3840 


AGTACGATTA 


AATGAACACG 


TTTAAACACA 


CTTGCTTCCT 


CCTCTAAGGT CGTAAATACG 


3900 


TCAAGAATGA 


CAATTATTTT 


ACGCTAGCCT 


TTTAAATAGT 


CTCGAGAATC AAACAGAATA 


3960 


ATCTATTTAT 


TCTCAGCTCT 


TCAATTATAC 


CATGAATCCG 


CTATTATTGC CTGAAAGAGC 


4020 


ATTAAAATTT 


TCACTGATTC 


CTTTGATTTT 


AGCAGGGCTT 


AAAAATTGAA AGAAAAAAAG 


4080 


TCTCCCACCG 


TAATTAGGAA ACTTTTTTAT 


TGTTTTTTTC 


GAATAGCCTG CTCCTTAAAA 


4140 


AGAACACGAC 


TCTTTATCCA 


ATCGAAACTT 


TTAACACTCC 


AAGCGGTCTC CCTCTTTTGG 


4200 


CACATTTAAA TACCAGATAA CCGCTTCTGA TAAAATCACA CCAATGCCAA TGGGCGGATA 


4260 


AAAATAAATA 


CCGATATATC 


CGATTGCGAA 


AAGACAAAGA 


TGAATGATTT TTGTTGCCTT 


4320 


ACTATAAGAT 


TTCGATTTTC 


TTAACTTAGC 


TGCACCTTGT 


GGATCTTTCA CTACATTGTC 


4380 


TTCATAAATG 


ATCCTTGTTA 


GCAAGAAGTA 


G6CAAAATCC 


CaAACCGTAT ACACAACAAG 


4440 


ATAAAAATAA 


ACAGGTGCCC 


GCATCCACCA 


AAATTCACTC 


ACCCAAGCAG TTGCTACCGG 


4500 


AATGAAGGCC 


ATGGAGAATA 


ACCAAAAATT 


ATTTGCCCAA 


AACGCTTTTT TAGAAAACCA 


4560 


ATGAGTTACG 


GACATTATGT 


AATGATGATT 


GTACCATGCG 


ATACAGATAA AAATAAAACT 


4620 


GACAATAAAC 


•GATAAAAAAT 


ATGGGATATT 


TTCTATCAAA 


CCTGCCAAAA CACCCACTTG 


4680 


AGGAGTTTTA ATTTCTAAAA 


TCATGATGGT 


CATGATGATG 


GCAATGACTG CATCACTAAA 


4740 


TGCTTCTACG 


CGTGTTTTAT 


TCATTGAACG 


TCCTCCACTT 


TTCCTATGCT TACTACAAAC 


4800 
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AAAAGTATAG CAGAAAAACC AACGATTTTT G7UWVAAAGC .TGCTAGACCT GTTTGGTTTT 4860 

TTTCAAAATT TCAACACTTC CTTCGAATTG TAAAGAACAG CTAAAAGAAA AmAGAAAAGT 4 920 

tACCTTTATC AACTCGCTTT TTTACAGTGA GGCAGTGCCT AATGCCGCTT GTGTTCAACT 4980 

GATTATCCTA AGAAAAGAAA TGGCCCCCCG TCGCTTTGAC GGAGGGCCTA TTTCTTTTCt 5040 

TATTTCGTTT TGACGTAATT TTTCCCGTTC GCTTTTGGTC CCGATGATTT TCCGAAGAAC 5100 

ACAACTAGGA CTAAAATGGT TAATACGTAT GGTGCTGCTT GTAGATAAAC TTTTGGAATT 5160 

TCTGAAATAA TTGGTAGGTT TTCCCCAGCA ATACTTAAGT TTTGTGCAAA GCCAAAGAAT 5220 

AAGGCTGCGC CCATTGCCCC AAGTGGATTC CATTTACCAA AGATCATTGC GGCCATTGAA 5280 

ATAAACCCTT GTCCTGCAAT TGTGGTAATC CCAAAACGAC CCGCAATACT TTGAGCAAAG 5340 

ACAGcTCCGC CCATACCGCC TAAGAAGCCG GATAGTAAGA CACCAGAATA ACGCATTAAG 5400 

TAGACATTAA TTCCCAATGT ATCTGCCGCT TGTGGATGTT CCCCCACAGA ACGTAACCGT 5460 

AAGCCAAAAC GTGTTTTATA AATGACAAAC CAAGCTAAAA TCGCAATCAA AATTGCTACG 5520 

AAAGCTGGCA AAGAGGTTTT TGCAAAGAAT AGTTCACCAA TCACTGGAAT TTTTGACAAT 5580 

CCTGGGAAAG AAAAATAACC AAATGATTGT GTAATCATAT CTGTTTGTCC TTTTTGATAA 5640 

ATAACTTTTA CTAGGAAAAC CCCTAGCGCC GGTGCCATCA AGTTAATGAC AGTACCACTG 5700 

ATAATATGAT CTGCTCGTAA GTTAATAGTT GCAACGGCAT GTAAGAGAGA AAAGACAATC 5760 

CCGACAACGC CTCCTACTAA ACAAGCAATC CATGGTGTCG CCGCGCCAAA TGTATCAGCA 5820 

AACGTCAAGT TAAAAACGAC CGAACTGAAA GCACCCATTA CCATAATACC TTCCAAACCA 5880 

ACGTTAACCA CACCACTTCG TTCTGAGAAC GTTCCGCCTA ATGCTGTCAA GATGAGCGGA 5940 

GTTGCATACA CTAATGTTT6 CGTGATGATT GAGGAAATTG TTGCTAAATC CATTAGATGT 6000 

CTCCCCCCTC TTGATTGTTT TTGTCTGGCG CCGCTTCAAT TGTTTCTACA ATCGCTACTT 6060 

CTTTTTTACT ACCAGACGCT TTTGCTAACA AGAAACGAAT TAAATAACTA ATTCCCACAA 6120 

AGAAAATAAT CGCTGCAATA ACAACATCAA CTAATTCGGT TGGTACACCA GCTATCAACG 6180 

GCATCCCTTG TCCACCAAGT TTTAACATCC CGAACAGTAA TGCGGAAAGT AAAATCCCGA 6240 

TTGAGCTACC TGCTCCTAAT AGAGACACGG CCATACCATC GAACCCAATG CTCAACGATG 6300 

TGTTTTGAAT AAAGAAGTTT TGGAAAGTTC CCAATCCTTG GACCGCACCA CCTAAGCCAG 6360 

CTAGCGTTCC TGAAATCATC ATGGATAAAA TAATCGTTCG CTTGCTGCTC ATTCCAGCAT 6420 

ATTCAGAAGC AAATGGGTTT AACCCCACCG ACCGAATTTC AAAACCAAGC GTTGTTTTTT 64 80 

TCATCAAGAA CCACACAAGA ATAAGGAAAA TGAAGGCCAA GAAAATGCCG ATGTTCAAAC 6540 

GAGAGCCATC ACTGATACTT GTTAAAAACG GTGAGCGTAA TGATGCATTC GCACCAATCA 6600 

TTTTGGTGAT CCCTTTGTTC GCCATAATAC TTGGTTTCAT GACATTATTA GCAATATGAT 6660 

TTCCTGTGTA TAATAAAATA TAATTCAACA TAATCGTAAC AATAACTTCA CTCGTCCCAA 6720 

AATAGGCACG CAAGAAACCT GGAATGGCTG CTGCTAAAGC TCCTGCAACT GCCCCAACAA 6780 
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TT^TCGCCAC 


CGGCAACGCA 


ATCATCCGGG ' GTGCATCAGG 


GATGGATAAA 


GCGACCCAAA 


6840 


TACTAGCCAC 


CCAACCACAT 


AAGGCTTGTC 


CAGAAAGCCC 


GATATTAAAG 


AATCCAGCCG 


6900 


AATTAGCCAC 


CGAGAATCCT 


AAAGCGGTAA AAATCAACGG 


TGCTGCGGTG 


ACAAAAATCT 


6960 


CACCAATACT 


TTTCTTTCCT 


TGGAAAGCTG 


TTTCAAACAT 


CGCTTGATAC 


CCAGCCACTG 


7020 


GGTCATAGCC 


AAAGATAACC 


ATAATAATTG 


CCCCTAAGAG 


GAAACCCATA 


ATAACAGATA 


7080 




TAAAATATTT 


CTTATTTTTT 


CAGAACGATT 


ATTCAACAGT 


TTCACCTGCC 


7140 


TCTACTTTTG 

X W X X X ^ X v> 


ATAACTCAGC 


ACGGGCTTCT 


TCCAATGAGT 


AGCCAGCCAT 


TAACAAGCCC 


7200 


AATTfATTTT 

flfi X X X X X X 


CTGTCGTTTC 


TTTCGGATCA 


ACAATTCCGA 


CAATTTCCCC 


CGCATGAATA 


7260 


ACAGCGATAC 


GATCAGAAAC 


GTTTAAAATT 


TCATCTAATT 


CAAAACTTAC 


TAATAAAACA 


7320 


fif^TTT AT T TT 


X nx v« X V* X X 1 w 


TTCAATTAGT 


CGACGATGAA 


TGTATTCAAT 


AGCTCCGACA 

«X\J^^ X %^%^\J*»X^»* 


7380 




O 1 O X X X X \7 


AGAAACAATC 


AACAAATCTG 


GATCGCGGTC 


CACTTCTCGG 


7440 


f^PAATAATP^^ 


^ 1 X X XX OVrf X o 


ATTCCCACCT 


GAAAGCGCCC 


GAGCTGGTAC 


TAATTCATTA 

X X X x x 


7500 


fir; & nTPP^zT a 




TTCAATTAAT 


TCACGGGCAT 


GTTTGTTAAT 

sj X X X V9 X L r\r\x 


TATTPGATAG 

\^r\x x\^\3r\xn\J 


7560 


tttaaaatap 


^ X V3 1 X OXVTV^ X 


GTATGGTTTT 


TGGTAATACG 


TTTGTAGGCC 


AATATTTTCA 

X «^ X X X X v*«» 


7620 


n A a Tf^ T p A 


1. Va« X v.* X X 


CAAGCCATAT 


TTATGTCGGT 


CT T C AGG AAC 


GTGGGCGACA 


7680 


CCCATTTCCCl 


TAATTTCCCG 

X/VlX X X^^VtfO 


AGGTTTGCGA 


TTAGTAATCT 


CT T TT T C ATT 

\^ X X X X X W» X X 


CAAATGAATG 


7740 


CACCCGTTTT 
unVat^^vax x x x 


CGAfTTTGCT 

\*\Dn\^ XXX U\3 X 


TAATCCTGCT 


AAGGCTTGGA 


TTAATTCTGT 

X X j^nx X \^ X w X 


TTGTCCATTG 


7800 


CCATCAATCC 


CCGCAATTCC 


AACAACTTCC 


CCTGCTCGGA 


CATCTAAGCT 


AAGATTTTTC 

• X X X X X 


7860 


ACTGCATCTA 


AGCCGCGGCT 


TTCTTTCACT 


ACTAAATCTT 


TAATTGATAA 


AATTGTTTCT 


7920 


TTCGGTTGCG 


CTGGTTTTTT 

X X X X X X X 


CTCTGTTTTG 


AAGGAAACAG 


AACGACCCAC 


CATCATATCG 


7980 


GCTAACTGTT 


GCGAAGAAAC 


GTCTTTCACA 


TTGACGGTGC 


CAATACTTTG 


CCCCCGGCGA 


8040 


ATAACCGTAC 


AACGGTCCGC 


CACTTTTTTA 


ATTTCATCTA 


ACTTATGTGT 


AATTAAAATG 


8100 


ATCGATTTGC 


CTTCATCGGT 

\« X X \^»m X \^ W>9 X 


TAGTTCTTTC 


AAAATAATCA 


TTAATTCATC 

X X X X ^^mrn X 


AATTTCTTGT 


8160 


GGCGTTAATA 


CTGCGGTTGG 


TTCGTCAAAA 


ATTAAAACGT 


TCGCACCACG 


ATATAAGGTC 


8220 


TTTAAAATTT 


CAACCCGTTG 


TTGCATACCC 


ACAGAAATAT 


CGCGAATATA 


GGCATTGGGA 


8280 


TTTACATCCA 


ACCCGTATTG 


CTTAGACACT 


TCTTCAATCT 


CTTTTTTCGC 


TTTTTTCCGG 


8340 


TCTAAAATAC 


CGCCATGGGT 


AGGTTCACTA 


CCTAAGAT7VA 


TATTTTCAGT 


CACAGTAAAG 


8400 


GCTTCAACTA 


ACATAAAATG 


CTGATGGACC 


ATTCCGATTC 


CTAAACGATT 


GGCTGCAGTA 


8460 


GGACTTGTAA 


TGTTGACTTT 


TTCGCCATTC 


ATATAAATCT 


CACCTGAGGT 


CGGTTCCAAT 


8520 


AGCCCTGATA 


AGATATTCAT 


CAAGGTCGAC 


TTGCCTGcAC 


CATTTTCACC 


TAGCAGTGCG 


8580 


TGGaTTTCAC- 


CAGGTTTTAC 


TTGTAAATCA 


ATTTGGTCGT 


TGGCTTTGAA 


CGTCCCAAAT 


8640 


TCTTTGGTGA 


TTTTACGCAT 


CTCAATTACG 


TTTTCTTCTT 


GAGACAAGGT 


CTATTCACTT 


8700 


CCTAACAAAA 


TTATATTGGC 


AATACTTTAG 


AGGGTAATTT 


TGACCCTTAA 


CTAAGTAGTT 


8760 
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GAAGAACTTC 


AACTATTTAG 


TTAAAAATCC. AAACATCATA 


AAGGGTTCCC 


TTGGCAGCCG 


8820 


TCGCTGCCAA 


GgACCACTTT, 


AXTTJTAGAA 


GGAACAATAA 


TGAGCCCTTC 


TTTTTTCATT 


8880 


TAGGAGCTTA 


AATTATTCTG 


GTTTTTCTGG 


GACTTTTACG 


TCACCTGAGA 


TTACTTTATC 


8940 


TTTTGCTGTT 


TTAACAGCTT 


CTTTTGTTTT 


GTCGTTTAAA 


TAGCCGTCTG 


TTAAGTCAAC 


9000 


GCCACCATCT 


TTTAATCCAT 


AAACTAAATG 


TTCGCCACCA 


GGGAATTTGT 


CTTCTAACGC 


9060 


ACGGTTGGCA 


ATATCTTGAA 


CCGCTGTGCC 


GACACCTTTA 


AGCGTTGAAG 


TTAACGTGAA 


9120 


GTTGTCTTCT 


TTGCCGTCTT 


TTGTTTTGTA 


CTTGCCATCA 


GCATCTTGAT 


CGCGGTCAAC 


9180 


GCCGATTACC 


CAAACTTTGT 


CGCCAGAACC 


TGATTCATTC 


AAGTCTTTTG 


CTTCTTGGAA 


9240 


GACCCCTTGT 


CCAGTCGCAC 


CAGAAGCATG 


GAAGATGATA 


TCAACGCCGT 


TTTGGTACAT 


9300 


TGCAGCAGCT 


AAAGCTTTCC 


CTTTGGCAGG 


ATCAGCAAAT 


GAAGCCGCAT 


ATTTCGTATC 


9360 


AACAGTAATT 


TCTTTACCTA 


ATTCTTTCGC 


AGCATCAGCC 


ACACCTTTTT 


CAAAACCAGC 


9420 


TTGGAAACGG 


TC/^TTACGA 


CCCCTTCTTC 


ACCACCAACA 


AAACCGACTT 


TGTTCGTTTT 


9480 


TGTTTCATTT 


GCAGCAGCAA 


CACCGGCTAA 


GTAAGCTGCT 


TCATTATCTC 


TAAATGTTGC 


9540 


AGAAACGACA 


TTCTTTTTGC 


CATCGATTTG 


ATCATCGATT 


AAAACAAAGT 


TTGTATCAGG 


9600 


GTTGGCATCT 


GCTGCAGAAG 


AAATTGCATC 


TTTTAGCAAG 


TAGCCAATAC 


CAAAGATTGT 


9660 


GTTGAATTTA 


CTTGATACCG 


CTTGGTCAAT 


ATTGGTTGTA 


TAGTCAGCTG 


CATCATTCGA 


9720 


TTGAATATAT 


GCATACCCTT 


TTGAACCTTC 


TGGTAAATCA 


TGTTCTTTAC 


CCCAAGCTTG 


9780 


CAATCCTTCC 


CAAGAAGATT 


GGTTGAACGA 


CTTGTC7VTCC 


ACGCCGCCTG 


TATCTGTAAT 


9840 


GATTACAGCA 


CTATGCGCTG 


CATCCCCTTT 


GCCACCGCCG 


CTTTCAGCGG 


TTTTGCCTTT 


9900 


GCCACCACCA 


CATGCTGCAA 


GTGAAACTGA 


TAAACCTAAT 


GC/VATCAAAC 


TAAAACCGAA 


9960 


TAATTTTGCT 


TTTTTCATTT 


CTGTAAAGCC 


TCCTAAAAAT 


AGTTTAATGT 


ATTTAAACAA 


10020 


CTAAGTTGTT 


TCTAAGGTTG 


GTCTGGGACT 


TTTACGTCAC 


CAGAGATGAT 


TTGTTCTTTG 


10080 


GCTTCTTTAA 


CTGCCGTTTT 


GGCTTGATCG 


CTTAGGGCTT 


CTGTTGTTAA 


ATCAACACCG 


10140 


CCATCTTTCA 


ATCCATAAGT 


TAAATGTTCG 


CCGCCAGGGA 


ATTTATCTTC 


TAATGCAAGA 


10200 


TCAGAGATTT 


TTTTCACAGC 


AATATTTACA 


CCTTTAATGG 


TTGAAGCTAA 


CGTTAAGTTA 


10260 


TCATCCTTGC 


CATCTTTGGT 


TGTATATTTA 


CCGTCTTCGT 


TTTGGTCACG 


GTCAACACCA 


10320 


ATCACCCAAA 


CTTTATCTTT 


TGAACCTGTT 


TCATTTAGCG 


CTTTGGCTTC 


TTGGAAGATT 


10380 


CCTTGTCCTG 


TAGTCGCCGC 


TGCGTGATAA 


ATAATGTCGG 


CACCTGCTTG 


ATACATAGAA 


10440 


GAAGCTAAGG 


CCCGTCCTTT 


TGAAGCGTCT 


GCAAATGTCC 


CTGCATAAGT 


AGACGTAATT 


10500 


TGAATATCTT 


TGCCTAATTT 


TTTGCCAGCG 


TCTGCTACAC 


CTTTTTCAAA 


ACCAGCTTGG 


10560 


AAACGGCCAA- 


TAACAGGTCC 


TTCAACCCCG 


CCGATAAATC 


CTACTTTGTT 


GGTTTTCGTA 


10620 


GTGTTAGCTG 


CTGCCACACC 


TGCTAAATAC 


GCAGATTCAT 


TATCTCTAAA 


TGTTGCAGAA 


10680 


GCGACGTTGT 


TTTTCCCATT 


AACTGTATCG 


TCGATTAGAA 


CGAAATTCGT 


TTCTGGGTTG 


10740 
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GCGTCCGCTG 


CATCTACTAC 


TGCATTTTTT 


AGTAAATAGC 


CAATGCCAAA 


AATCGTTTTA 


10800 


AATTGACTTG 


AAATGGCTTG 


GTCAATGTTT 


GATGTATAAT- 


CAGATGCTTC 


ATTTGATTGA 


10860 


ATATACGCGT 


ACCCTTGAGG 


TCCTTCTGGA 


AGTTTGTGCT 


CTTTGCCCCA 


CTCTTGCATT 


10920 


CCTTCCCACG 


CAGATTGGTT 


AAAAGATTTG 


TCATCAATGC 


CGGCTGTATC 


TGTGACCATT 


10980 


ACGACACTAT 


GTTGTGGATC 


ACCTTTTGTG 


TTGGCGTTCT 


TTTTGCTATT 


GCCGCCACAT 


11040 


GCAGCTAACC 


CAATGGTTAA 


GGCAACTGCT 


GCCAATCCTA 


GACCTAAAAA 


TTTTGCTTGT 


11100 


TTCACTTTCC 


GAAAAGCCCC 


CTACTATTTT 


ATCTATTATT 


TCATTCGCAT 


TTATGCTATT 


11160 


GAAATCTACT 


TTGAGCCCAA 


CGAAACCTTA 


TGTCCTTTTT 


TTAAAAATCT 


TTATCTGTAA 


11220 


ATGAATAGGG 


CAATAATGCC 


TCAACTGTCA 


TTGTTTTAGT 


CGCGCCATCT 


TTACCAACTA 


11280 


GCGTGACCGG 


CATTTGTGGA 


TCAAAGAATT 


CAGCCATTAC 


TTGACGACAA 


GCGCCACATG 


11340 


GAGAAATCGG 


TGCCTCTGTA 


TTACCTGCTA 


CAACTAAATG 


TTGGAAGGTC 


CGTTCATCTT 


11400 


CTGAAATGGC 


TTTAAAAATC 


GCCGTTCTTT 


CTGCACAATT 


TGTTAAACCA 


AAGGAGGCAT 


11460 


TTTCAATGTT 


GACTCCTTGA 


TACATTTTCC 


CGTCTGCAGT 


CACTAAGCAA 


GCGCCCACAG 


11520 


GAAAATTAGA 


ATACGGTACA 


TAGGCGTGTG 


CCAATGCTTC 


TACGGCACAA 


TCAAGCCATT 


11580 


CTTGTTTTAC 


TGTCATCTTT 


TTCGCCACAC 


CTTCGTTTAA 


TTTGAAATAC 


GAATTAATAC 


11640 


CCTTCGCCAG 


TTGCGCCAGA 


AACAATTGCT 


ACACCCGTAC 


TTGCACCGAT 


GCGTGTTGCC 


11700 


CCCGCTTCAA 


TCATTGcTAA 


CGCTTCTTCT 


GCATTATGCA 


CGCCACCAGA 


AGCCTTAACG 


11760 


CCCATGTCAG 


GACCAACTGT 


CTCACGCATT 


T^CGAATGT 


CTGCTACTTT 


AGCGCCGCCT 


11820 


GTTGAGAAGC 


CAGTAGATGT 


TTTAACGAAA 


TCGGCACCTG 


CTTCTTTGGC 


TAATTCACAA 


11880 


GCTTTAACTT 


TTTCTTCGTC 


TGTTAATAAT 


GCTGTTTCAA 


TAATCACTTT 


GACTAACGCT 


11940 


TTTCCTTTGG 


CTGCATCTAC 


TACGCCTTGA 


ATGTCTTGAC 


GTACGTAATC 


GTATTGtTGT 


12000 


GATTTTAATG 


CACCAATATT 


AATCACCATA 


TCCACTTCAT 


TCGCACCATT 


TTTAATCGCA 


12060 


TCAGCTGCTT 


CGTATGCTTT 


AACTTCTGGT 


GTATTTGCCC 


CCAAAGGAAA 


GCCAATAACG 


12120 


GTACACACAG 


CGACGTCGGT 


ATCTGCTAAT 


TGCTCGCTAG 


CAAAAGCGAC 


CCAACAAGGA 


12180 


TTGATACATA 


CAGAGAAAAA 


GTTGTACTCC 


TTAGCTTCGT 


CAATAATTTT 


TTGGACAGCG 


12240 


GCTTCTGTCG 


CTTCTGGTTT 


CAAAATAGTG 


TGGTCAATCA 


TGCGGTTTAA 


TTCCATGATG 


12300 


AGAATTCCTC 


TTTTCTTTTT 


GTTTTATTCX: 


GTAATAATAT 


CGTGAATCAA 


AATAGGCTCT 


12360 


TCGCCGGTTG 


GTCCAATTTC 


AATGTTTTTA 


TAAAGCAATT 


GTTCGACTTC 


TGTCACGTCT 


12420 


TCACGATTGG 


CATAAATGGT 


TAGTAATGAT 


TCGCCTTTTT 


CAACTGGGCT 


CCCTACTTTT 


12480 


TTGTGAAGTT 


TCAAGCCAAC 


TGCAAAATCA 


ATCGTATCTT 


CTTTCGTAGC 


ACGACCAGCG 


12540 


CCCAGCAACA 


TTGCGGCAAC 


GCCAATTTCA 


TTGGCAACGA 


TTTTTTGTAC 


CACGCCTGTT 


12600 


TCTTTGGCAG 


GTAAATCAAA 


TTGATAAGTT 


GCGGTTAATA 


ATTTTTCTGG 


CTGGTCAACA 


12660 


ACGGAACTGT 


CGCCGCCTTG 


ATTTTCAATC 


ATTTCTTTAA ATTTTGCGAG 


CGCTCGACCG 


12720 
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CTTGTTAATG CTTCTTCTAA TAAAGCCCGT .GCTTCGTCTA 


ACGTCTCGGC 


TTTTTTAGCT 


12780 


AAAACGACCA TCTGACTTCC TAAGACATAA CACATCTCCA 


CTAAATCTTC 


TGGCCCATTT 


12*840 


CCTTGCAAGG TCTCGATTGC TTCAACGATT TCTAAACTAT 


TTCCGATGGC 


CTCCCCTAAA 


12900 


GGTTGTGACA TGTCTGAAAT TACTGCCATC GTTTGGCGAT 


TTGCCAGTTT 


TCCAATGCGA 


12960 


ACCATTGTTT CCGCTAATCG ACGAGCACTT GCTAGGTCTT 


TCATAAAGGC 


CCCTTCACCA 


13020 


GTAGTCACAT CTAAGACAAT CGCATCTGCC CnTGCGGCAA 


TTTTTTTACT 


CATAATGGAA 


13080 


CTTGCAATTA ATGGAATTGA GTCAACAGTG GCGGTCACAT 


CGCGTAACGC 


ATAAAGTTTT 


13140 


TTGTCTGCTG GTGCTAGATC ACCTGACTGA CCAATTACTG 


CCACTTTTGT 


ATCGTTCACC 


13200 


AATTGAATAA AACGTTCTTC TGTCAATTCA ACTTTAAAAC 


CAGGAATCGG 


TTCTAATTTA 


13260 


TCCAAcGTAC CGCCCGTGTA GCCTAAGCCA CGACCAGACA TTTTTGCTAC ATTGACCCCG 


13320 


ACACTAGCAA CTAACGGCGC TAAAATAAGC GTCGTAGTGT 


CGCCTACGCC 


ACCTGTAGAA 


13380 


TGCTTGTCGA CTTTGATTCC TTCTATAGAA GAAAGATCCA 


CAATTTCGCC 


TGAATGGACC 


13440 


ATTGCTAACG TCATTTGTGT AATTTCTTCG TCTGTCATAT 


CTTGGTAAAA 


AACAGCCATT 


13500 


AAA/VAGGcAC TTACCTGaTA ATCTGGaATT gAACCmTCTG 


tGtATCCGTT 


GATAATAAAT 


13560 


TGAATTTCTT CGCTTGTTAG TTGATGCCCG TCaCGTTTCT 


TTTCAATAAT 


GTCTACCATT 


13620 


CTCATGAATT CTTTCCCTCC GATTTCCCGa TAATGTATTA 


TACAACAAAA 


TCGTTCAGCT 


13680 


TTAAACCATT TAGTAAACCA GTTTACTATT ATTTTTTTCT 


TGTGAAAACG 


CAAACAACAC 


13740 


GAACATTCTA TTTCTAAGCT GTTTTTTAAA TCGGTTTTGT 


GCTTTTTCTA 


GCAATAAACG 


13800 


TCGCATCGAA AACTTTATTT GGAACTTTCC CTGTTGGATC 


TGCAATAGCT 


TCCACCAAAA 


13860 


ATTTTGCTGC ATAATAGCCA ATATCAAAAC TCGGCTGATA 


GACAGTTGTT 


AAACTCGGCA 


13920 


CCAAATATTC GGAAATTTCC AAGCCATCAA AACCAACCAC 


CGAAATATCG 


TCAGGAATTT 


13980 


TCTTGCCCGC TTCTTCAATG GCTTGATAGG CCCCCATCGC 


CATTTCATCA 


TTGCCACAAA 


14040 


AAATTGCTGT GATTTCTTCA TTTTTCAAAA CTTCTTTCGC 


CGCAGCATAG 


CCGCCATGCA 


14100 


AGTTCAATTC CCCAGCACAA ACATAATGCG GACGAAAAGG 


TAAATTATTG 


TCCTGTAACG 


14160 


CATGCTGATA CGCCGTCACT CGTTCCGTCA AACGATAATA 


GCCGGCACTT 


TCACGCAACA 


14220 


TGCCAATATG CTGATGCCCT TGTTGAATCA AATGATGGAC 


CGCTTGATAA 


GCACCTTCGT 


14280 


ACTCTTTAAC GATTAATCGT CCACTATCTC GTGGATTAAT 


CCCACGATCG 


ACAAGAATGA 


14340 


CAGGAATCGG ATGTTCTCGT CGGTCTTTTA AAATATGATT TTCTGGCAAA ACATTCGGCG 


14400 


TGGCAAAAAT AATGCCATCC ACCGACCGAT GAATCAACTC 


GCTAACATAT 


TGCATCGCTT 


14460 


GCTCGGAATC ATGCTTTGAA TTGCATAACA AAATCATGTA 


GCCTAACGAA 


TTTAAATATG 


14520 


TTTCTACACC TTCAATCACT TTTGAGAAAA AGAAATCCGT 


CACATCTGGC 


ACAATCATAC 


14580 


CAATGGTTTT GGAATGACGG TTGATTACAT TCGAAGCAAA 


GTAATCTGGT 


TTGTAATGAT 


14640 


TTTCAGCGAC TACCGCCAAC ACCTTGTTGC GGGTTTTTTC 


ACTGT^CGA 


. CTCCCTTTAT 


14700 
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TATTCAGAAT TTGTGAGACG GTTGTGACAG AAACACCAGC CATATCAGCA ATTTCTTTAA 
TCGTTAGTTT CATCGCGCCC CTCCCAACGT CTCCTTTTGC ATTGACAGAC ATTTGTAGAC 
GTGTTACAGT CTTCAAAATA CTTATCGAAT CTTTTACATT TTAACAAAAA AAAGACAAAA 
GAAATAGACT AAAAAGATTA AAAAACAAAA 6AAAATGGAG GCTATTTTTT TCATGGAAGA 
ATCCCGTGTC CGTCAATTAC CAAAAATCGA GCTGCACTGT CACTTAGATG GGTCGATTCG 
ACCAACCACT TTACGCACCA TTGCTGAAAA ACAAAACATT CCATTACCCC AAGACGAACA 
AGCCTTGAAA GAGTTAGTGG TTGCCCCTGA AAAATGCACC GATTTGAATG ATTATTTAAC 
TCGGTTCGAC TTCGTGCTCA CCTGTCTACA AACAGCTGAG GCTTTGCAAG CAGCGGCGTA 
TGACGTGATT AGCCAAGCTG CCGAAGATGG CGTGGCTTAC ATAGAGGTGC GTTTTGCGCC 
TTCCCAACAT ACAGAAAAAG GCTTACGGTT ACCTGAAATC GTGACTGCTG TTTTAACTGG 
CTTAAAACAA GGCGAAGAAG ATTTTGGTGT AAAAAGCAAT GCACTGTTAT GCGGCATGCG 
GCATGACCAA CAACAAGCCA TAGAAAAAAT CGTTCACTTA GCGCATGATT TCAGAGAGAC 
AGGCGTTGTC GGCTTTGATT TAGCTGGCAA TGAAGTCGAT TTCCCTCCAT ACACATTTGA 
AGACGTATTA GCTTTAGCCA ACCAATTGTC AATTCCATTA ACACTCCACG CTGGGGAATG 
CGGCTGTGGC AAAAATGTCG CTGATGCTGT CACTTTAGGT GCTACTCGGA TTGGTCACGG 
TATTGCTTTG AAAGATACAC CCGAATATTT GGCGCTCTTA AAAGAAAAAA AGGTTCTTTT 
AGAAATGTGT CCAACTAGCA ACTTCCAAAC AGGAACCGTC AAAACATTGG CTGAATACCC 
TTTTCAACAA TTTATAGAGG CTGGCTTAGC CGTCTGCATC AATACCGATA ATCGAACTGT 
TTCAGATACT ACGTTAACGA AAGAATTCAT GAAATTGGCC ACTTGGTATC AATTAAGTTA 
TGACGAAATG AAGCAACTCA CTAAAAATGC GCTTGCAGGT GCTTTTTTAT CGCCAGACGA 
AAAAAAACTG TTGAACCAAA AAATTGATCA AGCGTATCTA TTCTAAAAAA GAACGCCACT 
AAAGTGGCGG TTCTTTTTTA CTCTGCTTCT TTAACAtTCG TAAAATATAA TAGCCTTTGT 
CTTTTGTCAC AATTTCAACA TTGCCAAATA CTTCAGCCAT TTTCTTTTCT GCACTAGGTG 
CACCTTGTTT CTTTTGAATA ACCACCGTTA GTGTGCCACC AACTTTCAAC AACGGAAGAG 
CCCCTGTTAA AATGCCATGG ACAACTTTTT TGCCTGCACG AATGGGCGGG TTGCTAACAA 
TCGCCGCGTA GGTCGTTTCG TTTAAGGTTT CATACACATT GGAAGAATGG ATATCCACAG 
TAGTAATTTG GTTACGTTGG GCGTTCATTT GAGCTAGCCC AACTGCCCGT TGGTTGACAT 
CCACCATTTC CACCAGTCGA CCAGTCGCCG CTGCTAAGGA CAACCCAATT GGTCCATAGC 
CACAACCAAC ATCTAACAAA CGGCCTGCTG GTAAATTTTC CCATTCAAAG GCATCAATCA 
ACACACGAGA ACCATAATCC ACTGTTCCTC TTGAGAAGAC CCnG 
(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



14760 

14820 

14880 

14940 

15000 

15060 

15120 

151B0 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15B40 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16484 



wo 98/50555 PCTAJS98/08985 

898 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



CCGAGTAGTA 


CCAGAGAGGm AnTCCATGTT wAAAAcATTA CTTCTTTmCA ATCTAGTTAC 


60 


AGTAATATGa 


AATTTTGTTG AAGTGTTTTT AATTCGCCAG AAAAAAAGGT ATACTTGTTA 


120 


TAATrACAAA 


ACTAAAAAAT TACATTTACA TATAAGTTAA AAAAGGAAAG TTGGGGACGT 


180 


ATCAATGAAG 


AAAGAATCAA TGTCACGTAT CGAAAGAAGG AAAGCACAAC AAAGAAAGAA 


240 


AACGCCAGTA 


CAATGGAAGA AGAGCACTAC TTTATTCAGC TCGGCGTTAA TTGTTTCATC 


300 


TGTAGGAACG 


CCCGTTGCGT TACTACCAGT GACTGCTGAG GCAACAGAAG AGCAGCCAAC 


360 


AAATGCGGAA 


GTTGCCCAAG CACCTACTAC GGAAACTGGC TTAGTAGAGA CACCAACAAC 


420 


AGAAAcTACG 


CCAGGAACTA CGGAACAACC GACAACGGAT TCGTCAACAA CGACTGAATC 


480 


GACAACTGAA 


TCATCAAAAG AAACACCAAC AACACCAAGT ACCGAGCAAC CAACAGCTGA 


540 


TTCAACTACA CCTGTGGaAT CAGGaACGAC TGATTCTTCA GTAGCAGAAA TTACGCCAgT 


600 


AGCTCCTTCA 


GCAACCGAGT CTGAAGCAGC GCCTGCGGTT ACACCCGATG ATGAAGTAAA 


660 


AGTACCAGaA 


GCTAGAGTAG CTTCTGCGCA AACTTTTTCA GCGTTATCAC CGACGCAAAG 


720 


TCCTTCAGAA 


TTTATTGCCG AGTTAGCTCG TTGTGCACAA CCTATTGCGC AagCCAATGA 


780 


TTTATATGCA 


TCAGTGATGA TGGCTCAAGC AATCGTTGAA AGTGGTTGGG GAGCAAGTAC 


840 


GCTATCTAAG 


GCACCAAACT ATAACTTATT TGGGATTAAA GGCAGCTACA ATGGACAATC 


900 


TGTCTATATG 


GATACATGGG AATATTTAAA CGGCAAATGG TTAGTGAAAA AAG7U\CCTTT 


960 


CCGTAAATAT 


CCTTCTTACA TGGAATCATT CCAAGATAAT GCGCACGTGC TAAAAACAAC 


1020 


TTCTTTCCAA 


GCGGGCGTTT ACTATTATGC TGGGGCTTGG AAAAGCAATA CAAGCTCGTA 


1080 


CCGCGATGCA 


ACTGCTTGGT TAACAGGTCG TTATGCGACA GATCCTAGCT ACT^TGCTAA 


1140 - 


ATTAAATAAT 


GTCATTACCG CATATAACTT AACTCAATAT GATACACCAT CTTCTGGTGG 


1200 


AAATACTGGG 


GGCGGAACAG TTAATCCAGG AACAGGCGGC TCGAACAATc AATCAGGAAC 


1260 


GAACACGTAC TATACTGTAA AATCAGGaGA TACCTTGAAT AAAATTGCCG CsCAATATGG 


1320 


yGTGAGCGTT GCTAATTTAC GCTCATGGAA yGGCATCTCT GGCGATTTAA TTTTCGTTGG 


1380 


TCAAAAACTy ATCGTGAAAA AAGGTGCTTC AGGTAACACT GGTGGCTCAG GQAGCGGTGG 


1440 


tTCTAACAAT AATCAATCAG GAACGAAyAC GTACTAyACt tAAAtCAGGs GATACCTTGA 


1500 


AyAAAATTGC CGCCCAATAT GGCGTGAGyG TTGCTAATTT ACGCTCATGG AATGGCATyT 


1560 


CTGGCGATTT AATyTTCGyT GGTCAAAAAa TtATyGTGAA AAAAGGTACT TCAGGTAACA 


1620 


CCGGTGGCTC 


AAGCAATGGT GGTTCTAACA ATAATCAATC AGGAACGAAT ACGTACTACA 


1680 


CGATTAAATC GGGCGATACC TTGAACAAAA TTTCTGCACA ATTCGGTGTT AGTGTGGCTA 


1740 


ACTTACAAGC 


CTGGAATAAC ATCAGCGGTA GTTTGATTTT TGCTGGTCAA AAGATTATCG 


1800 


TGAAAAAAGG 


CGCCAACTCA GGTTCAACGA ATACGAACAA GCCTACGT^AT AATGGTGGCG 


1860 
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GTGCGACAAC 
TTGGCGTGAG 
CTGGTCAAAC 
ATAGTGCATC 
TGCAATACGG 
TTTATATTGG 
GGT/U^CATGA 
GATAGAGAGC 
ACTTTTTTGA 
AGGTGACGGT 
GTTCACGTCC 
AAGTCAGAAT 
ACTAGGAGGA 
TGGGCTAAGA 
CTAGATATGT 
ACAGCAACCG 
ATGGGCTGGG 
CCAGCTGAAT 
TTCAGCTATG 
CAATGGATAT 
AACTGGGTCC 
GAACGTGGCG 
GCTTATGCAG 
GATATGCAAC 
GGCACAGAAG 
TATACTGTTC 
GCAGCTGTAG 
TTAGCAAAAG 
CAAGAAATTC 
ATGGCGGTCC 
ATCCTACCAG 
CATATCAATT 
GAGTGGCTAG 



ATCCTACACG 
TGTTGCTAAT 
AATCATCGTG 
AGGCAAACGC 
AATCAGCATC 
TCAAACTTTA 
GTTTTATAAT 
GTATGGCTGG 
ACAAACCAAG 
GACTGACC6T 
TATAGGAATT 
AGTGACAATG 
ATAACATGAG 
ACAATTGTTT 
TTCCCTATCC 
ATATTCTTTC 
ATGCGTTTGG 
TTACTAAGAA 
ATTGGAATCG 
TTACAAAATT 
CTGAATTAGG 
GTTATGATGT 
ATCGCTTATT 
GAAATTGGAT 
AAAGTTTCAC 
TAGCTCCTGA 
AAGCATATAT 
AAAAAACAGG 
CAATTTGGAT 
CAGCACATGA 
TAATCGCAGG 
CTGATTTCTT 
AAGAAAATCA 



ATTAAATCAG 
CTACGTTCAT 
AAAAAAGGCG 
CATACAGTTA 
CAAAAAATCA 
AAAGTTGGTT 
AGAAAAAAAC 
TGGAAATACG 
TAGGAAAAGT 
TTCTTTGACG 
TTTCCTATAG 
GCTGCAGGTG 
CTACAATCAC 
CAATACATTG 
ATCTGGACAA 
ACGTATGAAA 
TTTGCCAGCA 
AAATATCGAA 
TGAAATTAAT 
ATATGAAAAA 
AACTGTTATT 
GGTTCGCCGA 
AGAAGATTTA 
TGGACGTTCT 
GGTGTTTACA 
ACTAGAACTA 
TGAAGAAACC 
TGTTTTCACA 
TGGCGATTAT 
TGAACGGGAT 
TGGCGACATT 
AAATGGATTA 
CGTAGGGAAA 



GTGATACGCT 
GGAACGGGAT 
CTTCTGCAGG 
AAAGCGGTGA 
AACAATTAAA 
AATTTAAGAT 
AATGAGAAGG 
TACAGAAGCT 
CGGTGACGAT 
AATTTAGGTG 
GACTTTTTTT 
GCAAATTTTA 
AAAGAGATTG 
GACGACCCAA 
GGCTTACACG 
CGTGCGCAAG 
GAGCAATATG 
ACATTCCGTC 
ACCACTGATC 
GGGTTAGCTT 
TCAAATGAAG 
CCAATGCGTC 
GAGCTTGTTG 
GAAGGAGCCA 
ACCCGTCCTG 
GTGAAAAAAA 
TCAAAAAAAT 
GGTGCGTATG 
GTTTTAGCAA 
TACGAATTTG 
ACAACAGAAG 
AACAAAGCAG 
AAAGAAGTAT 



GAATAAAATT 
CAAAGGCGAT 
TGGCPATGCt 
TTCACTTTGG 
TGGCTTAAGC 
TG7VAAAAAAT 
AATAGTAGAA 
TTTGAACTCG 
CGTTAAGACG 
GTACCACGTT 
TATCACTAAG 
AGAATGAAAA 
AGAAAAAATG 
ATAAAGAAAA 
TAGGTCACCC 
GCTATAATGT 
CGTTAGATAC 
GCCAAATTAA 
CTGAATATTA 
ATGAAGCAGA 
AAGTCATTGA 
AATGGATGCT 
ATTGGCCAGA 
ATGTGACCTT 
ATACCTTGTT 
TTACGACACC 
CTGATTTAAA 
CTATAAATCC 
GCTATGGCAC 
CGAAAACATT 
CCTATACAGG 
AAGCCATCGC 
CTTATCGTTT 



TCTGCACAGT 
TTAATTTTTG 
TCTTCAACAA 
GGCTTATCAA 
GGGGATACAA 
AGCTATCTTT 
AGAACCATGT 
CCTTTAAGTT 
TTTGAGGTTA 
GCATTTGTAT 
TGGCTAGTTT 
ATTTTATATT 
GCAAAAATAT 
ATTTTATGCA 
GGAAGGCTAT 
GTTGCATCCA 
AGGAAATGAC 
TTCACTAGGA 
CAAATGGACA 
AGTTGCGGTT 
TGGAAAAAGT 
GAAAATTACT 
GAGTATTAAA 
TAAAGTCGCT 
TGGTGCAACC 
TGAACAAACA 
TAGAACGGAT 
AGTCAATGGC 
AGGCGCAATC 
TGGCATTGAT 
GGATGGACCG 
TAAAATGAAT 
ACGTGACTGG 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 
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TTATTCTCTC 
ACAACCACAA 
CGCCCAAGTG 
GACCCTGAAA 
AGCTCTTGGT 
GAAAAATTAA 
CTGCATTTAC 
ACCAAAGAAC 
AAAATGTCTA 
GCGGATACGT 
AATGAAAATG 
GATGAAGAAG 
GTTTATCATC 
GCGATTTCTC 
GAATATGTGG 
CTATGGCAAA 
GAT^CGGCCT 
GGCAAACAAA 
CATGAAGCTG 
GATAAATTAG 
GACGTATCAA 
GTTCTTTACA 
GCGGCGTACA 
CTAAAATAGA 
ATTTTTTCAA 
ACTTTTTTTT 
GTTGGT7UVTT 
GGGCAACAAT 
CTTCCATTAA 
AAATGGTCAG 
TCTTAGCTM 
GCAAACCTTT 
TCATTCCGAA 



GTCAACGCTA 
CGGTTCCTGA 
GAACTGGGGA 
CTGGCATGAA 
ATTACTTACG 
AACGTTGGTT 
TTTATGCTCG 
CTTTCCAAAA 
AATCACGTGG 
TACGTCTTTA 
GATTAGAAGG 
GCAAAATGCG 
AAACGGTTAA 
AATTAATGGT 
AAGGATTTGT 
TTTTAGGTAA 
TAGTAGAAGA 
ATGTCGCTCG 
TTAAAGAATT 
TAAATATTGT 
AACGTCAGGC 
TGTGATTGTT 
GAAAAAGGCG 
GATCATAAAT 
CTTCAGTTCC 
AGGCAACTGT 
GGTTAAGATC 
AAAGTTCAGG 
TAAGCCAAAT 
AATAAGTAAT 
GACACCAACG 
TGATAAGGCG 
AACCATCAGA 



CTGGGGTGAA 
ATCTGAGTTA 
ATCGCCATTA 
GGGAAAACGT 
ATTCATTGAT 
ACCAGTTGAT 
TTTTTGGCAT 
ATTATACAAC 
CAATGTTGTA 
TGAAATGTTC 
AAGTCGTAAA 
TGACCGAATT 
AAAAGTGACA 
TTTTGTGAAT 
CCAATTACTT 
CGAGGAAAGT 
TGAAGTGGAA 
TGGGTTAAGC 
TATTGAAGGA 
TGCAAATTAA 
TCTTTTTAAG 
ATGACTGAAA 
GGTTCTTTTA 
AATAAAAGAC 
TCCTTTAATG 
TAAAACTTAG 
ATACGTGAAA 
CCCCAAGCTT 
AAAGACATAC 
AGCCGTTTTT 
ATGAAAACAT 
TACAATGAAA 
AAGCACATTA 



CCAATTCCTG 
CCTCTACGTT 
GCAAACATTG 
GAAACGAATA 
CCTCATAATA 
ATCTATATTG 
AAATTCTTAT 
CAAGGTATGA 
AATCCCGATG 
ATGGGCCCAT 
TTCTTAGATC 
ACCACAATTA 
GAAGATATGG 
GAAGCCAATA 
GCGCCAATCG 
TTAACTTATG 
GTAGTTTTCC 
AAAGAAGAAT 
AAAACAGTGC 
GTTTACATGT 
GTTGTTTTGT 
GCT^GGCGAT 
AAAGTCCATC 
TAATCAAGCC 
AACTATCACT 
ATAAGAAATC 
AGGGACCGAC 
GGAAATACGC 
AAGTCACCAT 
CTTTATTAAA 
CTAAAATAAA 
ATTCTCCATG 
ACGTAgTAAA 



TGATCCATTG 
TACCAGTAAC 
ATGAATGGGT 
CTATGCCACA 
AAAATGAAAT 
GTGGTGCCGA 
ATGATATTGG 
TTTTAGGAGA 
ATGTGGTGGC 
TAGATGCTTC 
GCGTTTGGCG 
ATGATGGCCG 
CAAACTTGCA 
AAGTGGATGC 
CGCCACATAT 
TCCCTTGGCC 
AAGTGAACGG 
TAGAACAAAT 
GCAAAGTGAT 
TTTTATTAAA 
TTTACCTTTT 
GAAGAGAAAA 
GCTAGATAAT 
AAGAAATAAG 
AAGATAAATC 
TATTTTCTAA 
AATCAGCAAT 
ACTTAGTGAC 
GCTGATAATC 
GGGTAAA6AA 
AGCAACGATA 
CAATAATAAA 
AAGCATTCCT 



GGAAGATGGA 
ATCGGATATT 
CAATGTCGTC 
AT6GGCTGGA 
TGCTGATTTT 
ACATGCGGTG 
TGTGGTTCCT 
AAACAACGAA 
TAAATATGGT 
CATTGCTTGG 
TCTGATTGTT 
TTTAACGAAA 
CTTTAATACrA 
CTTACCTTAT 
TGGTGAAGAA 
AACCTATGAT 
AAAATTACGT 
TGCAATGAAC 
TGCTGTTCCA 
AAAAGAGCCG 
TGATTCATAT 
AAGTAACCAA 
GTTGATCCAA 
CTAATACGAT 
AAAAAG6TTA 
GTTTGTCGTT 
TGATAAGGAA 
AACGTCTGGC 
ATGAGACTCG 
AAAGCGATTT 
AATCCAGGAA 
TTATAGGTGC 
TCTTTTTTAT 



3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 
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TAGTASGCAT TTTTTTCTCC TTTCATTTAT TGAGAGTAAT GATTGTTCTA GGCAGAAAaG 5880 

GACAAATTCT CTCGACTTTT ACTAGTGTAG CAAAAATAAA AGTTCTTCAT AAGTTAGAAT 5940 

TTGCTAAGAA TTTGATTCAA ATAAAAAGGT GTACTAAGTC AGCAGTACAC CTTTTTATTC 6000 

GTTATTTAGT TTATTTTTGT GTTATTTGTA ACGTCCTCTC CAAACCATTT TTGAGAAATT 6060 

TTACTAAGGG TGCCATCTTT TCGTAACGTT TCAAAGGCAG TATTGATTTT TTGGACTAAT 6120 

TGATTGTCTG ATTTGCGGAC GCCCACAGCA AAATCTTCAT TGTCATAGCC TACATGAGAA 6180 

ATAGTATAGT TTTTTAAATT ATCTTCGTGG GAAAGATAGT AGTTGGCGTA AACGCGATCG 6240 

ATTAGGAGTC CGTCAATTCG ACCAGATTTT AAATCTAAGA AAGCTTCATT AAAGCCGTCA 6300 

TATAAAATAG GTGTTTGGTC TTTAACAAAT TTTTTCAAAA CGTCAGGCTG ACTTTCGAAG 6360 

CCATCATAGC CAGAAGAGCC GTTTTGAACC CCTAAAATTT TGCCTTGCAT GTCGCTCGCT 6420 

GTTGCAATGT TTTTTTCTTT TAAAGAAACA AGTACTTGGT CGTTCGTCAT GTAAGGTTGT 6480 

GTGAATTGAA CTTTTTCGGC CCGCTCGCTC GTTTTAGTGT AGCCGTTCCA AATAAGATCA 6540 

ATGGTTTGAT TTTGTAATTC TGTTTCTTTC ATAGACCAAT CAATCGGTTG GAAGTCAACG 6600 

GAAATGCCAT AAAGTTTAAA AACCGCTTTG GCT7AGTCGA CATCAAAGCC GACAATTTTG 6660 

CCTGATTTAT CTTGAAAACC CATGGGCACA AAGGAGTCAT CTAAGCCAAT AATAATCCGT 6720 

TTTTCTTCGT TAATCCGTGT CCATTGATCT TCGTTGCTTT TTCTTTTACC ACTU^CCTGCA 6780 

AATAGGAGTA AACTACAACA GATAACCAGC AAGGCTAAAG AATATTTCTT TTTCATCATA 6840 

AACTCCTTTC TATTGGACGG GGGCAACCGT CAAAAGATTG TCTGCAATTT TTTCAGCGAA 6900 

AGCCATATCG TGGGTCACGA CAATCTGTGT CATGCCTTGT TTCCGTAAAT CTAAAATAAC 6960 

TTCTTCCACT TGTTGGCGTA AAGCTGGATC TAGCGCGCTC GTTGGTTCAT CGTACCCAAG 7020 

AACTTTTGGC TTCATTGCTA AGGCACGAGC AAGTGCAACT CGTTGTTTTT GTCCACCGGA 7080 

AAGTTGGTAC GGATAGAGTT TTTCTTTACC AGCCAAGCCC AATTTTTCCA GTAATTCTAA 7140 

AGCTTCTTGC TGACTTTTTG CTTTTTCTTC TTTTAGTACT AATGTAGGTG CTAAGGTAAT 7200 

GTTTTCTAAA ACTGATAAGT GAGGGAAAAG TTGGAAATCC TGGAAAACGA CCCCAACCAC 7260 

TTGATCGGCA TTGTCCATTT CaGCAGGATT GAAAGGAACA CCATCCAACA AAAGCTCACC 7320 

AGAATCAATG GTTTCTAAAC CAGCCAAACA ACGAAGTAAA GTGGTTTTTC CGCCGCCTGA 7380 

AGGACCAACA ATCGTTAAAA TTTTACCATC TGGAATTTCT AAATTTAAGT GATCAATAAT 7440 

GGTCCGGTTG TCAAAATGTT TGGTTAAATT TTTAATAGTT AACATGGTAG AGTCCTCCTA 7500 

TTTATAATAT TGATAGAATT TTTCAATTTT CTTAGAAGCA ACAGTCAAAA TTGCCGTAAA 7560 

GAGTAAATAA ATCACTCCCA CTAAAGCTAA TGGCACTAAT GTAACATCGC GGCTCATCGC 7620 

AATTTTACCA GCACGTAAAA GATCGCCTAA GCCAAGAACA TAAATAAGAG AAGTATCCTT 7680 

GACCAAATTG ATAATTTCAT TTCnTACGGA AGGCAGCACG ATTTTAACAA CTTGTGGCAA 7740 

AATAATCCGC GTGACTGTTT GAAAACGTGT TAAACGTAAA ACCTTAGCAG CTTCATATTG 7800 
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GCAAAGTACG 
ACAATACCAA 
CCGCGCATTA 
AAAATAAAGG 
TTCAGTGTCA 
TTTTTTCCTC 



CAGCATAATT 

TTAATGGTAA 

ACCAAATGTA 

CAATGATGAT 

TGCCAGTCCG . 

CAACAATTTT 



ACCTTGGGGA ATTGCTTGAA TCCCCCCGCG GAAAATTTCA 
TAAAATAAAT GCGAATAAGG CCGCATCGTA ACGTTCAAAG 
GCCGTAAAAA ACAAAAATTA ATTGTAACAA TAGCGGAGTG 
GAGGTTAATT AAGTAACGTA AAGGTTTAAA CGAACTTTGT 
CCCCAGTGGA ATCGAGCCAA GGAGCGTAAA GAAAMGAmT 
CTAACTAAAG CAGGTAAAAT TTCTAAAATA TAATCCATCA 
(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
CAnCCCAAAT TTnTAGCTTG ATTCTGCAGC TTTTGATAAC aTTTGGTCAA TTTTTGGTAA 
TCCATTAAAT CAACTAACTG ACCTTGTTTT TUVCATGkkrs tAAAGTCTAA ATCCCAAAGC 
ATTTCTTTAG TCGGCTGTTG CAATAAATAA TACATAGAAT GACTACTAAT AATTTCAGTG 
GTCGGTTCTT CAATTGTTTC ATAAAAAGTC ACAAAATATT CTCGCTCAAT CACAGTAATA 
ATTTCTTCTT CTGAAAGCTC ACAATCTTCT AAATTTTGAT AGAATATCAA TAATTTAATC 
CGTTCATCTA TTTTATTCAG CGAancGACt AATTCAGGAA AAAGAACCGA CTGACAATTT 
TCTACTAATT CTCTTGCCTT ATTTTGTTCA TTTTTATAAA AGTTGTATAA CTTTTTCCGC 
TGTTTTAACG AGTTTAGTAA GTTATCGGGT ATATTCCAGT AATATTGTTT CTTCATAGCA 
TTCCCACTCT TCTTTTTTAT GTACAATTTT ACAAAATTGG TCTTAGCCAA TTTCGTCTCT 
AGTTATTCTA TCAAAAAAGT GATGCTTTTT TACTGACaAT AATTACTCGT TTTACGCCCA 
AAAAGTGAAA ATCTAGCTGT TAAGTAATTA TTGGGGCTTG TTyCCCTAGG gCCGAaAAAA 
CTITTTCCTT TTAAAAGTCC AGAAAAAGAC CTACACTAGA GATAGACAAA CGTTGGAGGG 
ATTTAAATGG ATTTACACTT AACGAATAAA TTAGCATTAA TTACTGGTTC AACCAAAGGA 
ATTGGCAAAG CAATTGCGAT TGAAATGGCT CGTGAAGGGA CCGATGTCAT TATCAATGGG 
CGCAATGAAG CCGAAGTAAC TAAAGTTGTT TU^AGAAATAC AAACAATGTT TCCAGACACT 
CATCCTCAAG CAGGGACTGC CGATATTTCC ATTGAAAGTC AACGAGCTAC TTTGCTTGAA 
AAATTCCCTA AAGTCGACAT TTTAGTGAAC AATATGGGGA TTTTTGAGCC AATGGAATAC 
TGGGACATCG ATGACGCCAC TTGGGAAAAA TTTTTTACTG TGAACGTGTT GTCAGGCAAT 
GCATTAGCAA-AAGCTTATCT ACCTAAAATG CTTGCACAAG ATTTTGGTCG CATTATTTTC 
ATCGCTAGCG AAGAAGCGGT GATGCCTTCT GGCGaAATGC CCCAATATAG CATGACAAAA 
ACGATGAATC TTTCCTTAGC TAAAAGTTTA TCCAACTTAA CTGTTGGCAC ACATGTCACC 



7860 
7920 
7980 
8040 
8100 
8160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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GTTAACACGG TTATGCCTGG CTCAACCCTT ACCGAAGtGT AGAAAAAATG TTGGAAGATA 1320 

TGTACGCTGA TTCAGACATT CCCAAAGAGG ATTGGGAAAA AGATTTCATG AAAAATCATC .1330 

GTTCTCGTTC ACAAATCCAA CGGCTCATTC GACCAGAAGA AATTGGTCGT TTTGTTACCT 1440 

TTGTGGcCAG CCCAGATTCT TCTTCTTTCT CAGGAGAAGC CTTAAGAATC GATGGCGGCT 1500 

TAGTTCCTWIC AATCTTCTAA AATAAACAAA AAAGACAGCC AGCAAAAATT CACATTTTTn 1560 

GCTGGCTGTC nTnTTArGCA TATTAATTGA TTACTTxnCCA AAAGTAAGTT ATATTTATCT 1620 

CGAATTCGAA wTAAAAAGAG GTGAAroCAAT GAACCAACAG CAAGAAGCTT TAAAAGCCTA 1680 

TATCGGTTTA TTAAGAACCA GCCATCGACT AGAGCAACTT GCCAAGCAAG ATGTCACTTG 1740 

TTATGACTTA AACATTACAG AATTTTCAGT ATTAGAGCTG TTACTCCATA AAGGTCCTCA 1800 

GACCATCC7VA AAAATTAAGG AGAAAATTTT AATCGCTAGC AGTAGCACCA CTTATGTTAT 1860 

TGACCAATTA CATAAAAAAG GCTATGTAAC GCGCACCCCC AGTGAAAAAG ACCGACGCAT 1920 

TACTTACGTT GAATTAACAG AAGCTGGAAA AACATTAATT GAAGAAATTT TCCCGACGCA 1980 

TGCAAAGCGA ATTGCAGAAG CATTTGAACA ACTCTCTTCC GAAGAATTAA CACTTCTTCA 2040 

AAAAACTTTA CGAAAAATAA CAAATGAaAC GAAATGAGGA AATAATGATG AAAAAAGAAG 2100 

ATCAATTATT AGGAATCCAC CACGTTACAG CTATGACAAG TGATGCAGAA AAAAACTATC 2160 

ACTTCTTTAC AGATGTTTTA GGGATGCGTT TAGTCAAAAA AACAGTTAAT CAAGATGATA 2220 

TCTATACTTA CCATACCTAT TTTGCTGATG aTTTGGGTAC ACCAGGTACA ACCaTGACCT 2280 

TTTTCGATTT TCCCAATAAC CCTAAAGGAT TAAAAGGAAC CAATACAATT TCAAGAACAG 2340 

GGTTCCGGGT TCCTTCAGAT GCAGCTTTGA CTTATTATGA AAATCGCTTC AaTGAATTTG 2400 

CTGTCAAACA CACAGGCATT TCTGAAGAAT TCGGGAAAAA AGTCCTTCGC TTTTGGGATT 2460 

TTGATGATCA AGCGTATCAA TTAATCTCTG ATGAATTAAA TCAGGGCGTT GCAGCGGGCA 2520 

CCCCTTGGAA AAAAGGACCT GTTCCAACAG AATTTGCGAT TTATGGCTTA GGACCTGTCG 2580 

AAATAGCTAT TTCCTATTTT CATGAATTCA AAGAAGTCTT TGAAGAAATT CTAGGCTTTC 2640 

ACGTAGTGGC ACAAGAAGGC AATCGCTATT TACTAGAAGT TGGACAAGGT GGCAATGGTG 2700 

CCCAAGTCGT TTTGGTAGAC GATGATACTA GCTCACAAGC GCAACAAGGA TATGGTGAAG 2760 

TACATCACGT TGCATTCCGC CTAGCGGATC GTAAATCACT TGGGACTTGG CAAGCGCTCT 2820 

TTGATCATTT AGGCTTACAA AACTCTGGCT ATGTCGATCG TTATTATTTT GAATCATTGT 2880 

ATGTTCGCAT TGGGCATATT TTAGTCGAAT TAGCCACCGA TGAACCAGGA TTTATGGGGG 2940 

ATGAACCTTA CGAAACATTA GGAGAAAAGT TATCTCTTGC GCCATTTTTA GAAAACCGTC 3000 

GTGAATATAT TGAGAGTGTG ATCAAGCCTT TCAATACAAA ACGAGCCTAA GGAGGAAATU^ 3060 

ACATGCATTC AaTTTTTAAA AAA6GACATC CTGAAGCACC TGTCTTTGTG CTACTTCACG 3120 

GTACAGGTGG TGATGAAACA TCTCTCCTAC CAATTGCCCA AGAACTAAAT AAACAAGCTA 3180 

CTGTGCTAAG TATTCGTGGT GATGTTTCAG AAAATGGAAT GAATCGTTAT TTTAAGCGCC 3240 
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TAGCGGAAGG 


TCATTATGAC 


TTAGAAGATC 


TAGAAAAACG 


CGQCGAGGCG 


CTTCATAAGT 


3300 


TTATTCAACA 


AGCCGCTAAC 


GAGCATCAAT 


TTTCATTGGA 


TAAAATTATT 


TTTATTGGCT 


3360 


ATTCAAATGG 


GGCCAATATC 


GCTATTCAAT 


TATTGCTTAC 


TCATCCCGAC 


AGCTACCATC 


3420 


AAGCTGTCCT 


CTATCATCCC 


ATGTTTCCTG 


TTGAATTGAC 


CAATCAACCA 


GACTTGACCG 


3480 


ACACTTCTGT 


TTTATTATCT 


CTAGGAGAGC 


ATGACCCGAT 


TGTCCCACTT 


CCTGAGAGTA 


3540 


TGCGTGTGAT 


TCAACTATTT 


CAGAATCATG 


GAGCAACCGT 


ACAAGAGGTT 


TGGACACAAA 


3600 


GTCATCAATT AACTTATCAA GAAATTAAGG AAACACAAAC TTGGTTGGCA CATCTGTCCT 


3660 


CTTAACAAAA 


AAGTGGGTCC 


TAAGTCATTT 


CGACTTAGGA 


CCCACTTTTT 


CTATTTTAAG 


3720 


ATAATTCACT 


GTCTGAGCTT 


CTTTTTATTC 


TGCCATTTGA 


ACAACTATTT 


TACCAACTGC 


3780 


ATGGTGCGTT 


TCACTGAGTG 


CATGTGCATC 


ATAAATCCCT 


TGTCTAGAAA 


AAGGGAAAAC 


3840 


TTCACCAATA 


ATTGATTTAA 


CTTTTCCAGC 


TGCCATCAAA 


TCAGCAATTT 


TTTGTAATTG 


3900 


TTCTCCATTT 


GGTTGAAGCC 


AAATACTTTC 


AGCAGAAATA 


TTTTTTTCGG 


CTGCCAATTG 


3960 


TTTATCTTCA 


ATTCCCACAA 


TTGAAACAAG 


ACGACCGGTG 


TTTGGTTTTA 


AGACGGCAAA 


4020 


ACTATTTTTT 


TGAACCTCAC 


CACCCATTGT 


ATCAAAGACT 


AAATCAACAT 


CAGTTAATAC 


4080 


CTCTGCAAAA 


TTCGTTGTAT 


GATAATCAAT 


CACTTCATCT 


GCACCAATTT 


TTTTCAGTAA 


4140 


GGTATGATTT 


TTTGCGCTAC 


GGTcGTGATG 


ACATGTGCGC 


CCGCTTCTTT 


TGCTAGTTGA 


4200 


ATCGCATAAG 


TACCGACCCC ACCTGCACCC 


GCATGAATTA 


AAACAATTTC 


TCCTTCTTTA 


4260 


AGGTGACCAT 


GATCAAACAA 


TGCTTGCCAA 


GCGGTTAAAC 


CAGCCAACGG 


AACGGCAGCC 


4320 


GCTTCTTCAA AGCTAATTGT 


TTCAGGGATT 


TTtGCTAATA AATGATCATC 


CACAATCGTT 


4380 


ACTTCTGCAT 


AGGTACCAAA 


ACGAGTAGTT 


TCAGGACGAG 


AAAAAACTTT 


ATCGcCAACT 


4440 


TGCCAATCCG 


TTACTTGACT 


CCCcACCTCT 


GTAATCACAC 


CAGCGACATC 


CCAACCAAGA 


4500 


ATAATCGGAA AGGACCAATC AAACATCTGT 


TTtAAATATC 


CTTCACGCAA 


TTTCCAATCA 


4560 


ATCGGATTAA 


TTGATGTCGC 


GTATTCTTTT 


ACCAACACTT 


GATGTTCCGA 


TAATTCTGGC 


4620 


AGTGTTACTT 


CTGCTTCTTC 


AAGTACCTCT 


TTACTTCCGT 


ACTGATTAAT 


CACAACAGCT 


4680 


TTCATT^CC 


AACTTCCTCT 


CATTTGTTTC 


ATTTTTCACA 


CTTTTAGTAT 


ACGnCCTATT 


4740 


GCCTTCTTTT 


TATATAAGAA 


TATGCTTTAG 


TTAAATTGAG 


AGAAAGTAGC 


TGTTTTTAGT 


4800 


CTATTATTCA TAaCGTAAAG ATTCAATTGG ATCTAATTTT GgCTGtCTTC GAGCTGGTAA 


4860 


AGTTCCTGCT 


AAAAAAGCaA 


TAAACATAAT 


AACTAAGATA ATCGTTAGTG 


AAGATGGCAA 


4920 


TGAAAATTGA 


ATTAATTTAA 


ATCCTGTTAA 


AGCCTTTAAA 


AAAGAATCCG 


TTGCCAAACG 




ATT7UVCTAAA 


TTTCCAACCC 


CAACAGCTCC 


TAAAATGCCT 


AATATTGAGC 


CAAAAAAGCC 


5040 


AATTAAAGCC GCCTCaACAC TAAAAATTGT AAAgACtTTC 


CCATTGCTAA 


GACCCATGGC 


5100 


TTTCATTAAA 


CCAATTTCTC 


GCGTTCTTTC 


TTGGACAGAC 


ATATACAGCG 


TGTTAATAAT 


5160 


GCCAAAGcTT 


GCAGCTAACA 


AGGCAATTGC 


ACCAAACATG 


GTTAAGACAC 


CAGTGATTGC 


5220 
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ATTGATTATA TTACGAATCA TGCCAATTTC ATCTTCGACA GTAGTCGCTA 
TTTGTCTAAA TCTTTTTTGA TGTCCTTAAT CTGCTCTGGC GTGCTGTCTT 
AGCAATAATC ATCGCGTATT GATTTTTTAA ATGTTCTGGA AGATCTGCTT 
GACAACTTTA TCAATCAATG CTTTATTCAT CAGCGAGAGC CCATTTTGAA 
ATTTCTAACG CCAACAATTT TTGCTTCAAT AACTTGCTCT TGTCCTTTCA 
GATACCTAAT TGAACTGTTT CCCCCACGGC AGCTTTACTA GATGTATAGC 
TACATATTCT GGTGACAAAT TAATTTCGAA GTCCTGGCTC GTTTGCGAAA 
AGCCGCTAAA TCAATGGTCA TCTCATCTAG CGCAGAAGTT GCTGAAAATA 
CTTATCTGCT CCTTTTATAT AATCGATAGC AACGGATTTC ATTGGTTCCA 
ATCCGAGATT TTTTTAATCT TCTCAATATC TTTTTCAGCA AGCATACTTT 
CGTACTGGTT TTTTTCTCTG GATTATATTT ACTTGGTTCT GTTCCGTTAC 
CTCCATTTTT GGCTGAATAA ATAACTGATT TGCACCACCG ACACTGCCGA 
AATATAATCA TTTACACCAA TATTGACTCC TGTAGTTAAA GCTATGGTAA 
AAAGATTGCA ATAATGGTCA ATACGGTTCG ACCTTTATTA CGCATTAAAT 
TGATTTTAAA ATATCTCTAA ATTTCATTTA CTCATCTCCT CCTACAATCA 
AACATGAACT TGCCGATCAC AACGTGCTGC TAAGTCTGGA TCATGTGTCA 
GGTAATcCTT tATTTTTATT TAAAyCA/UT AACAATTCTT CAATTTTCTT 
GAATCTAAAT TCC 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 
TGTGGCAACT TAAAAAAGAA CAAAATGTTC CCATTTATCG CAGTATTATG 
TTCAAAAGAT TCAAACAGGC GAATTATTAC CAGGCGAAAA AATCCCTTCG 
TGGCAGAGTA CTTGTCTGnC AATCGTTCAA CCGTTGTTCA TGCATTAGAC 
ACTTAGGCTG GATTGTGCGA AAAAGAGGGA GTGGAnGTnT GTCAACGAAG 
GGTAATGATG ACGCCACGAA CGGATTGGCA TCGTTATTTA GAACAAAATG 
AGTTCATCCC TTGGTGACAG AGATTGAGTT ACGTGCTAAA CAAAATTTGC 
AGATATGTAT ACAGGCGAGT TGCCGTTAGA CTTAATTCCC AGTTTCGATT 
AAATTGGAAA CAATTTTTAA AAGAGGAAGC ACAAGATGAA TTAGGTTATT 
AAAAGAAATC CAAGCCATTA CAGCAACGGA ATATCAGCTT CATTTACCAA 
ATTGCTTACT TCAGGTGCGC AACAAGCACT TTTTTTAGTT TTACAAGTTT 



AATAGCCAGC 
TTTTAACTTC 
GATTGATTGA 
TGACACTGGC 
AGGAACTTGA 
CGAGCGCTTT 
CTTTACGGCC 
CATATTTATG 
CTGATGTAAC 
GTTGTTGAAT 
CAACATTCAT 
CTTGTTTGTC 
ATGCTCCAAT 
TGGTACTAGC 
AGcCGTCTCG 
CAATAATtAa 
CCCAGTGGCA 



GAGATGATTG 
GAACGTAAAC 
GAACTGGTGG 
GAAAATGGGG 
TGTTCAAACA 
XGACTGACTT 
TTCCAGCATT 
TGCCGTTAAG 
GTGAAAGTTT 
TATTGAGACA 



5280 

53'40 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6253 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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AGGAGATAGT GTTGCCGTAG AAGATCCATC TTTCTTCTAT GCCTTACCAA TTTTCCAAGC 


660 


TGCTGGTGTG 


CGCTTATTTG 


GTGTGCCCAT GACAGAGGAA GGGATTGATC TTGAAGCGCT 


720 


AGAACAAACA ATTCGACAAC 


ATCGCATCAA AATGGTTATG GTTAATCCTT CCTTTCAAAA 


780 


TCCGACTGGG ACAGTGATGC CTTTGAGAAA AAGAGAACAG CTTGTGAAAG TCTGTCA7UVC 


840 


GTATCAAGTT 


CCTATTTTAG 


AAGATGATGT TTTTGGGCAG TTAAGTTTTA TCCCGAAAAC 


900 


AGAAATTCCT 


CCTTTAAAAA 


AATTAGATCC AGATAATGTC TTGTATATCG GTTCGTTGTC 


960 


AAAAATTTTA 


GGTTCGACCA 


CGAAAATTGG TTGGTTAAGT GCGCCCGCCT CCGTGACGAA 


1020 


ACAAATTGCA 


GAAGCCAGAA 


AAATGATGGA TGTTTCGTTG AGTATTTTTC CACAGATGCT 


1080 


CGCTAAAATG 


GCGATTGAAG 


ATCCTTCTTT CTCTGAAAAA ATTACGTTAT TAAATAAACA 


1140 


AGTGGAACAA 


CGAGCAACAG 


CTGTATATCA AGTATTTAAA TCGTTGTCAG AATGGGAAGT 


1200 


TTCGCCAGTA 


AAAGGCGGCT 


TTTATTTATG GGCACATTGG CGTCAAGGGG CACTTAAGCC 


1260 


AGAAGATTGG 


CAAGTGTTTT 


TACGAGAAGG TGTTTTGGTA GCACCAAGTG TAGCTTTTAG 


1320 


TGAAAAACGT 


GGAAGTATTC 


GCTTGAACTG TTCCCGAATT TCACCAGAAG AAATGCCTCT 


1380 


TTTTTGTGAG 


CGAATGGTAC 


GGATTACACG ACAGCTTTCT GAAAATAGAC AAACAAAAAT 


1440 


AGAACAGTAA 


GTAAGATTTT 


TCTTATAAAA ATCATTTGCT GAGAGAAAAC GCTACAACTT 


1500 


CCCTTTTCTA AACGAATATC 


CAATGCTATA CTTAA/^GT AAAAACAAAA AAACAAATGA 


1560 


GGTGTTTTCC 


ATGTCTTGGG 


AACAAGTTTA CCAACAATGG TTAGATGAAG AAAATATTCC 


1620 


AGAAAATTTA 


AAAAATGAAT 


TAAAGGACTT AAATACAGAC CCTGAAAAAT GTGAAGATGC 


1680 


ATTTTACGCA 


CCATTAGAAT 


TTGGGACTGC CGGGATGCGT GGGATTTTAG GCGCCGGTAT 


1740 


TAATCGGATG 


AATATCTTTA 


CGGTTCGTCA AGCAACAGAA GGACTTGCAC GTTTTATGGA 


1800 


TACGCAAGAT 


CCTGAAACAA AACGTCGCGG TGTAGCTATT GCCTATGATT CACGTCATAT 


1860 


GTCTCCAGAA 


TTTGCAATGG 


AAGCAGCTAA AACATTAGCT AAACATGATA TTCCTTCTTT 


1920 


TGTGTTTGAA 


AGCTTACGAC 


CAACACCAGA ATTATCTTTT GCGGTCCGCT ACTTTAAAGC 


1980 


TTTTGCGGGA ATCATGATTA 


CGGCTTCTCA CAATCCAGCT GCCTATAATG GCTACAAAGT 


2040 


CTATGGTGAA GACGGTGGTC AAATGCCGCC AGCTGATGCA GATGCATTAA CTAAATATGT 


2100 


TCGTAGTATC 


GAAAATCCTT 


TGAAAATTGA TGTCTTATCA GATGAAGAAG TTGCCCACAG 


2160 


CGGTTTAATT 


AACATTGTTG 


GTGAAGAAGT CGATAACGCG TATTTGAAAG AAATTAAAAC 


2220 


AGTTACGATT 


AATCAAGAAT 


TAATTAACGA AATGGGTAAA GAATTAAAAT TAGTTTACAC 


2280 


TCCCTTACAT 


GGAACTGGGA 


AAATGTTAGG AGAAAAGGCA TTAAAACAAG CCGGTTTTGA 


2340 


AAAATTTGTT 


TTAGTACCTG 


AGCAAGCCGT TGCTGATCCT GACTTTACTA CAGTTAAGTC 


2400 


ACCAAACCCT 


GAAGAACATT 


CAGCTTTTGA ATACGCGATT CGTTTAGGGG AAAAAGAAGG 


2460 


CGCTGATTTA 


TTGATTGCAA 


CAGATCCAGA TGCTGATCGT TTAGGTGCTG CTGTGCGGAT 


2520 


GCCAAACGGA GACTACCAAG TATTAACAGG AAATCAATTA GGTTCAATTA TGATTCATTA 


2580 
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TATTTTAGAA 


GCCCATCAAC 


AAGCAGGCAC 


GTTACCTCAA 


AATGCTGCGG 


TTCTAAAATC 


2640 


AATCGTTTCT 


AGTGAACTTG 


CTACAGCGAT 


TGCTGAGAAA 


TGTAATACGA 


AGATGTTTAA 


2700 


TGTTTTAACA 


GGTTTCAAAT 


TTATTGCTGA 


AAAAATTCAA 


CAATACGT^G 


AAGACCACAG 


2760 


TCAAACGTTT 


ATGTTTGGTT 


TTGAAGAAAG 


TTATGGTTAC 


TTAGTGAAAC 


CTTTTGTTCG 


2820 


TGATAAAGAT 


GCAATTCAAG 


CGTTAGTTTT 


ATTAGCAGAA 


GTAGCGGCTT 


TCTATAAAAA 


2880 


ACAAGGAAAA 


ACATTATACG 


ATGGTTTACA 


AGATATCTTT 


GAAGAATTTG 


GTTACTTTGA 


2940 


AGAAAAAACT 


ATTTCCGTAA 


CAATGAGCGG 


AATTGAAGGC 


AGCGGTAAAA 


TCAAAGCTTT 


3000 


AATGGCAAAA 


TGTCGTGAAC 


AAGCGCCAAC 


TGAGTTTGCC 


GGGATTCAAG 


TTGCTCAAAC 


3060 


GGAAGATTTT 


AAAGAATTAA 


CACGCACTTT 


TGCAGATGGT 


CAAACAGrAC 


AATTACAAAC 


3120 


GCCTCCTTCT 


GATGTGTTGA 


AATATCATTT 


AGAAGATGGC 


AGTTGGATTG 


CTATCCGACC 


3180 


ATCAGGAACA 


GAACCAAAAA 


TTAAATTCTA 


TCTAGCGACA 


AAAGCAACAA 


GTAGTTCAGA 


3240 


AGCTTCTGAA 


AAAATTGCTG 


CGTTTGAAGC 


nGTCGTTAAT 


GAATTAACAA 


AATAAAATAA 


3300 


ACGAAACAAG 


CT 










3312 



(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1428 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 



TATCTTCGTG 


AAATTCAAGA 


AGACTTAGCA 


ACCCTTCCTG 


AAAATACCTT 


AGAGATTCAG 


60 


ATCCATCACG 


GAAAAATTTC 


ATTACATACG 


ACTTCCAATG 


ATTTTCTTTC 


CCACTATTTT 


120 


ktAtTATTTA 


AtTAtTaCTG 


kGtCCaTyCT 


ACTGACTTCa 


TyCTTTTTCm AGcAATTATC 


180 


AAAAAAGrAA 


ACmATTCTGT 


CTGGAAAGTG 


AGTCmAGrAA 


CTAATTTTAG 


CTCATCTTAC 


240 


ATTTACAAcG 


GTTAAaTAAC 


ATTAATAAAC 


TATTTGGTTT 


GTATGGAATT 


TCCGTCCATT 


300 


TTTCCCCCTC 


CGCGAGTAAA 


GTGATTACAG 


GTTCTGAATT 


TCAAGTCCTA 


TATGCTTTTC 


360 


TTGATGTCTA 


CTGGACCATC 


TTTTTAAATA 


CCTTAGCCTT 


ACCACGTTAT 


ACCTCTACTT 


420 


ATTTTGAAGA 


TATACTGACG 


CAGCATATTA 


AACCAGAAAT 


TCTTGAACGC 


TTGGAACACG 


480 


CTTCTCTACT 


AAAATTAAAT 


TTACTTTTAC 


ACATCGCAAA 


ATATCGCTTT 


CCATATACTT 


540 


GTGAAGATGA 


TCTCCAAAAA 


GAATTTGAAG 


AACATCCTTA 


TTTAAGGCTT 


TTCTCAGACT 


600 


CAAAACAAAC 


TATTTTTTGT 


TCCCAATCAA 


ATTTTTCAGA 


AGATCAACAG 


CGGGTTGTAG 


660 


ATATCATTGC 


ACCCATTGCC 


ATTGATAATT 


TTGATTCAGC 


AAAACGAGCC 


AAAGATTTAC 


720 


TGGCTGCTCT 


AAATGaAATT 


TCTTGCCCAG 


AATATTTCTA 


CGTTGATTCT 


TTATTAACCA 


780 


ATTTTTGCGA 


aGTTTTTCAA 


ATTCCAAAAA 


TGCGTGAATT 


TGATCGTCAT 


TACTTCTTAC 


840 
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TTATTTTATT 


TAAAAATATT 


ATCTATAACC 


AATTATTTAT 


TTCACAGCAA 


CCAAATACCG 


900 


CAATTCTAGA 


TTTTTTAGTC 


AATAAAAATA 


GTACTTATCA 


AAAAGATTTA 


CAGAAAAAAA 


960 


TCTATGCTTT 


TTATCAAAAC 


TTTTCAGCGC ATTCCCCTTA TCAGTTAAGA 


TTTAAGGATC 


1020 


AGGAGAACCC AATTTGGACA TGTGAATTGT TGTTACGAAT 


TTATAAGCAC 


TATAGTCCCC 


1080 


CCAAAAAAAT 


TAAAATTGGT 


GTTGCATATT 


CTCGCGATTT 


TTACATTGCT 


AGCTTTATTA 


1140 


TGATGAAAAT 


CAAGCAAOCA 


TTTAGTGAGG 


ATATTATTTT 


TGAACGAACC 


GATTTTTCTG 


1200 


GTTGTGATAT 


TGTGATTACT 


GACTATCCCt. 


TATTTGATTT 


ACCAAAAGAA 


ATAGACCGAA 


1260 


TTTACATTTT 


AGAAGAAGAA 


TTAACCCGAG 


AAGATTGGGA 


ACTAATTTTC 


AGTAAAATAA 


1320 


GTATCGCCAT 


TTTTGATTTG 


CAATATTAAA ACAAACATCA 


AGCGCCAACC 


TGTCTCACCA 


1380 


GCCATAATGA AAACAAGCGA CCTACAAACC ACTTCTGCCA GTTTTAGTCT 


GACAGAAAAT 


1440 


AGTTTGTAGG 


TCGCTTGTAT 


ATTACTGTTC 


TTTTCTATGT 


TACCTTCTWUV 


GACAATCTAG 


1500 


TAACGTACTT 


CAGTCCTTAA 


AGTGCTACTG 


TTCCACCGCT 


GATCAGTTTT 


TAAGCCTGAG 


1560 


ACGArGATCC 


AAAGTGTTTT 


TTGTCTCAGG 


CTCTTACATT 


TTTTCCAACA 


AAGGTATTGc 


1620 


TaGGCGCTAC 


TTGAAGCTGA 


AAACTAGTTT 


CTCTATCGAG 


ATTCCTCATC 


AGTAGACTCA 


1680 


AATTTAAAAC 


TTTTTTCTAT 


AAAAGTGaCT 


TCTTTTGCGT 


ATGTTTTTAC 


AAAAGAAGAA 


1740 


TAGTCTTTGG 


GATCTTTGAC 


CTTTTCCAAT 


TGGTCAACAG 


AGACTTCTAA 


TTCAAATCCT 


1800 


GTCCCATCTT 


TAAAGAGATA 


ATATTTTGAG 


ATGTCTGTAT 


ATCCTTGATT 


CCGTTTTAAA 


1860 


ATATAGCCGG 


TTTGTACATC 


AGCAATTTTA 


TCAATTGGGT 


ATGCTTTGAT 


AAACTCTTTA 


1920 


CTTGTATCCA AATAATCAGA 


AAGACTGTTG 


ACAAGATTGC 


CAGAAAGGTA 


CATACCTGCC 


1980 


GATGTATAAT 


AGTCTCTTTT 


TTCATAAAAA 


CTGATATTGC 


CTAGATATTT 


TGTATCTGGA 


2040 


TACTCCTCTG 


TAGAAGGTAA AATTAAAAAG 


TTCTTTCCGC 


CACTGGTTTC 


AATATCGGCA 


2100 


TCGCCGATAA 


TTTCTATTTG 


GTAGCCTTGA TTAAATAAAA GTTTATTTTC 


TTCTATTGTG 


2160 


ACTTTTAGTG 


GTTTTTCTAA ATCAACAGGT 


CCAGTTGTGG 


TAAGTCCAGC 


TTCTTTTAAA 


2220 


GGATAAACAT 


CTTCTGATGA AGCGGAGTAA 


TCCCAATAAT 


CATCTTTCtC 


CGCAAGATCA 


2280 


ATATATACTA 


TAAAGCCAAT 


TGCTACAGCA ATGCCTATCC 


AAATAATTTT 


TTTGTTCTTT 


2340 


TTTTGTTTTT 


TCTTTTTTTT 


CGTCTGTGGT 


TTTACCACTT 


TTTTATTGGG 


GTTAGGAAGT 


2400 


TGCTTTGCTG 


TATCTGTTCr 


TGTGACAGGA AAAAACGACC 


ATTTTGGCTT 


TTGAAAAAGC 


24 60 


GCATCATATG 


kATCATAAAA 


ATTCTTTTGT 


TTAGCCATTT 


TTTCTCTCCA 


CTAATTTTTT 


2520 


TGATAAGGCA 


ATTTCCTCAG 


ATGCTTCGGA 


GACTAATCTT 


TCATAATCAT 




2580 


TTCCGACAAC 


ACTCGTATGG 


TACTTAAGAT 


ACCATTGGTT 


GCAAGTGTTA 


TTTCwTCTGT 


2640 


TGACAATTTA-TCTTCTTTAA TGCTTAAAAA CCGTTcTGAT 


GCTCGTAAAA 


TCTCTGGTAC 


2700 


CTTATGATAT 


AAAAAATCAC 


TAAAAtGAGT 


AATTGcTTCA 


GGATACTTCA 


CTAAGTGTTT 


2760 


AAAGATGCCT 


TtAGCGGAAG 


CAACACCCAG 


TTTTACTTGT 


TCAATTTTTT 


TTAAATTTCG 


2820 
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TGACTTATTc 


GTTAGTTCGT 


TTAAATAAAT 


AATTTGTTTT 


TTXAATTCGT 


CCATAATTCT 


2880 


TTGGTACATC 


TCTAGGTCAC 


TGTCCGTTAG ATGGTGCgTC 


GTTTGATATT 


TTTTTAGATT 


2940 


TTTTTTATAa 


CGGAAATACG 


TACGACCATT 


TCTATAAGTA 


ACCCAACCAA 


TGTACCCAGT 


3000 


CCCCACTAAC 


AAAACCATTA 


TCCCATTCAA ATCATGTACT 


CCTTTTTAAA 


TTAGTATATA 


3060 


TTACTGAATA ACACACTGTC 


CAGCTTAAAA 


TTGACAAATA 


AGTATGAAAT 


GTTACTCCCC 


3120 


CAATTACTTG 


TGCACTTTTA 


TTGTATCATA 


TAGGAAAATT 


GTTCTGTTCC 


GCATTTGGAC 


3180 


AAATTGTCTA AATGTCTAAG 


GGAATCTTTT 


TTTGTTTAAT 


ATGTAGCAAA AAACACTTGC 


3240 


AAGAATTGAT 


TACTTCTCAC 


AAGCGTTTTT 


TATTGTCTAA 


AACAACTATT 


TACATTTCAT 


3300 


TTTATTAAAT 


AGTGAATAAA 


TAAATTACTG 


CAGCTAATGA 


AGCaCCTAAG 


ATAGGTCCCA 


3360 


CAATTGGcAC 


CCAAGAATAA 


GCCCAATCGG 


AATCACCTTT 


TGTTTTCACA GGAATAATTT 


3420 


GGTGCGCTAA ACGAGGTCCT 


AAATCACGTG 


CAGGGTTAAT 


TiSCATAACCT 


GTAGGTCCGC 


3480 


CTAATGATAG 


ACCTAGTCCT 


AAGACCAAAA 


TCCCAACAAG 


CATTGGGTTA ATGCCAGCCG 


3540 


CTAAGTCGTT 


TTGAGAAAAG 


GCTAGTAAAC 


CAAAAACCAA 


GACAAATGTG 


CCGATAATTT 


3600 


CTGTTAAAAC 


ATTGGCTGGA 


TAGTTACGGA 


CAGCTGGACC 


AGTGGCGAAG 


GTTCCTAAAA 


3660 


TAGCACCTTT 


GTCTTCTGTA 


ATGTTCCAAT 


GGGGTAAATA 


AGCTAACCAG 


ACAACTAAAC 


3720 


CACCAATAAA 


ACCGCCTAGG 


ACTTGAGCTA 


CGATATAAGG 


TAAAACCATG 


CCCCATTCAA 


3780 


AGTTTCCAGT 


AATTGCCATC 


GCAACTGTTA 


CAGCAGGATT 


TAAATGCGCT 


GGACTCATAT 


3840 


ACCCAGACAT 


GTAAACAGCT 


AATGTAACAG 


CAGCGCCCCA 


ACCTAAAGCA 


ATAACGACCC 


3900 


AACCAGAAGC 


AAAGGCTTTG 


CTTTTCTTCA AGTTAACTGC 


GGCACAGACG 


CCATCCCCTA 


3960 


GTAAAACTAA AATCATCGTT 


CCGAAAAATT 


CACCGAATAA 


TTGTGTCATC 


ATCGAAGTTC 


4020 


CCATTTTTTC 


TTCACCCTAC 


TTTCAAATTT 


TTTAAATCTG 


ATTCTTCAAT 


TACTTTTTCT 


4080 


AATGTTTCAA 


TGTGTCGTGC 


TCTTTCTTCC 


GCTGTCCACT 


GATAATGCTG 


TGCCATTTCT 


4140 


TCAATGACTC 


CCGCTTTCAC 


TTGGTCCAAA 


CGATCACGCA 


TAAATAATAA 


GTGGTTGGTC 


4200 


CGTCGCAATA AAAAGTCGAC 


GGGTGTTAAA 


GCCATTTCTT 


CTTCCATCGC 


ATAGTTTAAA 


4260 


GAGACAGTTT 


CTGTTAAGGT 


TAAACCAGGA ATTACTTTGG 


CATCATCAAT 


CATTTCAAAA 


4320 


ACAGTTGGTA CGTTGGAGCC 


ATATAAATGA 


GCTAAGTATA 


GTGCATCCTT 


TTCACTTAAC 


4380 


CCTTTAGACA 


CGCCATGTTT 


TGCCAACGCT 


TCTAATTCTT 


CGTCTACCGT 


TGCAGGGTTC 


4440 


AATTTTCCGC 


CTGAAACTGG 


ATAGTTTTTA 


GAATCAATTA 


AGGTGAATTC 


TTTCTGGTAT 


4500 


TCTTCTGCTA 


AGATGGTTTG 


AA7CATTTTC 


ATCGCACCTT 


CTGCCATTTT 


ACGATAATCT 


4560 


GTTAATTTCC 


CACCAGCCAA 


AGTGAGCAAG 


CCATCCGCTG 


ATCTTTCCAA 


TGAGCTGCCT 


4620 


CGAGAGACAG 


CTGAAGGATT 


GACTTTATTT 


TCCACCAAAC 


TGTCTTCCAG 


ATGGTTTAGG 


4680 


ACTTCTTCAA 


TTTCTCGTTT 


ATCTGTTTGA 


TTTTTTTGGT 


ATCTGTCAAC 


GACATCAATG 


4740 


ACTTCATCAA 


TACTTTTATC 


TGAAAGTTTG 


CCATTGTTTC 


CCCCATTATA 


ATCTGAGCCA 


4800 
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CCGTTAGCAG 


AAATTAAAGG 


TCGTAAGCCA 


GCCCAGCTGG 


CT.TCAATATC 


ATCAATTGTC 


4860 


AATTGTACCT 


CTGGATAACG 


ATTATTGACT 


ACTTCCAATA 


GGTAATCCAC 


ATCACTTTGT 


4 920 


TCTACTGTTG 


GATGTTGAAA 


GTCTCCATGG 


TAATCTGTAT 


CCGTCGTCCC 


AAAATATGTT 


4980 


TTTTCTTCTC 


TTGGAACAAC 


GAAGACCATG 


CGGCCATCTT 


GTTTACCTGT 


ATCAAAATAC 


5040 


GTTGGTTGAG 


GAACATTTAA 


ACGGCTTTTG 


TCGACAACTA 


AATGAACCCC 


TTTGGTTGGA 


5100 


CGCATTTGTG 


GTGTAAATGA 


ATCGTTTTTG 


TCTAAGCCTC 


TTAATTTATC 


TGACCAAGGA 


5160 


CCAGTAGTAT 


TAATCACAAC 


ATCTGCGTGA 


ATCTCAAACT 


GTTCGTCAGT 


TAGTAGGTCC 


5220 


TTTGCTTGAA 


TGCCAGTAAT 


TTTCCCTTCA 


TCATTGTATA 


AAAAGCCCAC 


CGCTTGAACC 


5280 


CGACTCACAG 


CATGCGCCCC 


GTCAGCTACT 


GCTTGTTTAA 


TATTTTCAAT 


AACTAAACGA 


5340 


GCATCATTAT 


TTCGGTAATC 


TAAATAAACA 


CCGCCACCTT 


GTAAACCTTC 


ACTTTTTAGT 


5400 


TGTGGCTCAC 


GTGCCAAAAC 


CTCGTCTTTT 


GTTAACAGAT 


AGTTGGCGTA 


CTTGGTACCT 


5460 


GTTACATCAG 


CTAACTGATC 


ATATAAATCC 


ATTGCTATTT 


TGACAGAAAA 


TAGATTAAAT 


5520 


GTTGCATTTG 


GCTCATCATA 


AATCGGCAAT 


AACATGGGAT 


CCGCTTTAGG 


GATGTGTGGC 


5580 


GCAATGGATT 


GAACGACGGC 


TCGTTCTTTC 


ACTGTATCTG 


CTACCACCTc 


CACGTCAAAC 


5640 


GTTTTCAAAT 


AGCGGATACC 


ACCATGAACT 


AATTTGGTTG 


AtCGTGAAgA GGTGCCCTCT 


5700 


GCGAAATCCT 


GCATTTCAAT 


TAACCCTGTC 


TTCATTTTAG 


CGGCACTGGC 


TTGCAGGGCA 


5760 


ACCCCTGCGC 


CAGTAATGCC 


ACCACCAATA 


ATTAAGACAT 


CTAAGTGTTG 


TGTTTTTAAA 


5820 


GCGTCGATTG 


ATTGTCGGCG 


TGTTTCTATT 


GAGAAAGTCA 


TAGAAAATTC 


CCCTTTTCCA 


5880 


TTTACTTATT 


TTTACGTTTG 


AATTGTTGGG 


TAGCTGCTAC 


TGCTTGTTGC 


CAACCTTCAT 


5940 


ATAAATCTTC 


TCGTTCTTCT 


TCCGCCATAA 


TTGGTTCAAA 


TTGTTGTCCT 


TCTTCTTGGA 


6000 


ATGCTTTGAT 


TTCTTCTAAG 


TCTTTCXAGA 


AGCCAACCGC 


TAAGCCCGCC 


AAGAAAGcTG 


6060 


CGCCTAAGGC 


CGTTGTTTCT 


AAGTTATGGG 


CTCTTTGAAC 


AGCAGTGTTT 


AAAATATCTG 


6120 


CTTGGAATTG 


CATCAAGAAA 


TCATTGTTTG 


CTGCTCCACC 


GTCTACTTTT 


AAGACGGGAA 


6180 


TATCTATGCC 


TGTATCTTCT 


TTCATCGTGT 


CAATAATGTC 


ACGAACT.TGA TACGCAACTG 


6240 


CTTGAAGTGT 


CGCTTTGACA 


AAATCTTCTC 


TCGTGGTACC 


GCGTGTTAAT 


CCAAAGACAG 


6300 


CCCCACGCGC 


TTGTGAATCC 


CAATATGGTG 


CGCCCAAACC 


AGTGAATGCC 


GGAACCACAT 


6360 


AAACTTCATT 


GTGCCCTGTT 


GACGCTTTGG 


CTACTGCTTC 


AGATTCAGCA 


GCCGTTTGCA 


6420 


ACATTTTTAA 


GCCATCTCGT 


AACCATTGAA 


TCGCTGAACC 


AGCCACGAAG 


ATACTTCCTT 


6480 










AGTGGTAAGC 


AGATTATTTT 


6540 


TAGAAAGTTG 


AGGTTCTTCG 


CCAGTGTTCA 


TGACAATAAA 


TGAGCCTGTT 


CCATAGGTAT 


6600 


TTTTAACCAT 


CCCAGGTTCA 


AAGGCCATTT 


GACCAAATAA 


GGCGGCTTGT 


TGGTCACCAG 


6660 


CCATCCCAGC 


GATAGGAACT 


TCACTTCCGT 


AAAAATGGTA 


ATTTTTTGTT 


AATCCATACA 


6720 


CTTCAGAGTT 


CGAAACGACT 


TTCGGTAACA 


TCACTCGTGG 


GATATTTAAA AGATCCAAAA 


6780 
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TTTCTTGATC 


CCAATCTAAA 


TCATGGATAT 


XAAACAACAT 


CQTCCGACTT 


GCATTTGAAT 


6840 


AATCAGTGAC ATGAGTATCT. JCCCGTTAATT 


TCCAAACGAG 


CCAAGTGTCA 


ATTGTCCCAA 


6900 


ACATCAATTC 


TCCATTTTCT 


GCACGTTCTT 


GCGCACCTTC 


TACGTGATCT 


AAGATCCAAC 


6960 


GAACCTTCGT 


TGCTGAGAAA 


TAAGCATCAA 


TGATTAACCC 


AGTTTTTTCA 


TGAATCATTT 


7020 


CGCTGTAGCC 


ATCTTCTTTT 


AGTTGATCAG 


CAATAGGTGT 


CGTTTGGCGT 


GATTGCCAAA 


7080 


CAATTGCGTT 


ATAAATGGGT 


AACCCAGTT6 


CTTTGTCCCA AACAACGGTT 


GTTTCCCGTT 


7140 


GGTTTGTAAT 


CCCAATTCCG 


GCAATATCTG 


TAGGTTTCAC 


TCCTGACTCA ATTAAAGAAC 


7200 


CTGCAATAAC 


GGATTGAACA 


GAATTCCAAA 


TTTCATTGGC 


ATTATGTTCC 


ACCCAGCCTG 


7260 


CATTAGGAAA ATATTGAGTA AATTCCTTTT 


GTGAACTACC 


AATTTTGTTT 


CCTTTTTTAT 


7320 


CAAAAATAAT 


TGCACGAGAA 


CTAGTCGTTC 


CTTGATCAAT 


CGCCATAATG 


TATTTTTCTT 


7380 


CTGCCATGAT 


AATTCCTCCT 


AAACGTTTTT 


GAAATCGTTT 


TCTTTATGTC 


CCTATCTTAC 


7440 


AAATAAAAGA 


ATAACTTU^C 


ATGTCAAAAA 


CTTTTTTTAT 


TTTAAAAAAA 


TTGAAAGGAT 


7500 


TTCTACTAAA 


ACCAACAAAG 


ACAACGCTTT 


CAATAATACT 


TTTATTATCT 


CTAATTTTTC 


7560 


AATAAAAAGA 


AACATTTTTG 


TTTAAAAAAG 


CCTAATTTTC 


GACTTTTAAT 


TGTAAAAAGG 


7620 


CTGGTGCGAG 


GATCAGACAA 


TTTCCGTCAC 


ACCAACCTTT 


TGTTTATTTC 


TCCCGCAAGG 


7680 


TGCGTATTTT 


CTTTTTTAAT 


TGTTGCACAT 


CATAGGTCGA 


ACTTAATTCT 


GACAAAACAT 


7740 


AGACTTCCGG 


TGTTCCTTGA 


TAGCTCGTTT 


GCTGATAATT 


GGTCACAATT 


AAATCATATG 


7800 


TTTGTTCAGG 


CTGATACCTC 


TCCACCAGCG 


TGCCATTCAT 


ATTTTTAAGA 


CTAAGCATTA 


7860 


ATACTTGTGT 


TAACGCCGCT 


TGATAAAGAG 


CATCCATTGC 


TAAATGAATT 


CCAATTTTTA 


7920 


TTTCACCAAT 


CATTTTAAAT 


TCAATTAATG 


CCAACACGCT 


TAAATATTTG 


ACCAGAGAAA 


7980 


GTTCATGCAA ACTATCTCCT 


GGTTGATACG 


TTTCATCAAA 


ACAAGCTAAA 


GCTTTTTCTA 


8040 


ATAACGTTGT 


TGAAAAAGAA 


GCTAATTCCC 


GACTAGATAA 


TTGCTGTTCT 


TTTTCCCAGA 


8100 


TATGGGCCCG 


ATCAAAGACT 


TCCAATTCTC 


CTTTAAAAAA ATACAGTTTT 


CCATGAATTT 


8160 


GTGAAAAATA 


GTAAGCCATG 


GTTTTCTCAA 


GAGTAGGTTC 


AAATTTATGG 


GGCCGATAAT 


8220 


ACAAGACCAC 


ATTTTCCAAG 


ATAAACGTAT 


CAGCTAAAGA AACAGCGGAA CGACGTCCCC 


8280 


GAATTAAGGA 


ATATTCATCA 


AAAGCTGATT 


CTGGTAAGAC 


CGACATTGTG 


ATTAGAAAAA 


8340 


CAAAATGAAT 


CATGCTTTCT 


CCTTCGTCTA ATTCTAATGA ATAGCGACTA GAAAAACGGA 


8400 


AAAACAATTG 


TCGGACTTGC 


TTAAATAACG 


GATCTTCTTT 


AAAATCAATC 


ATTTTCTTTC 


8460 


TCAATTGTTG 


ATATTTTTTT 


GAAGTGAcCG 


CTAGGCGCTT 


TTTCGTAATA 


CTCAACCAAA 


8520 


GGCTAACTTT 


TAACGCATTA 


TGTTCATTAA 


AAGTAAAACC 


TAAGCCAGTC 


TCAATCCCTT 


8580 


CAATTATCCG 


TTTATTTAAC 


GGCGACATCG 


TTTTTTTCTG 


GTGTATTTCA 


TAGGGAGTAA 


8640 


TATACCAATA 


CAACTGAAAG 


TAAAAGTAAC 


GAATCTGCAA 


TTCTTCTCCC 


TGTAGTTGTC 


8700 


CATTTTTTAT 


TTGCAAATCA 


AAGACGGCTA 


ACAATTGATT 


GAGTTCCTTA 


ATCTTTCGAA 


8760 
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ATAAAGAAGA 


CTCACTAAtC 


aTAAATTTAG 


TCGtTAATTG 


AACAATAGAA 


AACTCTTTAT 


8820 


TACGATATAA 


ATAATCtAAA 


AGTTGAAATT 


TAAGAGACTC 


TTTTACAAAT 


GCTTCAATTA 


8880 


AATGATTCAA 


CGAAAAAAAA 


TCCGATAAGG 


TGATTGATAG 


CTGTTGGCCA 


TCATAAGTAA 


8940 


GCTGACAATC 


CTGTCCAAAA 


GGTTTCAAAA 


AATAATGTAA 


ATCCTCTAAG 


TCTTTTTCAA 


9000 


AGGAGGcTTT 


TGTGACCGCT 


AAAAATGCTG 


tTAAATCAGA 


GACTGAGCAG 


GTGCCGCCTG 


9060 


CTAAGATGGT 


TTGTTGTACC 


GTTAATJ\ATT 


GTCGCATCGC 


TTTCTTTTCt 


AATAATGTCT 


9120 


CAACTTTCAT 


CGTTCTCTCT 


CCAATAAGCC 


GAAGCACCCA 


TCAAATTTGT 


AATATCATAA 


9180 


ATATTGGTTA 


AGGAAGGTTG 


ATAAAAATAA 


TCTTTTTGTA 


ATAGGTCTGT 


CAGTGTCTGA 


9240 


CCTGTTTGGA 


TACTTAATGC 


TAAAGTATTT 


ATTTTTTCTA 


AACAATTGTT 


TTTTGAACAA 


9300 


AGCTGTGCAC 


CAAGGACCCG 


TTGCGTCACT 


TTATCATAGA 


TTAATTTGCC 


TAAAATTTCT 


9360 


6TCCCATGTT 


GTAGAGGCGG 


CGCTGGCTGT 


CTAACAATTA 


TGGAAGCAAG 


TGTTTGCGGA 


9420 


AA7VAATAAAC 


CTTCCGTTTC 


GGTTAGACCT 


GTACTAGCAA 


GGTAATAGTC 


CCCCACTTTT 


9480 


GTTCCCATTG 


TTCGCAAAGA 


GCCAATGAAT 


CGGTGTGTTT 


TTTCTTCTAA 


ATTGTTGGCG 


9540 


ACTACTAAGC 


CAGTTCGTAC 


CGCATTGTTC 


ACTAACGGAG 


CATAAAAAGT 


TTCCGCGACT 


9600 


GGTTCATTCA 


TAACTGAAAT 


GCAATCACCG 


ATGGCAAAAA 


CATTCGGCAC 


TGAAGTTTGT 


9660 


AAATAAGCAT 


CCACCGCGAT 


TGTCTGATCC 


AGATTCCGTT 


GAATTTTTTT 


ATCCAAATAG 


9720 


GCTAATTGAG 


GATGTAAGTT 


TAAAGCAAAA 


ATCCCACTGT 


CACAAGAAAT 


TTCCTGCTCG 


9780 


CTTGTTTCAA 


GAACAATACC 


ATTTGCTGTT 


TCTTCAATTC 


CTAGGACCGT 


TTCTTCAAAA 


9840 


TGGAAAATTA 


CTGCTTGTTT 


TTCCAAAGAT 


TTTTGAACTT 


CTGCGACCAT 


TTCTTTATCA 


9900 


AAATATTTAG 


GCAATAGATT 


TTCCAAGCTT 


TCAAAGACAT 


GGACCGTTTT 


CTTCATTTTC 


9960 


ACAAGAAAAT 


CAATAGCTTC 


CATTCCAATG 


GGTCCTGCAC 


CAATGACGGC 


TACTGTCTGG 


10020 


CTATTTTCCA 


GTAATG6AAC 


AGCCGCTAAA 


GCACCTGATA 


AAAATTTATA 


TTTGAGTAGT 


10080 


TTT TCTGTTT 


GACTACCACG 


AATTTGCGTG 


6AAAACTGGC 


TCGCGCCTGT 


CGCTAAAATC 


10140 


AATTTATCAT 


AAGAATACCA 


CTGTTGCTCT 


TCCTTTCGTG 


TCCAAGCAAT 


CAATTGGTTC 


10200 


TCAACATCCA 


TTGCCACAAC 


TTCTCTATTT 


AAAAGCAACT 


GAATCTTTTG 


ACGACGTAAT- 


10260 


TCCTCTTCCG 


TTATGTAACG 


TGCTTCGTGT 


AATTCATTGA 


TGGTATGATT 


AAAATACGCA 


10320 


CTTAAGCCAC 


CAGATA7UVTA 


CCCCACTGTT 


GCTTGTTTAT 


CAATTAAAGA 


AATTTCTGCT 


10380 


TGGGGATATT 


TTTTCCTTGA 


AGCAATTGCT 


GCGGAAATAC 


CTGCGAATGA 


TGCACCAATA 


10440 




TCATTTGTgT 


CC- 1 Cv- i \- 1 L» 1 








1 nsnn 

J. U«J Uvl 


ATGATTAAAA 


AGAaTGCTAA 


AgCAACAACt 


ATAGtTTTCC 


CTTAAAATAA 


TTATtTTTGC 


10560 


TGAGCGCATT 


GCTTTTCAAG 


CACTTTCATT 


TTCATTTATA 


ATAAATGTTA 


CTACTGTTTG 


10620 


AGGAGGAGTT 


TTCATGTCTG 


CAACAAAAGA 


GCAATTGGaA 


CTATTTTTAT 


TTTATTTATC 


10680 


TGAAACCCAT 


ACAAAAAGTT 


TATCTTTACA 


TGACTTAGTT 


ACAAGTCGAC 


CACGCCCAGA 


10740 
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AGAAGCGCGT 
TCTCTACACA 
CGAAAATTTT 
ATTGAATGAA 
TGTTCTATAA 
GATTGAACGT 
GAACTATTCT 
TTATGTGCGA 
AGGTTGTTTT 
GAAAAGGAGT 
AATTAATAAT 
ATTCGTTAGT 
ATGGCAAAGG 
TCTGCATCAA 
TATTCTGTAT 
AATGTTCCGA 
TGAATTTCTG 
CGAACTTCTT 
TCATtAATTG 
TTTCTCTtTC 
TCTTCATTCT 
GTTTTAATGT 
AAGTTCGTAT 
TGTTTTGTCG 
TGATAACCAA 
CCGATGACTT 
TAGGACTGTA 
TAGGCATACA 
TGCCATTCTC 
CACTCCTTAC 
TATTAAGCAC 
ATAGCAAAAG 
AATAAATCAT 



ATTTTATTAA 
AGTGTTCCTG 
TATTTTGAAT 
CGTCTAGAGG 
TAGAAGAGAA 
TTAAAAGAAG 
GAAGCAGATT 
AATGTTGAAA 
ATTATTAAAT 
ATTTACTCAA 
ACAATACACT 
TATTTTTATT 
TTAATTCATC 
TACCTGTTTC 
AAGCATGTTG 
CTTTTTTGAG 
CAATTCGTtC 
CGTAATATGG 
TTkGAACGTA 
ACTCCATTTT 
TTTCATTCTT 
ATCGATCCCT 
CACGAATCGT 
CTGCGGCAGC 
TTTCTGGATA 
CGCCTGTTCG 
AGTTCCACTT 
TCACATTTTC 
TGACAATAAG 
ATAAAGAAAA 
CTTTACCTGC 
AAGTACCTGC 
TAAATCCTGG 



ATATTAACGA 
CTCTTGAAAA 
TAAAACAACA 
AAATGAAAAT 
CAATTCGTTG 
AACTAAATTT 
ATCAACAAAT 
ATTAACAAGC 
AAAAAAAATA 
TAAGAGTAAA 
TTCTCTTTTG 
AGTTTTTTAT 
CGTTGCTTGA 
TGGATCTGTT 
AAATTTTTTG 
CATTAATTCA 
GGCTGTTAAA 
ATyCATGACT 
AAATCTTAAA 
ACACTTTTTC 
AACAGAGAAA 
AACAAGCATC 
TGAAATAACT 
TTCAATCGCA 
GCTCTTACCT 
CTTCTTCTCT 
TAGCCAGTTT 
ATCTTGCAAA 
TCGTTGTGTT 
ATTGACTGGT 
TTTACGTTGG 
TACAATTGCC 
AATAACAATC 



913 
GGTTTTCACT 
CAACAAGTCT 
TTTTTTCAAT 
TGCCTTTGAA 
AGAAGGGGTC 
GCCATTTTTT 
CAAGAAAGAG 
TAAAACAACC 
ACAAAAAAAC 
ATGAAAAATA 
TCCATAAAAA 
TTTTTCAAAA 
TCCTGCTTTT 
GCTAAAATCA 
TGAATGCCAA 
TATTTTTCAG 
TCATTCACTT 
TCGCCAATGy 
TTTGGTCTCA 
TATTCCATCT 
ACATAGTGAG 
TGGTTTCGAA 
TCTGGAAAAT 
TACCCTTTGT 
TTTACTAATT 
AACGCCCATA 
GCGACCTCTT 
AATTTTTTCA 
TCGAAAATAA 
TTAAGTACCA 
AAGAACTTCA 
CATTGATATG 
GTTGCAGCTA 



TACTATCATA 
GAGCCTTTTT 
GAAGAAGATG 
CAATTGACGG 
CTTATGAAGC 
AATGGTATGC 
TTAATTCAAT 
TTTGATTTTA 
CACCGGCAGT 
AAAAGGTTGT 
AGACATGAAA 
GAATGTTGCT 
CCTCTGAAGC 
TTTCATTGCA 
GTACTTGTGG 
TACCTTCTTT 
tATTCTTATC 
CTtCTGTkGT 
TAACCATCCC 
GACAAGTGGC 
GCATCGTTTT 
TAGCGACATT 
TTAATACATC 
GCCATTCTTT 
GATTGGTGAG 
AGCCGTAGCC 
CATCAGAAAA 
ATTCCACTTC 
TTTTCCGTTT 
GCCAATTTTC 
CAATTTCCAC 
CATCTAAATG 
ATAGCAAGAA 



GCGCTCGTGT 
TTCAAGCCTT 
AAACCAATCA 
ATGACTACAA 
AAGAAGAAGT 
TAGAAGAAAA 
ACTTTGATGA 
ATAAAATCAA 
TTTGCCGATG 
TGTTGGTATG 
AAAATATTAA 
CATCCGCGTA 
ATTAAATAAA 
TCCTGCTACA 
CGTCGGTTTT 
AAATTTTGCt 
GATCGCTTGA 
TTTAACGATA 
TCTTTTCCCT 
ACTTTTTCAG 
GCCACGGTAT 
CATAGAGGCT 
GAATGCGTAC 
CACACATAAA 
ACCACACTCG 
GTATTCTTGA 
AGCGTGTTCA 
ATCTTCCGAC 
CAAGCCCTTT 
TTTTCTTATT 
AATTGGGATG 
TGTGACGCTG 
CGAAACAAGA 



10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
. 12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 



wo 98/50555 PCrAJS98/08985 

914 



ATCCCATAGT 


TAAATGATTT GTTGCGGAAT AAGCCAACTT 


TGAAAATTGA 


TTGATAAATC 


12780 


GACTTCACAT 


TAAAGGCATG GAACAATTGA ATTAATCCCA 


AAGTAGCGAA 


GGCCATGGTT 


12840 


AAAGCATCGC 


CGTGTTGTAA ATCATACAAC GcTTGCGCCG 


ATAAGTTTGT 


GTTAGCTGCG 


12900 


GTATGTGCTG 


GGAACATAAT TGACATTTTG TACACAATCA AAGTCAATGC 


CCCTTGCGTA 


12960 


ATCCCTTGAT 


AAACAACACT GCTTAAAACG CCGCCAGAGA 


AGAAGTTCGA 


TTTTTTACCA 


13020 


CGAGGTTCAT 


GGCTCATCAC ATCACGTTCT GCTGGTTCCA 


CACCTAACGC 


AATCGCTGGG 


13080 


AAAGTATCCG 


TTACTAAGTT AATCCACAAT AAGTGAACAG 


GTAACAAGGT 


ATCCCAATTG 


13140 


AGCATTGTTG 


CAATAAACAA GGTTAATACT TCCCCTAAGT 


TAGCTGAAAG 


TAAATATTGA 


13200 


ATCGTTTTTT 


GGATATTTGA GAAGACTTTA CGTCCTTCTT CTACTGCCAC 


AATAATTGTT 


13260 


GA7VAAGTTAT 


CATCGGCAAG GACCATATCT GAAGCACCTT 


TTGATACTTC 


TGTTCCGGTA 


13320 


ATTCCCATAC 


CAATCCCGAT GTCAGCAGCT TTTAAGGCTG 


GGGCATCATT 


TACGCCATCA 


13380 


CCAGTCATCG 


CAACCACTTT TCCTTCTTGT TGCCACGCTT 


TAACAATCCG 


AACCTTGTGT 


13440 


TCTGGTGAAA 


CACGAGCATA GACAGAGTAG TGTCCCACGA 


CTTGTGCGAA 


TTTTTCATCT 


13500 


GACAATTCAT 


TTAATTCAGC GCCTGTAATG ACTGCATCGT 


CGTCGCCTTC 


TTTGATGATG 


13560 


CCTAAACGTG 


CCGCAATTGC TTCTGCTGTA TCACGGTGGT 


CACCCGTAAT 


CATAATTGGA 


13620 


CGAATACCCG 


CTTCTTTCGC GACTTTTACA GCATCTGCAG 


CTTCTTTTCT 


TTCTGGGTCA 


13680 


ATCATACCGA 


CTAAACCAGC AAATGTTAAA TCTTTTTCCA CAAGTTCAGA ACTCATTTCG 


13740 


GCTGGAATCG 


TCTCAACATA TTTATAAGCC ATTCCTAAAA 


CACGTAAGGC 


TTGTTTCGCT 


13800 


AATGAGGTGT 


TGGTTTTCAA AATTTCTTGA CGTTTTGTTT CATCTAATGG ACTTGTTTCT 


13860 


CCATTTGATA 


AGATTTCCGT ACAACGTTTC AATAATTCAT 


CTGGTGCACC 


TTTGACAGAA 


13920 


ACTAGGAAGC 


CACCTGTTTT TAATtCATGA ACAGTAGTCA 


TTAATTTACG 


ATCAGAGTCA 


13980 


AAAGGAATCT 


CTGCCACACG AGGTTCAGCT GCTACTTTTT 


CGGTTACGTT 


AAACGCATGA 


14040 


TCTAAACCAA 


ATTGTACTAA GGCAGTTTCT GTTGGGTCAC 


CAATTAATGA ACCATCTTGA 


X H X i/ V/ 


GCGATTTTTG 


TATCATTCGT AAAGTTCATG ATTTTTAAAG 


CCATATTATC 


GGCAGGGATT 


14160 


TCTGTGGAAG 


CAGATAAAAC TTGACCATCT GTATACAACG 


CTTCAACGGT 


CATTTGATTT 


14220 


AAGGTTAACG 


TACCTGTTTT ATCTGAACAG ATAATGTCTG 


TACTTCCTAA 


TGTTTCAACG 


14280 


GCTGGT 








14286 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
CAATGCTCAT TTCCACTCnT CCTAAATCTG TTATTGCCAG TAGTTTTGCT CTAACTTTTC 60 
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TAAATaATGk GaATAGTAtC CATGCTGTCG aACCTCAAAT TCTTTTAAGT AATCtAATGG 120 

AATACTCyTA CGtTCTAAGT TTTTTTGCAT ATTTAAATAA TCATCAACAT CCaCTAAAAA 180 

TACTTGATGA CTTTTGTTGA ATTGAATTAA TAAAAATGCT GTTCCACCGC TTAATTTAAA 240 

AnCCTTTAAA AAATTTTTTT GGTGATCTTT TAACATTGGT TTATTTCCAA CATAAAATGG 300 

AAATGCAGTT TTATTTTCAG TAGATTTGCA ATCGAAGGCG ATTGGCCGTC CCTTCAAATG 360 

TCCGATAAAA TCACATCCTG TCTTGTTAGT TGGAATCACT ACGGGTTTTC CACCAACCCT 420 

TATAGTTTTA GTTCCATTAG GTATTTTTGC TACTGTGCCT TTTCTGTTAC GACAATACCA 480 

CTCGTTGGTT tGTTCAATCA TTTTTTCAAA CTGACTCCAC GCTTTCATCA CAGCACCTCT 540 

CTCTTAATTT CCCTCTGGAT TTAACCAGTT ATGCATTGTA ACTTTTGGTA TTGAATCGAC 600 

AGAAATCTCC TCTTGATCAT TTCGCATATT GTATTCACTA ATTTTTTTTA TAGCTTGATT -660 

TTTTGTCTTT ATTCCCTTAC GTTCCCAGCT TAACAATATG CGGTCCATAT ACTTTAGACT 720 

GAATGCCTGA TTAAGAACTG CCTCTTTAAG GGCTAATTGA ATAAGGTCAT CTGGATAACC 78.0 

ATCTTCTGTT TGCCACTGTT TAATCATTTC CATTTCGaTA GGCGAAAGTG GTCtCCCaAA 840 

ATTTTGTTCg ACGGcAGCGT ATATATTTAA tTTaTTTTTA TCTGTTTCTG GTGAAAAAGA 900 

ATTATTTGTG TTTATGTTTT GTTTATGTTT AATTAATGTG CCACTGTTCG TTGACTGCGT 960 

TGTACTCTTT AGTGTCACTT TATCGTTACT CTGTTGTGTA CTCTCTTGCG CACTATTTAA 1020 

CGTATAAAGT ACGCATACCT TGTAAGAAGT AGCTTTTCGA CCATTACTTT TAAAATCAAT 1080 

CAAACCTAGT TGCTTTAACG CGnTTCTATT TTTATTAATT CCTGAGCGTG ACAAACCTGA 1140 

CAAAGATTCT AATGTTGCAT TAGCTGCTGT AAACCATGTA GCCCATCCTG CTTTGTTGTT 1200 

TATGGACATT AATGCACGCC ATAAAGCAAT CTGACCTGAT GAAAGCTTTT GTTTATAAAG 1260 

CAAATAATCG TCAAACGCAA GAATCTGTTG TAAATAATTC AATGTCACAC CTCCACTCTC 1320 

TATGCAATAA TTATtTTtTT ATTGGTTATC TtTTCTATtT CTtCTTTAAA TTTTTCTGGa 1380 

TCACTATTCT TGTCGCTTAA ATGGATGAGA TATATTTCTT CTGTTTTTGT TAAATCAGTT 14 4 0 

GACTGAAAAA ACTTTTTGCA GGCATCTATG CTCATATGCG TTCTTAAAAT TCGATCTTGA 1500 

ACACTTTTTG GCAGTTTGCT CTGTCTTACT AACTTGATAT CATGATTACA CTCAACGAGC 1560 

CAATGAGTGA CGTCTTTAAA CGTTTTAGGT AAATAGTTAG TATCTGTAGC AAATACTATT 1620 

TTCTTTCCAC TAGGTGAAAG AATAAGAAAG CCCAGTGGTT CTCTCGCCCG TGCCTTTTTG 1680 

TCATCGTGAA TAGTGGCAAA AGGTTTAACT AACCAATCAC CAATTTTCTG TTGCTGGTTA 1740 

GCTTTTAAAA TATGTGACCG TCTATTTATA CCTAATCCTT CTAAAGTTCC TCGTGAAGCC 1800 

CACACATCAA ATCTTCCAGC GAGTAAAATA TCATTGATAT ACTTTGAGTG GTCACCATGT 1860 

TCATGTGTTA CAAGTAAACC TTGAATATTT GAAAAATTAA TACCTTGTTT CATAATATCT 1920 

TTAGGCTTTA ATCCTGCTTC TAACATTAAG GAAGAGTTTC CATCTGCAAG TAAATAATTA 1980 

TTACCTGCAG ATGAAGAACC TTGTATATTA ATTTCAATCA TTAAAAGCCA CGTCCATCTG 2040 
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TTTCGCTAGC 


AATATTTGTG GTTAATTCAT 


CAAAAAACGC 


TGTCTGGGTT 


GGTTCGTTTT 


.2100 


CTATTTCTTG 


ATTCGTTGTT GCTTTTTGAA 


TCGGCTCTTC 


TGGTTCGATG ATAC3TCTCAT 


2160 


CTGCTGCAGT 


TTCTTTATCG AGTTTTACTG 


CTTCTTGTTT 


GACTTCTGTT 


GGTTCTTCAT 


2220 


CAAAATCGAA 


CACTTCTTGT GCTGTTTGCT 


CAGTGACATC 


TTTTCTAACA 


GATTTAACCA 


2280 


CTTCATCATC 


TTGCGACTGA TACTGCATTG 


CGACATAGGC 


GTTTTCAAAA 


TTTTTAGGAA 


2340 


TTTTCTTAAC 


AATGkTGTTT CTCATTTTAC 


GAACAATCAT 


GGATTCTCTA 


CTTTGTGGAG 


2400 


ATTTCCATGC 


GGGACTAATA TAAGTCTGGT 


ACTCTTCGCT 


ATCCAAGATG 


TCATCTAAAG 


2460 


ACATTGTTTT 


CAAATTGTTC ATGATTTCTT 


GCTTTTTATT 


TTCGATTTCT 


GTTCTCTGAC 


2520 


TCTGAGTTGC 


TTTGAATTTA TTTTCAGCTA GTCCAAACGT 


CTCATTCATC 


AAATTTTGGT 


2580 


TGATGTGGGC 


AATTAAGTTT TTAACAACGT 


CTGCACGCTC 


TGAAATATGG AACTCAACAG 


2640 


ATCCGTCTGT 


CATTTCAACT GGATAAACAA 


CTCTAACCAC 


TTTTCCTTTT 


CCAGTGGGTC 


2700 


CCCACTCTGG 


ATCAGTCACA GACAGCCCTT 


TGTATCCAGG 


GTACGAAAAA 


TGGTCCTCTT 


2760 


CCCGAACTTC 


CCAATGTCGA TGAACATGCG 


CCACATTTCG 


TCCGAACTTT 


GAAAGAATGG 


2820 


CGTCATTACC 


ATCTCCTTCA ATTCCCATTT 


CAATCTGTTT 


AATCCAAACA 


TCTTTCGGCT 


2880 


TGCCCGTGTT 


TGGATCATTC ACTTTTCTTT 


TAACATTTCG 


GGTTTGAAAA 


TATaCTTCGC 


2940 


GAGGAATCGC 


CGATGCATTC ACTTGTAATG 


CTGCGATGGT 


CATTAGCGTT 


TCTGTTATAT 


3000 


TCGTTGAATC TACATCATTA ATTGACAAAC TGGCATTTGT 


aAGCATTGTA 


TTAAtTGCTT 


3060 


GAAwTGAGCT 


GATTACGCAT TGTTTTTGAT 


ATTCATTCAT 


ATTAATTCCG 


TTGCTTACAA 


3120 


GCTGCGAttC 


AACTTGTGGT AAAAACGTAT 


CGTTGATTTT 


AGTTAATCTA 


TTCTCGAAAA 


3180 


TTTTATTTTT 


TGCAATTTCA TTGGTCATCA 


ATATCTATTC 


CTTTCTCTGT 


CAAAGCAGCC 


3240 


ATCCATACTT 


TTTTAATTTC GTCGGGACAA 


TGACTCATAG 


CATCTTTCCA 


AGTTGGCCAT 


3300 


CTTCCGTTTT 


TGTCATAGAA TTTATATTGA 


TATGTTAAAC 


TTTGCTGATT 


ATGTGGTTGC 


3360 


TCTTCATCAT 


GCtTAACTGC ACAAATTTCG 


CAGGTTCCTT 


CCGGAACGTT 


TCCGATCATC 


3420 


ATAAACCCAG 


TTTCGTCTTT TAGAAATCCC 


ATCTATTTTG 


CACCTGCCTT 


TTCAATTGaA 


3480 


TGCTGTTCAA TTCTAATTTT TTTAyCTTTT TCGCTTACAT ACATCGCAAT 


GACTTGTGTA 


3540 


TCTACTTGAA 


CAGAGTCTCT ATTGTGATTA GTCAAACCTT 


CAGCGTTATC 


GATAAAAATT 


3600 


GGAACGATAT 


AGCCTTCCTG TTTCATTAGA 


GTGTTGGAGA 


CATCTAATCC 


AGCTTGCATA 


3660 


CGGCTACCGT 


TATTTAGAGA ACTGAACGGT 


ACTCCGTCAA 


TCATCGGTTC 


ACATACTGCT 


3720 


TCATTGAGTC 


CTCCATCTTC AAAGAAATCG 


AACAATTTCC 


ATTTAACAAC 


TGAGAAATGA 


3780 


CTATTTATAA 


TTTCTTGCAG CATGTTCTGT 


TTTGTTATAA AGAATTCTTC 


AAATAGTACT 


3840 


AATTTTTGTA ATACTTCACC TTTCTTATGA 


GATAATTGGC 


GCTCTTGTTC 


ATTGAAATCT 


3900 


TCAATAATCG 


ATAGCTGTCT TTCATACTCT 


TTTAGCAAGG 


CTAATTTTTC 


ATCAATCATT 


3960 


GCGATTTCTT 


TATCTATTTC TGTAATCTCA 


CTTGTTTTAG 


CGGAAGTTTG 


TTCTAGAATA 


4020 
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GCTTCGTTAC 


TTTGAGTAAT 


GTACTCTTGC AATTTTTTTA 


TTTCTTTAGT 


GATAGTTGAA 


4080 


TATTTTTCAG 


TTGCTTCAAA 


TGGTATTTTG 


TCTAATTTTA 


ATGAGGAAAT 


CTGCTGCTCT 


4140 


ACATCAGCTA 


AGTTTTCCTT 


CACATCTTCC 


AAGTGTTTTT 


TAGCAATGTT 


ATATGCTTCT 


4200 


GTTTTAATCA ATAGTTGTTC TTTTAACTCG CCGATCCCTT TTTTTAGGgC TTCGCGATCT 


4260 


TTGTTATTTT 


GAATGCCTTT 


TTCTCGaATT 


TCAGAGAGCT 


TGATTTGCTT 


ATCCGCTTCA 


4320 


AACTGGGCTT 


TTATTTCTTT 


ATTTGTTAAC 


TCAATTTCTT 


CTGCACGTTT 


TTGCTCTTCT 


4380 


TCCTCATGAT 


GTCGTTTCAT 


TTCATCTTGa 


TCCTTGACAT 


CATATGGACG 


ATTACAGTGC 


4440 


TGACAAACTA 


AAAGATTTTC 


ATTGAAAGAT 


AATTTGGTAT 


AAACTAGACC 


GCCTGTAAAT 


4500 


TCTTCAGCTT 


CTACTTCATC 


ATATTTGTCG 


TACAATTCTT 


CATGCTTTTT 


ATTTAATGCA 


4560 


ATCAGTTCAT 


TATCTTTTAT 


AGAAaCCAAA 


CGTTCTGTTA 


CATTAAGACT 


TGATTCTTCA 


4620 


TCTGCATACG 


TCTTCTGGGC 


TTTATTGAGA 


TCAGCAAAGA 


GTTTTGACTT 


ACCTTGCTCA 


4680 


ATGCCATTAA 


TGCGTGCATT 


CTGTGCATTA 


TCATGCTTCA ATTTAGCAGC 


AGTTAATTCT 


4740 


TCTTGTTTTG 


TATTAAGACT 


TGCAATTAAT 


TCTGAAATAT 


TACCACCATT 


TCTAATAGTA 


4800 


ACGAGCTGGT 


TTTCGATGTC 


ATTTTTCTTT 


AATGTCAACT 


CGTTGCGAGT 


GGTTAATAGT 


4860 


TGCTCTTTGT 


TGATATTTTC 


AATATCTGGc 


AATGCTGCTT 


GAATACCTTC 


AATTTTTACA 


4 920 


GGAATATTTT 


TCAGCGTTTC 


GTTGATcGCT 


TTTGTTCTTG 


AAGCACACGC 


TCACGAGCTG 


4980 


TTTTAATGTC ATCATTACCA ATAATTtCTT 


TTAATTGGTG 


AATTGATGGC 


GATTCGTTGA 


5040 


TAATCTCTkC 


ATCTGTCTTG 


CTGCCAAAGT 


AtTCAAAAAG 


CTTTTGACGT 


CGTtCATCTG 


5100 


CCACTAACTG 


CTCACAAAAA 


TAAGTCACAC 


TAGTCAGATT 


CTTAAATGTG 


TCTTGATCTA 


5160 


ATACTTTTTC 


TACTTCATCG 


TCAAATGCTT 


TCTTAGTTGT 


GGTCTCTAAT 


CCGTCTACGA 


5220 


GGTATTTTGT 


AAACATCTCA 


TAAGACTTGT 


GTTCCGAATT 


TCGTTTGATA 


ACCTCTTTGT 


5280 


CTCCTCGGAC 


CTTTTCAAAC 


TCCTTTGCCT 


GACCATTGAT 


AGTCAAAACG 


ACAGTTACTG 


5340 


ATGTCTGTTT 


ACCGCGAATT 


GGCTCGCTAT 


TTTCATCAAG 


TGGACGCCAC 


TGAATCTTAG 


5400 


TACGTTCTTT 


TGAATCTTTG 


TTGAACAAGC 


ACCAGAGAAA 


aGCATCaTAG ATAGTTGTTT 


5460 


kGCCTGCATC 


GTTGTCTCCA AAAATATCAA 


TACTTTTACC 


ATTTGGTTCA ATCATCAGGT 


5520 


CCGATATACC 


TTTAAAATTA 


TGAATACGGA 


TTGACTCCAA 


GCTAATGTTT 


TTCATACAAT 


5580 


TAATCTCCTT 


CAATTTCGAT 


ATTGTCTTCT 


AAGATTGCTC 


GTATAACTGT 


TTCTTTGCGT 


5640 


TTGTTACAAA 


TTTCAGTAGC 


GACATCATAC 


CAAGCACTAC 


TATGATTTTG 


ATTCAGCTCT 


5700 


GTTGTTAAAT CATATCTATA ACTTTTACTA ATTATTGGAT CAACTAACCA ATCCATACGT 


5760 


GCAATTAAGG 


CTGTAGCGTA 


TAATTCTCGT 


TCTGCGCAAT 


TTTCAGTCGC 


GTTTTTAAAC 


5820 


CATTCTTCAA 


ACCGTTTTGA 


CATGGTTAAT 


TTATTTGACA 


CATTATCCAT 


TTCATTTCCT 


5880 


CCAGATTTTG 


TGTTATAATT 


CTCATGTATA 


ATTTTTGTAT 


GGGACTTAAT 


CGTTTGTGGA 


5940 


CGAGTGAGTC 


TCTTTTTTTG 


TGTAAGCTAA ATATTTTGCA 


TCGTCGTATT 


TCATGAACCA 


6000 
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AACAACTGCA ATTGGTCCAA TAATCAAAAG TAGATAGcTC GCTGGAACAC TATTTTTAAT 6060 

TaCTAACCCT MTATAAAGA CfiAGAATAAA TGATCCTAAA ATTCTAGCTt CGTATAAGCT 6120 

TTnTGGTTTT CTT?Cc?TCA TTTATTTTCC ACTCCTATTT ACCCATAGCA CTCTGATAAA 6180 

CATTAAAAAA AGTGCT.-Ji.TA TAATCGCTGT TAAAAATTGT GATTTAATCA AAGATAAAAT 6240 

AAACACAAAG AATAAA.-.TTC CTACGGAAAG TGTCATCATT GTTTTAATTG CAAGTTCCTT 6300 
ATTCTTCACT AGAGACCCTC CTATATAAAG TTTCTTTCAT ACCATTCTTC TAAATCATCG . 6360 

ACTTCTATTC TTACCAATCC ACCATTTCTT TTAGCAGGTA ATGGATCAAT TTCTCGTTTC 6420 

ATCCATCTTG TAACAGTAGC AGCTGAAACA TTTTTTCTTT TTGgCTACTT CTATTGCTTT 6480 

CAAAGGTGGn AATTTcTTTG aATTAACTTT TCGaTCkTTT TTgaTTTAAA ATCAATAACT 6540 

TCTAGTGcCA TTkGwATyCT CCTTCCTCAT GtATCCTAGT kGTTCCCAAT ATGGAAAACG 6600 

TTGCTCACTT AAATAACGAA TATCGATGAA AGCAAGATCA CACAAGCTAC TAAGTAGTGT 6660 

GATTCAACAA TTACTCAnCA AATAnCCATT GCGnAGCACT ATTTAGAAAT ACTGTCAGAA 6720 

AAA €"'23 
(2) INFORMATXOM FOR SEQ ID. HO: 164: 

(i) SEQUEKCE CHARACTERISTICS: 

(A) LENGTH: 27360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

TAAGCGAATT GGACATCATA TGTCAATAAC AAATGTTGAT AAAAAAGGTC AATTAAACGC 60 

TATTTTGAAA AATAAAATCA GCTTTTAGAT AATTTTGCCA AAAATTTCGC TTAATGAGTA 120 

TTAAAAATAT TTTTTTATTT TATGTGCTCA AATATTTTTC AAAAAGAAAA AATTGTATTA 180 

AGGTAATAGT AGAAAGGGAG GAGATTTAAT GACAGAACCA AACTTAAAAG ACCCAAGAAA 240 

GTTATTTTAT AGTGATGGAT TTCCGCAACA AGATCAAGAA ACTCCCGCAT TACAGGACAA 300 

AATGGTCCCT AAACCAGATT GTGGGGAAGA CAGCTATGTC GGAAATCACA AATTAGAAAA ^ 360 

TCGTCGGGTA TTAATTACAG GCGGTGATTC TGGGATTGGA CGAGCTGCGG CGATTGCTTT 420 

TGCTCGTGAA GGCGCAGATA TCGCCTTACA TTTTTTCCCA GGTGAAGAAA AAGACGCCGA 480 

AGAAGTGGCG CACTATATAC GAGAAGCTGG CCGAAAAGTC GTTCTTTTGC CAGCAGATTT 540 

AAGAGATAAA CAGGCACCTG AAGAATTGGT TGCACAAGCT CATGAAGCAT TAGGCGGCTT 600 

GGATACCCTC 6TTTTAAATG CAGCCCAACA AATTTCCTGT GCCGCTATTG AAGAATTACC 660 

AATGGAACAA 6T6ATAGACA CTTTCCATGT CAATATTATT GCGATGTTTG GCATTGTCAA 720 

AGCAGCAGTG CCTCACTTGC CTGCCGGAAG TAGTATTGTG ACAACGACTT CCGTCCAAGC 780 

CTTTAATCCC AGTGAGCATT TGTTGGATTA TGCAGCAACG AAGGCATCAA TTGCCAACTT 840 
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TACCGTTGGT TTAGCAAAAC AACTGGCACC AAAAGGCATT CGGGTAAATG GCGTGGCTCC 900 

AGGACCAATC TGGACACCGT TACAGTTAGA TCATGGACAA CCAATTGAAG AGTTGCCAGA 960 

ATTTGGGCAA CATTCTTTAC TGGAACGTGC GGGTCAGCCT GCTGAATTAG CCCCTGTTTA 1020 

TGTTTTCTTG GCGTCCAATG ATGCAAGTTA TGTTACGGCT CAAGTTTATG GCGTAACTGG 1080 

TGGAGAAGCA ATTAATCTTT AGGAGGGATG TTCCGTGAAT AAAGTTGTTA AAATACTTTT 1140 

GCTTCTAGGT TCACTGATTT TAATGGCATT ATTTGGCGCG GTGGCCATCA CCACTTGGTC 1200 

AGCCGATACA GCTTGGCAAC CACCAGATAA TGTCCAATGG TTCTTGCAAG ACAACGAGTA 1260 

CGCACGTCAA GCAATCTTTT GGGTAGCGCT GGTTTTACTA GGAATTATGC TCTTTCTCTT 1320 

TTTATTCATT CTTTTATTCC CAAG7UVGAAA AGGCACGTTT GAATTAAAAG AAGAGCAAGG 1380 

AAAGTTAGTG TTACACCGTA AAGCCATCGA AGGATTTGTA CGCTCTAGCT TGCAAGACAG 1440 

TGACTTTCTG GAAGCACCAA CCATTCGTGT T^GGCGACA AAACGGCGCA TTAAGGTCAA 1500 

TGTTAAAGGC ACACTGAAAC GAACGGCAGA TTTAATTGGG CAAACAGAAG CTTGGTCAAA 1560 

AACAATGGAA GAACGGCTTG CTCGTTTGGT GGGAACTGGT CACAAGATTA CGATTGATGT 1620 

TCATTTTGAA GATATTCAAT CAATGAAACG ACCACAACAA CAGCAAACAC AAGCACGTGT 1680 

GGAGTAGGAG GGATGTACAA TGGTTGAATG GATAAAACAA TATGCCGTTC CGTTTU^TTGG 1740 

TGGCCTAGTC GGCTTGGTTT TTGCAATCTT GTTTCTGTCA ATCGGCTTCT TTAAAACATT 1800 

GTTAGTGGTA CTCCTGACAG GACTAGGTAT TTGGCTAGGT TTGTATTTGG AACAGACAGG 1860 

CATTGTCAAA AATTATTTTC ATCATAAATA GGAGGTTTTG GCTATGGAAA ATGCAAATGT 1920 

ATCGATTGTT AAAGGTGAGT TGACGTTTGA GGATAAAGTA ATCCAAAAAA TTATTGGCAT 1980 

TGCTTTGGAA GAAGTCGACG GTTTGTTAAC TGTGGaCGGT GGTTTTTTCT CGAATTTACG 2040 

AGATAAAATG ATTAACAGTG AAGATGTTAC AACAGGCATT CACACCGAGG TCGGCAAAAA 2100 

ACAAGTCGCA GTGGATATGG ACATTGTAGC CGAATACGGG AAAGATATCG AGACCATTTA 2160 

TGATCAAATG AAAGAGGTTA TTACTCGCGA AGTCCAACAA ATGACTCACT TGGAAGTAAT 2220 

CGAAGTCAAT GTCAACGTAG TAGATATCAA AACACAAGCA GAGTACCAAG AAG7VAAGTGA 2280 

AACGrTCCAA GATAAACTTG GCAACGCCGC TGAAGCGACA GGTAGTTTTA TGTCAAAACA 2340 

AACCAACAAA GCCAAACATG CGGTAACTAA AGGCAAGACA AAAGTCAAAG AAAACACGGC 2400 

ACCTCGTGTT CAATAATTGG TTATTATTTT ATTAAAAATA AAGCAATAAT AGAAGAAACT 2460 

AGGAGGAATG AACCATGAGT AACGAAAAAT TCAATAATGT ACCAAAACCA GCAGGTUVACG 2520 

GACCACACAC ACCAGAAGAT CATGCAATCA AAGGTGAATT AACATTTGAA GATAAAGTCG 2580 

TTCAAAAAAT TATCGGTTTA GCCTTAGAAA ATGTCGATGG TCTATTAACA GTCGATGGCG 2640 

GCTTTTTCTCAAATATTGCT GAAAAATTAG TAAACACAGA TAACGTCACA GCGGGTATTG 2700 

ATACCGAAGT TGGTAAAAAA CAAGTCGCTG TTGATATGGA TATCGTGGTA GAATACGGCA 2760 

AAGATATTCA AGACATTTAC GAAAAAATGA AAGAATTAAT CAGTCGTGAA GTGAAAAAAA 2820 
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TGACCCACTT 


AGATGTGATT 


GAAGTGAATG 


TTAACGTAGT 


TGATATCAAA 


TCAAAAGAAG 


2880 


AATACGAAGA 


AGATAGCGAA 


ACGGTTCAAG 


ATAAAGTCAC 


AGGCGCTGCC 


AAAAGTACCG 


2940 


GTGAATTTGC 


TTCTGAACAA 


ACAGAAAAAG 


CAAAAAAAGC 


TGTGAATAAA 


GGAACAGAAC 


3000 


AAGTCAAAGA 


AAATATGGAA 


CCACGTGTAG 


AATAAACAGT 


TAAAGGGAGG 


AAACAATCAT 


3060 


GGGCTTTATT 


TGGGCATTAA 


TTGTCGGCGG 


GGTCATTGGG 


GCAATCGCTG 


GAGCAATTAC 


3120 


TAAAAAAGGA 


TCATCGATGG 


GCATTATTGC 


CAATATCATT 


GCAGGGTTAG 


TTGGTTCAAC 


3180 


AATTGGTCAA 


GCCATTTTAG 


GCACATGGGG 


ACCAAGCTTA 


GCTGGGaTGG 


CTATTGTGCC 


3240 


ATCGATTATC 


GGGGCAGTGA 


TTTTAGTTGC 


CATCGTTTCA 


TGGATTTTAG 


GACGAAAATA 


3300 


GAGAAAGTTT 


AAAAAAATAG 


GGCATAAGTC 


GTTTCGGCTT 


ATGTCCTATT 


TTTCTTTTTC 


3360 


TATGTATGAG 


GAAGTTTCGT 


ATGAAAAGTA 


AATTTTGAGC 


AATTAGCCAA 


GAGCTTATTT 


3420 


TAATTAAAAT 


CATTAGTCAT 


TTTGGCTTTT 


CAACAAAAAT 


CAACAACATT 


TGTTGAAGTT 


3480 


ATTTGTAGAT 


AACGTTTATC 


ATTCACAAAT 


TTACATTTTT 


TTGTAACGGA 


TATCTCGTAT 


3540 


AGTAATAACT 


GCAAATGAAT 


TTCAGAAAAT 


TTTTTGATAA 


GTCATCAAAG 


TGAGGGGATA 


3600 


GCGGTCTTTT 


ATTGAAAAAT 


AGCAACACAA 


AATAAACAGA 


ACAAAAGAGA 


GAGGAATTCA 


3660 


AAGAATTATG 


ACAGAACAAA 


CACCAACAAA 


GCTTTTGAAT 


AAAGGCTTTA 


TCAGTATTAC 


3720 


AGTGATTAAT 


TTCATTGTCT 


ATCTTGTTTA 


TTATTTATTA 


ATGGTTATTA 


TTGCGGTGAT 


3780 


TGCGCAAGAT 


AGCTTACACG 


CTACTTTAGG 


TCAAGCGGGT 


TTGGCCTCAG 


GAATTTACAT 


3840 


CATCGGGACA 


TTGCTTGCAC 


GACTTTTCAT 


GGGGAAACTC 


CTTGAACTGA 


TTGGGCGCAA 


3900 


ACAAGTCTTA 


CGTTAT6GGG 


CTTTGTTTTA 


TCTTTTAACG 


ACTGTTGCGT 


ATATGTATAT 


3960 


GCCAAGTATG 


GGCATTCTTT 


ATCTGGTCCG 


TTTTTTAAAT 


GGGTTTGGTT 


ACGGAACCGT 


4020 


TTCGACAGCG 


ACCAACGCAA 


TTGTTACCGC 


CTATATCCCT 


AAAAACAAAA 


AAGGGGAAGG 


4080 


GATCAATTAT 


TATGGTTTGA 


GTACTAGTTT 


AGCAGCCGGC 


ATTGGGCCAT 


TTATTGGGAT 


4140 


GTTATTGTTA 


AATGTTTCCA 


ACTTCCATGT 


CATTATTAAC 


TTTTCAATTA 


TTTTAATTTT 


4200 


GTTAACAACC 


ATTGCTTGCT 


TCATTTTCCC 


AGTGAAAAAT 


ATTGAGTTAA 


CACCTGAACA 


4260 


TCGTGAAGCT 


TTATCTAAAT 


GGAATTTTGA 


TAGTTTTGTT 


GAAAAGAAAG 


TTTTATTTAT 


4320 


TACATTTATC 


GCCTTTTTAA 


TGGGTCTTGC 


TTATTCAAGT 


GTCTTATCTT 


TCTTATCTTC 


4380 


TTATGTCAAA 


GTAATCGATT 


TAGTTGACGT 


CAGCACCTTC 


TTCTTTATCG 


TCTATGCGGT 


4440 


CGTGATTACC 


TTAACACGTC 


CATCGACAGG 


TCGGATTTTC 


GATGTCAAAG 


GTGAACGCTA 


4500 






TTTTCT T AAC 




1 Lk^L LAI XAA 


V3 1 Al va/iV^ i/iV-» 




GTCTGGTTGG 


ATGTTATTAG 


TTTCAGGTGG 


GTTAATTGGC 


TTAGGTTATG 


GTACTTTCAT 


4620 


GTCAAATGGC 


* CAAGCGGTTT 


GTCTTCAAGA 


AAGCCCAAGT 


CCTCACCGTA 


TCGGGATTGC 


4680 


TTTATCGACT 


TATTTCATCG 


GTTTAGACCT 


TGGATTAGGG 


GTTGGTCCGT 


ACGTTTTAGG 


4740 


TGAATTACGT 


AACTTTATGT 


CATTCCAACA 


AATGTACTTC 


TTAGCAGGCT 


GTATCCCAAT 


4800 



wo 98/50555 



PCT/US98/a8985 



921 



CGTTTGTACC 


ATTTTATATA 


TGGTCTTCCA 


GAAAGCTAAA 


AATGATGCCA 


AAGATTTGTC 


4860 


TCTTGAAACG 


ATTGAAGAGA 


TTGAACACGG. 


CTCAGAACTA 


TAAATGGACA 


AGGAGTGATT 


4920 


TCAATGCCTA 


ACGATGAACG 


GATTTTAAAC 


GTTTTACAAC 


AAACACTAAC 


GGAAATTCAA 


4980 


CGCATGAACA 


CGCGTTTAGC 


TAAAATTGAA 


ACAGAGTTAG 


CCGATGTCGA 


TCAACATCAC 


5040 


GCGACTTTGG 


ATCATTACTT 


GCAAGAGCTA 


GAATTGGTGA 


CAGAAAATAA 


CAACGTCATG 


5100 


TAAAGTGAGT 


AGTTAGTAAA 


TGAATGTTCT 


ACGTTTCGGT 


ATCACTAGAG 


TGGAACATTG 


5160 


AAAATTTTCG 


TAAAAAACGT 


CAAAAGGTAG 


CAGTTTTTCT 


GCTACCTTTT 


GGCGTTTTTT 


5220 


GTAATTTTTA 


TCACTTAAAA 


TGTTTCACGT 


GAAAACTTTT 


AAAAGTGGCG 


TGGACTTTGT 


5280 


TTGGTGACAA 


TTTGTGTATT 


TAAATTTTCT 


AAGCTTTGCG 


CTAGTTCACC 


AATGCTAGCA 


5340 


GAAAAATCAA 


CATAATAAGT 


GGGAACCGTT 


GATTTGACAC 


TAACTTCTGA 


AAAGATATCT 


5400 


GTAATAATTA 


CATCGAAAGG 


CTTTTCTGGT 


GAAGCGACAA 


AAATTAATTT 


TTTATCCACG 


54 60 


GCTTTACTTA 


GCTTTTCATA 


AAGGGTATCT 


AAAATTAAAT 


TACTTTGATT 


GGTCCTTACT 


5520 


AAGACACGAA 


TCGGTGGTAA 


AGATTTCGTT 


TTCAATGTAC 


AAAAAAGAAT 


AAACAAGCAG 


5580 


TCAACTAACA 


ATTTTTCATT 


GCCAATCAAT 


GTCGG7\AGCA 


TTTCAATTGT 


AAGGCGCTTC 


5640 


ACATCTTTGT 


AGAGACTTGG 


ATTTTTTGAA 


GTGCTTCTTT 


TAAAAAGCTT 


TCTTGCACAA 


5700 


ATGGATTGGC 


AATCGTGACA 


AAGGTCTTAA 


AAAAGACACT 


TAAATTTTGG 


ATAGAAGTAT 


5760 


ATAGATAAAG 


ATACTCGTGA 


AATGTCAATT 


TAATTTGATA 

xwvr* X X x\^*^ X 


AAAAGACGTT 


AGTTGATAAA 


5820 


TGATCTCTTT 


AATGGGTGAG 


CCTAATAGCT 


GGACCATATG 


CATCCGTGCT 


GCAGGAATTT 


5880 


CCAAGACATC 


TTGTGTGGCG 


TAgcTGaCAC 


TAATCGTCAA 


TAAGAAAAAG 


GCAAAAAGGC 


5940 


CTTCTTTTTG 


TATAAATTTT 


GTTGGAACTT 


GGTGTAAGGC 


TTCTTCTTTA 


AAAAACAATT 


6000 


CAAATTCACT 


CAGATTAACA 


AAGGGATTAA 


CAATCGCTTC 


TTCCTCCACG 


AAAGTTTCTT 


6060 


TGATAA7VAAA 


GCCCTCTTTC 


CAACGCGCTA 


AAATCACCGC 


AGAAAAAATC 


AAGTCTTTTG 


6120 


TAAAATCTTG 


TGTTGTATTG 


AAAGAAACAG 


AGTGTTTTAA 


ACGTC6AAAG 


AGACTATGGA 


6180 


CTTCTTTTTT 


AGTAATCGTG 


TGCAACGGCC 


AATCCCCTTC 


AGAGTAATTG 


TTCCAAAAAA 


6240 


GCATCATAAA 


GAAAATCCGG 


ATATCTTTTT 


CTTTGCCCAC 


CAACACATTG 


GTACGTTTTA 


6300 


AATTGATTTC 


AATCTGATAT 


TTTTGGAGCC 


ATTGGTTTAA 


TTGTCCAATT 


TTTCGTGTAA 


6360 


AGGTTGATGA 


AGAAATCTCT 


AGGACTTTTT 


CAAAATGTTT 


GCTATCAATT 


GCATTATGGT 


6420 


TAAAGGCATA 


GGTCAGTACT 


TGATACGCGA 


AACTTTTTTT 


GACAAAGTAG 


GGCATCAAAA 


6480 


ACGACAGTTG 


GAAATAGGGT 


GCAAACCAAA 


ATGTGATTTC 


TTTGGCATTT 


GCTGATAAAG 


6540 


AAAATAAGTT 


CTCCCAGCCG 


TTTTTTTCAA 


TTTCTGTTTG 


GAGCAAAGCT 


ACAGATTGTA 


6600 


GAATGGTATT 


TTGTTGCATT 


TTTAATTGAG 


TCGCTAAAGT 


AGAAATCGCC 


AATGTTTGTT 


6660 


CTTCTGCTTC 


ACGACGTTCG 


AGTMCTCTT 


CTAAAAGAGC 


AAAACTTTTA 


CGATAAGGTA 


6720 


CATCAAGTAA 


CCCTGTTAAC 


ATCCGATTCC 


CTCCAAACTG 


TTTTCCCCAC 


GCAATTTAGC 


6780 
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AAAATAAATT 


TTGCGAACTA CTTGTTTTAG 


GTTATAGTAT TCGTCTAAAT 


AAGGAACGGG 


6840 


ATGCCAATAA AAAGTTTTAC TGGCGTGAAA AAGCGGATTA TTTTTGTCAa.TAATGACTAA 


6900 


GTCAGGCACT 


TCTTGAATAT GATTAACAAA 


TTTCAAAGGA ACATCGCAAT 


AATTATTGAT 


6960 


TTCCTTCGCT 


AACCAATTTT CTTGTATCTG ATTAATAGAT GAATGAAGTA 


AAATTTTAAC 


7020 


AGGAATTTGT 


TTATTTAATA AAACACTCCG AATAATTAAA AAAATGGTTT 


CTTTAAATAA 


7080 


CGCATCTAGT 


GGATGTTGTT TGTCAAAAGG 


CAAACGCTTT AAGAAATAGT 


TGGTTGTTCG 


7140 


TGAAACATTT 


GGGTATTGTT TTTCATACAA AGAGACATTA GCAACCTCAT AGCTCAATTC 


7200 


CTCTTCCACA 


GGGAAATGAA TCAGTCGTTC ACTGGCATTG GATAAATTAA CCAACAAAAA 


7260 


GACCACTTCT ATTGCGCTTA AATCCAACTG TAATTGCGTA CATTTTTCAA AAATCCAAGT 


7320 


AGCGTGTGGA 


CTGTTATCCA TTCCCATGAT 


CGCTAAGTTG GCAAAGTTGA 


TGGTTTCCAT 


7380 


TTGGACCAGA 


GAGTACATCG TTAATGTGCT 


GAAGAAATCA TAATAAAAAT 


GAATCTCAGC 


7440 


TTGCCAAGTG 


TCAGGCTCTT TTTGTTCATC 


AATAAAACAA TCATTCACTA 


ATAGTTGGAA 


7500 


TTCTGAATAA 


GGAAAGGCCA AATTAATAGC 


AGCAACTTCT GGATAATTGG 


TGATTGTAAA 


7560 


ACCGGATTGA 


TAACGGAGTA AACCTATGGA 


CATAATTAAT TTTAATTTGC 


GTAAATCAGT 


7620 


ATATGTAAAG 


TAAGGGTAAC GGTTTTTGAT 


GATTTCAATG AAGTTCTGCA 


ATTGTGCTTG 


7680 


ATTAGTCTTA 


GATAGAAATG GTACTAGAGA 


AGTTCCTGAA ATGAAGAATA 


AGGAATAAAA 


7740 


GAAACT6CGA AAATCTTTCT CGTTACCACA AATTTTTTCT CGTCTTTTTA 


AGTTTAAATA 


7800 


ACAATTAAAT 


CTAGCTAGAT GATTATTTAA 


ACGGTCACTG ACTTGTTTAA 


AAATAGAAAA 


7860 


GTTAATAAAG 


TGCTCTTCAC AAAAATCTTT 


GGCTGAAGTG TAGCGTTTTA 


AAAGTAAATC 


7920 


CAGGGCTAAC 


ATATAGGTTA GTGACCCGCG 


GACAACAGCT TCTTTCATTA 


ACTCAATCGG 


7980 


GATATTTTCA 


TCGGCGACTA ATTCTATGGA 


CTGTTGATTA TTAATAATAC 


GAATATGTAC 


8040 


ATTTTCATGT 


GTATTGACTC GAGA7VATTAA 


CAAAGTTAAT TGATTCAATA AATTGTTGAT 


8100 


ATTGTAATGG 


GAAAATTCAT ATTTTCGTCG 


GAGTACTTGC ATGGGGACTT 


TTTCTTTACT 


8160 


GTCATACAAG 


TATTCAATTA GGGAAAGTAA 


CTGAATTTCC TGTTTGGTTA 


ATAAGTTATA 


8220 


CATAATAACC 


ACCTTTCTAA TTGGTTATGA 


TTGTTTGATT TTTCAGATAA 


TACAGATAGG 


8280 


GCATTCTTTT 


CTATATGACA TTGACGATTG 


CCAGTGAAAG AACTAGTTGT 


TCTTATGCTA 


8340 


CTGTATCTAT 


TCTTAAAAAT CAAGAAGAAC 


TGAATGTGTC AAATGGAAGA 


AAACATGGAC 


8400 


AAAAAATGAT 


AGAGCAAAAA ATTATTAAGA 


AAAACAACCA GTTTATATAA 


AACATTTAAA 


84 60 


TAACAAAAAT 


GATAATTTGA GTCTATTGAT 


TTTTTAGCTT AAATAGCTTA 


TACAAATAGG 


OOc.\J 


TTTTTTTGTC 


TGGTTAGTTC TTATGAAAAA AATTAAAAAA AACGGTATAG 


TGGAAGAAGG 


8580 


AATAGTAAAA -ATAAGAAAGG ATAAGGACGT 


GTTTTCATAA AGATGAAAAG 


GGTTTCTGGG 


8640 


ATTTCATCAA CTTTTTCTAT TTGGAGGAGA AGTTGTTGAT AATTTTTGCT GAAAAATCTC 


8700 


GTTATACTCT 


TCTTATTCTT ATAGAAGAAT 


CTATAGGAAA AAGCAGAAAA 


TGAATCCGCA 


8760 
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TTCGAAGTAA GAGTGCGTTT 


TGAGGAGATA 


AAAAGAAAAG 


GTGTGTTGGC 


TGTCTGTTTT 


8820 


AGCAAGGAAG 


AAAGACAACA 


GAAGAGCACT 


CACTTATTTG 


GTTTAAAATA 


AAAAATTGGT 


8880 


ACGAAGTGAA 


CGTTCTCTTC 


TATGTGTCGT 


TAGTAGAGGA 


AGGATGAAAG 


AAATGAGAAA 


8940 


GAATGGTCCA 


ATGGTAAACC 


GTTGGCTCTA 


CGGGTTGATG 


TGTTTGTTAC 


TTGTTCTAAA 


9000 


TTATGGCACA 'CCACTCATGG CTTTGGCGGA AGAGGTTAAC AGCGATGGCC AGTTAACGTT 


9060 


AGGAGAAGTG 


AAGCAAACCA GCCAGCAAGA AATGACCPTA GCGCTTCAAG GAAAAGCACA 


9120 


ACCAGTAACA 


CAAGAGGTTG 


TAGTGCATTA 


TAGTGCCAAT 


GTGTCAATCA AAGCTGCACA 


9180 


TTGGGCAGCG 


CCCAATAATA 


CGCGCAAGAT 


TCAAGTGGAT 


GACCAGAAGA 


AACAGATTCA 


9240 


AATTGAATTG AATCAGCAAG CGTTAgcAGA TACGTTAGTC 


TTAACGTTGA 


ACCCTACAGC 


9300 


TACAGAAGAT 


GTGACGTTTT 


CTTATGGACA ACAGCAACGA 


GCGTTGACGT 


TAAAGACTGG 


9360 


TACTGATCCG 


ACAGAATCAA 


CGGCAATCAC 


GAGTTCGCCA 


GCCGCATCAG 


CGAATGAAGG 


9420 


TTCAACAGAA 


GAAGCATCTA 


CAAACTCCTC 


TGTTCCTCGT 


TCGTCCGAAG 


AAACTGTCGC 


9480 


CAGCACGACA 


AAAGCGATAG 


AAAGTAAAAC 


AACTGAATCG 


ACGACTGTCA 


AACCGCGCGT 


9540 


AGCAGGACCA 


ACAGATATCA 


GTGATTATTT 


TACAGGTGAT 


GAAACAACGA 


TTATCGATAA 


9600 


TTTTGAAGAT 


CCGATTTATT 


TAAATCCTGA 


TGGAACACCA 


GCAACACCGC 


CGTATAAAGA 


9660 


AGATGTGACC 


ATTCATTGGA ACTTTAACTG 


GTCGATTCCA 


GAAGATGTGC 


GAGAACAAAT 


9720 


GAAAGCAGGC 


GATTACTTCG 


AGTTTCAATT 


ACCTGGCAAT 


TTGAAACCTA ATAAACCAGG 


9780 


TTCAGGTGAT 


TTAGTTGATG 


CAGAAGGCAA 


TGTCTATGGA 


ACCTACACAA 


TTAGTGAAGA 


9840 


TGGTACGGTT 


CGTTTTACCT 


TTAATGAGCG 


AATCACGTCT 


GAAAGTGACA 


TTCACGGGGA 


9900 


CTTTTCTTTA 


GATACTCATT 


TGAATGATTC 


AGATGGGCGG 


GGCCCAGGAG 


ATTGGGTGAT 


9960 


TGATATTCCT 


ACACAAGAAG 


ATTTGCCGCC 


TGTAGTGATT 


CCAATTGTCC 


CAGATACCGA 


10020 


ACAACAAATT 


GATAAACAAG 


GCCATTTTGA 


TCGAACGCCC 


AATCCTAGTG 


CGATTACTTG 


10080 


GACGGTAGAT 


ATCAATCAAG 


CGATGAAAGA 


TCAAACAAAT 


CCAACTGTGA 


CGGAAACATG 


10140 


GCCAACAGGG 


AATACCTTTA 


AGTCCGTGAA AGTCTATGAG 


TTAGTGATGA 


ATCTTGATGG 


10200 


AACAATTAAA 


GAAGTGGGTC 


GCGAACTTAG 


TCCAGATGAA 


TATACCGTTG 


ATAAAAATGG 


10260 


CAATGTGACG 


ATTAAAGGTG 


ACACCAACAA 


AGCGTATCGT 


CTTGAGTACC 


AAACGACGAT 


10320 


TGACGAGGCG 


GTTATTCCAG 


ATGGCGGCGG 


CGATGTGCCT 


TTTAAAAATC 


ACGCGACGTT 


10380 


AACAAGTGAT 


AATAATCCAA 


ATGGGTTAGA 


TGCTGAAGCA 


ACTGTTACCG 


CCACATATGG 


10440 


CAAAATGTTA 


GACAAGCGCA 


ATATAGATTA 


CGACGAAGCC 


AATCAAGAAT 


TCACTTGGGA 


10500 


AATTAACTAC AACTATGGTG AACAAACCAT TCCAAAAGAC CAAGCAGTCA TTACAGACAC 


10560 


AATGGGGGAT 


• AATTTAACGT 


TTGAACCAGA 


TTCTTTACAT 


TTATATTCAG 


TGACATTTGA 


10620 


TGACAAAGGA 


AATGAAGTCG 


TTGGAGCAGA ACTTGTGGAA GGAAAAGATT 


ACAAAGTGGT 


10680 


AATCAACGGA 


GACGGTTCCT 


TTGCAATTGA 


CTTTTTACAT 


GATGTGACTG 


GCGCAGTCAA 


10740 
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GATTGATTAT 


AAAACCAAAG 


TTGATGGAAT 


TGTCGAAGGC 


GATGTTGCCG 


TGAATAATCG 


10800 


TGTGGATGTT 


GGCACTGGTC 


AGCATTCAGA AGATGATGGC 


ACAGCCAGTC 


AACAAAATAT 


10860 


TATTAAAAAC 


ACTGGTGCAG 


TTGATTATCA 


AAATTCAACG 


ATTGGTTGGA 


CGTTAGCTGT 


10920 


GAATCAAAAT 


AATTATTTGA 


TGGAAAATGC 


CGTGATTACG 


GATACGTACG 


AACCAGTTCC 


10980 


TGGCTTAACT 


ATGGTACCCA 


ATTCGTTGGT 


TGTCAAAGAT 


ACAACCACTG 


GTGCTCAGTT 


11040 


GACGTTAGGC 


AAGGATTTCA 


TGGTAGAAAT 


AACTCGTAAT 


GCAGATGGTG 


AAACAGGCTT 


11100 


TAAGGTAAGT 


TTTATAGGGG 


CGTATGCCAA 


AACAAGTGAT 


GCCTTCCACA 


TAACTTATAC 


11160 


TACCTTTTTC 


GATGTTACCG 


AGTTAGACGC 


TAACAATCCT 


GCGTTGGACC 


ATTATCGAAA 


11220 


TACCGCTGCC 


ATTGATTGGA 


CGGATGAAGC 


AGGAAACAAT 


CATCATTCAG 


AAGATAGTAA 


11280 


ACCGTTTAAA 


CCTTTACCTG 


CTTTTGATTT 


AAATGCGCAA 


AAAAGCGGTG 


TTTACAATGC 


11340 


CGTCACCAAA 


GAAATCACTT 


GGACGATTGC 


GGTTAATTTA 


AGTAATAATC 


GTTTAGTCGA 


11400 


CGCCTTTTTG 


ACGGATCCAA 


TTTTAACCAA 


TCAAACCTAT 


TTGGCTGGGA GCTTGAAAGT 


11460 


CTATGAAGGC 


AATACAAAGC 


CAGATGGTTC 


GGTTGAAAAA 


GTGAAACCAA 


CGCAACCGTT 


11520 


GACGGATATC 


ACAATGGAAG 


AACCAAGCGA 


GAAAAACCAA 


T^TACTTGGC 


GTGTTGATTT 


11580 


TCCTAATGAT 


AGTCGTACGT 


ATGTGATTGA 


ATTTAAGACG 


TCTGTTGATG 


AAAAAGTTAT 


11640 


CGAAGGTTCG 


GCTAGTTATG 


ACAATACCGC 


ATCTTATACA 


AACCAAGGTT 


CTTCACGTGA 


11700 


TGTGACAGGA AAAGTTTCTA 


TTCAACATGG 


TGGCGAATCA 


GTGAAAAAAG 


GTGGCGAATA 


11760 


CCACAAAGAT 


GATCCAGATC 


ATGTGTACTG 


GCATGTAATG 


ATCAATGGCG 


CCCAATCGGT 


11820 


TTTAGACGAT 


GTGGTTATTA 


CTGATACACC 


CTCACCAAAC 


CAAGTGCTAG 


ATCCCGAGTC 


11880 


ATTGGTGATT 


TACGGTACCA 


ACGTAACAGA AGACGGAACT 


ATTACGCCAG 


ATAAATCTGT 


11940 


TATTTTAGAA 


GAAGGAAAAG 


ATTACACACT 


GGaAGTTACC 


ACCGATAATG 


aAACAGGACA 


12000 


ACAAAAAATT 


GTCGTTAAAA 


TGGCCCATAT 


TGAAGCACCT 


TATTATATGG 


AATATCGTAG 


12060 


TTTAGTGACT 


TCTTCAGCGG 


CGGGGAGTAC 


AGACACGGTA 


TCCAACCAAG 


TGTCAATTAC 


12120 


TGGAAATGGT 


TCAGAAGTCG 


TTCATGGGGA 


TGACAATGGC 


GATGTGGTCG 


TTGACATTGA 


12180 


TCACAGTGGC 


GGGCATGCCA 


CAGGGACTAA AGGCAAAATT 


CAGCTGAAGA AAACAGCCAT 


12240 


GGATGAGACG 


ACTATTTTAG 


CAGGCGCCCA 


TTTCCAAATT 


TGGGACCAAG 


CTAAAACACA 


12300 


AGTCCTACGT 


GAAGGTACAG 


TAGATGCCAC 


CGGGGTTATC 


ACATTTGGTG 


GGTTGCCACA 


12360 


AGGGCAATAC 


ATTTTGGTGG 


AGACAAAAGC 


ACCAGAAGGC 


TATACAGTTT 


CGGACGAATT 


12420 


AGCTAAAGGC 


CGAGTCATTA 


CTATTGATGA 


AGAAACTTCA 


GCCGAAGGAG 


CACAACCAAC 


12480 


CATTATTAAA 


AACGATGTCA 


ATAAAGTATT 


TTTAGAAAAA ATGGATGAGA 


AGGGTAAAAA 


12540 


GTTAGTCAAT 


•GCTCGCTTTA 


AATTAGAGCA 


TGCCGTAACC 


ACGCCGTTTA 


CTCATTGGGA 


12600 


AGAAGTTCCC 


CTTGCGCCGG 


ATCGAACCAA 


CGCGAATGGC 


CAGTTAGAGG 


TGGATAGTTT 


12660 


AAAACCAGGG 


CTTTATCAGT 


TCACAGAAAT 


CGAAGCACCG 


ACAGGCTATC 


TTTTAGACAC 


12720 
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GACCCCCAAA CGATTCATCG TGACACAAAA TACGAGCGGA CA^TTCGTG ATGTTCATGT 


12780 


CAAAATi^CTT AATTACCAAG GTTCTGCTGA ACTAATTAAA AAAGACCAAG CAGGCAATCC 


12840 


ATTAGCAGGT 


GCTGAATTTT CAGTCCTTGA CACCACAGGA CAAGCAGTTC GAGAACACTT 


12900 


AGTTTCGGAT 


GCAAACGGAA AAGTCACAGT GACGGATTTA GCCCCAGGAA AATATCAATT 


12960 


TGTGGAAACC AAAGCGCCAG CAGGGTACCT TTTAAACACT GAACCAAGTG CTTTCACGAT 


13020 


TGCAGCAAGC GATCGGGGCA AACCAGCAAC AGTTATAGCA ACGGCTAACT TTGTTAACTA 


13080 


TCAAGGCACG GCTAAATTAA TCAAAAAAGA TGTGAATGGA CACTTATTAA GTGGTGCGAC 


13140 


ATTTAAAGTG 


CTTGATGCGA AGGGAGAAAC GATTCAAACA GGCTTGACGA CAAATAATCA 


13200 


AGGGGAAATT GTTGCAGAGC ACTTAGCCCC AGGAAAATAT CGCTTTGTAG AAACCAAAGC 


13260 


GCCAACAGGC 


TATTTATTAA ATACCACGCC AGTCCCATTT GAAATTGCTG AGAAAAATGC 


13320 


TGGTAAACCA 


GCGGTCGTGG TTGCTAGTGA CAACTTTGTG AGTTACAAAG GGGCTTTCCA 


13380 


AATCGTGAAA ACGAATAGCG CAGACCAACC ATTAGCAGGT GCTGTTTTTG AATTATATGA 


13440 


TCACAATAAA 


CAATCATTAG GGATTACAGC AACGAGTGGC AAAGATGGcA AAATTATCTT 


13500 


TAGAGACTTG 


GCGCCAGGTA CCTATTATTA CAAAGAAATC T^AAGCACCAA AATTACCAGA 


13560 


TGGCGCAGAT 


TATATTATTT ATCCTGAATT AGTAAAAGTA GAAATTCGTG GTGATTTCAA 


13620 


AGGTGATCCG 


GAGATTTTCC AATTAGGGGC CTTCGCCAAT TTCAAAGGAC GCGCCGTCTT 


13680 


TAAGAAAATT 


GATGCCAATG CGAACCCACT TCCAGGAACG ATTTTTAAAT TGTATCGAAT 


13740 


CG/^AAACGGG 


GAAAAAATCT TTGAAAGAGA AGTAACTGCT GAAAAAGATG GTTCATTGGC 


13800 


TATGGAGGAT 


TTAGGTGCTG GTAGCTATGA ATTAGATGAA CTGGATGCAA CGGATGGCTA 


13860 


TATCGTCAAT 


AAACAACCCA TTTATTTTGT AGTGAAGAAG AATTCAAATG ATAAACAACC 


13920 


ACTAGATGAG 


TTAGAGTTTG TAAATTATCA AGCAGAAGTA ATGGGACGTA AAGTCAACGA 


13980 


GCAAGGTCAA 


ACCTTAGCGG GTGCAGTTTT TGCAATTTAC AATGCCGATG AGCAGAATCA 


14040 


GCCCCAAGGT 


TCACCGATAA CATTCTTGAA TCGTGCAGGA GAAAAAGTTT CTGAAATAAC 


14100 


AACGGATAAG 


ACTGGCGAAA TTTACGCTAA AGGGCTAAAT GAAGGGCATT ACGTTTTAGT 


14160 


GGAAACGAAA GCACCAACAG GCTATCTGTT AGACACAACG CTACATCCAT TTGATGTAAC 


14220 


CGCCCAATTA 


GGAAAAGAGC AGCCAATTGC TTTAGGCGAT CTTATCAATT ATCAAGGAAC 


14280 


TGCTCAATTA ACCAAAGAAA ACGAAACAGG TGAAGCATTG GCAGGTGCGG TGTTTAAGGT 


14340 


CATTGATGAA 


ACAGGGCAAA CCGTAGATGG ACAAACCAAT CTGATGTCTG ACAAGCAAGG 


14400 


CAAAGTCATT 


GCGAAAAaCT TAGCACCGGG AACGTATCGT TTTGTGGAGA CaCAAGCGCC 


14460 


AACTAGCTAT 


CTTCTTAATG AAACGCCAAG CGCAAGCTTT ACGATTGCCA AAGACAACCA 


14520 


AGGCAAACCA 


•GCCACTGTGG TACTTAAAGC ACCTTTTATT AATTACCAAG GTGCTGCCAA 


14580 


GCTGGTGAAA ATTGATCAGC AAAAGAATGC CTTAGCAGGT GCTGAATTTA AAGTGACAGA 


14640 


TGCAGA6ACA 


GGGCAAACTG TCGCTCGTTC ATTACGTTCT GACAACCAAG GGTTAGTTCA 


14700 
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AGTGAACCAC 


TTACAACCAG 


GAAAATATAC 


GTTTGTGGAA 


AGAAAAGCAC 


CGGATGGTTA 


14760 


CCAACTGTCT 


AAGCAAGCTG 


TCGCATTCAC 


.TATIGCGGCA 


ACAGCGAAAG 


ACAAACCTGA 


14820 


ACTCGTGAAT 


GCGGGCACGT 


TTGTTAACGA 


GAAACAACCT 


GTATCCAAAA 


AAACAAAACC 


14880 


AAATCAGCCA 


ACAACGAAAC 


AAGCAGCTAG 


AGAGACAGGT 


TGGCTTGGTT 


TACCGAAAAC 


14940 


CAACACACAA 


GTCAATTACT 


TCTTTGTCTT 


TATCGGCCTC 


ATGTTGGTCG 


GTTTGGCAAG 


15000 


TTGGCTCTTC 

X * www X W A A w 


TATAAAAAGA 


GCAAGAAATA 


AGCAAAAGCT 


ATAAAAAAAC 


AGAAAGrGCC 


15060 


TGGAACAAAA 


ATCACTTTGG 


ATTTTTGTTC 


CAGGCyCAAA 


ACCTTATAAA 


CGGCGGTAAC 


15120 


AGAACCAACT 


CCT TCGG AAA 


TAAGCCGAAA 

X ****X*X^ X#XJ«W«*X 


TTCTCCAAAA 


ATTAAAGAGC 


AATTTTCGGA 


15180 


AATTTCTTCT 


TATTTCTCGG 

X XXX X X^X^XJ 


AGCTAAACGG 


TTCTGTACCG 


ACCTCTTTTT 


TCATCTTAAG 


15240 


CTAATCAACT 


GAACAAAAGC 


AGCTCCTTCT 


TATTGACATT 


CTCAGGTAAG 


CCTTCATCTC 


15300 


AGTTTACCCT 


TTGAACAGAA 


CTTACTGGGT 


TCTTGGAGAT 


AACCGCTTTT 


GCCTCAGCCT 


15360 


TTGTTTTTPG 


ATGCTATCTT 

X X3X^ X X X X 


TAGGTATTTT 

X J*\D^J X X X X X 


GGCTAAAATA 


TTGTACACTT 

X X X X X 


TACATGTGCA 


15420 


AG AAAT TT AG 


TTAGAGATAT 

X X I»\^^^\J^^ X ^» X 


GAGGATAGAA 


TGATGAAAAA 


GATGATGGTT 


TTATACAACG 


15480 


AAACTTCAGG 


AAGTAGTGAA 


AGTAAAGAGA 


TTGCAGAACG 


CTTTAAAAAA 


GCTGCAGAAG 


15540 


CGAGAGGGGA 

Wwnwnwwwwn 


AGCGGTTATT 


TTGCAACCTT 

X X \7XM^r»WW X X 


CCAATCCAGA 


CATTGATCCT 

X>X^ X X X \^ X 


GAAGAMTGC 


15600 


GGAAAAATGC 


PAAAGAAAAT 


CAAGTGGGTG 

wnn\3 X wV3V3 X w 


Tf TTGGTCGT 

X w X X V3w X \^\3 X 


CATTGGTGGT 

wJ^X X \3w X X 


GACGGGACCA 


15660 


TTCATCATGC 


CGTTCAAAAT 


TTTAAAGACA 


CCATTCGTGA 

wfl X X V« W X W 


CTATCAAATA 

\^ X 1^ X w J^rv^ X r\ 


GGGATTATTC 

XJ^JXJ'X X X X X x^ 


15720 


CAGGTGGAAC 

^4^Uw A OU/VIW 


AGTCAATAAT 


TTTGCACGTG 

XXX wwnw w X \J 


TGTTAAGTAT 

X w X X <M*w X n X 


TCCTTTGAAA 

X WW XXX yjM^TMn. 


GAAGAAGACG 

\^ AUAiXJf^X»X7«XX^ X^ 


15780 


CCTTTGAGAC 

WW AAA \U^WflW 


CATTTTGGCA 


GGACAGACCA 

\3 wrliW«»wa* W W*» 


CACCAGTTGA 


CTTTGGGATG 

X^ XXX XJX^X^«X X X7 


GTAAATCAAG 


15840 


ATGTGATGAT 


TTCGACCTTA 

X X wwnw w X X 


ACGATTGGAC 


TATTAGCAGA 

X AX A A «XX7Xm*X7«X 


TACAGCAGCA 

X «XXm«XJX^*^X9 x^*x 


AATGTGACCC 


15900 


AACAGGAGAA 


ACAAT^AATAT 

«aX^»lsXB^SMa X *aX 


GGTCCTTTAG 


CCTTTACAAA 


GCAAT TTTTC 


CGCTTATTAA 


15960 


TGAAAAAGAA 


AAAATACAAA 


CTAAAAATTG 


ATGGCGATGA 


AAGACGTTGG 


CATGGCAAAG 


16020 


CCCAATTATT 


AACGATGACC 


ATGACGAATT 


CTGCTGGTGG 


CTTTACCAAT 


TTCGATGCCA 


16080 


ATGctACGCC 


AGATGACGGC 


GAAGTCCACA 


TTATCATTTT 


ACCTAAGTTA 


GTTTTTTACA 


16140 


AGTTTGTCTA 


TTATTTACCT 


AAAATTATCA 


GAGGACAATT 


AAATGAAATT 


CCTGGTGTCG 


16200 


TGTATTTCTC 


TGGGAGTCAA 


TTTAAAATTA 


GTGGCGAAAA 


AGACAAGAAA 


GTCCAAACAC 


16260 


GAACAGATGG 


CGACCCTACT 


GATGATTTAC 


CGATTAAAGT 


AACGGTGGAA 


GCTGGGGGAC 


16320 


TCAACGTGTT 


TGTTCCTGAA 


ACGTCCACAT 


CTACGAAAGA 


ATAAAAGAAA 


CAACTGTTAT 


16380 


TTTTTCGTGG 


AATAGATTTC 


TCATTTTGTG 


TACAATAGAA 


CAGAAGAGAG 


GCAGTTGAGT 


16440 


GAACAGTGAA 


ATGGGATGAA 


TATGTTGATA 


TATTCGATAC 


AGATGTCGTT 


TCTAAAATTA 


16500 


TTATTTTAAG- 


AAACTTGCCA 


ATTAATGAAG 


GCTGGATTGA 


CATCAATGAG 


TTAGCAGAAG 


16560 


AATTACAATT 


AAACAAAAAA 


TCAGTCAAAA 


AATACGTAGA 


ACTTCTAAAA 


GATGACATTA 


16620 


CGTATTATGT 


TGTGGATAAT 


GATGCTTCTT 


TGGCTTTTGA 


AAAGGGGCAT 


GGCTATCGCT 


16680 
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fjTr'T a./2PP a a 


oi,»l9 X X V^LrAutA 


aanTTTaTap 

AAk^X X X AXAV,* 


A a aTTT*i**p 

Ai:iV«AAX X X X X 


UbvauAiAbX i 


1 cn A f\ 
lb /4U 




a^TaaT^caa 


uAi X XAl Av«X 


XV#uAX X 


*i»a aT/sp a aTT 

1 AAluuAAxX 


. b lbL«UAi7U,b£L. 


lobUU 








i X l..^OOV#UV« 1 1 


oV.* X CAAAbiavi 


fnrpfp A ^ a ^ A A A 

X 1 lALAGAAA 


IbobU 


mm rp*p A 7\ 


Ai iiaGCsl X X A 


GiAAl TAAI A 


AAAvyXA\,^x iv? 


bAAAAx, IbAA 


^^^A itm^it It ^ 

GGCAATGAAG 


16920 


CGTATTTGCG 


CCATTTTGCT 


TACACTATTT 


TTATGGACTT 


A A m^^oni^ A m 

AATCCGTGAT 


ACTTGGCCTT 


16980 


TTGACTATAT 


It y^ik ft ^it It It 

ACAAGAA^CG 


^it it^m^^^A^^ 

GAAGTCGCAC 


A A A ^O^fP/^/^ A 

AAACCGTCCA 


AAAAATGATG 


GCTTTTTTCA 


17040 


ACGTAGAAGT 


TGCGGAAACG 


GTGACACGCC 


GAGTTAAATA 


TATGGTCGCT 


ATCTCAAAAG 


17100 


fvi ^^^^ rib Ik ^»m^^^> 

TCCGTAGTGC 


GCAAGGCGCC 


GAATATCGAC 


CTTCAGCCGA 


GCAAGAAACT 


TATTTAGAAG 


17160 


GTAATAAACA 


mntik ^^^^ik Ik 

TTACCAAGCC 


mm nt m m it ^ it 

TTTATAGAAG 


It ^ It m.^ Ik ^ik 

AGATGACAGC 


TACAACCCTT 


TTCCTTAACA 


17220 


AAAACGATTT 


AACTTTTGTG 


TACTTTTTCT 


TGTTAGCCAA 


TGACCAATTT 


TATCAAGATG 


17280 


AAGCCACTGC 


ACAAGCAATT 


TTAGCGCATT 


TTGATGAGAC 


TGAAGCACCT 


GTTTACATGT 


17340 


TGACTCATTT 


AGTCCAACAT 


TTCTTATTAG 


AAGAAATACA 


CATGACGCCG 


CAATTGGAGA 


17400 


AAATTCGTCC 


GCAAATTTTT 


TATTATTTGT 


TTGCTACACA 


TTTATTTGCC 


GAACTTTACT 


17460 


ATGTCCAAAA 


TGTGAAAATG 


TACAATGTCT 


TTTGGAATAA 


AATCAAAGAA 


AAATATCCGG 


17520 


TTCTTTTAGC 


AGAGTTAATT 


CAA6GTTTAA 


ATCAACTTTA 


CGAACAAACA 


GGCAATGAAT 


17580 


TATTTACAAA 


TAAAGAATTT 


TTGGCTTTGC 


GTTATTCTAG 


TGTTTTGGCG 


CTTTCCAATG 


17640 


ATTTAATTCA 


TCGCGAACAA 


AAAATTGTTT 


TAGGGTTAAA 


ATCAGGCTTG 


CCAGTACTGG 


17700 


AAGAAGAACG 


GGTGAAACTT 


TCGATTGAAA 


ATAATTTTCG 


TCGGTTTTAT 


GCATTGGAAG 


17760 


mmmmit ^^it mit 

TTTTACGATA 


CAGTGAAoCA 


AGCAAAGAAG 


AATGGCAAAA 


TATTGATTTA 


CTATTAGTGA 


17820 


CTTCTTTGGC 


AGATGTGGAA 


ACGGGCCATC 


^ It ^ It ^ It m^vm m 

CAGAGATCTT 


TTTATTTTCA 


AGTGAGTTAA 


17880 


CGTTTGAGCA 


It mit mit It m^< «fi> 

ATATAATCGT 


mm^it It Av^it It It 

TTGAACCAAA 


It It It m It ^mit It It 

AAATAGTAAA 


It It mm It It ^«^it ^ 

AATTAACGAG 


AAAAGAACCA 


17940 


CGTCACTTTA 


^ mit 'K It It % It mit 

ATAAAAAATA 


GGCTATTTTG 


ATGCACATAT 


CTGATAAATT 


GTAAAAAAAT 


18000 


mmrpip/^ip jt )t it it 

r 1 TTCTAAAA 


A /**n* A A A A A A A 

AGTAAAAAAA 


GACaAGAAGA 


It It 3t^^*m/*<it It m 

AAACGTCAAT 


It It s "tti It ^^m^m 

AAGTAGGTGT 


m^ It mm it ^^m it 

TGATTACCTA 


18060 


U r r ATTGACG 


TTTTCTCGTT 


GTTTTTTAAT 


Ik A A m A^>1*^J^^ 

AAATACTGCG 


TCAATTCTTC 


m^it It Ik n mm Ik m 

TGAAAATTAT 


18120 


U i. L.AftAAAv3(j 


TACTTTTTTT 


GCTTTTTTTA 


m^^mA It Tt m 7t it 

TCG T AAAT AA 


m It m^ n mm it 

T AGATG AT T A 


/^r^TT% It mm^ It m it 

GCTATTGATA 


IdIoO 




lAL. 1 ACTTTT 


TTTAACACaTC 


A /^T*f* A A A A A A 

AGTTAAAAAA 


TAT TGAGG T T 


TCT C ACAAGA 


1dz4U 


nl X 1 JL X^ X Al 


aTaa*pa/^AAA 
Ai AA 1 AvjAAA 


r» A A A r^r* A 
CiAUt^AAljrCKjA 


Ab VjAACab Ubo 


^ATA A^A*P^A 

GA r AAGATGA 


AAAAA^Af^AT* 

AAAAACAGAl 


lb JUU 


nnaafaaaTT 

VJvsrVn^MrVn X X 


r'aapaaaT/^T 
V3AA0AAAI L. 1 




A/^A A A A«T» A r*»P 

AUAAAA X Ab 1 


GCZAGGUATCL 


^ An**!* A A AA^f^A 

GATTAAAGGA 


lo JbU 


ATTGGTTGAA 


GCTTTAGAAT 


TGGAAGTCAC 


AAATCAAAAC 


TTGTTAAAGG 


TAACAAGTAT 


18420 


CTTACATATG 


AATCCAAAAT 


TCAAAAAAAT 


TTATGCATAC 


GAAGACAGTC 


GTGTTATTAC 


18480 


GCTGTATCAA 


CTTTTACAAA 


ATAAGCCGTT 


GGAAGTAACG 


GAATGAAAAG 


AGGGAAACAC 


18540 


ATGAAGAAAA 


CAATGCTCAT 


TCTTCTATTT 


GGTTTATTGA 


CAGTGGTTGG 


TCTGCCAATG 


18600 


GTGACCGAAG 


CGGTTGAACC 


AGTTAACGCA 


ACAGATACAA 


CGATTTTTGG 


TGCGCAAGCG 


18660 
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AHATPAAAPA 


OATAAGTTAC 


AGACGTTGCT 


GAGTCAGACA 


18720 






ATTTTTAPPA 
nx X X X xn^v*n 


PAGGGAAGTT 


ATGCTCTGTC 


AAAAGATATT 


18780 




r'AAATTATPA 


ATT A ATTH^^ P 


f^ATAPAAPAG 


GAGCCACTAT 


TTTACATAAT 

XXX X x 


18840 




r'APPI^ A T*pr» A 


nPTAAPAr:AP 


APAAPRTAPG 


GGACGAAAAC 


CAATGTAPGA 


18900 


IV TV 71 TV 'P A 

CTTCAAAAIA 


1 i\3d X i. ioA 


PC^^ATPAAT 


f^TC^APPTTAA 


AATTGACGAA 


TCAATTAACG 




^•PH^/^ JV RR^li 

CTAGCAAALrA 


A T A TP TTT T A 
AiAiL»l 1 1 1/i 


T A ATPP ATT A 
XMnXV^V^nX Xrl 


AAAGCATTPG 


TGGTCAATTT 


AAATGCAGAC 




ATTGGCCjTTA 


AuAX i xv#Au(9 


PAATATTTTP 


ATHPfZAf^APA 


CGGCACATAT 


GCAATCGGGA 




GGTGATfTlA 


TV C^rT* TV CCf^ A T* 

ACCliAliCuA i 


•r*P A P A TTPPT 


I^I^TTATTPAA 


CGCCAAGTCG 


CTTCCAGTAT 


1 Q1 40 


TV »P^n»/^ TV TV n^o 
ATGTCAGA rb 


1 AbAC A 1 (^b i 


/^/* A ^ A A TPT A 
bbAb AA X b 1 A 


TTT^^TTT A A 
XXX bb XXX rVr\ 


7WVGTGACAGA ATTAGATGCG 




ATTAAAAbTA 


CCAb 1 Cb 1 i b 


Ar* A/^TT Ar'r'A 
AbAb 1 X AbbA 


^Or* APT ATT A 
bbbAblAX lA 


CACGTTTGCA 


AACAGCCATT 


X 7£ O W 


GAAGCCGGTG 


CT AT T T CAGT 


TV r^CT* A A A A 

Abb TAA 1 bAA 


i^AAAATTAT/^ 

bAAAAi lAXb 


TATCCACCGG 


GGTTAATTCT 


X Z70^ U 


TTTAATATGC 


TG AAGG ATG T 


TV TV ^'P/^'P'T'^A A 

AAb rb I 1 bAA 


*^A T A ATTTTT 

bAi AAX XXXI 


TCTATTCACC 


CTATGATAAT 


X -7 JO U 


GAAAACTTGA 


ATGGTCTAGT 


GGGAGATCAx 


f^f^ TV TV 'P'P'T'TV T"P 

bbAA X 1 1 A X X 


TCCGTGGCGC 


CCAAAACATC 


1 A n 


ACGGTAGTAG 


GGAATCATTT 


ACGbGGACTA 


OTV TV TV TV rp A^/^fT»/-» 

CAAAATbbTC 


CAGCTGGCGG 


ATTTAAGTTT 


X^OUU 


AAaTCAGGCC 


GTAATATTAC 


TV TV *T*f^ TV TV T* 

GATTATGAAT 


TV TV n*n* A *pn«P#^^ 
AATTATTTbb 


GAAATACAGG 


TCTCATTATG 




TATGGAACAC 


CAGAGATCGG 


mmm^^^Tv iv TV 

TTTGGCAGAA 


ACZACAAbCAb 


AGGGGGCTAT 


TTCAGAACTC 


X7D4CU 


tctaactggt 


TAGTGGCTAA 


TAACATTTTT 


A T T/!V A A r^T 

b AT TbbAAb X 


ATTGGGACAA 


TCAATATGCA 




TV H ff/^/*/^ TV fT*f^/^ 

ATTGGGATGG 


TV TV ni> A ^TV TV 

AATAuAATCG 


A r» A T A r" A /^/^ A 

AbATAbAbbA 


A ATA A^A AT^ 

AAX AAbAAX b 


TGTTCAATGG 


CGTATTTATC 


1 Q74n 


AACAATwAGT 


rpm/-»mr¥» TV A/^T* A 

TTbTTAAb 1 A 


r**^ A T» A AT A TT 
bbAlAAXAX X 


PPCP A A A APP 
bbbbAAAAbb 


GACGTCGGGA ATTGTTAATC 


X 70 V w 


^/*"p TV f^Pt^r^f^r^f 
Gi^TAGTGGCG 


oTbbb 1 Tbbb 


Ar*r*AP A A AP^ 
AbbAbAAAbb 


A /If'l* TT T^5T/^ A 


AAGACAATAC 


TCGAGATGAT 


19860 


r*^/**r*<i>'Piv TV TV fV 
GGCCTTAAAA 


TV f^r^Cf^r^tS. TV TT 

AbbbbbAAi 1 


ATTPPTA/^AA 
AX X bb X AbAA 


A ATTC/Z AP A^; 
AAX X bbAbAb 


AGGCGGACTA 


TCGCTTAATG 


X i7^€m\S 


AuCAAL. A I 


bbbAAxwll 1 


A ATTTP APP A 
AAX X JlbnLrV^/i 


APAPTTTATf: 


AGCAATATAA AAATACGCCA 


19980 


TV 'Pfpor*^* ^1*1* TV 

Al TCCG&Tl A 


r*H. A A^P Ar^fPTT 

bAAAi Ab ill 


AbbAAbbb^ i 


PTP/2PT A PA A 


CGATTGTGCA AGGACAATCA 


20040 




AbbAAb 1 bb X 


Tr^/^TAAT APA 
XbbXAAXAw\ 


A ATP APPPP/^ 


ATGAAGCAGT 


CCCAGCCGCT 


20100 


AAAAiCvjlCA 


A 1 bbAb Abb 1 


r'TTA AATPAA 
bi i/\HAlb/Vfi 


ATPPnPPAAP 


AAAAAGTGAC 


TGTTCAATTA 


20160 


TV pfp rp TV TV 21 a 


b/\bbbib/\l 1 


r* PT A A P A /^T/^ 


aapptapppt: 


TGACGGTAGA 


GGCCCCTGCA 


20220 


a A 21 A a r:^*TT<^ 


ATTTnTPPPA 
/\ 1 X i o i 


(^PTAPAAAPP 


f^TTTATf^PTT 

U X X X A X VJv« X X 


CGATTGGTGA 


AGCAAACCAA 


20280 


TATar'AflTTT 
x/ax r&v.«x\\j X J. 1 




nPTATTPAPT 


nPPATTGGGC 


CTAAAACAAT 


TGTTCCTTCC 


20340 


TATTATCAGC 


AGGCGACACA 


ATTGTTAGCA 


GAAGGTCAGG 


AGAGTCAGGA 


CAAAACCCAA 


20400 


GAGCAAGTGG 


ATCAGTTAAC 


AAGTAATTTG 


CAATCTGCAA 


TGAAAGTATT 


AGTCAAAAAA 


20460 


GCCGATATCA 


•CGCTAGAGCG 


TACAGAAGCA 


GAAAATGAAC 


TGGCAAGCGT 


TCATAAATTA 


20520 


GACGAATCTG 


TATATACCAA 


AGACTCTTGG 


CAAGCCATGC 


AAGAAGCCTT 


AATAGATACA 


20580 


ACAACTGGCG 


AAGGAAGCTC 


TAAGCAGTTA 


CAACAACTGC 


TTGCTTGGTC 


AGATGAAGAA 


20640 
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TTATTAGAGC 


CAACACTTGG 


TGGCTTTAAA 


ACGCCTGCAG 


ATGCACAAAa 


GCGTATTAAT 


20700 


CAATTAACAC 


AAACCATAAA 


AACGGCGTTA 


CTTCTGTTAG 


TTGAAAAATC 


AACAGAGACA 


20760 


ACTAGTAACA 


CAAGCGAAAG 


CAGCACCTCA 


AGCACAACCA 


GTGAAACCAG 


TAATACAAGC 


20820 


GAAAGCAGCA 


CCTCAAGTAC 


AACAAGTGAA 


ACAAGCAATA 


CAAATGAAAG 


CAATACACCA 


20880 


ACTACAACCA 


GTAAAACCAG 


TAATACAAGC 


GAAAGCAGCA 


CCTCAAGCAC 


GACAAGTGAA 


20940 


APAAGCAATA 


CAAATGAAAG 


CAACACACCA 


AGTACAACTA 


GCGTIAACCAG 


TAATAPAAGP 




r;AAAGTAGTA 


CCTCAAGCAC 


GACAaGTGAA 


AGCAGTAGTA 


CCAGCGAAAG 

wwr%ww\SrW^^9 


CAGCACGCCA 

x*«»0 x<» mW^ 0 W 


21060 


a AfCACAA 


GTCAGAGCAG 


TAGTACCAGC 


GAAAGCAGCA 


CvTCAAGCAC 


GACAAGTGAA 


21120 


APCAfiTAATA 


CAAGCf^AAAf; 


PAGPAPGPCA 


AGCACGACAA 


GTGAGAGCAG 


TAGTACGAGT 
X no X nwono x 


21180 

X XO V 




pnnpAAr^ap 


aaPPAGPGAA 


APPAGTAGTA 


PPAGPGAAAG 


TAGTAPPTPA 
X AO X AVi\« X 




21 r^T a r"r^ 11 r* IX a 

MVS 1 \3r\\^r\r\ 


f^Tr^aaapaar: 


r»A aTar^aaaT 

V-AAl A\^MAA1 


rza AACPA APA 
oMMMoV>MMv-M 


PAPPAAGTAP 
VfMV^^AAo X AV^ 


aappaGTaa A 

AMLrV.^0 X AAA 


91 "inn 


ML^MMo 1 M.VJ 1 


ppa<^pf^aaafz 


TafZTGPTTPa 
X Ao i X 1 


acTar'aapPA 

Mo X M\-.MMl^V>rM 


ppppaappaa 

OV- 0 V^MM\^\_MM 


paaTapaaGT 

VrfMM 1 Mv^MMO X 


^ X^ WV/ 




rr^ppaanpap 


TnTaaappaa 

i O 1 AAAV^lj AM 


anTanppaaa 

Mo 1 M0V-.OMMM 


ppaaapnppa 

oV^MMMoo^V^M 


PaaTTPTGTP 
OMM 1 1 X 0 X 


91 49n 




T a a <r:Tr*T a a 

J. /ioMvj i v> 1 Art 


TpaanaTPPT 

X ^AMoAi L.L' 1 


aaTnap/^PTP 

MM i oMV^O^ X \^ 


aaaPTaaTTP 

MMMVj i. MM I 


a a afiPPPTPT 

AMAoV-.^V# 1 1 


91 dpn 

^ X 1 0 




f^Tr* a a a r*a a a 


/^/zaaTPaPTa 

OUAAl \.«Au X A 


GpaGAAaapp 

OV^AoAMMML< V.# 


AAGPAaPTAa 
MMoC>AMV^ X MA 


apaGaTapAG 

MV^MoM X M^^MO 


91 SdO 




A A TP/^ TP A flfl 


AAf^AGTPAAA 
AAlmfAo i ^rfAAA 


AAAGPGGATA 
MMMOV^OOM X A 


ATAPGAPPAA 
A X A^o ACtV^AA 


A ATTGPP A AG 
AAX 1 OV^VfMMO 


91 fino 


AAAAAATTPr* 


PP A A A APACn 


GGAAPAATPT 


TPAGPAGTTG 


GGAGPTTPPT 

OOnOv^ X X \*\^ X 


AGGPPTGAGP 


21660 


X X X X X AXV^OX 


T A^SPr'A T AfiT* 


A APPT ATTGT 
AA^LrlAl XVsX 


TTTAAAGTGA 
X X XAAAVsXoA 


A A AG ATA APT 
MMMOM X MA\« X 


PTTTAAAAAP 

V« X X XMMMnMV^ 


91 790 


C/tATAA^TAA 


AP A TPPT^ A A 


APAAGAAATA 
ALmAAo AAA 1 A 


OuA^X XoX X X 


P A GG ATGTTT 
^AouAXoX X X 


TPATATAAAA 


21780 


TTAAAAf^AAf 


xnv.*A\3X X X X X 


AHTGAAAAAA 
MVa X uAnnMnn 


PAPAPAAAAG 


TAPTTGAAAT 
X MVif X X vsMrm x 


TPTGTGATAT 
X v^x ox omxcix 


21840 


AACGGTAAAG 


TAAAt^Anr^A 


PAAAAAAPAA 


AGGAAGTGAA 


ATGTTTGTAT 
nxox X xoxj^x 


ACACCAGATG 


21900 


AAATTTTAfSn 
rvmx X X X AvfU 


XAX XAlaXAX X 


G A A A ATGGTP 

OMMMMX OV3 X V« 


AGAAAAAAAT 


TPAAAAGPPT 


TTGGTAGPAA 
X X 00 X n\3\^t%r\ 


21960 


AACTPATTTT 
rin\.*x^«/\x XXX 


AGCATTTATP 

£\\3\3t\ X X X n X v* 




TGATTTPPPT 

X \3/\ XXX WV^Vf X 


PGGPTATTTG 

V«\7V3V» X n X X X V3 


GCTTATGTGC 

O^X Xf^XOXO^ 


22020 


w X \J X X X 


CTCTATTCCA 


GCAGATTTAG 


CCAGTGTTCA 

^\^fW9 X w X X 


AGCTTTAGTG 
nwv« XXX n,^s x w 


GGAGCTGCGG 


22080 


TTTTCCCAAT 

X X X X ^ X 


TCGTPTAATT 

X X V# X X X 


GTTATTTTAA 
\9 X x^x X X xnn 


TGGCTGGTGG 


TGAGTTAATC 


ACAGGAAATA 


22140 


TGATGGCGGT 


GAGTACGGCT 


TGGTTTGCTA 


AAAAGGTCTC 


TTTCAGAGAA 


TTACTCGTGA 


22200 


ACTGGGTCAC 


AATTACATTG 


GCAAATATGG 


TTGGCGCCCT 


ATTTGTTGCC 


TATTTTTTTG 


22260 


GCCATTATTT 


AGGTTTAACC 


AGTAGCGGCG 


CCTATTTAAA 


TCAGGTTCTT 


TCTCTTGCAG 


22320 


AACATAAAGT 


TGCAGCAACT 


TGGCTTCAAG 


CAATTATTTC 


AGGTATTGGC 


TGTAACTGGT 


22380 


TCGTAGGTCT 


GGCCTTATGG 


CTTTGTTATG 


GTGCGAAAGA 


TGCAGCGGGG 


AAACTATTAG 


22440 


GCATCTGGTT 


TCCAGTAATG 


ATTTTTGTTG 


CAATTGGCTT 


TCAGCATAGT 


GTGGCGAATG 


22500 


CATTCATCAT 


TCCAGCCGCA 


ATTTTTGAAG 


GCGGCGTTCA 


GTGGGGAGAC 


TTTTTCCGTA 


22560 


ATTTTATTTT 


TGTTTATTTA 


GGTAATATCA 


TTGGTGGTGC 


CATTTTTGTT 


TCTGGCTTTT 


22620 
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ACTATTTAAG 


CTATCGCAAA 


CATTAAATCC 


ATTTAGCAGA 


TAAAAACCTA 


CACAAAGGCT 


22680 


AAAIGCCITT 


GTGCAGGTTT 


TTTCTTTTTA 


AACAAAAAGG 


AAAACGGTTG 


AAAAATTATT 


22740 


GCTAGATTTT 


TAAAACAATG 


TTACATTAAT 


GTTACAAAAA 


GATTTCTGGA 


GGTAACGAAA 


22800 


AATGAAAAAA 


ATTGCTTTGT 


TCAGTATGTT 


AACGTTCAGT 


GTATTGTCTT 

\^ A 1* A A \^ A A A 


TAAGTCTAGC 


22860 


AGGATGTGGA AACAAAAAAA 


CAGCAaGCAC 


AAATGATTCT 

«V]b*A A %7«A A A A 


AAGCCAAAGC 


AAGAAACAAA 




GAAAGCCACG 


CAGAAATCCT 


CTAGCCAACA 


AGAAATGAAA 


AGTAGTCATT 


CGTCTGTCAC 




GGGTCAAAAT 


TCTAATGTGA 


CAGGGGAAAA 


TCCGTCAGAA 


AATGTCACCC 


AGCCTTPTCr 




AGGAACTGAT 


GAAACGAATG 


AAGTCCCTCA 


AAAPrAAGCA 


PPTGATACAA 


ACATTACAAT 




TACCAATGTT 


GTTTTCAATC 




TGAAATTAAT 
xunrmx x/mx 


GGTAPTAPAT 

\3V9 X An^ X AV« A X 


TAPPTAATGP 
X r\\^\^ X nn X OS* 




AACCATTACA 


GCAACGGTAG 


TCflflTflATfZr' 


X 1 ^ 1 uVr ALr An 


nP af2/iT<^ TTT 


TTT ATfZPrZ/^a 
1 1 XAlVa^ouiA 




TGCCAATGGC 


AATTTTACAG 


1 riri 1 1 tvj 1 


^ AVjr AVj^ VjuuA 


ppr^apTapTP 


aaTTaaTPfZP 

AAl lAAlV^ov. 




AACCGTTGAT 


CAACGGAATA 


o 1 Ln-riiv^V- -I o 1 




aTTPpaaPTT 

All OL.AAV3 1 1 


pappapaapa 

t^AvjoAV-AAvjA 


9*^ "^AH 


AGCAGCGCTT 


TCTTTTAGCA 


A T a T T bl r* <^ 21 T 


T c a TPfr; a a a 

X oA X ^^»*VjAAA 


paaripnapaa 

v,-AAo\3WjAV^AA 


TTTPT^^/^Taa 
1 1 IV^iVJOlAA 




AACAGCACCG 


AATGCAACTA 


1 1 1 X r\\j 1 o X ^ 


apfiTr^anaT 


ft a T f^pn Pfzr:/^ 

oAl Lj^La^Ljoo 


T<^aTTTTar:p 

1 VjA 1x11 AoV^ 




AAGTTTTACT 


GCGGATGCCC 


rlrVuV^OMn 1 1 1 


papanppa^^T 


aaTTTanTTP 

AA XIX AO X 1 


ppnppapaaa 


9*^ ^9n 


AAATCGCTTA 


GATGTTACGT 




AATAf^^^nPA 
AA X AV3U\9A^ A 


PPTTAPTTi^T 
X X A^ X X \3 X 


TT^^ATTTAPP 
X XoAX X XA^ly* 




AAATTAAAAT 


AGAAGAGAAG 


TCATPAAAGf 


CACCAACC^CT 


AGTTAGPGTT 

A\9X X AV9Va>V9X X 


^^GTG^^PTTTG 

U\3XU\J^X X XV9 


23640 


ATGGTTTAAG 


CTTCTTCTTT 


X X 1 /\X UAX X V3 


Pf^f^ATATfSTA 

V^\30A X AX V9 X A 


AATAfZAT^ZGT 
AA X AVjA X \s\3 X 


ATTTTTTf^A A 
AX X 1 X X XuAA 


9*1700 


ATACCTAATC 


GATCGGCAAT 


PTTTATGAPA 


UAAX^X X XOA 


GATTAAAAAT 
UAX XAAAAAX 


TPPTTGATPA 
XV<*V«X XuAXV^A 


o-aTcn 


TATAAACGAA 


TAACAATTTC 


X X XAX X X xvan 


TTP A A ATtZTtZ 
X X V.rAAAX O XV3 


aTAPATTTGG 
AXA^AX X Xul3 


ATPGPTTTPT 
AXv^ul^X X Xv^X 




ACTGCTTGTT 


TTACTTTTGT 


f*acTr;aAr;TA 

y^rVj X X f\ 


f^Tft ATT A ATT 
UXUAX XAAX X 


PGTPAATaTT 


PTPA^AaAAA 
\3 1 LtAVjAAAAA 




TGTTCA6AAT 


TTTTTUVGCTC 


TTnnf5Af!ATA 


TPATTTTfITT 

X V«AX X X XOX X 


GTPPTGAAGG 


TAATAAGTPT 
X AA X AAU X ^ X 




TGCAAGTAAT 


CAATTAATGG 


CATTTCTAAA 
^^^x X xwx#vm 


TGC AT AT TG A 


TACACATPAA 


TPf A AT AAT A 


24000 


CGATCATTTT 


CACCGATAAC 


TGGAATGGTG 


GAAGATTTTA 


AAATAACCCC 


TTTTTGATTT 

X X X X X wAX X X 


24060 


TTATTGTAGT 


AGGGTTTCAC 


AGCGTAATCA 


TGTTCCGACT 


CTACTTGACT 

\^ X €a\* X X ysns^ x 


TAACATTTTT 

X #1r%VM%X X X X X 


24120 


AACGCCATAT 


CAGTAATGGG 


GGCACCTTCt 


TGCGGTTACT 


GAAATGGCCA 


TTGATGACTT 


24180 


TGATAGCCGA 


ACTTTCCAAA 


TTTTCTAAAC 


TATGAAGAAT 


GACTTCATGG 


GTACTACCAA 


24240 


AGAAGTTCCC 


TAATCCATCA 


AGCATTAAGG 


CATGTGAATG 


TAAGATTTGT 


TTATCTACTT 


24300 


CACTTAAACG 


TAAATATGAC 


TCCaAGTGAT 


ATCCTCCCCT 


ACACTGAAAT 


TAATTTTAGT 


24360 


GTAATACTTT 


TTTAATGGAA 


AAGCAATAAG 


ATTCTTTACG 


TtAAAAAAAc 


nCaTtGACTA 


24420 


AAcTTTkGTT " 


TAGTCGTATA 


CyCkGAATGT 


GGAATGAAAT 


TTTTGTTACT 


CATTGTACTT 


24480 


TAGTTCATCG 


GAATGCTCGC 


TCATTCTTTC 


ACTTGTTGaG 


CaATAAAAAt 


TTTaTTCAGG 


24540 


AGTACCATCC 


GTTATAAAAA 


AATATAGTTG 


GAGGCAAAGA 


AATGAAAACG 


TTATCAGGAA 


24600 
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TCTTGATGTT 


AGTCTTAGTT 


TTATGTGGCA 


TGTCCCTATT 


CTCTTTTAAA 


GCACCGCGAG 


24660 


GGAAAAAATC 


TTTAAGTGCT 


TTGTCTGGT.G. 


CAGCTXGJGC 


CACCTTTCTA 


CCAGAGGCGT 


24720 


TTTTGCGGTA 


TGCAGTAGGC 


GGCGTTTTTC 


ATTTAGATTA 


TGTGTCGCAA 


ATTGGTGAAA 


24780 


CTATGGGTAG 


TCTAGGTGGT 


TTAGCAGCTG 


GAGCTCTTGT 


GCCGCTTGCT 


TTCGGGATTA 


24840 


GTCCAGTTTT 


TTCAATTATT 


TTAGGTCTGT 


CATTGATTAA 


AGTCAGCTTG 


TTACCAGCCT 


24900 


TTATTACCGC 


GTATCTTTTA 

X X X X X X *x 


TCTTTTGTTG 


TTGAGCAGAT 


GCAAAAACGT 


ATTCCAGAAG 




GGATTGACTT 


ATTAGTCGTT 


ATTTTAGTCA 


TTCCAGTCGC 


TGCT TCGTT A 


GTTGCTGGTT 

V# X X V*V*P X \Jf\Jf X X 


25020 


TTATTCAACC 


CGTTGTATTA 

X^XJ X X\JX<^X X** 


GGTGTCCTAG 


AAGTTATTGG 


GGGCACGATT 

xJ\J X X 


TTAAGTGCAG 


25080 


TAGATGGCAA 


TCC7VTATGTG 


ATGGGCGCAA 


TTTTAGGTGC 


TATTATCCCG 


ATTGTTGGTA 


25140 


TGACACCGTT 


AAGTTCAATG 


GTTTTAACTG 


CCTTGATTGG 


TTTGACTGGT 


GTACCAATGG 


25200 


CTGTCGGTGC 


GTTGACTTGT 

\^ X X X X V7 X 


TACGGCAGTT 


CCATTGTCAA 


TGCGGCGCTA 


TTTAAAAAGT 


25260 


TAAAACTAGG 


CACAGCTTCA 


ACCCCGTTAG 


CTGTGGCAAT 


TGAGCCATTA 


ACACAAGTCG 


25320 


ATATCATCAG 


TTCCAATCCA 


ATTCCTATTT 

«x X X V^V^ X £^ XXX 


ACGCAACGAA 


TTTATTTTPA 

XXX rX X X X X 


GGAATGGTTA 


25380 


GTGGCATTGT 


AG TGACCTTC 


TTTGGCTTAA 

XXX w Ul V<r X X TXTk 


AAGTACCTGT 


GAPAGGAATG 


GCAArAPrGT 


254 40 


GGGCTGGTTT 


ACTTCTGArA 


TTAGGTAACA 


AT GCC ATT C A 


AArG APT T T A 


TT A GC AGT AG 


25500 


X X /AX Va^nVi^ 




wx\3r\x\7x X xv? 


GATTTTTAGG 


TGPGrTGGTr! 

X V7 WV3Vi# X WW X W 


TTTAAACGTT 
X X x/i/mw\3X X 


25560 


ACA6AATTCA 




AATACACCAC 


AATTTGAAGG 


aACGACGGAA 


GTAACCAGTG 


25620 


AAAAAGAAAA 


AGAAGCAGTA 


AAAACCAAAG 


GAGGAGTTAC 


GTATGAACAC 


AGCTATGGCC 


25680 


ATCAATAAAC 


CTAAAAAAGT 


GAACAAAGTA 


ATGAACTACA 


AAAGTTGTTT 

•vwr»w X X XXX 


TGACATTATC 

X ^MTW^** X X »» X W 


25740 


GGTCCTATTA 


TGATTGGTCC 


TTCTAGTTCG 


CATACAGCTG 


GGGCCTTAGC 


GATTGGCTTA 


25800 


GCGGCAAGAA 


AGTTATTCGG 


CGGAACGCAG 


AAAAAATTGT 


6ATTAAATAC 


TATGAATCTT 


25860 


TTGCAGACAC 


GCATAAAGGA 


CATGGGACTG 


ATTTTGCAAT 


TATTGGCGGA 


ATTCTTGGAC 


25920 


TAGCAGCAGA 


TGATGCCAAC 


GTAACACAAT 


CCATTGAACT 


TGCGGAAGAA 


CAAGGAATTG 


25980 


AGATTCAATT 


TTTAGAAATG 


AGTGAAGAAA 


GTCCAGTGAA 


GCATGCTAAT 


ACAGCATGTG 


26040 


TGACACTCAG 


TGATGCAACG 


CACGAAATTC 


ATTTAACAGG 


AATTTCTGTA 


GGTGGCGGCA 


26100 


CTATCGAGGT 


GAAATACATT 


GAATTAGACG 


GTTTCAATGT 


CCAACTACAT 


GGTCCTTTAC 


26160 


CTATTTTGTT 


AGTCATTAAT 


CAAGAAGAGC 


AGGCGATGCA 


AGCTTTTAAA 


GATACGTTAC 


26220 


AAAAAAATAA 


TATTCAAGTG 


AATGCGGTGT 


CTCGTTATGT 


GGAAGGGAAT 


CAAATTTTAT 


26280 


TCATCTTTGA 


CTTAGATTCG 


GCACCAATTT 


CCTCGGTGAA 


AGAACAACTG 


TTTTCACTAG 


26340 


ATGACACATC 


GAAAATAATC 


TTATTATGAT 


AGGA6TTTGA 


CATATGTATC 


TTTCGATTGA 


26400 


AGAATTTATC- 


GATAAAGCTG 


AAAAAACAGG 


AAAAAAAATC 


TCTGATTTAA 


TGATTGAGCA 


• 26460 


AGAAATGGAA 


CGTAGCCAAA 


AAAGCTATGA 


AGAAATTTGG 


ACACAGATGG 


GTCAAAATTT 


26520 


GGATGTCATG 


GATGCTGCTG 


TCAAACGCTC 


GCAAGAAGGA 


GCAGGCGTTT 


TTTCTCCTAC 


26580 
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/■AAAAftA 1 


CGCGCAgcTG 


GCAAAACGTT 


26640 


IV Tr* 21 T r* li T 




k« 1 oWw 1 X w\ 


aPPPPPIkPT21 


GGTACTAATG AAGTCAATGC 








LJiAW/oCCAAL* 


LtuC X X VaVi^ 


AGTGGCACCT 


TACCAGGGGT 




TTTAACAtaUv^ 


Al lAAAAAlA 


CUX XuAAX X X 


AAV«V7\.^u X OA X 


CAACAAATTC 


GTTTCTTGTT 




TACCAGTTCT 


TTATTTGGAA 


TGGTCACAGC 


ik/^iMkl»PPP 


ATGATTGCTG 


GTGCTGTTGG 


zbooU 


TGGATGTCAA 


GCCGAAGTAG 


GCAGTGCTTC 


n /^r*r»ii'npppp 


GCCGCTGCAG 


CTGTTGAAGC 




GGCTGGCGGA 


Ik ^ ^^^^ 

ACACCACGCC 


AATCTTCTGA 


AGCTTT TGCT 


ATGGCTTTAG 


GGAATTTATT 


/ /uuu 


AGGTTTAGTT 


TGTGACCCAG 


TGGCTGGGTT 


AGTCGAAATT 


CCTTGTGTCA 


AACGAAATAC 


^ /UbU 


AGTAGGTGCA 


GGCAATGCGT 


TAATCGCTGC 


AGATATGGCT 


TTAGCTGGAA 


TTGTAAATAA 




AATACCAGCA 


GATGAAGTGG 


TGGAAGCCAT 


GAATAAAGTC 


GGCCGTCAAT 


TACCACGAGA 


27180 


ATTGAGAGAA 


ACGGGTTTAG 


GTGGCTTAGC 


TGGCACAGCG 


ACTGGCCAAC 


GAATGAAGAA 


27240 


TGAAATTTTT 


GaAAAAGTAA 


AATTCACCAT 


TGnTTAAAAT 


TTAATCATGA 


AGAGGaAtAa 


27300 


yGCTGGTTGA 


AAAATTCGCA 


ATGTGAGTGT 


CAGAAAGTAA 


ACCGGTnGGC 


nAGAGTTAnC 


27360 



(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



CAGAAACTGA 


ACAAAGTGAA 


TTCCCTTTTG 


CGATTCACnA 


AAGTCACATT 


TTCAcGcAGG 


60 


tTTaAatCTt 


CCTGAAaaTT 


TtAAaGGTaT 


TTTTttGCTt 


GAaGAaaGTT 


ATTATaCTTT 


120 


AAATGGAAAA 


GAACGATTCA 


AATCAGATAT 


CTTTCTCTTT 


GAAGTAAATT 


CGTCAAGTAA 


180 


CGTTAAATTA 


AGTTCCATAA 


CTGTTCCTAA 


AAACTATGCT 


AATAAGAAAT 


ATGAAGAAAT 


240 


TGAATCTATT 


CCTTTTTCTG 


AAATGACCAT 


TTCAGAAAAA 


TTTGTTCCGA 


TTGTTTACAA 


300 


AAAAGAGAAT 


GAAACTTTTA 


TTGGTCAATC 


TGAAAGTAAA 


TTTACAGAGA 


AGAATCATTT 


360 


CATATTAAAA 


CAAACGATAT 


CTGAAAAAAA 


ATTGATAATC 


AAAGAAGAGA 


TAATGAGAGG 


420 


AAATAAACGG 


ATATTCGGAT 


TCGATAAACC 


TATCGAATAC 


GTAAGAGTTA AAGAAAATTA 


480 


AAAAAGCTTT 


TTUU^GAATA 


TAATTTAACA 


GTGAAGTAAT 


TTCTTTGTAA 


AATAGTAATT 


540 


ACTTCACTGT 


TATCATATTA 


ATAAAGATTT 


TCATCATTTC 


TGAAAATATC 


TTCTCTTTCC 


600 


AAGAAGAATG 


CTAATTCAGA 


TGTGTGTGGT 


TAAATTGATA 


GTCTGCAACA 


TTAAACATTT 


660 


GACTGATTCA 


CTTTCC7VAAA 


TTCTATTATA 


CCAGTTAAAA 


TCGCTTAAAT 


AGAAATACAC 


720 


TTCTAGCAAC 


ATGTTCCATA 


GTAAAAGTAG 


CACTTGTTAA 


AAATAACTAA 


TCTGCAACTT 


780 


ACTATCCTTG 


AAAATAAAAT 


TCTCAAACAC 


TTTACAAACA 


ATTATTTTTT 


AGGAAGATGG 


840 


TTTGTTATTC 


TTATATTTCG 


AGAAAAATAA 


AAATAGCAAA 


AGATACATTA 


CGATACAACT 


900 
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ATACTATCAR TACAAAATTA GArACCTCnT CTGCCCTTTG TTCAGTGATA AAAGCAATTG 960 

AGTTAGGAAT AATARATGTT~CACCTTa-AT AGACTTATTT ATTCAGTCAG CTAAACGCTG 1020 

TTAGATACAG CTATAGAAAT CTiJiTTP-kG ACATAGTTAG ATAAACAACT ATCCTTTATA 1080 

AAAATTGGTT CCATGTCTTT AGTACAA.---A GAGTACTATC ATTATTTAAT TTAAGTAAAT 1140 

AACGGATTTC AGAACTCATC GTTGATTCSA CGAGCACTCT TATTCAATTG GATAAGCGTT 1200 

ATCTCCTCTT TTTTAAAGTA ATTrGTATTC AGCAACCTAG AATATTGATA GTGCCGATTA 1260 

TTTAGACAGC TATCCS6ATA ACTTGGft&rA . ACTGAAACTA GAGTAAAAAA TCTAATAAGG 1320 

TGCTTTTAAA ATATCATCAT TTAAGRTACG AAATCGCATA CCCCGAAAAA TATCTGCTCT 1380 

CTTTA6TTCT ACAATTTCTG ATATCCAA-nT AfiTTCAATTT GAGTTAACGC TGGAAATGGT 1440 

GATTCATCTT CTGCTTTGAT CAGCTGTTirC AATGTAGCAG TCGTTACTGT TC6TGGAGAG 1500 

AAGGTGAAGT ATTGCGTTGC TATCGCATCA CTCGTAGTGA ATATTTCTTC TGAACCATCG 1560 

GAAAAGGCTA TGGTTACTTG TTTCCAATiyv CTATCATGGG GAAAATCAGC ACGTAATACC 1620 

AATCCAAGCT TATCGAGTTC AACTTCTCSG CCAAAATCAA TTGTTATTTC AGCATCTTTT 1680 

TGCTGATTGA TCCCCCAAGA TTGATAGGSG TACGATCCAT GATCTCTGIT GGCCAGAATT 1740 

CCATCAATTG CATTTCTAGC AAAAAAAG7C GCATCGTTTC GTGTTTCAAC ATTTGCATAA 1800 

GCGTGGGGAT AAACATTGGC CACATCTT7T TGATCATGAG GATTGAGCGC CCAGTTACGA 1860 
TAACTAGTCT GTTCTTCAGC AAAAGCCT7T CTGACAAAAA AGTAATGCTT CTCGTCAAGA 1920 
AAAGCCGCTG GTGGATAGGC TTCTCGCA.-.T TTTTCCGTCG TAGGAATTTC ATAGACCCAC 1980 
TCGCCTTTTG ATAAATACAC TARAGAGGAT CCTAAAGCAT TGTCTAGTTG AACATAAACT 2040 
AATTGTTTTT CTGCGGTTAC AACTTTAA^.T TTATCTCCTG CTTGAAAAGC ATACTTTTTC 2100 
GTTGCCAAAA CTGCTTCATC CACTCCaCrG CTGCACATAA CTACTTCATT TTCTTGATTA 2160 
ATAATT6ATA ATTCTATTTG CATaCTTTCT TCGCTCCTTT TACTTTTTCT TTAAAAACTT 2220 
ACCCGCAATA AATAAAAGGA 6AGGCGGCCT TAACCGTTTC CTCCCCTCTT TATTCTCTTA 2280 
AGGTTGTTTA CCAATATAAG CTAAAATTCC GCCATCAACA TACAGAACAT GACCATTTAC 2340 
AAAGTCAGAG GCCTGTGAGG CAAGAAAGAC T6ATGGACCT GCAAGATCGA CCGGATCACC 2400 
CCAGCGTGCT GCTGGCGTAC GACCGACGAT AAATTGGTCA AATGGATGAC GTTCCCCGTT 2460 
TTCTTGCAGC TCTCTCAGCG GCGCTGTTTG CGGTGTCGCA ATATAACCAG 6ACCGATACC 2520 
GTTGCACTGG ATATTGAATT GACCATATTC AGAAGCAATA TTTTTTGTTA GCATCTTCAA 2580 
ACCACCCTTT GCCGCGGCAT AAGCACTAJVC AGTCTCACGA CCTAATTCAC TCATCATTGA 2640 
GCAAATATTA ATAATTTTAC CCCCGCCTTT CTCGATCATC TCAGGAATAA CCGCTTTTGA 2700 
CATAATAAAC GGCGCATTCA AATCAACA?C GATGACTTGT CGGAAATCTG TCGCTGACAT 2760 
ATCTGTCATT GGAATTCGTT TTATGATTCC AGCATTATTA ACTAAAATAT CGATT6AACC 2820 
AACCTCTTCT TTGArTTGCT TGACCATTGC TTGAACTGCT TTTTCATCCG TTACGTCACA 2880 
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GACATAGCCT 
TTCTTGAGTC 
TTCAAAGCCA 
CATATCCATA 
ATTGGAACCA 
GAATAGTTGC 
TCATTCCCCA 
GTATCTTCCT 
GGCATGGTAT 
CTTTCACAAA 
TCCCCAGTTT 
TGCGCCGGAG 
ATCACTTGTC 
CGACCGTCGA 
TCCAAGAAAT 
GGTGTTACAC 
GGTACA7UVTA 
ATATCTTTTG 
TACTTAATAT 
TTTCTGAAAA 
CAAATTCAAC 
CAAAAGCATC 
CATAATACGT 
CGATCCTTTT 
AATAATAAAC 
CCAATTCTCC 
CTGCTGCCTG 
ATTTCTTTGC 
CTGTAATGCC 
GAAACTCTGT 
GTTCACCGAT 
AATCACTCAT 
CAGGCACTTT 



TTTGCTTCAA 
AAATTATTGA 
ATTCCATAAA 
TTAAAATCCA 
TATCCATGTC 
TTGTTCCTAC 
TTACCAGATG 
CCATATCAAA 
TCCAAGAACT 
TATTGGGATG 
CCATTGGCTT 
CAGAACTAAT 
GAGTTTCTTT 
TTTCAATAAA 
AATCGACACC 
CGCCAAAGAT 
CTTTTTCAAC 
GACTGTGCGT 
TTAACGAACG 
TTGATTACAA 
GGTTCTTTGC 
ACCTCCGCCA 
ATTCTCCAGC 
CGTTGAATAG 
GAATCCTGAA 
AGCTGAACAA 
ACTCCATAGT 
GGCTTTCACT 
TGAAATATGA 
TACTTCCGCA 
CCCCGTTTCC 
CACACCGTTT 
ACTTGCAAAA 



TGCCTTCTGC 
AAACGATCGT 
CTGCGCCCGT 
TCTATTCTCC 
GGTATAAGTG 
ACCTGAATGG 
CTTTGTCTCA 
GTAAACATAC 
GCCCGGCTCT 
AATATATTGA 
AATCTCATCA 
GTAA/^TTTC 
CCCGACATAA 
ACCCGGACCG 
TAATTCTTTG 
CATGCGATCA 
TAAAAATTCT 
ATATCTCGTA 
ATTCGACCAG 
TCCCCGTGAA 
CCGTCAAAAT 
ACTCGATCAA 
CATAAAGTCC 
AAGCATTTAA 
TTCATCCGGT 
TAATCGACAT 
TTCCCACGAT 
AATTCAACCG 
AATAGCTCAA 
AAGCTTGACC 
ATGTAATAAG 
CGACGCAAAT 
GAAACCTGAT 



TTTGTAATTT 
TGCTCCTGCT 
AATCAATGCG 
CACTTTCATT 
ATATTCTCTC 
ATCGACCAAC 
GCTGGGTCAC 
GCTTCCATTC 
AAAATCGTAT 
TAAATTTTAC 
ATACTGATTT 
GCAGGATCAG 
TAGCCATCTT 
CCAATATTAA 
TTCAAAATAA 
TTGTGGGTAT 
TTCCTCAATT 
TCCATATTCT 
AACCTGCTGT 
CAGTATGCTT 
CTGAGATCAA 
CGATTGGATT 
CTACCAACTG 
TATTGGGAAA 
CTTCTGGAAT 
AAGGCAAGAT 
AATTAATATC 
TCCATTGACA 
TCCCAGAAAA 
CCGCTCGATC 
TTCCTAAACG 
GTTTCACTGC 
GACCAAAATT 



GCCGATCCTT 
GCCGCTAATG 
ACTTTCCCAT 
CGTTCTATTT 
CACACATCGC 
TCGGTGAAAT 
CCATCATATG 
GACGTTCATG 
AGCCCATTTG 
GTTCATTTAA 
TTACATTTGG 
CTGCATCTTC 
GCTTCTTCAT 
TGACTCCTAA 
TCTCTAACGA 
ACGTCAAACT 
CTTCTGTTGA 
GCATGTTTAT 
CAAAAATTGT 
CAATGCTGAT 
TCCATGCAAG 
TATTTTATGA 
ATTGTTGCTA 
CTGCTGCTGC 
CTCCAATAAA 
TTCTTTTAAT 
AAAGCTGACT 
CCATTGCACT 
AAGCTTTTCA 
ATAAGTTACA 
CGcTCCGCCA 
CGCTTGACCT 
GGCTAATGAA 



CATCGACTGA 
ATTTGGCGAT 
CTAAGCGAAA 
TAGTTCATCC 
CCAGATAAAT 
CACGGCTTGT 
GAAAATTTTT 
CGTATGACAC 
CAGCTGACAG 
TGTCAATGCT 
GTACTTGTGA 
TGAAGAGAAA 
CGCTTCTTTT 
TTCACGCCGT 
CTCAGTCGTC 
GACTTCATTA 
ATAATGACTA 
TTCCCTCTTT 
TCGATCTCTA 
GCAGCCGTAG 
ATCCCCGCCG 
ACTTCAGACT 
GCCGAAGAAA 
ATTTGTTGAT 
TAACGCGCAT 
GCTTTCCCAG 
AACACACCAA 
GACAAAGCTG 
TAAGACCACG 
CTAGCAGCAC 
AGTTGTACAA 
AGTGAGTTAT 
ATAGCTACAT 



2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 
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TTGCCTCGCC 


GCCGCCATAG 


TGACAGATCA ATTGCTGTGT 


TTCGTTAAGA 


CGAACACCTA 


4920 


TATCCGTTGA AAGTCGCAAC 


ATAATTTCGC 


CTAAAGTTAG 


TATGTTTGCC 


ATAATTAGTT 


4980 


TTCCTTTATT 


TCTCTAAATT 


TGGCCATATA 


TTGTTTTGCC 


AAAGCCGTTA 


CTTTACCAAA 


5040 


CTCACCAGTT 


TTAGCGTGAG 


CTAATAAGTT 


TCCGCCAACA 


CCCACTGCCG 


TCACACCCGC 


5100 


CTCAAACCAT 


GACGCCATAT 


TATCCAGACT 


AACACCGCCT 


GTGGGCATAA 


TATTCAAGTA 


5160 


CGGCAAAGGT 


GCTTGGAATG 


CAGAGATAAT 


ACTCGGACCA 


TAGGTACTTC 


CCGGAAATAA 


5220 


CTTCACGATA 


TCTACTCCGC 


TCTTCAAAGC 


TGTCTGCATC 


TCAGTAATTG 


TCATACATCC 


5280 


TGGCAAGTAA 


GGAACTTGAT 


ACAAGTTACA 


AATTTCAGCT 


GTCTCCCGAT 


TAAAGCTAGG 


5340 


GCTTACAATA AATTCGGCTC 


CGGCCATTAT 


AGCTAGGCGA 


GCCGTCACAG 


CATCTAACAC 


5400 


CGTGCCCGCG 


CCAATCACGA 


TATCTTTCTC 


GTCATGATAT 


GTTTCAACTA ATGACCGAAT 


5460 


GACTTCATCT 


GCTTGAGGGA 


CTGTA7UUVGT 


CAATTCTAAT 


CCAATCATTC 


CGCCCTCTAC 


5520 


TACTGCATGA 


CTAATTTTCA AAGCTTCCTC 


TTTAGAATCT 


GCTCGCAATA 


CTGCGATAAC 


5580 


TCCCGCATTT 


TCTAAGCGTT 


TCAATACTTC 


AACTTTTTTC 


ATTCGTCTAC 


GCCTTTCTTA 


5640 


TAAAGCTTTA 


TGGAATCAAA 


TATACGAGCA 


CCAGATTGAC 


TATACCGCGC 


TACTTCTTTT 


5700 


TAAATTCCTT 


CTTCTGGCAG 


TTCTTCTGTT 


tCATTGTTTT 


TTTCAAAAaG 


CraTTVATATtG 


5760 


CTCTTTCCAG 


CCTGTACGAC 


TGTTTCAGCA ATTTCAAATG AATCCGTGAC 


GCTATTTCGC 


5820 


ATCATCAAGC 


CTTCTAATAG 


TATTCCTAAG 


TTTGTGCCAC 


TCACAACAAA AACTTGATCT 


5880 


TTTTCCTCAG 


AGAATTCAAC 


ACAGGTTTTA 


AATGGGGTAC 


CACCCAACAA 


ATCTGTGAAA 


5940 


CAAATAGCTT 


GATCAAACAT 


TTGAATTTCT 


GCCGCTACTT 


TTTGGCGATA 


CTCATCTACC 


6000 


GTCTCCTCGG 


CTAAAAAGGG 


AATAATGATC 


ACATCTTCTA 


GTTCTCCAGC 


AATCATCGCT 


6060 


AAAGCACTCT 


GCATCCCAAT 


CGAAAATTCA 


CTATGCCCAG 


TAATTAATAA 


TTTCATCTCA 


6120 


TTGCTCCTTT 


ATTTAATCCG 


AATCTTTGCT 


TCCTTTTGAT 


CAATCAAAAC 


7VAGCCGACCA 


6180 


TAAAAAAATT 


GGCCATATTC 


ACTCACTAAT 


AGTTTATTCC 


CTTTGACAAT 


ATCTTCCCGC 


6240 


ATAACAAAGA 


GATCAAATTT 


TTCCGATCCG 


GCGATTATCC 


GAAATCCCTT 


AGCCTGACAG 


6300 


AATTGCTCAT 


TTTCTCCTGT 


TTGACTCACT 


TTTATTGGCG 


TAATTTGGAT 


ATCCTCTAAC 


6360 


GGAGAAATGA 


CTGTTGCTTG 


AATTTCTTTT 


CCCGTTTTGT 


AACAAAACTT 


GTTTGATAAG 


6420 


CGTGGGTGCT 


CATTCAGCTG 


GTTATAAATC 


TCTGAACCTT 


TTGCCACTGA 


TTGCTGTGTT 


6480 


TGGCCTCCAG 


CAAAAAGGAG 


TGTGTACTTA TGCTTGTTCG 


TAGTTAAAGC 


AAACCGATGC 


6540 


GCTTCTTTTT 


GACAATTTAT 


CGACGGTGCC AAATTATAGG 


TACTCGTAAT 


TTCAGTCTCT 


6600 


TTCTGTCCTG 


CAAAGCTATC 


AATAATAACT 


ACGGAGTTGA 


TCGACTTTAA 


ATAGATGAAG 


6660 


cTGCGCTCAA AAATCATTGG 


ATTCTGATCC 


GCCTTATCCA 


GCCAGCCGCA 


TTCTGCATVAA 


6720 


AAACCGACAG 


AAAGTTCTTT 


TATTTGCTGA 


AATAAGGGTG 


TCGGTAATTT 


GTCATAACCC 


6780 


CACGTATCGG AAACTAAAGT ATGGGGATTT TCTGCGATAA ACATCGTATT GTGCGAAgCG 


6840 
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CACTCTTTTA GCTGAAGTCG CTCTGATTTG 


TTGACATAGC 


TGTAACGACC 


ACTATCGGAA 


.6900 


AATAAGTCAT CCCCTTGTAG TTGTAGTGTA AATCCACCTG 


TAGAAGCATG 


ACCATGTGCG 


6960 


CTCCCATGCA GACCGTTAAA AAGTGTAAAA TAGATATCCT 


CTGCTTTGTA 


CGCCATCAGC 


7020 


CCACTTGATT CGCCACGAAA AAGTTCTTTT 


GGCTTCATTG 


TTTCCCAGAT 


TCTTTCTTCA 


7080 


TAAAGATCCC CCGTCCAAAG CCTTGCCATA 


TTCGCAGTCA 


TGGAGGGTTC 


A/yVTATGAAA 


7140 


CCTAATTTGC GATAGATATC ATATACGTAA 


TGAAAGTTGA 


CATGATCACT 


ATCATTGATC 


7200 


GGATTTAGGA TATCTTGGTT ATCCGCCAAA 


TAATGGGTAG 


AGAAAATAGG 


TGTTTTTAAT 


7260 


TTCATGCGAA GATCTAATGG TAACTGGACT 


TCAAGATATT 


CAGAAATCTG 


CAATAGATAC 


7320 


ACGAATGTCA TCAAAACTTC GTGCTGGTAC 


AGCGGGCTCT 


GCTCCCAATG 


AATTCCATCT 


7380 


GAATAGAATT GTAGATCAAG CTGTTCAGCT 


AAACGAGACC 


ATATTAGATC 


CCTCTGTTTA 


7440 


CTGGTCACCA GTTCTGGAAG AAATAAATCA ATAGCTGCCA TTCCACCAAT 


TGCCAACACA 


7500 


CCCCAATTAC TGAGCCTGTA TTTATCGATA 


TAGGACCGCT 


CCAAATAGTC 


CAGATGGATC 


7560 


AGCAAGGCGT TGTTCAACAC ATCATCAATT 


CCTAATAGTC 


TGAAATCAGC 


GATTGGAATA 


7620 


TACGTCAAGC TTTTCATCCA GTTTGTTACT 


CGAATCCCflA 


CATCTAACGG 


ACGCCAAACA 


7680 


TCCCTATTCG TCGAATTTGG CTCACCCTCA 


TCGTTAATAA AATCAATAAG 


CAAGCTGTGC 


7740 


CATTTCTGTA AGTAACGTTC TTTTTTAGTA AGTGCATATG 


CTTGTGCCAG 


ATCTACAAGA 


7800 


AAGCTTTGAC GACTCAACAT AAAGAGCCAT 


TCTGGATCAT 


CATCAGGATA 


CCGATTCCAC 


7860 


GCATACTCTT TTAAAGAATA AGGAATGGAG 


CAAGCTTCCA 


TATCCATTGC 


ATCATTGTAA 


7920 


ATAATCTGAT CTTTCAGCAA ATAAGT^GTA 


CGTTCAAAAA 


GTGGCTTCAA 


AAATTGTGAC 


7980 


TGTTGCTCCA AGTACTCTGA AAGCCAATCT 


TGTTGAAAAA 


ATTTTAGTTG 


GTAGTCTTCG 


8040 


ATTCTTTGAA GATTCATCGT GTTCCCGCTT 


TCAAATTTTT 


ATTCGCCTCA 


GGCTTAAAAT 


8100 


TACAGTATAC TTCCTCATTC ACTCCGAAAA 


GTGACAAATA 


CGGAGACAAG 


AATAATTTTC 


8160 


CAAAAATTAA AAATCTTGTT TCTGATTACC 


CTCGTTTACT 


ATGCTAAAAT 


TCCTAGGACA 


8220 


GACGCTGCTA TCCCAATAGC AAATAGTAAG 


ATCAATAGTT 


TATACGTGTT 


CCAATTTCTC 


8280 


TTCTTGATCA AATAAAACAC TAAGATAGTT 


AAAATTACTG 


GAAGCATTTT 


AGGCATAATC 


8340 


TTATCTAAAA TATCCTGAAT AGCGATTACT 


TGTTTTTCAC 


CGGATTCTAC 


GTGATGCGAA 


8400 


TACTTTATGG CGATGTTGGC CTTAACAAAT GATGTTGCTA ATGCCGCTAT AACAGTTACT 


8460 


CCGACAATAT TAGCAGCCGA AGAAATTTTC 


CCGATCTTAT 


CACTCAATGT 


TTCGATGACA 


8520 


CTCGTTCCAA GTTTAAAGCC TAAATAGCCC 


ATTAATAACT 


TGATAATCAA CATCGAGATT 


8580 


AGCATTGAAA ACCAAAAGCC CATCGGTGCA 


AAACCCAAAC 


CATCCATTGC 


TAACCCGGCA 


8640 


AAGATTGTTG AGAATAAAGG AGCAATCCCA 


AATTGAGCGA 


TTGAGTCTCC 


AATTCCCGAT 


8700 


AATGGTCCCA TTAGCGCCAT TTTGATGCTG 


CGCGTATCCG 


TGACACTTTG 


TCCATTGTCA 


8760 


TACATTGCCA ACTGCATACT AGTAATGAAT 


GGCAGCATTT 


GAGGGTTGGT 


ATTATAAAAC 


8820 
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TCTAAGTTTG CACTCGCCGT TTCTTTCAGC TTCTCTGGTT CATCTTTATA GATTTTCTTC 8880 

AGCGCTGGCA TGATCGTGTA TAAATAACTT ACCCCTTGAT AAGATGTATA ATTAAAAGCG 8940 

TTCTGTAACA AATACGACCG TAGTGCTGTC TTCATGTAGT CTTTTTGGCT TAATTTTGGT 9000 

GTCCTAGATT CCATTACTAA AATCCTCCTC TTCCATCAAG GTGGTTTGGC TTTCGTCTCT 9060 

TATGCTCTTT TCTCCAACAA TCGTTTTATC TTTTCCATAA TATTGAATCA ACGCAAAGAC 9120 

AGTTCCTACC ATTGCGACAG CCATCGTAGG CAGCTCTAAA TAAGCTGCAC AAATATAGCC 9180 

TAGTAACACA TAAGGTGTCA GACTCTTTTC GAGCATCACA GACATGATCA TTGCAAAACC 9240 

TATCGCTGGA ATAAGCCCGC CAGCAACAGA AAAGCCATTG ATTAGCCAAT TTGGTAACGC 9300 

TTCAACAAAT GCCTGCAATC CAGGACGACT CAAGGCCGCT AATAGTCCAA TCATAAATCC 9360 

TGCAACGGCA AAAGCTATAT ATGTTGAATT AGCAATCAGA TTAAAACGTG TATATTTCCC 9420 

TTGCTCAATT ACATTCTTCG ACCATTTAAC ATTCCCAGAA AATATAGTAT AGATCGCCGT 9480 

TGTTACCAGT TGGAACAATA CTGCAAAGGG AAATGATAGT GCTAACGCGG ATTCAGGTGT 9540 

TACCCCTTGA TGTTTAAGCG TAATGGCCAT AATCGTACCA ACGATTCCGG GTCCAACTGG 9600 

ATTGGGTGGT ACGGTTCCGC CTGCTCCAAC ACCAAATCCC ATATAAGCCA ACTCGGCTAA 9660 

AGCACCAAAT GTTAAAGCCG TTGGAATGTC ACCCAAAACA ACACCTACAA AAAAGGACAT 9720 

AACCAGTGCA CGGTTTGTGT AAATTCCTAA TAGCTGCCCC GaATAACAAA AAGCTGTGAT 97B0 

CT^TCCGATC AATATACCTT GAACTAATGA AATCTCCATC TGCTTCTCCT TCTCCTTCTC 9840 

CTTCTAATTT AATTAGATAT TTTTTCCATC AATACATCTA AGTACTGTGA GCCATCATTT 9900 

GAAAGTGGTG AAGTCTTGGT GTTGAAAGTT ACACCATAGT TATCTCGCAT CAAGCAGAGA 9960 

GCTTGCTTAT CTTCTTTACC TAMTAGATA AATTGAGAGA TTTTCTCCTT GCCTTCGTCA 10020 

GCATGAATAT TCCCAATATT TAACTGCTCC ATTGGAAAGC CCAATTTGCA TAGTTCTAAG 10080 

GCATCATGCA AGTTTCCAAC GACTACAAAA ATTGTTTGAC TAGGGGCTGC TTTCCAAATA 10140 

ACTTTTGCCG TACGTTCGAT CGTCCAAAAA CGAATATTTG TTTCTTTCGG GACTACCGTT 10200 

TTCATCAGCG TCTGCTGTAA GCTATCCTCT GCCGCTTTAT CATTGGCACA AATAACTAAG 10260 

TTCACTCCTA AACTCTTGAT CCATAGTTGA CCTTGTCCAT GTATCAGACG TTCATCTACA 10320 

CGCACCATTT TTACATTGGG TTTTTGCATT TTCTTTCCTC CTGTTTTTAC CAGTACATTT 10380 

TCCAATTTTT TTTGAATCGC ACTAAAGCTT CTAGATAGAA AT-AATCACCC CAAATATTAC 10440 

CTTCATTTAC GCCTTTACCA CTATGCCAAG AATAAACACC CTCATTCAGT AAAGCTGTAA 10500 

TTCCGTCTGT TTTGGATTCA GTATAGTTTT CAGATAAGCT TCTTAGCATG GAATGctGCG 10560 

CACACTTGTA AACCAACTTA TGACTGTTCG TCTCAGGTAA AAACGCGTCC ATTAGACTCA 10620 

TACCACAAAC AGCAATCGcT GTCGCAGAAG TATCTCGGGA CTGTTCGGAA CCATCCATAA 10680 

AAATCAGATC CCAATAACTA ACAAAATCCT TCGGcAAGCG ATTCAAGaAA TAATTCGTAA 10740 

TCGCCtCAAA AATTTCTGTA TTCTCTTCCT CTGGACAGTA GGAGCGGAAC AATGCCAAAC 10800 



wo 98/50555 PCTAJS98/08985 

938 

CATaAATCAA CCACGATTGT CCCCTCGCCC AGCTTGAACT ATGGGAATAT CCTTGGCGGG 10860 

TTTTTcCATA CAAAGGATCG CCCGTTTCAG CATCGAaGAA AAACGTATGG TAAGCAGATG 10920 

AATCTTCTCT AATCGCATTA TCGATTGTCG TATGATAATG TTTAACGGCT ACTTCACGAT 10980 

AACGTTTATC TTGAGTCACT GTTGACGCCC AAAATAATAA TGGAATATTA AGTAAGCAGT 11040 

CTACGATCAA ACGATAGTTA TCTTGATTGC CCAATTCTCC CCAAGCTTGG ATAAATTCTC 11100 

CCTTAGATTG GTAGCGCTGC AATAATTTTT CTGCTGCACT TATCGCGGCA TTCTTAGCAG 11160 

CTTCATTTCC AGTCAACTTA TACGCACTTA CACAGGATAA GCTGTATAAG AATCCCAAAT 11220 

CATGATGCTC GACTTCAATT TCATTTTCAA TTCGGTAAAT AAAACTCTTG ACATTTTGTT 11280 

CTGCAACTTC ACGGTAATGC TTTTCCCCAG AAACTTCATA GCACAGCCAT AATATTCCTG 11340 

TCCAAAAACC GTTTGTCCAT TCTGTATTAT CCATAGCTAt ATACGTAwTG TCCGTTGkTG 11400 

ckGGTGTTGG GAATTaTCTC CCAGTTT 11427 
(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 



CCCTGnGACT 


ATTTTTCTTA 


TCTCTTTTAT AAATAGAAGA GAAAGTATTC TGATTTTGTG 


60 


TTTTAAGCTT 


TTTATGCTAG 


AATAACGCCA GAACATAAGG AAAGGGACGA GwCAwTTGGA 


120 


ACCCATTTAT 


TTAGATCATG 


CGGCT^CAAC ACCGCTTCAT CCAACGGTCA TTCAAGCGAT 


180 


GACTGAAAAT 


ATGCAAACGA 


CTTTTGGCAA TCCATCAAGT ATTCATCAGT TTGGACGAAA 


240 


GGCACACGGT 


CTTCTGGAAG 


AAGTGCGTCA AACGATTGCC GAGAGCTTAC AAGCCAAACC 


300 


CCACGAAATT 


ATTTTTAATA 


GTGGTGGCAC AGAAGGGGAT AACACAGCCA TCTTGGCAGT 


360 


TGCTTTTTCT 


CGTCAAAAGG 


AAGGAAAACA TATTATTACT ACGGCAATTG AACATCCCGC 


420 


TGTGTTACGA 


ACGATGGAAT 


ACTTAGAAAC GTTAGGCTTT GAAGTGACTT ACTTGCCTGT 


480 


GAATGaAAAT 


GGTCAGATTT 


CAATGGATCA ATTCAAAAAG TCTTTACGCG AAGAAACGAT 


540 


CTTGGTTTCA 


ATGATGTATG 


GCAATAATGA AATTGGAAAT CGATTACCGA TTGCTGAAGT 


600 


TGGTGCAATT 


CTTAAAAATC 


ATTCGGCGAT TTTTCATACA GATGCCGTCC AAGCTTATGG 


660 


AAGTGAAGTC 


ATTTTACCTC 


ATGAATTAGG GATCGACTTA TTAAGTATTT CCGCTCATAA 


720 


AATCAATGGT 


CCAAAAGGCG 


TAGGTTTTTT ATTTAAAAGC GATGCAATCC AATTACCACC 


780 


TCTTTTACAT 


GGTGGGGAAC 


AAGAAGAAAA ACGACGTGCG GGCACGGAAA ACTTAGCTGG 


840 


AATTATTGGC 


ATGGGTACTG 


CCGTTTCATT ACTAACTTCT GCAGAAAAGC AAGCAAGAAA 


900 


AACAGCCTAT 


CAAAGTTTTC 


AAACGATTAT TTTAAAAGCG TTAGAAGAAG CCAATATTGA 


960 
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TTTTtCmATT AATGGkGaAC CAACCAATCG TTTAGCGCAT GTCTTAAayC TTCATTTAAA 
nGGAATcCCA GTGATTTGCT CTTAATGCAT TTGGACTTAA GAGGTATTGC GATTTCAACA 
GGCTCGGCTT GTACAGCGGG AACTGTGGAT CCTTCGCACG TGTTGACAGC GATGTATGGA 
GAAAATTCCT CAGCTATTAA AGAATCAATC CGGATTAGTT TTGGCTACGG GAACACACCA 
GAAGAAATTG CGACTTTTTC TGAAGTACTA GTTGCAGTGA TTCAACAATT GAAAAAATAA 
TCTGATTGCG GCAGCTACAG GTTTTCGCTA TAATGAAGAT AACAATGAAA ATAATGAGGT 
GGAACCCATG GCATTTACAA CAACGGCTGC TGTTGAAGGC TCAACAGTTT TTTACAAAGT 
ACATCCTAGT GCAAAACGCT ATACATTAAA AGATAACGGC TTTACTGAAA CAAAATCAGG 
GAACTTTCAA TTGATTCGTT CCTTAGATCC AACGCCTCAG AGAAATGAAG GCTTCAAATT 
GAAAATTACG ATCACTGCTG ATCTAAAAGA GTTGAAAATG TCCATTACAA CGGCAAATGG 
TTTAAAACCA ATGAATATTT TcAAGTWVTGA ACAACATGAA ATGAGTAAAG AAAAATACTT 
TTTCTTAATG GATGGCTTAA TTAGCCGTGG TGTATTAGAG AAAGTAGAA 
(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19031 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
GGTGAGTGTT CTAGCAGTGC TGTTTGTGGC AGCTGCTTGT TCCTCAAAAC CAGAAGCGAA 
AATTAATACA GAACCATACT CTGACCGTCA AACAATGCTT GGTACCTACG TTCAAATTCG 
CATCTACGAT GATGGCAAAG AAGACGTCTT ACCGAAAGCC TTTGCCCGTG TCAAAGAACT 
AGGCGATAAA ATCACCGTCA ACCAACCAGG TTCAGAAATT GATGAArTCA ACCAAGAAGC 
AGGCGTGAAA CCAGTCAAAG TCTCAGATGA CTTGTATCCT TTGTTGAAAA AAGCCTATGA 
ATACAGCAAA GACTCCCGCG GCGGCTTTGA CATGGCAATT GGCCCAATCA CCTCTATGTG 
GCACATTGGC TTCGATGATG CCCGCAAACC AAGCCAAGCC GAAATTGATC AAGCCTTGAA 
ACTTGTCGAT TACACGAAAG TCAAATTCAA CGACAAAGAG CAAACCGTTT ATCTTGAAGA 
AAAAGGCATG CAACTAGATT TAGGCGCAAT TGCCAAAGGC TTCATCACCG ATGAAGTCGT 
GAAAGTTCTG AAAGACAACG GCGTTACCAC CGCTATTGTC GATTTAGGTG GGAATGTTTA 
CGTCTTAGGA CACAGCCCGC GTGGCAAAGA CATGGATTGG ACAGTCGGGA TTCAAGATCC 
GAATAAAGCC CGCAACACAG TCTTAGGTCA AGTCAAAGAA AGCAACAAAA CCTTAGTCAC 
ATCAGGAATT TATGAACGCT ACTTAAAAGT CGATGGCAAA ACCTACCACC ATCTCTTTGA 
TGGTGAAACT GGCTATCCGT TTGATAACGA CATCGCGGGT GTCACCATTA TCACCGACAA 
ATCCATCGAT GGTGATGGTC TCTCAACAGC CGTCTTCTCA ATGGGCGTCA AAAAAGGACT 
AGAATATGTC GAAAGTCTGA AAGGAACCGA TGCCATTTTC GTCACAAAAG ATGACAAAGT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1669 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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CTACATCAGC 


AAAGATATCG 


AAGGCAACTT 


TGAAATCGGC 


AAAGACTCTG 


GCTATACGAT 


.1020 


GGGGAATCGG 


GCCGATTTGA 


AATAAGGAGC 


GACTAAAAAT 


GTCTTTAAAT 


GTGTTTTTAA 


1080 


AAGTAGTGGA 


AATTCAGACG 


AAGCTTGCCA 


GTCTCTTTCC 


CTTCATTATC 


GGGGTCTTGT 


1140 


TTTCAATGAC 


CTATTTCGGG 


GAAGTTCAGT 


GGGGCAACAC 


GCTGGTTTTC 


TTCATTGGCA 


1200 


TGGTGGTCTT 


TGATATGGCC 


ACTACCGCCA 


TTAATAATTA 


CATGGACTTC 


AAAAAAGCCA 


1260 


AATCAGACGT 


CTATAAATAT 


GAAGAAAATA 


TTATCGGACA 


ATCAGGCGTT 


TCACCACAAC 


1320 


TGGTCCGCAA 


CATGATTTTC 


GCCATGATTG 


CCTTCACTGC 


CGTTGTGGGT 


GCGTACTTGA 


1380 


CGGTTCAAAC 


CGGCTGGCTC 


TTCCTTGTTT 


TAGGCGGGAT 


TTGCTGCTTC 


ATCGGGATTT 


1440 


TCTACACGTA 


TGGACCGATT 


CCGTTATCTC 


GCATGCCCTT 


GGGGGAAATT 


TTCAGTGGCT 


1500 


TCACGATGGG 


CTTAGGCATC 


TTCGTCTTAA 


CCGTTTACTT 


GAACGTCGTC 


ACCAATCCGC 


1560 


CTTTTTACTT 


AACGGTCGAT 


TTTGCTAGCG 


GCGCCTTTCG 


CTTAGACGGC 


AATCTCTGGG 


1620 


CGGTGCTTGC 


GATTGTCGTG 


GCGTCCTTGC 


CCTTGGTCTG 


TTCGATTGCT 


AACATCATGC 


1680 


TCGCCAACAA 


TTTACGGGAC 


TTAGACACGG 


ATATTGAGAA 


CCACCGTTAT 


ACATTAGTCT 


1740 


ATTATATTGG 


GCGCCCATCG 


GGGATTGTGC 


TGTTTCAAGT 


GTTAGCGCTA 


sctGGGTATC 


1800 


TGGTGATTTT 


ATTCGGCTTC 


ATTAGTGGTA 


TTTATCAATG 


GCCTGTGTTA ATTACATTTG 


1860 


CCACTTTACC 


TGTCATTTGG 


AAAAATATTC 


AAACGTTTAA ACAAGAATTG 


CCGCAGCCGA 


1920 


AAAGTTTTCG 


CCATTCCATT 


AAGAACTTAA 


TGGTTTTCAA 


TGGCACCTAT 


GCTTTAGGGC 


1980 


TATTCATTAG 


CGCCTTACTT 


GGTTAGTCTT 


GCGAAAAAAT 


AGCGGGCTGG 


GCGAAAAAAA 


2040 


TACTGTTTTT 


TTTCAGAGGC 


CTGCGTCAAA 


ACAATCAAAC 


AAAGCTCAAA 


ATGTTAGTAT 


2100 


TCGTTGCTAA 


CATTTTGAGC 


TTTTCTTGTT 


AAAAAGAGCG 


GAATCTTAAA 


ATTTTGTCAG 


2160 


GGGATTTTGG 


TAAAAATTTG 


CCGAAATCGC 


CCTTTTTTTG 


AGCCGATTAT 


TAAATAACAG 


2220 


CTGTTGTAAC 


TCTTGTCAGA 


ATAGGAACCT 


TCAAAATTTT 


CGCATGGGGT 


GCGAAAAAAT 


2280 


CGTCAATTTC 


TGCTGGGGGT 


TTCTTTGTTA 


TTATAGAGGA 


GTAACAAAAG 


GTTGTTTTGT 


2340 


CTTTCTTGTG 


TAAAAGGGCA 


AGAAAGGCTA 


GCGAGTTAAA 


AGGAGGTTTT 


TCAATGAAAA 


2400 


AAAAGCGTTA 


TTTAATGATT 


GTGTGTCTAC 


TATCTTCTCC 


TAGTTTTTTT 


ATAAATGTTG 


24 60 


AAGCGTCTGA 


TGGTGGTTCT 


AGTTCGGTGG 


GGATTGAATT 


TTACCAAAAT 


CCGAGAACAC 


2520 


CCGCTCCTAA 


AGATCCCCCA 


CCGAAAACAG 


ATGCGCCAGC 


TGCTGATCCC 


AAGGAACCAG 


2580 


CTGGTCCTCC 


GCAAGGAGAT 


CAACGAAgTG 


GTGGTTCGAC 


ACAGACCACC 


ACAACTGGCT 


2640 


CAAC6CTCCC 


TCGTACAGGG 


AGCAAGAGTC 


AGGCAAATTT 


GAGCATTCtC 


nGnTTCGCCT 


, 2700 


TAATCGGTTT 


GGCGGGAATC 


GTACATAGAA 


AGAAGGGACG 


ACATGAAGCA AACTAAGTGG 


2760 


CAACGATTAG 


CAACCATTGG 


CTTGTGTAGT 


TCTTTAGTAA 


TTAACGCCTT 


TTCTGGTGTG 


2820 


ACGGCAGTTG 


CGGAAACCGT 


GACGATTGAA 


AGTAGTCCGA 


CCGCCGAAAG 


TAGTGCCAAG 


2880 


GAAGAGACGC 


AAGCAAGTAG 


CGTGAAGGAA 


GAAACAACGA 


AAGCCAGTAC 


GGAAAATAGT 


2940 
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CAAGTAACAA 


CTGACACGAG TCAGGAAGAA 


GCAACGAAAG 


AAGCGGAGAA AGAAGAACCG 


3000 


CAAGCAGAAG 


TGGAACAAGC AGAAACACCA 


ATCATTCCTA 


AACCAAAAAA AATCAATATG 


3060 


AAGGCAACTT 


ATTCATTTTC TGCAGAAACT 


TATCAGTTTG 


GATTTGTGAA 


TGAATCAGGT 


3120 


CAATTAATAA 


ATCCAGATAT TATACCAATT 


ACGTATAGCT 


ATGCCAAAGG 


ATCATGGAAG 


3180 


ACAGATGGTT 


ATAATCGAAA GTGGACTAGT 


ATGGTTCAAG 


GGAGTGCTTC 


AACCGTAGGA 


3240 


AACTTAAAGA 


ATGTAATAAT GCCAGCT^ACT 


TCTGTAGTTA 


TGCCACCAGG 


ACCGTCATAT 


3300 


GAAGGAACTC 


AAGAGGTGTA CACAAACTTT 


TCAATTCGCA 


TACCAAAATA 


TTATGCATCA 


3360 


GCGAGTCTCT 


ACAATAGAGA AGGTAAAATT 


GATTCTACTT 


ATCCGTTACC 


TGCTATTGCA 


3420 


CTAGCAGGTA 


CTAGACCGCT ATCTTTGACT 


CAAAGTAGTG 


TAATTAGTGC 


ATTGGCGCTG 


3480 


ACCAGTAAAG 


GAGACAATGT TTATACACCA 


CGGGAAACAT 


TTTTTGGAGG 


AGATCCTGCA 


3540 


GGTGTAAAGT 


TTACTAATTT TTTGTATCGT 


ATAAATGACT 


TTGATGTGAA 


AGGTAATAAC 


3600 


ATAGGTTATA 


AGACTGTGAG TAGCCCAATC 


TATTACCATC 


TGACCAACCG 


CCGTGTCACC 


3660 


GAAAACTTCG 


TAGATACAAG TGGCGCCAAA 


ATCACGCCAC 


CAAGTAATTT 


CACCCAAGGG 


3720 


AAACAAACG6 


TCATTAACAG TGATCCTTAC 


ACGTTCCAAC 


AAAGTGGTTT 


TTTACCCGAG 


3780 


ACCTACAAAG 


TTGGCACGAA ATCTTACCGA 


TTCAAAGGCT 


GGTACAAAGG 


GAAAACCAAA 


3B40 


ACCGAGCCTT 


TGGCCACCAC TAAAACACCT 


AGCTATAAAG 


TCACGTATGA TGACAATGAT 


3900 


GATTTGACGG 


TGGTCTATGA GGAGTTTTCA 


GGGTACGAGC 


TGCCTGCTTC 


GACCAATCAA 


3960 


TTTGGCTTTG 


TGGATGAAGC GACGAACAAA 


TTAATTGCCC 


CCGACCAAGT 


GCAGATGAAG 


4020 


TATAATCTTA 


CTTTAAATGA AAATAATAAA 


AAAACAGTAA 


TGAGCAGTAA 


CTTAACGGGG 


4080 


ACAGATACAG 


CGACACT3AA AAACTTGTCC 


GTGCCTGTCA 


ACTATTTTGA 


ACAATATCGC 


4140 


GTCAATACGT 


TTTATGGCGC GAGTGACATT 


ACGTTTACAT 


TGCCCAAACG 


GTACAAATCA 


4200 


ATCAATATTA 


CCAAATCAGA TGGCAAAACC 


GACCCAGCTT 


TTCCTCTTCC 


TAAAATCTAT 


4260 


AATATAGATC 


AAGTAGAAAT GTCACACATG 


CCTGTGACCA 


CTTATAACAA 


GTTGAAACAG 


4320 


CTGTCGGGCC 


AAACGTTTGG CTTTAATGCT 


TTAGCCGATC 


AACCTGAATT 


TTATACGAAA 


4380 


ACGTTATTTG 


GGACAGAGTC TGGCATCGAT 


GACCCAGTCA 


ATTATTATAC 


AATGAGTGGC 


4440 


CCTGTTTACT 


ATTATTTAGA AAACCGCAAA 


GTCACCGAGA 


aCTTCGTAGA CACCAACGGC 


4500 


GCTAAAATCA 


CACCGCCAAC AGGTTTCACC 


CAAGGT7\AAA AAACGGTGAT 


TACAAGCGAC 


4560 


GCCTACACTT 


TCAAACAAGC AGGCACCTTA 


CCAGACACTT 


ACACAACAGG 


CGGTAAGACC 


4620 


TACAAGTTCA 


AAGGTTGGTA CAAAGGCAAG 


TCCATACTCA ACACATTGAC 


AACTACCAAA 


4680 


GCGCCAAGTT 


ATCAAGTGAC CTACGATGAC 


AATGATGATT 


TGAATGTGGT 


GTATGAAGAA 


4740 


GAAACAGTTA 


CGACAGTGTA TCCATCAGTC 


GATATGAACT 


TTGTGAATGA 


AAAAGGCGGG 


4800 


GCTTTCACAC 


CGGCGTTAAC TTTTAGTGGT 


AAGTACTATG 


CGCAAAGTAC 


GAGTGCGTAC 


4860 


TTAAGAACCG 


ATTTATATGA CGTGACCTCA 


AAAAATAATG 


GTAATGGGCA 


ATATACGGTA 


4920 
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AGTATTAATA 


ATGGTAGTAT 


GCCATTGTCC 


CAAGAATTAT 


TGAAAAAATA 


TAATAATGGA 


. 4980 


CAACCAATCA 


GTGCTACCAA 


CAGATTACAG 


TTTAATGTTG 


ATAAATTAGC 


CATCGACCAA 


5040 


CAACTAAAAT 


ATGTTGACAG 


CATTCAATTA 


GACACAGCTC 


7VAAGTAGCAA 


TCTGAAATCC 


5100 


TATAGATATG 


TGTACACGAA 


CAATAGCTCA 


CTGGTTTTCG 


ACCCAAATGT 


AGCACCAGCA 


5160 


GAGGTTGACC 


TTAGTTCAGA 


ATCTCTTAAC 


TTGCTTAATT 


TTGATTCAGA 


TGGCACCTAT 


5220 


TTTTCTAATG 


CAAATAATAG 


ACTTTTTTAC 


ACGCATTTAG 


GATATAGTGG 


CACACCAGGA 


5280 


GTTAACTATC 


TTCTCGTAAT 


GTTTCTTTTT 


AACGCCAAAc 


CTGCGGATAA 


GTCAAAACTT 


5340 


GTCTACAAAG 


TCACTCGCAA 


ACAAGTCACC 


GAAAACTTCG 


TGGATGTCAA 


CGGTGCCAAA 


5400 


ATCACTGCAC 


CAACAGGCTT 


CACCCAAGGT 


AACCAAGTAC 


CAATGAACAG 


TAACACCTTC 


5460 


AAGTACACAG 


CGGCAAAAGC 


TTTACCAGCG 


ACGTATACTA 


CAGGTGGCAA 


AGTCTATACG 


5520 


TTCCAAGGGT 


GGTATAAAGG 


GAAAACCAAG 


CCAAGTACGT 


TGAACAAAAC 


AACAACTCCA 


5580 


ACGTTCAATG 


CGACCTTTGA 


TGGCAATGAC 


GATATGACCG 


CCATGTATAA 


GGAAGAAATA 


5640 


CCAACAGCTA 


GTGTCACATT 


AACTCGACCA 


AAAGAAGTGA 


TTGATACGAA 


TACCAATGTA 


5700 


ATCTGGACAA 


CAACGATCAC 


GAATACTAGC 


AAAGCACCCT 


TACAAAATCT 


CACCTTGAAA 


5760 


AAAGGGCCCA 


ATTGGTCAGC 


TGGTCTGACG 


ATCCCGACCT 


TTATGGAAGT 


GACACCAGAA 


5820 


GGAGAAACGA 


CAAAaTCAAT 


CCCAGTAAAT 


AGTACACTTT 


GGACAGAGGG 


GGTTCCTTTA 


5880 


CCAAATGCC6 


TTCcTATCGG 


CAAAAAAGTT 


TCAGTTGCTT 


TCACAACTCG 


CGCAACAGGG 


5940 


AAACCAAACA 


CTGTTTTGAA 


AGCAGAAGTT 


GTAGTATTTG 


GTGGTATTAA 


AGATAGTACA 


6000 


GTGGATAACT 


TCGTGAGAAT 


TCGTCCAAAT 


GATCAAGAAG 


TAGTCACACC 


AACGACCGAA 


6060 


GGCTTCATCA 


GTGTGCCAAC 


CTTCGACTTC 


GGCCAAGTGG 


GCGTTGCAGG 


AACTAAGCAA 


6120 


XAACACAGCT 


TGAAACAAGC 


CGCGGATTAC 


TACGGTAACG 


GCACACGGAA 


TCCGTATCTG 


6180 


CGGATTAAGA 


AAACGCAACC 


CAATTGGAGC 


TTAACAGCGC 


AACTGTCACA 


ACCAAAATCA 


6240 


GCGACAGACA 


GCTTGCCTAC 


AGCGACCCGC 


TTATTATTAG 


GGGCGGCGCC 


TGTCTCTAGC 


6300 


TTTACCAATT 


ACAATCAACC 


AACCGAGTTG 


AAAAATACGG 


TCGGTACCAC 


GAGTGCCATT 


6360 


AGCTTAACAG 


CCAACAACAC 


AGCAACGAGT 


ATTATTGCCA 


ACAAGC/U^TT 


CACAGGTAGT 


6420 


AATGTTTATC 


AGTTGGACTT 


CACCTTCAAT 


AATGTCAAAC 


TTGAAGTGCC 


AGCCAATCAA 


6480 


GGTGTTAAAG 


GGCAACAATA 


CAAGGCCGCA 


GTTACATGGA 


ACCTAGTTAC 


AGGTCCTTAA 


6540 


AATAGTTAAG 


TTATCATGAG 


AAAAGCTCAA 


AATGTTAGTG 


GTTATCACTA 


ACATTTTGAG 


6600 


CyTTTTGGTT 


TAAATATTAA 


GGAAAAAACA 


GTTGGAGTTC 


ATCAAAAGAG 


CGTTTAAATC 


6660 


CTTGATTTTC 


GCTAAAATAA 


CACTTATTTT 


CAAAGGGAAC 


AGAGGCGCTC 


AGCCTTGAAA 


6720 


GGATAGGAAT 


CACTAAAAAA 


CGGCACTCAA 


TATGTCAAAA 


TTTGAAATTT 


CAAGCTGTGT 


6780 


GTTCTTTGGT 


AAAATAwATA 


wAAAAATGCT 


AGTTATCAGT 


ATCGAT7KATA 


ACAGGATACT 


6840 


GATTAAGAAA 


GGACTTTATA 


GAGACTATAG 


ATTGAATTTT 


TACATAGAAA 


GAAGGAGCAA 


6900 
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GATGAAGCGA 


GTAAATTGGA 


AAAGATGGCT 


AGTTGTTGGG 


TTAAGTTGTT 


CTTTGTTCAT 


6960 


GGATTCAGTG 


GTTGGTGTGA 


CTGTGTTAGC 


GGAAACGATT 


ACTGGGGCGA 


CGGAGCAAGG 


7020 


AGTAGCAACA 


TCTCAGTCGA 


GTGACGAAgC 


GAGCCAGACG 


aCGCAAACAA 


CCGaAGAGTC 


7080 


ACAGGCAACG 


GTCGCTAGTG 


AAGCGAAAAC 


AGTACCGCCA 


CAGGAAACGG 


CAAGAATTGC 


7140 


TTCTCGAGCG 


ATTGGTTATT 


CTTCTGTGGA 


AGGGCGCGAG 


ATTCCCTTTT 


TCTTTGTGGA 


7200 


GGAAGACGGG 


ACGTTGTTTG 


ATCCCGACCG 


AATTACGATG 


GCGGTCAATC 


TTTCCACGTT 


7260 


TTCGTTTTAT 


GAAGAGAAAT 


TACAACGAAC 


CCCCCTTGAG 


CCCACCACTG 


TGAATGGCGG 


7320 


AAAGTTACTG 


TCTATTCCAA 


CGTCACCAGC 


TTTTT^TAT 


GATACAAATA 


ACCAGAATCC 


7380 


AAGTAATATT 


TATGGCGTTT 


CTGAAGTGTC 


GTTTACTATT 


CCTAAGGAGT 


ATCAAAGCCT 


7440 


GGACATTCGA 


CCAAGTACGT 


TTTATACAGG 


AGACACTACG 


CAATATCCAG 


TGCCAACGGT 


7500 


TTTTGCGAAC 


GTTGGGGGCA 


AAGTGACGAA 


CTATGTGGGC 


GCCAATGCGG 


AGACGGAATT 


7560 


AGAGTTAACC 


AATGAAAAAA 


TGCCCAATAA 


GCTGACGTTT 


GGTCCTAAAA 


AGACGTTTAA 


7620 


ATATACGGTA 


GCTACGGCAC 


CAGGAGGCGT 


TACGTATGCG 


CTGACCTATT 


TTTATGGAGA 


7680 


TGTCGGCGGT 


CCAACTAGTT 


CGCACCAAAG 


ACGAGGAACA 


GCGGGTCCTG 


TGTATTATTA 


7740 


TTTAACAAAG 


CGGCGTGTCA 


CGGAAAAATT 


TGAGAATCCC 


GCAGGCGGGG 


CGATTCCTGC 


7800 


GCCAGAAGGT 


TATACGCAGG 


ATAAGAAAAC 


CATTGTAACA 


GGGGAGGATT 


TTACTTTTAC 


7860 


CCAAGAAGGC 


ACCTTGCCTG 


AACGTTACAC 


AGGCAGTGAT 


GGGAAGACGT 


ATTTATTTAA 


7920 


AGGTTGGTAC 


AAAGGGAATG 


CGAAACCTAG 


CACGTTGGAA 


ACCACCAAAA 


CGCCTAGTTA 


7980 


TGCGGTGACC 


TATGATGACA 


ATGACGATTT 


GCATGTGGTC 


TATGAAGAAG 


CAGTGATGAA 


8040 


AACCTATACG 


TTGCCAGCGA 


GAGAAGCTTT 


GTTCGGCTAT 


GTTGATGAGC 


AAGGAAACTT 


8100 


GATTAATCCC 


GCCAAGTTTA 


AGCTAAGTGC 


GACCATGGGT 


GAAAGTGACG 


GAGCCACAGG 


8160 


GGAAATGACG 


ACTTTTCCCA 


CT^TTGATGG 


AATCGATATG 


CCAGCAAGTC 


AATTAAAGAA 


8220 


ATTAGCCATC 


CCGCAAAAAG 


TCTACACACG 


CCCAGACGAT 


GGGACAATCG 


TAACTTATGG 


8280 


CCCGCAAGAA 


GTGAGTGTTG 


AAATTCCTAA 


GTATTACCAG 


ACGATTTCGA 


TTTCACCAAC 


8340 


TACTGCGTAT 


ACAGGGGATA 


AAACCAAGTA 


TCCAGTACCA 


AATGAAGTGC 


GCCGTGGCAT 


8400 


CGAAAACCCC 


GACAACATTG 


TTAGTAGTTT 


aGTGGGAAmC 


mCTGCGTATA 


ACTTGACCCA 


8460 


AAAAAGTGCC 


ACACGCTATA 


CTGCCCGCCG 


TTCTTACTGG 


AmGTGGGGCC 


CCACGAAGAC 


8520 


ACTTTACTCA 


ATGAGTATCT 


ATTCAGGAAC 


TGCTGGGGGC 


AACTATAATT 


TATCGACCCC 


8580 


TGATGGCACC 


ATTTATTATT 


ACTTAGAAAA 


TCGGCGGGTC 


ACTGAACATT 


TTGTAGACGA 


8640 


AAGTGGCGCA 


AAAATCACGC 


CACCAACTGG 


CTTTACACAA 


GGAAATCAGC 


TAGTGGTGGA 


8700 


CAGTGAAAAC 


TATGTCTACA 


CTGTCGCAAA 


AGCTTTGCCG 


AAGATCTACC 


AAGCTGGTGA 


8760 


AAAAACCTAT 


ATCTTCCAAG 


GCTGGTTTAA 


AGGCAAAACC 


AAGCCAGCAA 


CATTAAAGAC 


8820 


GACAACGACC 


CCAAGTTTTA 


CACCAACTTT 


TAATGATGAG 


GACGACATGA 


CCGCTGTGTA 


8880 
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CCAAGAAGCG 


ATTCCCACCG 


CGGaAcTAAC 


GTTAACAGGT 


GCCGTTGACA 


TAATCGAAAA 


. 8940 


TGGCGCCACA 


ATGGATTACT 


GGGAGGCGCT 


ACTGAAGAAC 


ACAGGCGAAg 


cGCCGTTAAC 


9000 


CACCATTAAA 


ATCAAGCCAA 


CGGCAACTTG 


GGCGGCTGGC 


ATCGGCGCAC 


CCAACACGAT 


9060 


ATTTGTACAA 


GGAACGGGTC 


AAAACACCAA 


AGCTTTTCCT 


GTCACCAAAG 


AACAATGGAC 


9120 


GACCGGTGCA 


GGAGTGTCCA 


TCACGTTGGA 


TCAGCCTTTA 


CCAGCTGGCG 


GTCAATTAAA 


9180 


AATGAACTTA 


TTAGGAACCG 


CCGTTACAGG 


AAATCCTGGT 


CAAGTTTTAA 


CCGCTGATGT 


9240 


TGAAGTAACG 


GGCAACTTTG 


GCAGTTTAAC 


TGCCAAAGAT 


ACGGTCCGTA 


TTAAAGACTT 


9300 


AGATCAAGAA 


ATTACGAGTC 


CTGACGGCGA 


CGGCTTTATT 


AGTACCCCGA 


CATTTGATTT 


9360 


TGGTAAACTA 


GCAATTTCAG 


GAAGTAAGCA 


ACAATATGGT 


TTGAAGAAGG 


CCGCAGATTA 


9420 


CTACGGCAAT 


GGCACrCGCA 


ACCCTTATTT 


ACGCCTGAAT 


ACTAGCCAAG 


CCAATTGGAG 


9480 


TTTAACGGCC 


CAGcTATCGC 


AACCAAAATC 


AGCCACAGAC 


AGCTTGCCAA 


CAACGACCCG 


9540 


CTTGTTGCTA 


GGAACGGCCG 


CTGCTGCCAG 


CTTTACCGAT 


TACAACCAAC 


CAACAGAAAC 


9600 


CAGGACACCA 


CTTGGCAAGA 


CCAGCACCGT 


GACTTTAACC 


GCCGACAATA 


CCGCAACAGC 


9660 


GGTGGTCGCA 


AACCAACAGT 


TCACAGGCAG 


TGACGTCTAT 


CAGTTGGACT 


TCACGTTTGC 


9720 


TAACATCAAA 


CTAGAAGTGC 


CAGCCAACCA 


AGGTATGGCT 


GGCCAACAAT 


ACCAAGCCGC 


9780 


CGTCACGTGG 


AATTTAGTGA 


CTGGCCCCTA 


AAATAAAAAT 


GCCCGAGCCT 


TAGAAAAATA 


9840 


AGAGGCTCGG 


GCATTTTTAT 


TGATGGTCTT 


TTGATGGTCG 


TTGTGCTATA 


ATAGCATTGA 


9900 


AAAGAGAGAT 


ATGCTGTAAC 


ATACCTCTCT 


AGTGTAGAGC 


CGTTTAAGAC 


GGTGACCGAT 


9960 


TTTGTTACAA 


AAAATAACCG 


TACTCAGTCA 


AAGTAAACGG 


TTATTTTTTC 


TTGTCTTCTT 


10020 


TAACAACCTT 


TAGGATTCCG 


AAGATTAAAG 


TCGCGATAAA 


AGAACCGAAA 


CTAATCATCA 


10080 


GAGCTAAAGC 


TTCTGCGATA 


GACAAAAGGC 


CTCTCCTTTC 


ATAGATTTTA 


GTACTTACAT 


10140 


TCATAAGCAC 


CACCACCTTT 


CAGATTGGAT 


AGCCACCGCC 


ATAACTTCTC 


TACACTTTTA 


10200 


TTATACCATT 


TAGAACTAAA 


AAATAATTAG 


TTTTCAATAT 


ATTTTTTTAG 


GAAATTGTGC 


10260 


TATAATAGCA 


ATGAAAAGAG 


AGATATGCTT 


CAACATACCT 


CTCTGATGTA 


GAGCCGTTTA 


10320 


AGACGGTGAC 


CTTTTTAGTT 


ACAAAAAATA 


ACCGTACTCA 


GTCAAAGTTG 


ACGGTTATTT 


10380 


TTTATTGTCA 


TTTTTAAGCT 


TTAAAAGATT 


TTCCTGTTCC 


CTAATTGTTA 


AGAATAGAGA 


10440 


AGAAAAGCTC 


AAAATGTTAG 


TTCTGATTAC 


TAACATTTTG 


AGTTTTTTTC 


AATAAATATC 


10500 


AAGAAAAATC 


AGCGGAAATT 


AGAGAGAAAA 


TGGTGAATTC 


CACTGATTTC 


CAGGCTGTTT 


10560 


TTTACCAAAA 


AACAGT6GCT 


CCTCCTTGGT 


ATAAAAGGAA 


TCATTGGAAA 


TATCAAGTTG 


10620 


TCGTATCAAA 


ATTTAAAATT 


TCAGTGGGGC 


CTTTTTTTGA 


TAAAATGAAA 


ATCATACAAA 


10680 


AAATCATACA 


TAGGACAATA 


AAAATAACAA 


TAAAAAGTTA 


CATAAATU^G 


GAG7UVTGGCA 


10740 


TGAATCAAAA 


AAGAAACCGT 


TTATTTGCAA 


TCGCTTTGTG 


TGGTTCTTTA 


CTAATCAGTT 


10800 


CATTAAATGG 


CATCCCAGCA 


ATTGCTGAAA 


CCATGGCGGC 


CCCAGATGTT 


CAAGGGGAGG 


10860 
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ACCGCCAAAC 


AGTAGTCCCG 


TCAGAGGGGG 


TTCAAGGACG 


GGCGCCCGTT 


TCTTTTCCAG 


. 10920 


AGGCAGATAC 


GGAAGTnAGc 


GTACCGCCAG 


CTGATGAAGA 


GCTACCAGAA 


CAGACTGAAC 


10980 


AAGTAGTCGG 


GCAAGTTGAA 


GCGTCAGTGA 


CGAGTCCAAT 


AATAGTTAAT 


ACACCAGAAT 


11040 


CACTGCGTGC GCAgCGAGCA GTGCAGACAT 


TCGATATTCC 


AGAGAAGACG 


TACAAAATGT 


11100 


CGTTTTTAGA 


TGAAAGTGGT 


CGTTTTATTG 


ATCCAGCGAA 


AGTTTCGATT 


ACTGGGGATA 


11160 


TTATGAAGTT 


TACTATGCAA 


GGCGCTTTGA 


CGAAAGTCGG 


GGCCGTGGCC 


ACTAATAATG 


11220 


TGGGAACCAC 


GAAGCAATTG 


ACGGTGCCGA 


AAGTAGCTTT 


GTCTGGGGAT 


ACAGCCATAA 


11280 


AGAGTTATGA 


TTTTTCCGGC 


ATCCAAAATG 


AAAAGATTAC 


TTTGCCGCGG 


TTCTATAATG 


11340 


TTCCGAAAAT 


CACGCCCAGT 


GCGAATTACA 


CTGGACAAAT 


CTATCCAATG 


GCAGTGACGA 


11400 


AACAGGTGAA 


TGTTACAGGA 


GAAGTATCGA 


ATGAGGAAAC 


CACAGGCGCG 


AGGCTGAATA 


11460 


TGTTTGTATC 


AGGTATCCCT 


AACCAGTTTG 


TCTTTTCTCG 


TGCTAGTTGG 


GACAATGATC 


11520 


CAGCGGGAAA 


TATGTTTGCG 


CGTGGCTTTA 


ACATGGCTAC 


TGCAGGCGGC 


ACACCAGCTT 


11580 


ATCTGACTAC 


GGATGATCAC 


ATTTACTATT 


ATGTGCCTAA 


CAAACGGGTT 


AACATTTATT 


11640 


ATAATGACAT 


GACCCCGAAT 


GGCTTACCAG 


CGTATCCTCC 


AGGGTATAAT 


GCCAGTCTTT 


11700 


CAAAAACAGT AGTTGATTCA 


GAAAATTTTC ACTATAAAGC 


GCCAAAAGCT 


TTTCCAGAAT 


11760 


TCTTTAGTTC 


TGGGAATCGT 


CACTTCCAGT 


TTAAAGGCTG 


GTATAAAGGG 


CCTGCAAGAC 


11820 


CAGCGGATCC 


AAAAGCCGTT 


AAGTTAGAAA 


CCTCATTGAC 


GCCAGAGTTC 


GATGTCACTT 


11880 


ACGATGGTGC 


CGATAATTTA 


TATGTTTTTT 


ATGATGAAGT 


GAAAGAAATG 


ACCACTACTA 


11940 


TTCCAGAAGT 


TACCTATAAG 


TTTGGCTTTG 


TTGATGAAAA 


AGGGGCGCTT 


GTTGCGCCAA 


12000 


CGAATGTAGA 


CATTCAGGCG 


AACTTAACAA 


CTGTCGAGGA 


TAAGGTGGTC 


CAGAAGGTGG 


12060 


GGGCTATTAC 


AGGCGCTAAT 


GTAGGCAATT 


TAAAACAGTT 


AACCGTTCCT 


AGCCAAACAC 


12120 


TAACCTATGT 


ACCTAAAATT 


AAAGTTGCTT 


CTTCTGGGGT 


AACCGATTTT 


CGGTTGACTA 


12180 


TTCCCAAGCG 


CTATAAAATT 


CCAACAGTCA 


CACCTGGTTC 


ATTCTACACT 


GGCTCAACGA 


12240 


CCGCCTATCC 


TTTGGCGACA 


ACGCTGTTAA 


GGCATATTAG 


TGGTCAGGCA GATGAACGGA 


12300 


TAACAACGGA 


TGGCACTCGC 


TATAGTTTAT 


ATAATTTACC 


GACGCCTCAT 


AGCTATCGTA 


12360 


TGTACAGAAG TAGTTGGCAA CCTGACGTTA CCAACACGGG ATTTAGTGCA ACTTTAAACA 


12420 


TGGCGACGAC 


CGAGATTTAT 


CCAGCTTATT 


TCACGACCGA 


TAACACGATG 


TACTATTTCT 


12480 


TAGAAAACCG 


CCGCGTCACT 


GAACACTATG 


TTAATGAAGC 


AGGTGCTGAG 


GTACCGATGC 


12540 


CTACAGGCTT 


CACGCAAGGG 


AATCAAACGG 


TGATTGACAG 


TGATACTTAT 


CATTTTAAAT 


12600 


TGGCAAAGGA ATTGCCGTTT 


AGTTATTTTT 


TAAACAATAA AGCATACCGA 


TTTAAAGGCT 


12660 


GGTATAAAGG 


CAAAACCAAA 


CCTAAGGTAT 


TGGAAACGAG 


TCGAACGCCT 


GAGTATGATA 


12720 


CAACTTTCGA 


TGATAACGAT 


GATTTAACCG 


TTGTTTACGA 


AGAAATTAAA 


TACGGCGGAA 


12780 


ATGCGGTGAC 


ATTTGGTTTT 


GTTGGGGAAG 


ATGGGCAATT 


GCTTCAGCCG 


ACAGGGTTTC 


12840 
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AGGTGACTAC 


GGATATTGTT 


GAAACAATCG 


ATGGGCTGAG 


TACGGTTTTA 


AGTAATGTTT 


. 12900 


CAGCTGTCGA 


CAGTACAGGC 


AACGTTAAAA 


CTTTGACGAT 


CCCGCAAAAG 


GAATATGTAT 


12960 


TGACTCCCCA 


GATGAATTTC 


TATGGAACAA 


AAAATTCATT 


GATTACGATT 


CCAAGACAAT 


13020 


ACCAAGAAAT 


TTCTTTTACC 


AAACCTGCCA 


ACTACCAAGG 


ACTAGATTAT 


CCAGTTGCAG 


13080 


GGCGTGTTGA 


GAATATGTTT 


AGTGGTAATC 


TGTATGTTGA 


GGGGGCACAA 


CCACCTTACT 


13140 


TTTCTGCTTC 


GAAAGTCAAA 


GCCAATCAAT 


ACAAAATAAT 


TGAGTTTCAT 


TGGAAAGAAG 


13200 


AGCCTACACA 


AAGTTTCTCT 


AGTTTGTACA 


AAGCTACGTT 


AATTCGAAGT 


AGCGTGGCAC 


13260 


CAACAGGCGT 


AGAAATGTAT 


CCTAATAAGC 


CAATCTATTA 


CTATGCGACC 


AATCGCCGAG 


13320 


TGACTGAAAA 


CTTTGTAGAT 


AAATCTGGCG 


CAAAAATCAC 


GCCACCTCAA GGATTTACGC 


13380 


AAGG6AATCA 


AGTCACAGTC 


ACCAGTGATC 


CTTTTACTTA 


TACTGCTTCA 


AAAGCTTTAC 


13440 


CAAGCGTTTA 


TGCGGCTGGG 


GCAAAAACCT 


ATAAGTTTGT 


TGGTTGGTAT 


AAAGGAACCA 


13500 


CCAAGCCAAC 


GACCTTAAAA 


ACAACGGCTA 


CTCCTAATTA 


TTTGGTGGAT 


TTTGATGATA 


13560 


ATGACGATAT 


GACGGCG6TC 


TATGAAGAAG 


AAACGCCGAC 


TGCTGCTTTA 


ACTTTGACCA 


13620 


GCACCAACCG 


TGTCGTGAAT 


AATGACGATA 


GTGTAGATTG 


GGTGGCAACA 


CTGAAAAATA 


13680 


CTAGCCTGGC 


ACCACTTAAA 


ACGCT6ACAG 


TGAAACCAGC 


GACTACTTGG 


CCTGCAGGAA 


13740 


TTGGTACGCC 


GACGTCACTT 


TCTGTCCAAT 


TGGACGGACA 


AGCACCGAAA ATTTATCCAG 


13800 


TAACGGCGAC 


TACTTGGTCT 


GAAGGAATCA 


GCTTaACAGG 


TCTTGAAATA 


CCGGCAGGTC 


13860 


AAACAGCCAA 


CGTGACGCTT 


GTTGGCACCA 


AAATTTCAGG 


AACATCAGAC 


CAACGCTTGA 


13920 


CGGCAACCCT 


TGATGTTACA 


GGAAATTTTG 


CGACAGTGAG 


CGCCGCGGAT 


GCGGTACGTC 


13980 


TGACAGATAC 


AACCCAAGGA 


ACAATTACAC 


CAACAGAAGG 


CTTTATCAGC 


GTGCCAACCT 


14040 


TTGATTTTGG 


AAAAATGAAT 


ATTGCTAGTA 


AAACGCAACA 


ATCAGGCTTG 


AAAAAAGCGG 


14100 


CCGATTACTA 


CGAAAATGGT 


ACACGCAATC 


CTTACTTGCG 


AATTAAGAAA AATCAACCGA 


14160 


ATTGGCAGAT 


GACAGC6CAA 


CTATCTCAAC 


CGAAAGCAAC 


CACAGATAGT 


CTGCCAACAG 


14220 


CGACTCGTTT 


ATTGTTAGGT 


CCAGCGAATG 


TTTCAAGTTT 


TACAAATTAT 


AACGAAGCAA 


14280 


CCGAACAAAT 


CAAAGCAGTT 


G6TAAGACTA 


GTTCTTTGAG 


CTTAACGGCG 


AATAATGTGG 


14340 


CAACCAGTGT 


TGTCGCAAAC 


CAACAGTTCA 


CAGGTAGTGA 


CGTCTACCAA 


CTGGATTTTA 


14400 


CGTTTGAGAA 


CATTAAATTA 


GAAGTACCAG 


CCAACCAAGG 


AACCAAAGGA 


CAACAGTACA 


14460 


ATGCTGCGGT 


CACTTGGAAT 


TTGGTGACTG 


GTCCCTAAAA 


TAAAAATGCC 


CGAGCCTTGA 


14520 


AATAGGAGGC 


TCAGGCATTT 


TTTATTGATG 


GTCTTTTTAG 


GGTTGTTGTG 


CTATAATGAT 


14580 


AAAG7\AAAGA 


GAGATATGCT 


TCCAACATAT 


CTCTCTGATG 


TAGAGCCGTT 


TAAGACGGTG 


14640 


ACCAATTTTA 


TTATTTAAAA 


ATAACCGTGC 


TTGGTCAAAG 


TAGACGGTTA 


TTTTTTCTTG 


14700 


TCATTTTTAA 


GCAATTTCAC 


AATCAGCGCA 


ATCAAAGCAA 


TGGTAAACAT 


ACCAAACCCA 


14760 


AGAATTGTCT 


GAATTGTTTC 


ATATGCTGAC 


AAAAGGCCTC 


TCCTTTCTGT 


GAATTTCGGA 


14820 



